Almost all work on texture in the computervision and graphics communities has modeled the texture as tangential, i.e. lying in the tangent plane to the surface. This is equivalent to thinking of the texture as a patt...
详细信息
ISBN:
(纸本)0780342364
Almost all work on texture in the computervision and graphics communities has modeled the texture as tangential, i.e. lying in the tangent plane to the surface. This is equivalent to thinking of the texture as a pattern painted on the surface. Three-dimensional textures, where the elements may point out of the surface, have largely been ignored. We study a special class of 3D textures, perpendicular textures where we can model the elements as being normal to the surface. The perspective projection of perpendicularly textured surfaces results in several interesting phenomena, which do not occur in the much-studied tangential texture cease. These include occlusion, foreshortening and illumination. In this paper, we study the geometry of the problem, modeling the locations of the elements of the texture as being a realization of a spatial point process. Relations between slant and tilt of the surface, density and height of elements and occlusions are derived. Occlusions can now be used as a cue to infer shape, instead of being treated as a source of error.
We present a new approach to the tracking of very non rigid patterns of motion, such as water flowing down a stream. The algorithm is based on a ''disturbance map,'' which is obtained by linearly subtr...
详细信息
ISBN:
(纸本)0780342364
We present a new approach to the tracking of very non rigid patterns of motion, such as water flowing down a stream. The algorithm is based on a ''disturbance map,'' which is obtained by linearly subtracting the temporal average of the previous frames from the new frame. Every local motion creates a disturbance having the form of a wave, with a ''head'' at the present position of the motion and a historical ''tail'' that indicates the previous locations of that motion. These disturbances serve as loci of attraction for ''tracking particles'' that are scattered throughout the image. The algorithm is very fast and can be performed in real time. We provide excellent tracking results on various complex sequences, using both stabilized and moving cameras, showing: a busy ant column, waterfalls. rapids and. flowing streams, shoppers in a mall, and cars in a traffic intersection.
T. Simchony et al. (1990) proposed a semidirect method for computing area-based optical flow, based on the iterative application of a direct Poisson solver. This method is restricted to Dirichlet boundary conditions, ...
详细信息
A common factor in all illusory contour figures is the perception of a surface occluding part of a background. In our previous work, we have shown we could diffuse a proper set of junction hypothesis (what is salient ...
详细信息
ISBN:
(纸本)0818672587
A common factor in all illusory contour figures is the perception of a surface occluding part of a background. In our previous work, we have shown we could diffuse a proper set of junction hypothesis (what is salient or background) to obtain a surface where their boundaries represented illusory contours. Amodal completions emerge at the overlapping surfaces. We address the problem of selecting the best image organization (set of hypothesis). We propose an optimization criteria based on a coherence measure between pairs of junctions (correlation between the diffusion of each pair). A statistical physics approach to select the best organization is applied. The experiments suggest that despite the large number of possible organizations our approach may take only a few steps (in organization space) to select the best one.
An automatic target recognition (ATR) classifier is proposed that uses modularly cascaded vector quantizers (VQs) and multilayer perceptrons (MLPs). A dedicated VQ codebook is constructed for each target class at a sp...
详细信息
ISBN:
(纸本)0818672587
An automatic target recognition (ATR) classifier is proposed that uses modularly cascaded vector quantizers (VQs) and multilayer perceptrons (MLPs). A dedicated VQ codebook is constructed for each target class at a specific range of aspects, which is trained with the K-means algorithm and a modified learning vector quantization (LVQ) algorithm. Each final codebook is expected to give the lowest mean squared error (MSE) for its correct target class at a given range of aspects. These MSEs are then processed by an array of window MLPs and a target MLP consecutively. In the spatial domain target recognition rates of 90.3 and 65.3 percent are achieved for moderately and highly cluttered test sets, respectively. Using the wavelet decomposition with an adaptive and independent codebook per subband, the VQs alone have produced recognition rates of 98.7 and 69.0 percent on more challenging training and test sets, respectively.
This paper summarizes a novel logic-based approach to grouping and perceptual organization, (presented more thoroughly in [2]), and presents novel efficient methods for computing interpretations in this framework. Gro...
详细信息
ISBN:
(纸本)0780342364
This paper summarizes a novel logic-based approach to grouping and perceptual organization, (presented more thoroughly in [2]), and presents novel efficient methods for computing interpretations in this framework. Grouping interpretations are first defined as logical structures, built out of atomic premises (''regularities'') that are derived from considerations of non-accidentalness. These interpretations can then be partially ordered by their degree of regularity or constraint (measured numerically by their codimension). The Genericity Constraint-the principle that interpretations should minimize coincidences in the observed configuration-dictates that the preferred interpretation will be the minimum in this partial order, i.e. the interpretation with maximum codimension. The preferred interpretation, called the qualitative parse, corresponds neatly to the interpretation intuitively preferred ed by human observers. As a side-effect, the ''most salient'' or most structured part of the scene can be identified, as the highest-codimension subtree of the qualitative parse. An efficient (O(n(2))) method for computing the maximum codimension interpretation is presented, along with examples.
We present a neural network-based face detection system. A retinally connected neural network examines small windows of an image, and decides whether each window contains a face. The system arbitrates between multiple...
详细信息
ISBN:
(纸本)0818672587
We present a neural network-based face detection system. A retinally connected neural network examines small windows of an image, and decides whether each window contains a face. The system arbitrates between multiple networks to improve performance over a single network. We use a bootstrap algorithm for training the networks, which adds false detections into the training set as training progresses. This eliminates the difficult task of manually selecting non-face training examples, which must be chosen to span the entire space of non-face images. Comparisons with other state-of-the-art face detection systems are presented;our system has better performance in terms of detection and false-positive rates.
Many presume that parsing the shadows out of an image is a high-level task, because of the global nature of the shadow formation process. But shape-from-shading algorithms are low-level, in the sense that they seek so...
详细信息
ISBN:
(纸本)0818672587
Many presume that parsing the shadows out of an image is a high-level task, because of the global nature of the shadow formation process. But shape-from-shading algorithms are low-level, in the sense that they seek solutions (surface normals or depth values) directly from image intensities. A dilemma arises: since shape-from-shading involves an illumination term, shadows must first be identified. We show that a structure intermediate between intensities and surfaces - the shading flow field - provides a solution to this dilemma. Our analysis is based on the observation that the geometric information that can be derived from images supports different inferences than the photometric information, and our specific goal will be to articulate this geometric structure and to show how shading flow fields can be reliably computed.
作者:
Nair, H.N.Stewart, C.V.Electrical
Computer and Systems Eng. Dept. Rensselaer Polytechnic Institute TroyNY12180-3590 United States Computer Science Dept.
Rensselaer Polytechnic Institute TroyNY12180-3590 United States
Depth maps obtained from focus ranging can have numerous errors and distortions due to edge bleeding, feature shifts, image noise, and field curvature. An improved algorithm that examines an initial high depth-of-fiel...
详细信息
Inspired by the properties of the human visual system, a new active vision system called ESCHeR (Etl Stereo Compact Head For Robot vision) has been recently implemented with foveated wide-angle lenses. The lenses exhi...
详细信息
ISBN:
(纸本)0818672587
Inspired by the properties of the human visual system, a new active vision system called ESCHeR (Etl Stereo Compact Head For Robot vision) has been recently implemented with foveated wide-angle lenses. The lenses exhibit a wide field of view along with a space-varying resolution for facilitating both detection and close observation. However, to handle such optical properties and achieve basic eye movement functions, new calibration methods are needed. Therefore, two novel and online techniques are presented that in one case perform a global identification of the optical process through artificial neural techniques and in the other case compute the physical parameters by using environmental feature-tracking and controlled rotations of the cameras. Self-alignment of the cameras is also achieved using a similar technique.
暂无评论