A generalized neighborhood concept is presented which extends the usual techniques for feature extraction using parameter transforms. Generalized neighborhoods allow operators to use the joint information contained in...
详细信息
ISBN:
(纸本)081861952x
A generalized neighborhood concept is presented which extends the usual techniques for feature extraction using parameter transforms. Generalized neighborhoods allow operators to use the joint information contained in distant portions of the same feature;i.e., to utilize the long-distance correlation present in the image. The generalized neighborhood techniques, by correlating local information over different portions of the image, produce up to two orders of magnitude improvement in accuracy over conventional techniques. Unfortunately, the response also becomes more complicated;false features may be detected due to a peculiar form of correlated noise. A general framework, motivated by connectionist networks, is presented which eliminates this behavior by introducing competitive processes in the parameter spaces. A novel approach to the generation of lateral inhibition links in the networks is proposed which is consistent with generalized neighborhoods. Experiments are provided that show results on range data. Complex surfaces and 3-D surface-intersection curves are reconstructed from the data.
A correspondence method is developed for determining optical flow where the primitive motion tokens to be matched between consecutive time frames are regions. The computation of optical flow consists of three stages: ...
详细信息
ISBN:
(纸本)081861952x
A correspondence method is developed for determining optical flow where the primitive motion tokens to be matched between consecutive time frames are regions. The computation of optical flow consists of three stages: region extraction, region matching, and optical flow smoothing. For region extraction, in each image frame the regions are extracted either from the sign of the bandpass operator or by thresholding the output of morphological image transformations for peak/valley detection. For region matching, a general correspondence approach is applied to region tokens by using an affinity measure based on region features. Optical flow is then identified as the spatial vector displacements among centroids of corresponding regions. The computation is completed by smoothing the initial optical flow, where the sparse velocity data are either a smoothed with a vector median filter or interpolated to obtain dense velocity estimates by using a motion-coherence regularization. The proposed region-based method for optical flow is simple, computationally efficient, and (as experiments on real images indicate) more robust than iterative gradient methods, especially for medium-range motion.
A scheme is developed to match range images in an environment where distinctive features are scarce. When each image overlaps with several other images, the match must also be performed at the global level. This is pa...
详细信息
ISBN:
(纸本)081861952x
A scheme is developed to match range images in an environment where distinctive features are scarce. When each image overlaps with several other images, the match must also be performed at the global level. This is particularly challenging, because of the possibility of bending and compression in range images (i.e., the relative position of data points on the image reference surface may be inaccurate). The primitives used for local matching contours of constant range which are extracted from data are represented by means of a modified chain-code method. All best matches of pairs of contours are considered tentative until their geometrical implications are evaluated and a consistent majority has emerged. To do global matching a cost function is constructed and minimized. Terms contributing to the cost include violation of local matches as well as compression and bending in range images. The present global scheme is valid for any set of multiple overlapping range images, they contain distinctive features or not. This scheme has been used to map the floor of the ocean, where the range data are obtained by a multibeam echo-sounder system.
A unified approach for modeling objects which are imaged by thermal (infrared) and visual cameras is presented. The model supports the generation of both infrared (8-μm-12-μm wavelength) images and monochrome visual...
详细信息
ISBN:
(纸本)081861952x
A unified approach for modeling objects which are imaged by thermal (infrared) and visual cameras is presented. The model supports the generation of both infrared (8-μm-12-μm wavelength) images and monochrome visual images under different viewing and ambient-scene conditions. A modified octree data structure is used for object modeling. The octree serves two different purposes: (1) surface information encoded in boundary nodes and efficient tree-traversal algorithms facilitate the generation of monochrome visual images;and (2) the compact volumetric representation facilitates simulation of heat flow in the object which gives rise to surface temperature variation, which in turn is used to synthesize the thermal image. The detailed object model allows for more accurate prediction of thermal and visual images of objects. It also predicts the values of discriminatory features used in classification. The model developed is designed to be used in a model-based vision system which uses a hypothesize-and-verify strategy to interpret thermal and visual images of scenes. Several blocks-world examples are presented to show typical images generated by the approach.
Interpreting uncertain information, a fundamental requirement of many computer-vision and patternrecognition systems is commonly supported by models of the uncertainty. Evidence theory, also called Dempster-Shafer th...
详细信息
ISBN:
(纸本)081861952x
Interpreting uncertain information, a fundamental requirement of many computer-vision and patternrecognition systems is commonly supported by models of the uncertainty. Evidence theory, also called Dempster-Shafer theory, is particularly useful for representing and combining uncertain information when a single precise uncertainty model is unavailable. A framework is presented for deriving and transforming evidence-theoretic belief representations of uncertain variables that denote numerical quantities. Belief is derived from probabilistic models using relationships between probability bounds and the support and plausibility functions used in evidence theory. This model-based approach to belief representation is illustrated by an algorithm currently used in a vision system to label anomalous high-intensity pixels in imagery. As the uncertain variables are manipulated to form features and object discriminants, the belief representation of the uncertain variables must be transformed accordingly. Belief transformations, analogous to the transformation of probability-density functions in mappings of random variables, are derived to maintain the same rigorous belief representation for computed quantities. The results demonstrate novel ways to address uncertainty in the use of sensor information, and contribute to understanding of the similarities and distinctions of probability theory and evidence theory.
The author analyzes a stereo method which determines lines in space from the intersection of projected planar sheets. Object descriptions are built from information about linear features instead of by points. It is sh...
详细信息
ISBN:
(纸本)081861952x
The author analyzes a stereo method which determines lines in space from the intersection of projected planar sheets. Object descriptions are built from information about linear features instead of by points. It is shown that there are major advantages to accurately determining the orientation of object lines and surfaces using this stereo method. In the absence of errors (apart from the baseline translation error), the measurement of the orientation of lines and surfaces from this stereo method is translation-invariant in the sense that the orientation measurement is completely independent of knowledge of the baseline. computer simulations of realistic imaging configurations show that, even in the presence of errors from other camera parameters, this stereo method is nearly translation-invariant and can far outperform stereo methods based on the absolute correspondence of points. Another advantage of determining the orientation of lines and surfaces from stereo using intersecting planes is that orientation errors do not grow rapidly as the object distance from the baseline increases.
The authors review their pseudo-one-subcycle parallel thinning algorithm. Y.S. Chen and W.H. Hsu present two improved versions of this algorithm, and describe a two-stage structure to realize the one-subcycle parallel...
详细信息
ISBN:
(纸本)081861952x
The authors review their pseudo-one-subcycle parallel thinning algorithm. Y.S. Chen and W.H. Hsu present two improved versions of this algorithm, and describe a two-stage structure to realize the one-subcycle parallel algorithm. The first stage is to produce a perfect 8-curve excluding T-junction thin line. The second is to obtain the isotropic skeleton of an L-shaped pattern. The two-stage structure consists of a thinning table and a control unit. The thinning table is used to provide the attributions for an input 3 × 3 local pattern. The control unit is used to check the removal of the center pixel of this local pattern, and the inputs of the control unit also comprise the outputs of other neighboring thinning tables. This structure can exactly implement the proposed one-subcycle parallel algorithms. The two improved algorithms have been implemented. Experiments confirm that the improved algorithms can produce the desired effective thin line and also show that the structure realized is feasible and practicable.
A novel memory system is proposed for imageprocessing. Latin squares, which are well known combinatorial objects, are used as the skew function of the memory system. A Latin square with desired properties for image-a...
详细信息
A novel memory system is proposed for imageprocessing. Latin squares, which are well known combinatorial objects, are used as the skew function of the memory system. A Latin square with desired properties for image-array access is introduced. The resulting memory system provides access to various subsets of image data (rows, columns, diagonals, main subsquares, etc.) without memory conflict. The memory modules are fully utilized for most frequently used subsets of image data. The address generation can be performed in constant time. This memory system achieves constant-time access to rows, columns, diagonals, and subarrays using a minimum number of memory modules.< >
An automatic and dynamic thresholding based on an illumination-independent contrast measure is proposed. In this algorithm, a local threshold is calculated for each segmented square region in an image. Since the thres...
详细信息
ISBN:
(纸本)081861952x
An automatic and dynamic thresholding based on an illumination-independent contrast measure is proposed. In this algorithm, a local threshold is calculated for each segmented square region in an image. Since the thresholds obtained in regions that include no characters or objects would produce only noise by thresholding, they are modified by regionwise interpolation based on the contrast measure. The contrast measure is introduced by an illumination reflectance image formation model proposed by T. G. Stockham where the gray-level function is expressed as the product of an illumination component and a reflectance component. The contrast measure is expressed by the ratio of the standard derivation to the mean of the object reflectance. It is found that, if the illumination function is smooth, the ratio can be calculated from just an image function and the characteristics of the video camera are without knowing about the shape of the illumination function. The characteristics are determined for two types of video cameras. The experimental results using 100 outdoor scene images for each camera show that (1) almost all the characters included in the images successfully binarized with the same parameters by this method;(2) this algorithm is independent of character or object size and its gray-level values;(3) the contrast measure is approximately independent of illumination;and (4) this method is applicable to real-world scene images.
The authors present a method for texture segmentation that does not assume any prior knowledge about either the type of textures or the number of textured regions present in the image. Local orientation and spatial fr...
详细信息
ISBN:
(纸本)081861952x
The authors present a method for texture segmentation that does not assume any prior knowledge about either the type of textures or the number of textured regions present in the image. Local orientation and spatial frequencies are used as the key parameters for classifying texture. The information is obtained by creating a local multifrequency multiorientation channel decomposition of the image, with the width of each frequency band constant on a logarithmic scale. This decomposition is implemented by applying a set of Gabor-like functions that were modified to have a decreased frequency selectivity when the filter's center frequency increases. This transformation is actually a wavelet transform and not a Gabor transform. The set of filter outputs is then used to create robust texture descriptors. The segmentation algorithm uses the similarity of the descriptors to determine the existence of texture regions and to outline their border rather than concentrating on segregating the textures. First, seed regions are found. Then, texture region borders are extended and refined through an iterative stage. An adaptive threshold based on specific region characteristics is used rather than a fixed threshold. The method has been applied to image containing natural textures, resulting in a good segmentation of the texture regions.
暂无评论