In the photometric stereo method, the surface orientation of an object is determined by using multiple images (at least three images). The multiple images are obtained by changing the position of a light source, and t...
详细信息
ISBN:
(纸本)0819410276
In the photometric stereo method, the surface orientation of an object is determined by using multiple images (at least three images). The multiple images are obtained by changing the position of a light source, and the surface orientation of the object can be determined only in the area of the object illuminated by all light sources. So it is desirable that the number of light sources should be reduced. We propose a method for determining surface orientation along with reflectance, in which only two light sources are used. It is assumed that an object is convex and has a smooth Lambertian surface with locally constant reflectance. We have found the following: there exists a 'separation line' in the image, along which the surface normal is represented as a linear combination of two vectors pointing toward the direction of light sources, and the separation line separates the surface into two regions. Not only when the reflectance is known, but even though the reflectance is unknown, we can determine the surface orientation of the object using the property of the separation line. Here we use the constraint of convexity. Simulations carried out under various situations yielded satisfactory results.
We present a technique to guide vergence movements for an active stereo camera system and to construct dense disparity maps. Both processes are described in the same theoretical framework based on phase differences in...
详细信息
ISBN:
(纸本)0819410276
We present a technique to guide vergence movements for an active stereo camera system and to construct dense disparity maps. Both processes are described in the same theoretical framework based on phase differences in complex Gabor filter responses, modelling receptive field properties in the visual cortex. While the camera movements are computed with coarse spatial resolution input images, disparity calculation uses finer resolutions in a scale space. The correspondence problem is solved implicitly by restricting the disparity range around zero disparity to the filter kernel sizes (Panum's area in the human visual system). The method contrasts to matching algorithms - that require an explicit search for correspondence - and to correlation, needing a maximum detection in the correlation function. The vergence process is interpreted as a mechanism to minimize global disparity, thereby setting a 3d region of interest for subsequent disparity detection. This mall volume is centered around the fixation point where both optical axes intersect. Additionally it produces a scalar distance measure via vergence angles and camera base. The disparity map represents smaller local disparities as an important cue for depth perception. The vergence control works in a real-time feedback loop. Quantitative results are presented.
dynamic identification of temporally changing signals is a key issue in real-time signal processing and understanding. Such changing signals may arise from moving objects in visual images, spoken words, target traject...
详细信息
ISBN:
(纸本)0819410276
dynamic identification of temporally changing signals is a key issue in real-time signal processing and understanding. Such changing signals may arise from moving objects in visual images, spoken words, target trajectories and other kinds of sensor data in a wide variety of applications. An Adaptive Time-delay neural network (ATNN) is proposed, which dynamically adapts its time-delays as well as its synaptic weights. The resulting network is trained to distinguish the temporal properties and spatiotemporal correlations of various input patterns. In biological systems, the delays along axons or at the synapses may vary, like in the ATNN, due to factors such as the length of the axon, insulation (myelin), or the details of the biochemical processes. In this paper, an improved learning algorithm based on gradient descent is derived, both for adaptive time-delays and synaptic strengths. This adaptation paradigm offers more flexibility for the network to attain the optimal time-delays and to achieve more accurate pattern mapping and classification than is the case of using arbitrary fixeddelays, as has been done previously. Noise tolerance was tested on a series of experiments, and it is found that the proposed ATNN shows advantages. Time series prediction was tested with the chaotic Mackey-Glass equation, and the ATNN performed better than training with fixed time delays. The ATNN is suitable for spatiotemporal signal recognition, prediction and classification.
The proceedings contain 52 papers. The topic discussed include: nonreconstruction approach for road following;applying geometric sensor and scene models for range image understanding;estimation of motion parameters us...
The proceedings contain 52 papers. The topic discussed include: nonreconstruction approach for road following;applying geometric sensor and scene models for range image understanding;estimation of motion parameters using binocular camera configurations;fusion-baseddepth estimation from a sequence of monocular images;scene description: interactive computation of stability with friction;planning of an active range sensor structure for pose estimation of 3-d regular objects;and clustering methods for removing outliers from vision-based range estimates.
A computervision-based system for stop and go driving is presented. The horizontal edges characteristics of vehicles are extracted with a model-based filter. Symmetric groups of horizontal lines indicate the centerli...
详细信息
ISBN:
(纸本)0819410276
A computervision-based system for stop and go driving is presented. The horizontal edges characteristics of vehicles are extracted with a model-based filter. Symmetric groups of horizontal lines indicate the centerlines of vehicles in the scene. discontinuities in the lines indicate the edges of the vehicles. The centerline and width of the vehicles gives the bearing and approximate range to them. This approximate range is filtered and used to maintain a safe distance to the vehicle ahead.
This paper will describe from plane to space corresponding lines location 3-d information acquisition and three views reconstruction. For this purpose, a series of space geometry calibration should be done, and a set ...
详细信息
ISBN:
(纸本)0819410276
This paper will describe from plane to space corresponding lines location 3-d information acquisition and three views reconstruction. For this purpose, a series of space geometry calibration should be done, and a set of coordinate also are set. Then the intersection of the corresponding points in space can be realized to acquire the 3-ddepth information.
Technical Illustrations (TI) are one of the strong methods to show a way to assemble/disassemble a mechanical assembly. The information including not only the shape of a constituent and an order of operations necessar...
详细信息
ISBN:
(纸本)0819410276
Technical Illustrations (TI) are one of the strong methods to show a way to assemble/disassemble a mechanical assembly. The information including not only the shape of a constituent and an order of operations necessary to assemble/disassemble can be driven from a single TI. However an additional new TI is often required to further supplement insufficient information obtained from a single TI. The paper shows the solution required to resolve the problem which arises in augmenting a model description by unifying the results obtained from several TIs.
We consider analog neural network implementations (using VLSI or optical technologies) with limited accuracy and various noise and nonlinearity error sources. Algorithms and techniques to achieve high performance (goo...
详细信息
ISBN:
(纸本)0819407453
We consider analog neural network implementations (using VLSI or optical technologies) with limited accuracy and various noise and nonlinearity error sources. Algorithms and techniques to achieve high performance (good recognition P'c% and large storage capacity) on such systems are considered. The adaptive clustering neural net (ACNN) and robust Ho-Kashyap (HK-2) associative processor (AP) are the neural networks considered in detail.
We present a new edge-based matching algorithm that can be applied for unstructured stereo images. We first review briefly a major recent development in establishing stereo correspondences and show that a great deal o...
详细信息
ISBN:
(纸本)0819410276
We present a new edge-based matching algorithm that can be applied for unstructured stereo images. We first review briefly a major recent development in establishing stereo correspondences and show that a great deal of research in stereo is needed in order to apply stereo techniques to solve more real-world problems. We then discuss why we use edges (without simplification and without linking) as primitives to be matched rather than other primitives (points, regions, ...) and what adequate edge detector should be used. We then argue that matching a whole parts of edges should be done rather than comparison of straight line segments. We then discuss the occlusion problem and some defects of the edge detector that possibly cut the edges to several pieces, and argue that partial matching of edges can be a solution to such problems. We then explain what best matching criteria are used, comprising both shape and gray level attributes. These criteria are chosen for their pertinence, representation and computation simplicity and for their similarity to those probably used in human vision system. Finally, complete quantitative experimental results are shown with various indoor and outdoor real world scenes.
暂无评论