The logical structure of robot vision systems is introduced. It can be used as a basis for designing such systems. Both sensor data fusion and knowledge representation in a vision system may be broken down into four h...
详细信息
ISBN:
(纸本)081944300X
The logical structure of robot vision systems is introduced. It can be used as a basis for designing such systems. Both sensor data fusion and knowledge representation in a vision system may be broken down into four hierarchical levels where at each level knowledge representation in the form of partial models and data fusion complement each other. Multi-processor architectures supporting this type of structure are introduced as a particularly efficient basis for robot vision systems. Practical implementation is given.
A new optical arrangement is introduced for such applications of machine vision that need to discriminate structured laser lighting from stray ambient lighting. Instead of using transmissive filters that usually have ...
详细信息
ISBN:
(纸本)081944300X
A new optical arrangement is introduced for such applications of machine vision that need to discriminate structured laser lighting from stray ambient lighting. Instead of using transmissive filters that usually have high attenuation, a double reflective filter arrangement is presented. A combination of dielectric and metallic mirrors achieves steep stopband attenuation especially in the near infrared region where Si detectors are sensitive and common blackbody-like radiators have high spectral density. An additional advantage follows from the compact size of such a Maksutov-type mirror telescope. Compared to a corresponding refractive design, the length is reduced to less than half.
An intelligent robot is a remarkably useful combination of a manipulator, sensors and controls. The use of these machines in factory automation can improve productivity, increase product quality and improve competitiv...
详细信息
ISBN:
(纸本)081944300X
An intelligent robot is a remarkably useful combination of a manipulator, sensors and controls. The use of these machines in factory automation can improve productivity, increase product quality and improve competitiveness. This paper presents a discussion of recent and future technical and economic trends. During the past twenty years the use of industrial robots that are equipped not only with precise motion control systems but also with sensors such as cameras, laser scanners, or tactile sensors that permit adaptation to a changing environment has increased dramatically. intelligent robot products have been developed in many cases for factory automation and for some hospital and home applications. To reach an even higher degree of applications, the addition of learning may be required. Recently, learning theories such as the adaptive critic have been proposed. In this type of learning, a critic provides a grade to the controller of an action module such as a robot. The adaptive critic is a good model for human learning. In general, the critic may be considered to be the human with the teach pendant, plant manager, line supervisor, quality inspector or the consumer. If the ultimate critic is the consumer, then the quality inspector must model the consumer's decision-making process and use this model in the design and manufacturing operations. Can the "adaptive critic" be used to advance intelligentrobots? intelligentrobots have historically taken decades to be developed and reduced to practice. Methods for speeding this development include technology such as rapid prototyping and product development and government, industry and university cooperation.
For robust and secure behavior in natural environment an autonomous vehicle needs an elaborate vision sensor as main source of information. The vision sensor must be adaptable to the external situation, the mission, t...
详细信息
ISBN:
(纸本)081944300X
For robust and secure behavior in natural environment an autonomous vehicle needs an elaborate vision sensor as main source of information. The vision sensor must be adaptable to the external situation, the mission, the capabilities of the vehicle and the knowledge about the external world accumulated up to the present time. In the EMS-vision system, this vision sensor consists of four cameras with different focal lengths mounted on a highly dynamic pan-tilt camera head. Image processing, gaze control and behavior decision interact with each other in a closed loop. The image processing experts specify so-called regions of attention (RoAs) for each object in 3D object coordinates. These RoAs should be visible with a resolution as required by the measurement techniques applied. The behavior decision module specifies the relevance of obstacles like road segments, crossings or landmarks in the situation context. The gaze control unit takes all this information in order to plan, optimize and perform a sequence of smooth pursuits, interrupted by saccades. The sequence with the best information gain is performed. ne information gain depends on the relevance of objects or object parts, the duration of smooth pursuit maneuvers, the quality of perception and the number of saccades. The functioning of the EMS-vision system is demonstrated in a complex and scalable autonomous mission with the UBM test vehicle VAMORS.
vision is a part of a larger information system that converts visual information into knowledge structures. These structures drive vision process, resolving ambiguity and uncertainty via feedback, and provide image un...
详细信息
ISBN:
(纸本)081944300X
vision is a part of a larger information system that converts visual information into knowledge structures. These structures drive vision process, resolving ambiguity and uncertainty via feedback, and provide image understanding, that is an interpretation of visual information in terms of such knowledge models. A computervision system based on such principles requires unifying representation of perceptual and conceptual information. computer simulation models are built on the basis of graphs/networks. The ability of human brain to emulate similar graph/networks models is found. That means a very important shift of paradigm in our knowledge about brain from neural networks to the "cortical software". Starting from the primary visual areas, brain analyzes an image as a graph-type spatial structure. Primary areas provide active fusion of image features on a spatial grid-like structure, where nodes are cortical columns. The spatial combination of different neighbor features cannot be described as a statistical/integral characteristic of the analyzed region, but uniquely characterizes such region itself. Spatial logic and topology naturally present in such structures. Mid-level vision processes like clustering, perceptual grouping, multilevel hierarchical compression, separation of figure from ground, etc. are special kinds of graph/network transformations. They convert low-level image structure into the set of more abstract ones, which represent objects and visual scene, making them easy for analysis by higher-level knowledge structures. Higher-level vision phenomena like shape from shading, occlusion, etc. are results of such analysis. Such approach gives opportunity not only to explain frequently "unexplainable" results of the cognitive science, but also to create intelligentcomputervision systems that simulate perceptional processes in both "what" and "where" visual pathways. Such systems can open new horizons for robotic and computervision industries.
A model of an object is an image consisting of features of die object. The input is a gray scale image from which features are computed. In his doctoral thesis [7] J. L. Chen used a model based approach for object rec...
详细信息
ISBN:
(纸本)081944300X
A model of an object is an image consisting of features of die object. The input is a gray scale image from which features are computed. In his doctoral thesis [7] J. L. Chen used a model based approach for object recognition. His method is based on Rosin's work [26] for extraction of parts. Both model and scene features are contour based properties. Properties of each part such as area, compactness, convexity etc. are computed and used to match the scene image to the model. This paper extends the algorithm in several directions. The contours are improved using two passes over the initial input image. The notion of internal part or base of an object is introduced and used to normalize the part areas. Insignificant parts are merged with neighboring parts to provide a better segmentation of the scene. Interpretation trees are used to match scene to object. The algorithm is tested on simple hand drawn images and also images of buildings obtained from architectural databases.
In this work, a camera with integrated image processing capabilities is discussed. The camera is based on an imager coupled to an FPGA device (Field Programmable Gate Array) which contains an architecture for real-tim...
详细信息
ISBN:
(纸本)081944300X
In this work, a camera with integrated image processing capabilities is discussed. The camera is based on an imager coupled to an FPGA device (Field Programmable Gate Array) which contains an architecture for real-time computervision low-level processing. The architecture can be reprogrammed remotely for application specific purposes. The system is intended for rapid modification and adaptation for inspection and recognition applications, with the flexibility of hardware and software reprogrammability. FPGA reconfiguration allows the same ease of upgrade in hardware as a software upgrade process. The camera is composed of a digital imager coupled to an FPGA device, two memory banks, and a microcontroller. The microcontroller is used for communication tasks and FPGA programming. The system implements a software architecture to handle multiple FPGA architectures in the device, and the possibility to download a software/hardware object from the host computer, into its internal context memory. System advantages are: small size, low power consumption, and a library of hardware/software funcionalities that can be exchanged during run-time. The system has been validated with an edge detection and a motion processing architectures, which will be presented in the paper. Applications targeted are in robotics, mobile robotics, and vision based quality control.
The analysis of the prenatal engineering of the human brain and more specifically that of the human eye may encourage new interpretations and better understanding of 'cortical' processors and lead to better id...
详细信息
ISBN:
(纸本)081944300X
The analysis of the prenatal engineering of the human brain and more specifically that of the human eye may encourage new interpretations and better understanding of 'cortical' processors and lead to better ideas about how to built optical sensors. What human vision at its first processing stages realizes is an adaptive transformation of physical parameters from an 'outer' 4D-spatiotemporal into an 'inner' psychological world or its reciprocal projection and construction of an illusionary (inner or outer) world. The description of some of the most remarkable steps in the development of the human eye before birth, very critical for the optical functionalities in vision, will illustrate the new interpretations.
We have proposed a boundary extraction algorithm for cross-sectional critical dimension (CD) measurements of the scanning electron microscope (SEM) images in machine/computervision. Noisy SEM images have the image di...
详细信息
ISBN:
(纸本)081944300X
We have proposed a boundary extraction algorithm for cross-sectional critical dimension (CD) measurements of the scanning electron microscope (SEM) images in machine/computervision. Noisy SEM images have the image discontinuity of roof edge with peak. A task is to extract a low contrast boundary or edge with about one pixel width in a noisy cross sectional SEM image. Also CD measurements require that the algorithm locate a boundary or an edge at the correct position. Our policy is to detect the edges with high degree of confidence at first, and then to extract the edges with the high probabilities of edge that are calculated by using gray levels of neighboring pixels. Our proposed algorithm integrates the Gaussian pyramid to suppress noise, morphological operations to detect the edges with high degree of confidence, the relaxation method to recover the gaps in the detected edges, and the hierarchical approach. Application of the proposed algorithm to cross sectional SEM images of DRAMs shows its validity.
暂无评论