There has recently been growing interest in exploiting the concept of reasoning about function for object recognition. In a function-based approach to object recognition, recognition of an object means labeling it as ...
详细信息
ISBN:
(纸本)0819413208
There has recently been growing interest in exploiting the concept of reasoning about function for object recognition. In a function-based approach to object recognition, recognition of an object means labeling it as belonging to some category of objects according to the function that it could serve. The few function-based recognition systems which have so far been described in the literature have all assumed that the input to the problem is a pure static shape description. By `pure' shape we mean that the only object property that the systems have reasoned about is their abstract shape. By `static' shape we mean that the systems have reasoned about an object from only a single (assumed rigid) abstract shape instance. This paper discusses some of the issues which must be addressed in extending the function-based approach to handle non-shape properties (such as material properties) and dynamic shape descriptions.
The vision system employed by an intelligent robot must be active; active in the sense that it must be capable of selectively acquiring the minimal amount of relevant information for a given task. An efficient active ...
详细信息
ISBN:
(纸本)0819413208
The vision system employed by an intelligent robot must be active; active in the sense that it must be capable of selectively acquiring the minimal amount of relevant information for a given task. An efficient active vision system architecture that is based loosely upon the parallel-hierarchical (pyramidal) structure of the biological visual pathway is presented in this paper. Although the computational architecture of the proposed pyramidal neuro-vision system is far less sophisticated than the architecture of the biological visual pathway, it does retain some essential features such as the converging multilayered structure of its biological counterpart. In terms of visual information processing, the neuro-vision system is constructed from a hierarchy of several interactive computational levels, whereupon each level contains one or more nonlinear parallel processors. Computationally efficient vision machines can be developed by utilizing both the parallel and serial information processing techniques within the pyramidal computing architecture. A computer simulation of a pyramidal vision system for active scene surveillance is presented.
This paper describes a fuzzy logic controller (FLC) designed and implemented to control the yaw angle of a 10 kW fixed speed teetered-rotor wind turbine presently being commissioned at the University of Texas at El Pa...
详细信息
ISBN:
(纸本)0819413208
This paper describes a fuzzy logic controller (FLC) designed and implemented to control the yaw angle of a 10 kW fixed speed teetered-rotor wind turbine presently being commissioned at the University of Texas at El Paso. The technical challenge of this project is that the wind turbine represents a highly stochastic nonlinear system. The problems associated with the wind turbine yaw control are of a similar nature as those experienced with position control of high inertia equipment like tracking antenna, gun turrets, and overhead cranes. Furthermore, the wind turbine yaw controller must be extremely cost-effective and highly reliable in order to be economically viable compared to the fossil fueled power generators.
Most of the information regarding the shape of polyhedral objects is preserved in the edges and the vertices of these objects. Gray level images of scenes containing such objects are often processed to extract edge an...
详细信息
ISBN:
(纸本)0819413208
Most of the information regarding the shape of polyhedral objects is preserved in the edges and the vertices of these objects. Gray level images of scenes containing such objects are often processed to extract edge and vertex information to produce equivalent line sketches. An accurate line sketch of a scene serves as an effective input to high level vision systems concerned with scene understanding or object recognition. The performance of these systems is therefore greatly dependent on the accuracy of the line sketch. The work reported in this paper addresses the issues associated with generating accurate line sketches from gray level images. The methods described here have been implemented and tested with real and synthetic images and are compared to other vertex or corner detection techniques. The performance of the vertex detector is assessed using simulation runs on images with varied signal-to-noise ratios. The computational performance of this algorithm is evaluated and assessed by operating directly on the gray-scale image.
This paper describes a model based vision system that has been developed which is able to perform model based reasoning at real-time (or near real-time) rates and for which both the hardware and prototyping costs are ...
详细信息
ISBN:
(纸本)0819413208
This paper describes a model based vision system that has been developed which is able to perform model based reasoning at real-time (or near real-time) rates and for which both the hardware and prototyping costs are low. The basic approach taken is to extract a set of useful features from observed models using a library of feature primitive operators. Scale and orientation invariant combinations of these features are used as indices into a hardware lookup table to establish initial correspondence between similar combinations that will be encountered when examining unknown objects. When performing initial recognition of an unknown object, evidence for an object in a particular spatial pose is accumulated, giving rise to an initial set of hypotheses. The strongest hypotheses are then refined by iteratively hypothesizing new (previously uninstantiated) model/object feature matches and computing a confidence measure associated with the current instantiation set. If confidence increases the newly hypothesized instantiation is retained, otherwise it is discarded.
We describe an implemented computer program that recognizes the occurrence of simple spatial motion events in simulated video input. The program receives an animated line-drawing as input and produces as output a sema...
详细信息
ISBN:
(纸本)0819413208
We describe an implemented computer program that recognizes the occurrence of simple spatial motion events in simulated video input. The program receives an animated line-drawing as input and produces as output a semantic representation of the events occurring in that movie. We suggest that the notions of support, contact, and attachment are crucial to specifying many simple spatial motion event types and present a logical notation for describing classes of events that incorporates such notions as primitives. We then suggest that the truth values of such primitives can be recovered from perceptual input by a process of counterfactual simulation, predicting the effect of hypothetical changes to the world on the immediate future. Finally, we suggest that such counterfactual simulation is performed using knowledge of naive physical constraints such as substantiality, continuity, gravity, and ground plane. We describe the algorithms that incorporate these ideas in the program and illustrate the operation of the program on sample input.
In this paper, a novel image analysis technique is proposed, which may be performed prior to coding in order to decide what is the most significant information to encode. In the proposed system, the image to be coded ...
详细信息
ISBN:
(纸本)0819413208
In this paper, a novel image analysis technique is proposed, which may be performed prior to coding in order to decide what is the most significant information to encode. In the proposed system, the image to be coded is first partitioned into a large number of sub-blocks of N*N pixels. The blocks can then be stored into two major classes according to the level of the visual activity present. The classification is based on analyzing the local histogram within each sub-block. In this paper, we initially analyze the image blocks to separate uniform blocks from those that can be classified as non-uniform blocks. Adjacent uniform blocks with the same statistics are merged to form large blocks. These blocks can then be coded by their mean values. It is also shown that the non-uniform blocks may also be classified into three categories with different levels of activity.
In many object recognition problems, the object to be identified is one of a fixed set (library) of objects. The problem of identifying which object is present then shares characteristics of the signal detection and p...
详细信息
ISBN:
(纸本)0819413208
In many object recognition problems, the object to be identified is one of a fixed set (library) of objects. The problem of identifying which object is present then shares characteristics of the signal detection and parameter estimation problem: which signal is present and what are its parameters? The Reciprocal Basis Set/Direction of Arrival (RBS/DOA) technique is a recently developed technique for object pose determination. It uses a single, comprehensive analytic object model representing a suite of views of an object. Object orientation can be directly established from single 2-D views of the object, without a costly search of the pose parameter space, and without need for the views to be related by a geometric image transformation. This paper describes how one can construct reciprocal basis sets to simultaneously determine object identity and pose from a single 2-D image. Results are presented which demonstrate this ability for a single unknown pose parameter using synthetic and camera-acquired images.
Surface mounted technology (SMT) in automated assembly facilities requires the use of automatic surface-mount-device (SMD) placement machines. One of the problems involved in the electronic printed circuit board (PCB)...
详细信息
ISBN:
(纸本)0819413216
Surface mounted technology (SMT) in automated assembly facilities requires the use of automatic surface-mount-device (SMD) placement machines. One of the problems involved in the electronic printed circuit board (PCB) assembly process is the verification of the SMD placement operation within tight tolerances. The high throughput of modern manufacturing lines along with the required accuracy demand the use of automatic inspection systems to verify SMD placement. Image complexity of the board makes the use of machine vision for the inspection process a difficult task. This is complicated by the fact that misclassification errors should be kept to a minimum. Additionally, it is desirable that the inspection results provide enough information to be used for statistical process control (SPC). The strategy adopted to solve this problem was to simplify the complexity of the image by means of special illumination devices. The simplified image was then suitable for analysis by simple processing, segmentation, and detection algorithms that, sequentially applied to the image, met the required repeatability and accuracy specifications for the inspection system. The scope of this paper is to describe the techniques explored by the authors to solve the SMD inspection problem in order to develop a working industrial SMD inspection system.
We have been exploring the hypothesis that vision is an explanatory process, in which causal and functional reasoning about potential motion plays an intimate role in mediating the activity of low-level visual process...
详细信息
ISBN:
(纸本)0819413208
We have been exploring the hypothesis that vision is an explanatory process, in which causal and functional reasoning about potential motion plays an intimate role in mediating the activity of low-level visual processes. In particular, we have explored two of the consequences of this view for the construction of purposeful vision systems: Causal and design knowledge can be used to (1) drive focus of attention, and (2) choose between ambiguous image interpretations. An important result of visual understanding is an explanation of the scene's causal structure: How action is originated, constrained, and prevented, and what will happen in the immediate future. In everyday visual experience, most action takes the form of motion, and most causal analysis takes the form of dynamical analysis. This is even true of static scenes, where much of a scene's interest lies in how possible motions are arrested. This paper describes our progress in developing domain theories and visual processes for the understanding of various kinds of structured scenes, including structures built out of children's constructive toys and simple mechanical devices.
暂无评论