We explore the idea that the visual system uses specialized processes to extract critical information about 3-d motion and structure for visually-guided navigation. We first consider the computation of three essential...
详细信息
We explore the idea that the visual system uses specialized processes to extract critical information about 3-d motion and structure for visually-guided navigation. We first consider the computation of three essential properties: the relative 3-ddirection of heading of the observer, the time-to-collision with approaching object surfaces, and the locations of object boundaries defined by motion discontinuities. We then focus on the heading computation, relating current algorithms to the perception of heading direction.
biological sensor design has long provided inspiration for sensor design in machine vision. However, relatively little attention has been paid to the actual design parameters provided by biological systems, as opposed...
详细信息
biological sensor design has long provided inspiration for sensor design in machine vision. However, relatively little attention has been paid to the actual design parameters provided by biological systems, as opposed to the general nature of biologicalvision architectures. In the present paper, we provide a review of current knowledge of primate spatial visiondesign parameters and present recent experimental and modeling work from our lab which demonstrates that a numerical conformal mapping, which is a refinement of our previous complex logarithmic model, provides the best current summary of this feature of the primate visual system.
Our fds-theory accepts context effects in perception. It takes account of early perception phase, by establishing the concept of 39;Structural Identity39; of a k-norm, 39;SkId39;. Early perception detects the ...
详细信息
Our fds-theory accepts context effects in perception. It takes account of early perception phase, by establishing the concept of 'Structural Identity' of a k-norm, 'SkId'. Early perception detects the 'maximally abstracted' holistic view of an object by representing it in terms of an attributed semantic graph-word. The SkId graph-word, along with the contextual, abstraction-level dependent, description of perceptual characteristics, constitute the 'formal description schema - fds' models of the norm-objects.
There is a growing need for progress in the technology for the management of knowledge derived from sensory data. In this paper we address the structure and operations of a cellular 39;Experiential Knowledge Base - ...
详细信息
There is a growing need for progress in the technology for the management of knowledge derived from sensory data. In this paper we address the structure and operations of a cellular 'Experiential Knowledge Base - E*KB' system, which acts as a cognitive prosthesis to the decision maker. We concern ourselves with the experiential knowledge representation techniques, and with the methods and tools required for the extraction (relevance filtering), the contextual abstraction (data compression and generalization), the classification and storage of real time sensory knowledge in the cellular architecture of the E*KB.
A multi-task dynamic neural network that can be programmed for sorting, processing and encoding spatio-temporal visual information is presented in this paper. This dynamic neural network, called the PN-network, is com...
详细信息
A multi-task dynamic neural network that can be programmed for sorting, processing and encoding spatio-temporal visual information is presented in this paper. This dynamic neural network, called the PN-network, is comprised of numerous densely interconnectedneural subpopulations which reside in one of the two coupled sublayers, P or N. The subpopulations in the P-sublayer transmit an excitatory, or a positive, influence onto all interconnected units, whereas the subpopulations in the N-sublayer transmit an inhibitory, or negative, influence. The dynamical activity generated by each subpopulation is given by a nonlinear first-order system. By varying the coupling strength between these different subpopulations it is possible to generate three distinct modes of dynamical behavior useful for performing vision related tasks. It is postulated that the PN-network can function as a basic programmable processor for novel vision machine systems.
For autonomous machines equipped with vision capabilities and in a controlled environment. 3-d model-based object identification methodologies will, in general, solve rigid body recognition problems. In an uncontrolle...
详细信息
For autonomous machines equipped with vision capabilities and in a controlled environment. 3-d model-based object identification methodologies will, in general, solve rigid body recognition problems. In an uncontrolled environment, however, several factors pose difficulties for correct identification. We have addressed the problem of 3-d object recognition using a number of methods including neural network classifiers and a Bayesian-like classifier for matching image data with model projection-deriveddata. neural network classifiers used began operation as simple feature vector classifiers. However, unmodelled signal behavior was learned with additional samples yielding great improvement in classification rates. The model analysis drastically shortened training time of both classification systems. In an environment where signal behavior is not accurately modelled, two separate forms of learning give the systems the ability to update estimates of this behavior. Required, of course, are sufficient samples to learn this new information. Given sufficient information and a well-controlled environment, identification of 3-d objects from a limited number of classes is indeed possible.
We investigate the use of chromatic information in dense stereo correspondence. Specifically, the chromatic photometric constraint, which is used to specify a mathematical optimality criterion for solving the dense st...
详细信息
We investigate the use of chromatic information in dense stereo correspondence. Specifically, the chromatic photometric constraint, which is used to specify a mathematical optimality criterion for solving the dense stereo correspondence problem, is developed. The result is a theoretical construction for developing dense stereo correspondence algorithms which use chromatic information. The efficacy of using chromatic information via this construction is tested by implementing single-and multi-resolution versions of a stereo correspondence algorithm which uses simulated annealing as a means of solving the optimization problem. Results demonstrate that the use of chromatic information can significantly improve the performance of dense stereo correspondence.
This paper addresses the problems of how to efficiently extract information from different data sources and how to fuse the information to achieve a more complete and accurate interpretation of the underlying 3-d scen...
详细信息
This paper addresses the problems of how to efficiently extract information from different data sources and how to fuse the information to achieve a more complete and accurate interpretation of the underlying 3-d scene. This paper describes an integrated approach to 3-d image interpretation. The approach is to first obtain region information by applying a multi-resolution segmentation technique to a monocular image. The elevation information is then extracted by applying an edge-based matching technique to a stereo pair covering the same scene. The region information, the elevation information, and the a priori knowledge about the scene are then integrated by using a rule-based scheme to classify the various objects in the scene. Results of applying this integrated technique to an overhead visible urban scene are presented.
The reconstruction of curves and surfaces from sparse data is an important task in many applications. In computervision problems the reconstructed curves and surfaces generally represent some physical property of a r...
详细信息
The reconstruction of curves and surfaces from sparse data is an important task in many applications. In computervision problems the reconstructed curves and surfaces generally represent some physical property of a real object in a scene. Thus, the characteristics of the reconstruction process differs from straight forward fitting of smooth curves and surfaces to a set of data. Since the collecteddata is represented in an arbitrarily chosen coordinate system, the reconstruction process should be invariant to the choice of the coordinate system (except for the transformation between the two coordinate systems). In this paper, reconstruction algorithms are presented for reconstructing invariant estimates of both curves and surfaces. The reconstruction problem will be cast as an ill-posed inverse problem which must be stablized using a priori information about the constraint formation. Tikhonov regularization is used to form a well-posed mathematical problem statement. Examples of typical reconstructed objects are also given.
Edge linearization operators are often used in computervision and in neural network models of vision to reconstruct noisy or incomplete edges. Such operators gather evidence for the presence of an edge at various ori...
详细信息
Edge linearization operators are often used in computervision and in neural network models of vision to reconstruct noisy or incomplete edges. Such operators gather evidence for the presence of an edge at various orientations across all image locations and then choose the orientation that best fits the data at each point. One disadvantage of such methods is that they often function in a winner-take-all fashion: the presence of only a single orientation can be represented at any point;multiple edges cannot be represented where they intersect. For example, the neural Boundary Contour System of Grossberg and Mingolla implements a form of winner-take-all competition between orthogonal orientations at each spatial location, to promote sharpening of noisy, uncertain image data. But that competition may produce rivalry, oscillation, instability, or mutual suppression when intersecting edges (e.g., a cross) are present. This 'cross problem' exists for all techniques, including Markov Random Fields, where a representation of a chosen favored orientation suppresses representations of alternate orientations. A new adaptive technique, using both an inhibitory learning rule and an excitatory learning rule, weakens inhibition between neurons representing poorly correlated orientations. It may reasonably be assumed that neurons coding dissimilar orientations are less likely to be coactivated than neurons coding similar orientations. Multiplexing by superposition is ordinarily generated: combinations of intersecting edges become represented by simultaneous activation of multiple neurons, each of which represents a single supported oriented edge. Unsupported or weakly supported orientations are suppressed. The cross problem is thereby solved.
暂无评论