We address the problem of describing, recognizing, and learning generic, free-form objects in real-world scenes. For this purpose, we have developed a hybrid appearance-based approach where objects are encoded as loos...
详细信息
We address the problem of describing, recognizing, and learning generic, free-form objects in real-world scenes. For this purpose, we have developed a hybrid appearance-based approach where objects are encoded as loose collections of parts and relations between neighboring parts. The key features of this approach are: part decomposition based on local structure segmentation derived from multi-scale wavelet filters, flexible and efficient recognition by combining weak structural constraints, and learning and generalization of generic object categories (with possibly large intra-class variability) from real examples.
The process of detecting lines and curves in an image is an important component of many pattern recognition and computervision applications. There are well understood approaches to finding curves from a parametrised ...
详细信息
The process of detecting lines and curves in an image is an important component of many pattern recognition and computervision applications. There are well understood approaches to finding curves from a parametrised space of curves. In many practical cases, however, there exist arbitrary shape curves, and different types of curves in the same image. In addition, local perturbation noise can be added to the images. We propose a nonparametric method to overcome the above mentioned problems. The method is based on local modeling of the data. It does not require the specification of a parametric space of curves. It can be used to detect arbitrary curves and thus has a wider applicability than parametric approaches.
To design the image communication system including human visual system systematically, the objective quality estimation method of the picture based on the model of the human vision is necessary. We develop the objecti...
详细信息
To design the image communication system including human visual system systematically, the objective quality estimation method of the picture based on the model of the human vision is necessary. We develop the objective picture quality scale (PQSvideo) for video coding considering the relation between the physical distortion factors and the psychological picture estimation factors. The obtained cross correlation coefficient between the PQSvideo and the mean opinion score (MOS), becomes 0.978.
作者:
M. RouxDépartement IMA
Ecole Nationale Supérieure des TéIécommunications Paris France
This paper presents a new method for the automatic registration of SPOT images and digitised maps. Direct matching of high level features (urban areas and crossroads) is performed using an hypothesis generation and pr...
详细信息
This paper presents a new method for the automatic registration of SPOT images and digitised maps. Direct matching of high level features (urban areas and crossroads) is performed using an hypothesis generation and propagation scheme. Results are presented for the registration of SPOT images and maps covering the same scene, as well as for the retrieval of partial maps into a large SPOT image.
The paper studies segmentation of moving objects with low texture in a low textured background. We describe an algorithm that resolves the difficulties associated with other approaches by integrating over time the inf...
详细信息
The paper studies segmentation of moving objects with low texture in a low textured background. We describe an algorithm that resolves the difficulties associated with other approaches by integrating over time the information in the video sequence. We motivate and demonstrate our approach by building the background and moving object world images, important constructs in generative video.
The depth perception we present in this paper is based on monocular computervision. The method exploits the physical effect that the imaging properties of an optical system depend upon the acquisition parameters and ...
详细信息
The depth perception we present in this paper is based on monocular computervision. The method exploits the physical effect that the imaging properties of an optical system depend upon the acquisition parameters and the object distance. Our approach is working along the edges. The basic principle is to compare the blur in two defocused images of the same scene taken with different apertures (depth from defocus). To increase speed and precision our algorithm is working only at the exact position of ramp edges, which are determined by a biological model of the visual cortex.
Two species of crabs are Chionoecetes bairdi and C. opilio and their hybrids live in the Bering Sea. The two species differ in generic, morphological and morphometric characteristics. Inter breeding of C. bairdi and C...
详细信息
Two species of crabs are Chionoecetes bairdi and C. opilio and their hybrids live in the Bering Sea. The two species differ in generic, morphological and morphometric characteristics. Inter breeding of C. bairdi and C. opilio results in a hybrid form with intermediate morphological and morphometric characteristics. These species were determined by analyzing the empirical covariance matrices associated with the two species and the hybrid and by taking into account the statistical reliability of the estimated principal components. A modified eigen image classifier is implemented based on the assumption that the data is drawn from multivariate Gaussian distributions with different means and covariance matrices. The clustering technique is introduced to minimize the misclassification rate of the eigen image classifier.
Knowledge-based vision for robots needs a radically new approach. The traditional approach has not made substantial progress for various reasons including the engineering problems of building systems based on a hybrid...
详细信息
Knowledge-based vision for robots needs a radically new approach. The traditional approach has not made substantial progress for various reasons including the engineering problems of building systems based on a hybrid on-line/off-line paradigm. A new situated agent approach is presented. The constraint net model of Y. Zhang and A.K. Mackworth allows the designer to specify the robot's vision, control and motor systems uniformly as on-line systems. If the perceptual and control systems are designed as constraint-satisfying devices then the total robotic system, consisting of the robot symmetrically coupled to the environment, can be proven correct. Examples of this approach are given.
We present a method that integrates local fractal dimension and edge information into a region growing algorithm for the segmentation of natural images. We compare two methods of estimating the local fractal dimension...
详细信息
We present a method that integrates local fractal dimension and edge information into a region growing algorithm for the segmentation of natural images. We compare two methods of estimating the local fractal dimension in the proposed segmentation algorithm. One is a blanket method and the other is a Fourier-wavelet method. We also propose a technique to store the edge information not on a pixel itself but on a boundary between pixels in the region-edge integrating algorithm in order to use the edge information more effectively and to simplify the algorithm.
We present a control scheme for a rate scalable video codec. We describe a wavelet based video codec with motion compensation used to reduce temporal redundancy. The prediction error frames are encoded using an embedd...
详细信息
We present a control scheme for a rate scalable video codec. We describe a wavelet based video codec with motion compensation used to reduce temporal redundancy. The prediction error frames are encoded using an embedded zerotree wavelet (EZW) approach which allows data rate scalability. Since motion compensation is used in the algorithm, the duality of the decoded video may decay due to the propagation of errors in the temporal domain. An adaptive motion compensation scheme is proposed to address this problem. We show that using our control scheme the quality of the decoded video can be maintained at any data rate.
暂无评论