Computational models of visual attention result in considerable data compression by eliminating processing on regions likely to be devoid of meaningful content. While saliency maps in static images is indexed on image...
详细信息
A new robust estimator based on an evolutionary optimization technique is proposed. The general hypothesizeand- verify strategy accelerates the parameter estimation substantially by systematic trial and parallel evalu...
详细信息
A new robust estimator based on an evolutionary optimization technique is proposed. The general hypothesizeand- verify strategy accelerates the parameter estimation substantially by systematic trial and parallel evaluation without the use of prior information. The method is evaluated by estimation of multi-view relations, i.e. the fundamental matrix. Additionally, some results for the trifocal geometry are presented. However, the general methodology could be used for any problem in which relations can be determined from a minimum number of points.
Efficient and comfortable acquisition of large 3D scenes is an important topic for many current and future applications in the field of robotics, factory and office visualization, 3DTV and cultural heritage. In this p...
详细信息
The bag of visual words representation has attracted a lot of attention in the computervision community. In particular, Probabilistic Latent Semantic Analysis (PLSA) has been applied to object recognition as an unsup...
详细信息
The bag of visual words representation has attracted a lot of attention in the computervision community. In particular, Probabilistic Latent Semantic Analysis (PLSA) has been applied to object recognition as an unsupervised technique built on top of the bag of visual words representation. PLSA, however, does not explicitly consider the spatial information of the visual words. In this paper, we propose an iterative technique, where a modified form of PLSA provides location and scale estimates of the foreground object through the estimated latent semantic. In return, the updated location and scale estimates will improve the estimate of the latent semantic. We call this iterative algorithm Semantic-Shift. We show results with significant improvements over PLSA.
Being able to read LCD/LED displays would be a very important step towards greater independence for persons who are blind or have low vision. A fast graphical model based algorithm is proposed for reading 7-segment di...
详细信息
Being able to read LCD/LED displays would be a very important step towards greater independence for persons who are blind or have low vision. A fast graphical model based algorithm is proposed for reading 7-segment digits in LCD/LED displays. The algorithm is implemented for Symbian camera cell phones in Symbian C++. The software reads one display in about 2 seconds by a push of a button on the cell phone (Nokia 6681, 220 MHz ARM CPU).
Several patternrecognition and classification techniques have been applied to the biometrics domain. Among them, an interesting technique is the Scale Invariant Feature Transform (SIFT), originally devised for object...
详细信息
Several patternrecognition and classification techniques have been applied to the biometrics domain. Among them, an interesting technique is the Scale Invariant Feature Transform (SIFT), originally devised for object recognition. Even if SIFT features have emerged as a very powerful image descriptors, their employment in face analysis context has never been systematically investigated. This paper investigates the application of the SIFT approach in the context of face authentication. In order to determine the real potential and applicability of the method, different matching schemes are proposed and tested using the BANCA database and protocol, showing promising results.
Multiscale techniques have been used for many years in computervision. Recently multiscale edges have received attention in spectral graph methods as an important perceptual cue. In this paper multiscale cues are use...
详细信息
Multiscale techniques have been used for many years in computervision. Recently multiscale edges have received attention in spectral graph methods as an important perceptual cue. In this paper multiscale cues are used in the context of max-flow/min-cut energy minimization. We formulate multiscale min-cut versions of three typical computervision applications, namely interactive segmentation, image restoration, and optical flow. We then solve across all scales simultaneously. This use of multiscale models and constraints leads to quantitatively and qualitatively improved experimental results.
A process is described to determine the shot accuracy of an automatic robotic pool playing system. The system comprises a ceiling-mounted gantry robot, a special purpose cue end-effector, a ceiling-mounted camera, and...
详细信息
We propose to use attribute grammars for recognizing normal events and detecting abnormal events in a video. Attribute grammars can describe constraints on features (attributes) in addition to the syntactic structure ...
详细信息
We propose to use attribute grammars for recognizing normal events and detecting abnormal events in a video. Attribute grammars can describe constraints on features (attributes) in addition to the syntactic structure of the input. Events are recognized using an extension of the Earley parser that handles attributes and concurrent event threads. Abnormal events are detected when the input does not follow syntax of the grammar or the attributes do not satisfy the constraints in the attribute grammar to some degree. We demonstrate the effectiveness of our method for the task of recognizing normal events and detecting anomalies in a parking lot.
暂无评论