In this paper, we present our approach to robust background modelling which combines visible and thermal infrared spectrum data. Our work is based on the non-parametric background model describe in 1. We use a pedestr...
详细信息
In this paper, we present our approach to robust background modelling which combines visible and thermal infrared spectrum data. Our work is based on the non-parametric background model describe in 1. We use a pedestrian detection module to prevent erroneous data from becoming part of the background model and this allows us to initialise our bacjground model, even in the presence of foreground objects. Visible and infrared features are use to remove incorrectly detected foreground regions. Allowing our model to quickly recover from ghost regions and rapid lighting changes. An object-based shadow detector also improves our algorithm's performance.
The automotive market puts strict and often conflicting requirements on computervision systems. On the one hand the algorithms require considerable computing power to work reliably in real-time and under a wide range...
详细信息
The automotive market puts strict and often conflicting requirements on computervision systems. On the one hand the algorithms require considerable computing power to work reliably in real-time and under a wide range of lighting conditions. On the other hand, the cost must be kept low, the package size must be small and the power consumption must be low. In addition, automotive qualified parts must be used both to withstand the harsh operating environment and to guarantee long product life. To meet all these conflicting requirements Mobileye developed the EyeQ, a complete ’system on a chip’ (SoC) which has the computing power to support a variety of applications such as lane, vehicle and pedestrian detection. This paper describes the process of designing an ASIC to support a family of vision algorithms.
Dimensionality reduction via feature projection has been widely used in patternrecognition and machine learning. It is often beneficial to derive the projections not only based on the inputs but also on the target va...
详细信息
Dimensionality reduction via feature projection has been widely used in patternrecognition and machine learning. It is often beneficial to derive the projections not only based on the inputs but also on the target values in the training data set. This is of particular importance in predicting multivariate or structured outputs which is an area of growing interest. In this paper we introduce a novel projection framework which is sensitive to both input features and outputs. Based on the derived features prediction accuracy can be greatly improved. We validate our approach in two applications. The first is to model users' preferences on a set of paintings. The second application is concerned with image categorization where each image may belong to multiple categories. The proposed algorithm produces very encouraging results in both settings.
We introduce mixture trees, a tree-based data-structure for modeling joint probability densities using a greedy hierarchical density estimation scheme. We show that the mixture tree models data efficiently at multiple...
详细信息
We introduce mixture trees, a tree-based data-structure for modeling joint probability densities using a greedy hierarchical density estimation scheme. We show that the mixture tree models data efficiently at multiple resolutions, and present fast conditional sampling as one of many possible applications. In particular, the development of this data-structure was spurred by a multi-target tracking application, where memory-based motion modeling calls for fast conditional sampling from large empirical densities. However, it is also suited to applications such as texture synthesis, where conditional densities play a central role. Results are presented for both these applications.
The virtual white cane is a range sensing device based on active triangulation, that can measure distances at a rate of 15 measurements/second. A blind person can use this device for sensing the environment, pointing ...
详细信息
The virtual white cane is a range sensing device based on active triangulation, that can measure distances at a rate of 15 measurements/second. A blind person can use this device for sensing the environment, pointing it as if it was a flashlight. Beside measuring distances, this device can detect surface discontinuities, such as the foot of a wall, a step, or a drop-off. This is obtained by analyzing the range data collected as the user swings the device around, tracking planar patches and finding discontinuities. In this paper we briefly describe the range sensing device, and present an online surface tracking algorithm, based on a Jump-Markov model. We show experimental results proving the robustness of the tracking system in real-world conditions.
A new face recognition algorithm is proposed which is robust to variations in pose, expression and illumination. The framework is similar to the ubiquitous block matching algorithm used for motion estimation in video ...
详细信息
A new face recognition algorithm is proposed which is robust to variations in pose, expression and illumination. The framework is similar to the ubiquitous block matching algorithm used for motion estimation in video compression but has been adapted to compensate for illumination differences. One of the key differentiators of this approach is that unlike traditional face recognition algorithms, the image data representing the face or features extracted from the facial data is not used for classification. Instead, the mapping between the probe and gallery images given by the block matching algorithm is used to classify the faces for recognition. Once the mappings are found for each gallery image, the degree of bijectivity that each mapping produces is used to derive the similarity scores for recognition.
We present a quantitative evaluation of SE-MinCut, a novel segmentation algorithm based on spectral embedding and minimum cut. We use human segmentations from the Berkeley segmentation database as ground truth and pro...
详细信息
We present a quantitative evaluation of SE-MinCut, a novel segmentation algorithm based on spectral embedding and minimum cut. We use human segmentations from the Berkeley segmentation database as ground truth and propose suitable measures to evaluate segmentation quality. With these measures we generate precision/recall curves for SE-MinCut and three of the leading segmentation algorithms: mean-shift, normalized Cuts, and the local variation algorithm. These curves characterize the performance of each algorithm over a range of input parameters. We compare the precision/recall curves for the four algorithms and show segmented images that support the conclusions obtained from the quantitative evaluation.
We present a comprehensive strategy for evaluating image retrieval algorithms. Because automated image retrieval is only meaningful in its service to people, performance characterization must be grounded in human eval...
详细信息
We present a comprehensive strategy for evaluating image retrieval algorithms. Because automated image retrieval is only meaningful in its service to people, performance characterization must be grounded in human evaluation. Thus we have collected a large data set of human evaluations of retrieval results, both for query by image example and query by text. The data is independent of any particular image retrieval algorithm and can be used to evaluate and compare many such algorithms without further data collection. The data and calibration software are available on-line. We develop and validate methods for generating sensible evaluation data, calibrating for disparate evaluators, mapping image retrieval system scores to the human evaluation results, and comparing retrieval systems. We demonstrate the process by providing grounded comparison results for several algorithms.
We present an evaluation of a probabilistic, part-based algorithm designed at The Ohio State University. Our algorithm is robust to errors of precision made by the (automatic) face and facial feature detector and to l...
详细信息
We present an evaluation of a probabilistic, part-based algorithm designed at The Ohio State University. Our algorithm is robust to errors of precision made by the (automatic) face and facial feature detector and to local image changes due to, for example, expression and illumination. Our contributions include the design of a novel face and facial feature detector and the justification of the use of the Mahalanobis cosine distance. We show results on experiments 1 and 4 in the FRGC (Version 2) test/database. Our algorithm includes a new face detector that is used to demonstrate the robustness of our algorithm to small errors of localization.
We present an automotive-grade, real-time, vision-based driver state monitor. Upon detecting and tracking the driver's facial features, the system analyzes eye-closures and head pose to infer his/her fatigue or di...
详细信息
We present an automotive-grade, real-time, vision-based driver state monitor. Upon detecting and tracking the driver's facial features, the system analyzes eye-closures and head pose to infer his/her fatigue or distraction. This information is used to warn the driver and to modulate the actions of other safety systems. The purpose of this monitor is to increase road safety by preventing drivers from falling asleep or from being overly distracted, and to improve the effectiveness of other safety systems.
暂无评论