In this paper we introduce the NDS-Forest data structure, which can be used for the calculation and representation of Maximally Stable Extremal Regions in real-time video. In contrast to the standard MSER algorithm, t...
详细信息
ISBN:
(纸本)1904410146
In this paper we introduce the NDS-Forest data structure, which can be used for the calculation and representation of Maximally Stable Extremal Regions in real-time video. In contrast to the standard MSER algorithm, the NDS-Forest stores information about the extremal regions as they are formed, making it unnecessary to regrow the regions from seed pixels. Using the NDS-Forest structure, we describe a system that uses MSERs in an automobile for face registration, segmentation, and pose estimation of the driver.
This paper presents an efficient and robust paradigm for analysis and query of moving-object interactions in planar homography *** and vehicle activities/interactions are analyzed for situational awareness by using a ...
详细信息
ISBN:
(纸本)1595934960
This paper presents an efficient and robust paradigm for analysis and query of moving-object interactions in planar homography *** and vehicle activities/interactions are analyzed for situational awareness by using a multi-perspective *** homography constraints are exploited to extract view-invariant object features including footage area and velocity of objects on the ground plane. Spatio-temporal relationships between person-and vehicle-tracks are represented by a semantic event grammar. Semantic-level information of the situation is achieved with the anticipation of possible directions of near-future tracks using piecewise velocity history. An efficient query paradigm is proposed by histogram-based approximation of probability density functions of objects and by quad-tree indexing. Experimental data show promising *** framework can be applied to applications for enhanced situational awareness such as disaster prevention,human interactions in structured environments,and crowd movement analysis in wide-view areas. Copyright 2006 ACM.
This paper presents a method for registering multimodal imagery in short range surveillance situations when the differences in object depths preclude any global registration techniques. An analysis of multimodal regis...
详细信息
This paper proposes a concept of panoramic appearance map to perform reidentification of a people who leave the scene and reappear after some time. The map is a compact signature of appearance information of a person ...
详细信息
We propose a new approach to the hand pose estimation problem using only volume information. We describe a thermal and color image-based approach to generate silhouettes of the hand with which voxel images are produce...
详细信息
This paper presents an approach for the registration of multimodal imagery for pedestrian detection when the significant depth differences of objects in the scene precludes a global alignment assumption. Using maximiz...
详细信息
Driver assistance systems have both the potential to alert the driver to critical situations and distract or annoy the driver if the driver is already aware of the situation. As systems attempt to preemptively warn dr...
详细信息
Driver assistance systems have both the potential to alert the driver to critical situations and distract or annoy the driver if the driver is already aware of the situation. As systems attempt to preemptively warn drivers more and more in advance, this problem becomes exacerbated. We present a predictive braking assistance system that identifies not only the need for braking action, but also whether or not a braking action is being planned by the driver. Our system uses a Bayesian framework to determine the criticality of the situation by assessing (1) the probability that braking should be performed given observations of the vehicle and surround and (2) the probability that the driver intends to perform a braking action. We train and evaluate our system using over 22 hours of data collected from real driving scenarios with 28 different drivers
In this paper we present a framework for tracking nonrigid facial landmarks by combining various visual cues at multiple levels of detail. Using a probabilistic framework consisting of a hierarchy of particle filters,...
详细信息
In this paper we present a framework for tracking nonrigid facial landmarks by combining various visual cues at multiple levels of detail. Using a probabilistic framework consisting of a hierarchy of particle filters, we are able to track individual facial landmarks using multiple visual cues at the local level, as well as tracking results at more coarse level of detail. This allows for the fusion of global and local cues in an efficient and robust manner. Testing is performed by tracking and classifying facial action codes obtained from the Cohn-Kanade AU-Coded Facial Expression Database.
This paper presents an approach for the registration of multimodal imagery for pedestrian detection when the significant depth differences of objects in the scene preclude a global alignment assumption. Using maximiza...
详细信息
This paper presents an approach for the registration of multimodal imagery for pedestrian detection when the significant depth differences of objects in the scene preclude a global alignment assumption. Using maximization-of-mutual-information matching techniques and sliding correspondence windows over calibrated image pairs, we demonstrate successful registration of color and thermal data. We develop a robust method using disparity voting for determining the registration of each object in the scene and provide a statistically based measure for evaluating the match confidence. Testing shows successful registration in complex scenes with multiple people at different depths and levels of occlusion
This paper presents a method for registering multimodal imagery in short range surveillance situations when the differences in object depths preclude any global registration techniques. An analysis of multimodal regis...
详细信息
This paper presents a method for registering multimodal imagery in short range surveillance situations when the differences in object depths preclude any global registration techniques. An analysis of multimodal registration approaches gives insight into the limitations of global assumptions and motivates the developed algorithm. Using calibrated stereo imagery, we use maximization of mutual information in sliding correspondence windows that inform a disparity voting scheme to demonstrate successful registration of color and thermal images. Extensive testing of scenes with multiple people at different depths and levels of occlusion shows high rates of successful registration and gives a reliable framework for further processing and analysis of the multimodal imagery.
暂无评论