In this paper a coarse-to-fine system framework for analyzing the head gesture is presented. We discuss several important modules from computervision aspects, including the pose-invariant face detection, face trackin...
详细信息
In this paper a coarse-to-fine system framework for analyzing the head gesture is presented. We discuss several important modules from computervision aspects, including the pose-invariant face detection, face tracking, pose determination and high-resolution image reconstruction for eye pupils detection. Visual cues using intensity images obtained from in-car cameras are explored. A pose-invariant face detection algorithm is used to get the initial face area; afterwards face tracking and validation step is proposed to segment the face region for pose determination. The algorithm is tested on the drivers images under natural driving conditions. Experimental results show that the algorithm is robust to the head pose changes as well as the illumination changes. In this system framework, we propose that when coarse analysis utilizing the head pose alone is not sufficient for driver's behavior analysis, a finer analysis based on the eye gaze tracking is used, which requires images with sufficient resolution. A novel super-resolution reconstruction algorithm is proposed to help reveal more facial details, so as to facilitate the pupil detection. Experiment on the synthesis data shows the effectiveness of the super-resolution reconstruction algorithm
A regression model in the tensorPCA subspace is proposed in this paper for face super-resolution reconstruction. An approximate conditional probability model is used for the tensor subspace coefficients and maximum-li...
详细信息
ISBN:
(纸本)0769525210
A regression model in the tensorPCA subspace is proposed in this paper for face super-resolution reconstruction. An approximate conditional probability model is used for the tensor subspace coefficients and maximum-likelihood estimator gives a linear regression model. The approximation is corrected by adding non-linear component from a RBF-type regressor. Experiments on face images from FERET database validate the algorithm. Although each projection coefficient is estimated by a local estimator, tensorPCA subspace analysis is still a global descriptor, which makes the algorithm have certain ability to deal with partially occluded images
This paper proposes a concept of panoramic appearance map to perform reidentification of a people who leave the scene and reappear after some time. The map is a compact signature of appearance information of a person ...
详细信息
This paper proposes a concept of panoramic appearance map to perform reidentification of a people who leave the scene and reappear after some time. The map is a compact signature of appearance information of a person extracted from multiple cameras. The person is detected and tracked in multiple cameras and triangulation is used to accurately localize the person in 3-D. A virtual cylinder is formed around the person's location and mapped onto an image with the horizontal axis representing the azimuth angle and vertical axis representing the height. Each bin in the map image gets the appearance information from all the cameras which can observe it. The maps between different tracks are matched using a weighted metric. Experimental results showing person matching and reidentification show the effectiveness of the approach.
We propose a new approach to the hand pose estimation problem using only volume information. We describe a thermal and color image-based approach to generate silhouettes of the hand with which voxel images are produce...
详细信息
We propose a new approach to the hand pose estimation problem using only volume information. We describe a thermal and color image-based approach to generate silhouettes of the hand with which voxel images are produced using shape-from-silhouette. We assume a 16-component, 27 degree-of-freedom kinematically constrained Gaussian mixture model, and fit it over the voxel images. We constrain the otherwise freely-arranged components of this model by a system of equations describing joint characteristics parameterized by component centroids and orientations. We use the EM algorithm and steepest descent to estimate the parameters of the model such that they yield the maximum likelihood and kinematically correct pose estimates. We demonstrate the effectiveness of the proposed system on synthesized as well as captured voxel images of the hand, and show that given appropriate initial conditions, the iterative model parameter estimation procedure effectively converges to and tracks the "voxelized" articulated hand.
This paper presents an overview of a novel multimodal system being developed at UC San Diego for vehicle detection and traffic flow analysis. A distributed multimodal array (DiMMA) framework is presented for sensory d...
详细信息
This paper presents an overview of a novel multimodal system being developed at UC San Diego for vehicle detection and traffic flow analysis. A distributed multimodal array (DiMMA) framework is presented for sensory data acquisition, processing, analysis, fusion, and "active" control mechanisms needed to recognize objects, events, and activities which have multi-modal signatures. Current sensing modalities being researched include video, audio, seismic, magnetic, and passive infrared. Feature extraction and data fusion techniques are being investigated to improve robustness and study the advantages and disadvantages of each sensing modality. Preliminary results of this rapidly deployable system are discussed, along with possible future expansions, including laser range scanners, geophones, pneumatic road tubes, and traditional inductive loops
Driver assistance systems that monitor driver intent, warn drivers of lane departures, or assist in vehicle guidance are all being actively research and even put into commercial production. It is therefore important t...
详细信息
This paper presents an overview of investigations into the role of computervision technology in developing safer automobiles. We consider vision systems which can not only look out of the vehicle to detect and track ...
详细信息
In this paper we proposed to solve the eye detection and localization problem under a general statistical model based object detection framework. A binary tree representation is used to discover the objects' under...
详细信息
Subspace analysis has been widely used for head pose estimation. However, such techniques are usually sensitive to data alignment and background noise. In this paper a two-stage approach is proposed to address this is...
详细信息
This paper describes an approach for detecting objects in front of an automobile using wide field of view stereo with a pair of omni cameras. Several configurations are suggested for effective detection of vehicles an...
详细信息
暂无评论