This paper presents a new framework for a multi-stage multi-view approach for human interactions and activity analysis. The analysis is performed in a distributed vision system that synergistically integrate track- an...
详细信息
This paper presents an new framework for homographybased analysis of pedestrian-vehicle activity in crowded scenes. Planar homography constraint is exploited to extract view-invariant object features including footage...
详细信息
Accurate segmentation of foreground objects in video scenes is critical for assuring reliable performance of vision systems for object tracking and situational awareness in outdoor scenes. Most existing techniques for...
详细信息
Seafloor imagery is a rich source of data for the study of biological and geological processes. Among several applications, still images of the ocean floor can be used to build image composites referred to as photo-mo...
详细信息
ISBN:
(纸本)9780933957374
Seafloor imagery is a rich source of data for the study of biological and geological processes. Among several applications, still images of the ocean floor can be used to build image composites referred to as photo-mosaics. Photo-mosaics provide a wide-area visual representation of the benthos, and enable applications as diverse as geological surveys, mapping and detection of temporal changes in the morphology of biodiversity. We present an approach for creating globally aligned photo-mosaics using 3D position estimates provided by navigation sensors available in deep water surveys. Without image registration, such navigation data does not provide enough accuracy to produce useful composite images. Results from a challenging data set of the Lucky Strike vent field at the Mid Atlantic Ridge are reported.
This paper proposes a real-time navigation system for an AUV that takes advantage of the complementary performance of a sensor suite including a DVL, a compass, a depth sensor and altimeter sensors with a feature base...
详细信息
This paper proposes a real-time navigation system for an AUV that takes advantage of the complementary performance of a sensor suite including a DVL, a compass, a depth sensor and altimeter sensors with a feature based motion estimator using vision. To allow for real-time performance of the vision based motion estimator a simple but fast correlation algorithm is used for feature matching. The compass and the depth sensors are used to bound the drift of the heading and depth estimations respectively. The altimeter is required in order to translate the feature displacements measured from the images into the metric displacements of the robot. While the robot must rely on DVL navigation above a certain altitude where vision is useless, DVL measurements can be complemented with higher frequency accurate motion estimates from the vision system when navigating close to the seafloor.
The objective of this paper is classifying images by the object categories they contain, for example motorbikes or dolphins. There are three areas of novelty. First, we introduce a descriptor that represents local ima...
详细信息
ISBN:
(纸本)1595937331
The objective of this paper is classifying images by the object categories they contain, for example motorbikes or dolphins. There are three areas of novelty. First, we introduce a descriptor that represents local image shape and its spatial layout, together with a spatial pyramid kernel. These are designed so that the shape correspondence between two images can be measured by the distance between their descriptors using the kernel. Second, we generalize the spatial pyramid kernel, and learn its level weighting parameters (on a validation set). This significantly improves classification performance. Third, we show that shape and appearance kernels may be combined (again by learning parameters on a validation set). Results are reported for classification on Caltech-101 and retrieval on the TRECVID 2006 data sets. For Caltech-101 it is shown that the class specific optimization that we introduce exceeds the state of the art performance by more than 10%. Copyright 2007 ACM.
Recognizing driver awareness is an important prerequisite for the design of advanced automotive safety systems. Since visual attention is constrained to a driver's field of view, knowing where a driver is looking ...
详细信息
Recognizing driver awareness is an important prerequisite for the design of advanced automotive safety systems. Since visual attention is constrained to a driver's field of view, knowing where a driver is looking provides useful cues about his activity and awareness of the environment. This work presents an identity-and lighting-invariant system to estimate a driver's head pose. The system is fully autonomous and operates online in daytime and nighttime driving conditions, using a monocular video camera sensitive to visible and near-infrared light. We investigate the limitations of alternative systems when operated in a moving vehicle and compare our approach, which integrates Localized Gradient Orientation histograms with support vector machines for regression. We estimate the orientation of the driver's head in two degrees-of-freedom and evaluate the accuracy of our method in a vehicular testbed equipped with a cinematic motion capture system.
This paper discusses the issues and approaches involved in developing a mobile vehicle-mounted system to detect, classify, and log the surrounding vehicles in a database for efficient query-based retrieval. This syste...
详细信息
This paper discusses the issues and approaches involved in developing a mobile vehicle-mounted system to detect, classify, and log the surrounding vehicles in a database for efficient query-based retrieval. This system consists of three components: (1) vehicle sensing, detection, and tracking (2) feature extraction and classification (3) database storage and retrieval. Relevant research in each of these components is described in order to guide the development of such a system. Such a system has applications including traffic analysis from mobile probes, analyzing driver behavior based on surrounding vehicles, as well as surveillance from mobile platform.
We address the problem of sensor fusion for stereo and ultrasound depth measurements for map building for a robot operating in a cluttered environment. In such a situation it's difficult to make useful and realist...
详细信息
We address the problem of sensor fusion for stereo and ultrasound depth measurements for map building for a robot operating in a cluttered environment. In such a situation it's difficult to make useful and realistic assumptions about the sensor or environment statistics. Combinatorial Fusion Analysis is used to develop an approach to fusion with unknown sensor and environment statistics. A metric is proposed that shows when fusion from a set of fusion alternatives will produce a more accurate estimation of depth than either sonar or stereo alone and when not. The metric consists of two criteria: (a) the performance ratio PR(A,B) between sensors A and B, and (b) the diversity d(A,B) between A and B as captured by the rank-score function fA and fB. Experimental results are reported to illustrate that these two CFA criteria are viable predictors to distinguish between positive cases (the combined system performs better than or equal to the individual systems) and negative cases.
This paper presents a new framework for a multi-stage multi-view approach for human interactions and activity analysis. The analysis is performed in a distributed vision system that synergistically integrate track- an...
详细信息
ISBN:
(纸本)9781424431366;9780769527932
This paper presents a new framework for a multi-stage multi-view approach for human interactions and activity analysis. The analysis is performed in a distributed vision system that synergistically integrate track- and body-level representations across multiple cameras. Our system aims at versatile and easily-deployable system that does not require careful camera calibration. Main contributions of the paper are: (1) context-dependent camera handover for occlusion handling, (2) switching the multi-stage analysis between track- and body-level representations, and (3) a hypothesis-verification paradigm for top-down feedback exploiting spatio-temporal constraints inherent in human interaction. Experimental evaluation shows the efficacy of the proposed system for analyzing multi-person interactions. Current implementation uses two views, but extension to more views is straightforward.
暂无评论