This work analyzes the problem of homography estimation for robust target matching in the context of real-time mobile vision. We present a device-friendly implementation of the Gaussian Elimination algorithm and show ...
详细信息
ISBN:
(纸本)9781479943098
This work analyzes the problem of homography estimation for robust target matching in the context of real-time mobile vision. We present a device-friendly implementation of the Gaussian Elimination algorithm and show that our optimized approach can significantly improve the homography estimation step in a hypothesize-and-verify scheme. Experiments are performed on image sequences in which both speed and accuracy are evaluated and compared with conventional homography estimation schemes.
During the performance optimization of a computervision system, developers frequently run into platform-level inefficiencies and bottlenecks that can not be addressed by traditional methods. OpenVX is designed to add...
详细信息
ISBN:
(纸本)9781479943098
During the performance optimization of a computervision system, developers frequently run into platform-level inefficiencies and bottlenecks that can not be addressed by traditional methods. OpenVX is designed to address such system-level issues by means of a graph-based computation model. This approach differs from the traditional acceleration of one-off functions, and exposes optimization possibilities that might not be available or obvious with traditional computervision libraries such as OpenCV.
Recent interest in developing online computervision algorithms is spurred in part by a growth of applications capable of generating large volumes of images and videos. These applications are rich sources of images an...
详细信息
ISBN:
(纸本)9781479943098
Recent interest in developing online computervision algorithms is spurred in part by a growth of applications capable of generating large volumes of images and videos. These applications are rich sources of images and video streams. Online vision algorithms for managing, processing and analyzing these streams need to rely upon streaming concepts, such as pipelines, to ensure timely and incremental processing of data. This paper is a first attempt at defining a formal stream algebra that provides a mathematical description of vision pipelines and describes the distributed manipulation of image and video streams. We also show how our algebra can effectively describe the vision pipelines of two state of the art techniques.
This paper describes a novel methodology for automated recognition of high-level activities. A key aspect of our framework relies on the concept of co-occurring visual words for describing interactions between several...
详细信息
In the past decade, there has been a growing need for machine learning and computervision components (segmentation, classification) in the hyperspectral imaging domain. Due to the complexity and size of hyperspectral...
详细信息
ISBN:
(纸本)9781479943098
In the past decade, there has been a growing need for machine learning and computervision components (segmentation, classification) in the hyperspectral imaging domain. Due to the complexity and size of hyperspectral imagery and the enormous number of wavelength channels, the need for combining compact representations with image segmentation and superpixel estimation has emerged in this area. Here, we present an approach to superpixel estimation in hyperspectral images by adapting the well known UCM approach to hyperspectral volumes. This approach benefits from the channel information at each pixel of the hyperspectral image while obtaining a compact representation of the hyperspectral volume using principal component analysis. Our experimental evaluation demonstrates that the additional information of spectral channels will substantially improve superpixel estimation from a single "monochromatic" channel. Furthermore, superpixel estimation performed on the compact hyperspectral representation outperforms the same when executed on the entire volume.
Small aerial vehicles, like quadrotor, have a high potential to be helpful tools in first response scenarios like earthquakes, landslides and fires. But even simple tasks like holding position and altitude can be chal...
详细信息
ISBN:
(纸本)9781479943098
Small aerial vehicles, like quadrotor, have a high potential to be helpful tools in first response scenarios like earthquakes, landslides and fires. But even simple tasks like holding position and altitude can be challenging to accomplish by a human operator and even more challenging autonomously. When outdoors, using GPS and pressure sensors is feasible, but indoors or in GPS denied environments it is not. Until now, for indoor flight scenarios either a lot of energy consuming sensors and hardware or a perfectly defined surrounding is required. In this approach, the viability of an onboard FPGA based indoor flight navigation system with a pan-tilt camera mount and a single VGA camera is tested. It can be used to either support an operator performing a hold position and altitude task, or act completely autonomously to achieve this task.
Automotive systems provide a unique opportunity for mobile vision technologies to improve road safety by understanding and monitoring the driver. In this work, we propose a real-time framework for early detection of d...
详细信息
ISBN:
(纸本)9781479943098
Automotive systems provide a unique opportunity for mobile vision technologies to improve road safety by understanding and monitoring the driver. In this work, we propose a real-time framework for early detection of driver maneuvers. The implications of this study would allow for better behavior prediction, and therefore the development of more efficient advanced driver assistance and warning systems. Cues are extracted from an array of sensors observing the driver (head, hand, and foot), the environment (lane and surrounding vehicles), and the ego-vehicle state (speed, steering angle, etc.). Evaluation is performed on a real-world dataset with overtaking maneuvers, showing promising results. In order to gain better insight into the processes that characterize driver behavior, temporally discriminative cues are studied and visualized.
In this paper we present a novel approach to detect groups in ego-vision scenarios. People in the scene are tracked through the video sequence and their head pose and 3D location are estimated. Based on the concept of...
详细信息
ISBN:
(纸本)9781479943098
In this paper we present a novel approach to detect groups in ego-vision scenarios. People in the scene are tracked through the video sequence and their head pose and 3D location are estimated. Based on the concept of f-formation, we define with the orientation and distance an inherently social pairwise feature that describes the affinity of a pair of people in the scene. We apply a correlation clustering algorithm that merges pairs of people into socially related groups. Due to the very shifting nature of social interactions and the different meanings that orientations and distances can assume in different contexts, we learn the weight vector of the correlation clustering using Structural SVMs. We extensively test our approach on two publicly available datasets showing encouraging results when detecting groups from first-person camera views.
This paper presents a mobile application for real time facial expression recognition running on a smart phone with a camera. The proposed system uses a set of Support Vector Machines (SVMs) for classifying 6 basic emo...
详细信息
ISBN:
(纸本)9781479943098
This paper presents a mobile application for real time facial expression recognition running on a smart phone with a camera. The proposed system uses a set of Support Vector Machines (SVMs) for classifying 6 basic emotions and neutral expression along with checking mouth status. The facial expression features for emotion recognition are extracted by Active Shape Model (ASM) fitting landmarks on a face and then dynamic features are generated by the displacement between neutral and expression features. We show experimental results with 86% of accuracy with 10 folds cross validation in 309 video samples of the extended Cohn-Kanade (CK+) dataset. Using the same SVM models, the mobile app is running on Samsung Galaxy S3 with 2.4 fps. The accuracy of real-time mobile emotion recognition is about 72% for 6 posed basic emotions and neutral expression by 7 subjects who are not professional actors.
Change detection is one of the most important low-level tasks in video analytics. In 2012, we introduced the changedetection. net (CDnet) benchmark, a video dataset devoted to the evalaution of change and motion detec...
详细信息
ISBN:
(纸本)9781479943098
Change detection is one of the most important low-level tasks in video analytics. In 2012, we introduced the changedetection. net (CDnet) benchmark, a video dataset devoted to the evalaution of change and motion detection approaches. Here, we present the latest release of the CDnet dataset, which includes 22 additional videos (similar to 70,000 pixel-wise annotated frames) spanning 5 new categories that incorporate challenges encountered in many surveillance settings. We describe these categories in detail and provide an overview of the results of more than a dozen methods submitted to the ieee Change Detection Workshop 2014. We highlight strengths and weaknesses of these methods and identify remaining issues in change detection.
暂无评论