We study the challenging problem of maneuvering object tracking with unknown dynamics, i.e., forces or torque. We investigate the underlying causes of object kinematics, and propose a generative model approach that en...
详细信息
This paper addresses human activity recognition based on a new feature descriptor. For a binary human silhouette, an extended radon transform, ℜ transform, is employed to represent low-level features. The advantage of...
详细信息
In this paper, we present a method for photometric self-calibration of a projector-camera system. In addition to the input transfer functions (commonly called gamma functions), we also reconstruct the spatial intensit...
详细信息
This paper presents a new constraint connecting the signals in multiple views of a surface. The constraint arises from a harmonic analysis of the geometry of the imaging process and it gives rise to a new technique fo...
详细信息
We introduce a method of understanding of four musical time patterns and three tempos that are generated by a human conductor of robot orchestra or an operator of computer-based music play system using the hand gestur...
详细信息
ISBN:
(纸本)9781424407835
We introduce a method of understanding of four musical time patterns and three tempos that are generated by a human conductor of robot orchestra or an operator of computer-based music play system using the hand gesture recognition. We use only a stereo vision camera with no extra special devices. We suggest a simple and reliable vision-based hand gesture recognition with two naive features. One is the motion-direction code which is a quantized code for motion directions. The other is the conducting feature point (CFP) where the point of sudden motion changes. The proposed hand gesture recognition system operates as follows: First, it extracts the human band region by segmenting the depth information generated by stereo matching of image sequences. Next, it follows the motion of the center of the gravity(COG) of the extracted hand region and generates the gesture features such as CFP and the direction-code. Finally, we obtain the current timing pattern of beat and tempo of the playing music by the proposed hand gesture recognition using either CFP tracking or motion histogram matching. The experimental results on the test data set show that the musical time pattern and tempo recognition rate is over 86.42% for the motion histogram matching, and 79.75% for the CFP tracking.
In this paper, a generic rule induction framework based on trajectory series analysis is proposed to learn the event rules. First the trajectories acquired by a tracking system are mapped into a set of primitive event...
详细信息
Video cameras are no ionger being used only in their traditional role of providing "Viewable pixels, but are rapidly becoming sources of intelligent information about the world. More recently 3D cameras are being...
详细信息
ISBN:
(纸本)1424411807
Video cameras are no ionger being used only in their traditional role of providing "Viewable pixels, but are rapidly becoming sources of intelligent information about the world. More recently 3D cameras are being developed to directly provide 3D measurements of objects and scenes. Appearance and geometry of objects and scenes, and the temporal dynamics of objects are the key information bearing sources for deriving visual intelligence. This talk will highlight sensor data analysis techniques for creating intelligent representations from 2D and 3D sensors. Intelligent sensor data analytics can be performed for mobile as well as widely distributed static sensor platforms. Applications ranging from 3D video manipulation, 3D situational awareness, wide area surveillance and tracking to video/3D object recognition and fingerprinting will be used to illustrate the work.
Parametric active contours have been used extensively in computervision for different tasks like segmentation and tracking. However, all parametric contours are known to suffer from the problem of frequent bunching a...
详细信息
In this study, we aim to determine if iris recognition accuracy might be improved by correcting for the refractive effects of the human eye when the optical axes of the eye and camera are misaligned. We undertake this...
详细信息
Informative Vector Machine (IVM) is an efficient fast sparse Gaussian processs (GP) method previously suggested for active learning. It greatly reduces the computational cost of GP classification and makes the GP lear...
详细信息
暂无评论