A survey of video databases that can be used within a continuous sign language recognition scenario to measure the performance of head and hand tracking algorithms either w.r.t. a tracking error rate or w.r.t. a word ...
详细信息
Manifold learning is an important topic in patternrecognition and computervision. However, most manifold learning algorithms implicitly assume the data are aligned on a single manifold, which is too strict in actual...
详细信息
ISBN:
(纸本)9781467346498;9780769549057
Manifold learning is an important topic in patternrecognition and computervision. However, most manifold learning algorithms implicitly assume the data are aligned on a single manifold, which is too strict in actual applications. Isometric feature mapping (Isomap), as a promising manifold learning method, fails to work on data which distribute on clusters in a single manifold or manifolds. In this paper, we propose a new multi-manifold learning algorithm (M-Isomap). the algorithm first discovers the data manifolds and then reduces the dimensionality of the manifolds separately. Meanwhile, a skeleton representing the global structure of whole data set is built and kept in low-dimensional space. Secondly, by referring to the low-dimensional representation of the skeleton, the embeddings of the manifolds are relocated to a global coordinate system. Compared with previous methods, these algorithms can keep both of the intra and inter manifolds geodesics faithfully. the features and effectiveness of the proposed multi-manifold learning algorithms are demonstrated and compared through experiments.
the availability of dense motion information in computervision domain allows for the effective application of Lagrangian techniques that have their origin in fluid flow analysis and dynamical systems theory. A well e...
详细信息
ISBN:
(纸本)9780769547978
the availability of dense motion information in computervision domain allows for the effective application of Lagrangian techniques that have their origin in fluid flow analysis and dynamical systems theory. A well established technique that has been proven to be useful in image-based crowd analysis are Finite Time Lyapunov Exponents (FTLE). Based on this, we present a method to detect people carrying object and describe a methodology how to apply established flow field methods onto the problem of describing individuals. Further, we reinterpret Lagrangian features in relation to the underlying motion process and show their applicability towards the appearance modeling of pedestrians. this definition allows to increase performance of state-of-the-art methods and is shown to be robust under varying parameter settings and different optical flow extraction approaches.
Image re-ranking aims at improving the precision of keyword-based image retrieval, mainly by introducing visual features to re-rank. Many existing approaches require offline training for every keyword, which are unsui...
详细信息
We present a novel dataset for evaluation of object matching and recognition methods in surveillance scenarios. Dataset consists of more than 23,000 images, depicting 15 persons and nine vehicles. A ground truth data ...
详细信息
ISBN:
(纸本)9780769547978
We present a novel dataset for evaluation of object matching and recognition methods in surveillance scenarios. Dataset consists of more than 23,000 images, depicting 15 persons and nine vehicles. A ground truth data - the identity of each person or vehicle - is provided, along withthe coordinates of the bounding box in the full camera image. the dataset was acquired from 36 stationary camera views using a variety of surveillance cameras with resolutions ranging from standard VGA to three megapixel. 27 cameras observed the persons and vehicles in an outdoor environment, while the remaining nine observed the same persons indoors. the activity of persons was planned in advance;they drive the cars to the parking lot, exit the cars and walk around the building, through the main entrance, and up the stairs, towards the first floor of the building. the intended use of the dataset is performance evaluation of computervision methods that aim to (re) identify people and objects from many different viewpoints in different environments and under variable conditions. Due to variety of camera locations, vantage points and resolutions, the dataset provides means to adjust the difficulty of the identification task in a controlled and documented manner. An interface for easy use of dataset within Matlab is provided as well, and the data is complemented by baseline results using a basic color histogram-based descriptor. While the cropped images of persons and vehicles represent the primary data in our dataset, we also provide full-frame images and a set of tracklets for each object as a courtesy to the dataset users.
Improving human action recognition in videos is restricted by the inherent limitations of the visual data. In this paper, we take the depth information into consideration and construct a novel dataset of human daily a...
详细信息
ISBN:
(纸本)9783642338687;9783642338670
Improving human action recognition in videos is restricted by the inherent limitations of the visual data. In this paper, we take the depth information into consideration and construct a novel dataset of human daily actions. the proposed ACT4(2) dataset provides synchronized data from 4 views and 2 sources, aiming to facilitate the research of action analysis across multiple views and multiple sources. We also propose a new descriptor of depth information for action representation, which depicts the structural relations of spatiotemporal points within action volume using the distance information in depth data. In experimental validation, our descriptor obtains superior performance to the state-of-the-art action descriptors designed for color information, and more robust to viewpoint variations. the fusion of features from different sources is also discussed, and a simple but efficient method is presented to provide a baseline performance on the proposed dataset.
Despite the strengths and popularity of the log-cromaticity space (LCS), there is still a significant amount of concern regarding its narrow-band assumption (NBA). though not always necessary, this assumption is relat...
详细信息
Textile electrode is flexible, folding, washable and biocompatible with skin. Withthese advantages, the textile electrodes should be an ideal alternative for electromyogram (EMG) recordings in clinical applications. ...
详细信息
Image classification is a challenging problem in computervision. Its performance heavily depends on image features extracted and classifiers to be constructed. In this paper, we present a new support vector machine w...
详细信息
Sparse decomposition has been widely used in numerous applications, such as image processing, patternrecognition, remote sensing and computational biology. Despite plenty of theoretical developments have been propose...
详细信息
暂无评论