Accurate tracking of human movement has the potential to enable many kinds of human-computer interaction (HCI). vision-based methods promise tracking without encumberance by body-mounted apparatus but constitute a sig...
详细信息
Accurate tracking of human movement has the potential to enable many kinds of human-computer interaction (HCI). vision-based methods promise tracking without encumberance by body-mounted apparatus but constitute a significant research challenge. Its many applications both in HCI and other areas such as surveillance have made markerless human tracking a central problem in computervision. Despite this research focus, there are few established mechanisms for evaluating and comparing the performance of reported solutions. Furthermore, very few systems have attempted human-centered evaluation using methods from HCI research. This paper presents a review of evaluation methods employed in the literature of high degree of freedom human movement tracking systems. Detailed full-body tracking and hand tracking systems are included. Suitable evaluation methods employed in other computervision and HCI research are also considered.
In this work we propose a scheme integrating perceptual grouping into stereopsis to reduce the ambiguity of those early processes. We propose a simple perceptual grouping algorithm that - in addition to the geometric ...
详细信息
In this work we propose a scheme integrating perceptual grouping into stereopsis to reduce the ambiguity of those early processes. We propose a simple perceptual grouping algorithm that - in addition to the geometric information makes use of a novel multi-modal affinity measure between local primitives. We then use this group information to 1) disambiguate the stereopsis by enforcing that stereo matches preserve groups; and 2) correct the reconstruction error due to the image pixel sampling using a linear interpolation over the groups. We show quantitative and qualitative demonstrations of those processes on a variety of sequences.
AdaBoost has received considerable attention in the vision and multimedia research community in recent years. It is originally designed for two-class classification problems. To handle multiple classes, many AdaBoost ...
详细信息
AdaBoost has received considerable attention in the vision and multimedia research community in recent years. It is originally designed for two-class classification problems. To handle multiple classes, many AdaBoost extensions have been developed primarily based on various schemes for reducing multiclass classification to multiple two-class problems. From a statistical prospective, AdaBoost can be viewed as a forward stepwise additive model using an exponential loss function. In this paper, we derive a generalized form of AdaBoost for multiclass classification based on a multiclass exponential loss function. To prove its effectiveness, we benchmarked a number of multimedia problems of different nature. Experimental results show that the new boosting algorithm outperforms other multiclass alternatives. In addition, the generalized boosting algorithm can be used to either boost a multiclass classifier, or build a multiclass classifier from a binary one.
Semantic image segmentation aims to partition an image into separate regions, which ideally corresponds to different real-world objects. Many segmentation algorithms have been proposed, exploiting a wide variety of im...
详细信息
Semantic image segmentation aims to partition an image into separate regions, which ideally corresponds to different real-world objects. Many segmentation algorithms have been proposed, exploiting a wide variety of image features and characteristics. It has been shown through empirical studies that segmentation methods that assume a good segmentation partitions an image into different homogeneous regions are likely to fail in non-trivial situations, while methods based on perceptual organization generally generate more favorable segmentations. Yet no formal justification has been provided. In this paper, we propose an information measure for images, the perceptual information, based on human visual perception organization. Using perceptual information, we justify that homogeneity-based segmentation methods are inherently biased, and by incorporating knowledge, perceptual organization can overcome the bias and generate better segmentations.
This paper addresses the problem of human action recognition by introducing a sparse representation of image sequences as a collection of spatiotemporal events that are localized at points that are salient both in spa...
详细信息
This paper addresses the problem of human action recognition by introducing a sparse representation of image sequences as a collection of spatiotemporal events that are localized at points that are salient both in space and time. We detect the spatiotemporal salient points by measuring the variations in the information content of pixel neighborhoods not only in space but also in time. We derive a suitable distance measure between the representations, which is based on the Chamfer distance, and we optimize this measure with respect to a number of temporal and scaling parameters. In this way we achieve invariance against scaling, while at the same time, we eliminate the temporal differences between the representations. We use Relevance Vector Machines (RVM) in order to address the classification problem. We propose new kernels for use by the RVM, which are specifically tailored to the proposed spatiotemporal salient point representation. The basis of these kernels is the optimized Chamfer distance of the previous step. We present results on real image sequences from a small database depicting people performing 19 aerobic exercises.
Existing classification algorithms use a set of training examples to select classification features, which are then used for all future applications of the classifier. A major problem with this approach is the selecti...
详细信息
In this paper, we propose a method to track multiple deformable objects in sequences (with a static camera) in and beyond the visible spectrum by combining Gabor filtering and clustering. In a first step, a set of Gab...
详细信息
In this paper, we propose a method to track multiple deformable objects in sequences (with a static camera) in and beyond the visible spectrum by combining Gabor filtering and clustering. In a first step, a set of Gabor filter banks is used to filter the difference image between two consecutive frames. Then, the moving areas are sampled by randomly positioning particles in high magnitude area of the filtered image. Finally, these points are clustered to obtain one class for each moving object. The novelty in our method is in using cluster information from the previous frame to classify new particles in the current frame. This makes our method robust to occlusions, objects entering and leaving the field of view, objects stopping and starting, and moving objects getting really close to each other.
The core of the traditional RANSAC algorithm and its more recent efficient counterparts is the hypothesis evaluation stage, with the focus on finding the best, outlier free hypothesis. Motivated by a non-parametric en...
详细信息
The core of the traditional RANSAC algorithm and its more recent efficient counterparts is the hypothesis evaluation stage, with the focus on finding the best, outlier free hypothesis. Motivated by a non-parametric ensemble techniques, we demonstrate that it proves advantageous to use the entire set of hypotheses generated in the sampling stage. We show that by studying the residual distribution of each data point with respect to the entire set of hypotheses, the problem of inlier/ outlier identification can be formulated as a classification problem. We present extensive simulations of the approach, which in the presence of a large percentage (> 50%) of outliers, provides a repeatable and, an order of magnitude more efficient method compared to the currently existing techniques. Results on widebaseline matching and fundamental matrix estimation are presented.
A set of microarray images were acquired by a sequence of biological experiments which were scanned via a high resolution scanner. For each spot corresponding to a gene, the ratio of Cy3 and Cy5 fluorescent signal int...
详细信息
暂无评论