This paper proposes a novel method for human detection from static images based on pixel structure of input images. In training stage, all sample images consisting of human images and non-human images are used to cons...
详细信息
ISBN:
(纸本)9781424431618
This paper proposes a novel method for human detection from static images based on pixel structure of input images. In training stage, all sample images consisting of human images and non-human images are used to construct a Hausdorff distance map based on statistically analyzing the difference between the different blocks on each original image. A projection matrix will be created with Linear Discriminant Method (LDM) based on the Hausdorff distance map. This projection matrix will be used to transform multidimensional feature vectors (distance maps of testing images) into a feature in a one-dimensional domain. The decision will be made on the simple one- dimensional feature domain according to a precalculated threshold to distinguish human figures from non-human figures. In comparison with the common method based on Mahalanobis distance maps, the proposed method based on Hausdorff distance maps performs much better. Encouraging experimental results have been obtained using 800 human images and 800 non-human images. .
The high breakdown M-estimator (HBM) is introduced in this paper as an outstanding choice compared to modern high breakdown estimators for epipolar geometry estimation and motion segmentation problems. It is mathemati...
详细信息
ISBN:
(纸本)9781424431618;9780769530673
The high breakdown M-estimator (HBM) is introduced in this paper as an outstanding choice compared to modern high breakdown estimators for epipolar geometry estimation and motion segmentation problems. It is mathematically demonstrated that since HBM automatically guides its cost optimization using an iterative reweighted least square regression (instead of using random sampling like high breakdown RANSAC-based techniques), its computational cost is substantially cheaper than that of modern high breakdown estimators. In a number of experiments involving both synthetic and real image pairs, the performance of HBM, MSSE and pbM-estimator to solve fundamental matrix estimation problems are compared. The results verify that in terms of computational cost, HBM significantly outperforms the modern RANSAC-based high breakdown estimator and the pbM-estimator, while the estimation accuracies of HBM and its tolerances to high fractions of gross or pseudo outliers are similar to the other estimators. For real-time epipolar geometry estimation and multiple motion segmentation problems that involve multi-structured data segmentation (large fractions of outliers), application of the HBM-estimator for fast computation is highly recommended.
This paper describes a rarity-based visual attention model working on both still images and video sequences. Applications of this kind of models are numerous and we focus on a perceptual image representation which enh...
详细信息
This paper describes a rarity-based visual attention model working on both still images and video sequences. Applications of this kind of models are numerous and we focus on a perceptual image representation which enhances the perceptually important areas and uses lower resolution for perceptually less important regions. Our aim is to provide an approximation of human perception by visualizing its gradual discovery of the visual environment. Comparisons with classical methods for visual attention show that the proposed algorithm is well adapted to anisotropic filtering purposes. Moreover, it has a high ability to preserve perceptually important areas as defects or abnormalities from an important loss of information. High accuracy on low-contrast defects and scalable real-time video compression may be some practical applications of the proposed image representation. Copyright (C) 2007 Matei Mancas et al.
We propose an original method of video summarization based on camera motion. It consists in selecting frames according to the succession and the magnitude of camera motions. The method is based on rules to avoid tempo...
详细信息
We propose an original method of video summarization based on camera motion. It consists in selecting frames according to the succession and the magnitude of camera motions. The method is based on rules to avoid temporal redundancy between the selected frames. We also develop a new subjective method to evaluate the proposed summary and to compare different summaries more generally. Subjects were asked to watch a video and to create a summary manually. From the summaries of the different subjects, an "optimal" one is built automatically and is compared to the summaries obtained by different methods. Experimental results show the efficiency of our camera motion-based summary. Copyright (C) 2007 M. Guironnet et al.
The use of wavelets in the broad areas of source coding, communications, and networks is surveyed. Specifically, the impact of wavelets and wavelet theory in image coding, video coding, image interpolation, image-adap...
详细信息
The use of wavelets in the broad areas of source coding, communications, and networks is surveyed. Specifically, the impact of wavelets and wavelet theory in image coding, video coding, image interpolation, image-adaptive lifting transforms, multiple-description coding, and joint source-channel coding is overviewed. Recent contributions in these areas arising in subsequent papers of the present special issue are described. Copyright (C) 2007 J. E. Fowler and B. Pesquet-Popescu.
The fluctuation of quality in time is a problem that exists in motion-compensated-temporal-filtering (MCTF-) based video coding. The goal of this paper is to design a solution for overcoming the distortion fluctuation...
详细信息
The fluctuation of quality in time is a problem that exists in motion-compensated-temporal-filtering (MCTF-) based video coding. The goal of this paper is to design a solution for overcoming the distortion fluctuation challenges faced by wavelet-based video coders. We propose a new technique for determining the number of bits to be allocated to each temporal subband in order to minimize the fluctuation in the quality of the reconstructed video. Also, the wavelet filter properties are explored to design suitable scaling coefficients with the objective of smoothening the temporal PSNR. The biorthogonal 5/3 wavelet filter is considered in this paper and experimental results are presented for 2D+t and t+2D MCTF wavelet coders. Copyright (C) 2007 V. Seran and L. P. Kondi.
Object-based image coding is drawing a great attention for the many opportunities it offers to high-level applications. In terms of rate-distortion performance, however, its value is still uncertain, because the gains...
详细信息
Object-based image coding is drawing a great attention for the many opportunities it offers to high-level applications. In terms of rate-distortion performance, however, its value is still uncertain, because the gains provided by an accurate image segmentation are balanced by the inefficiency of coding objects of arbitrary shape, with losses that depend on both the coding scheme and the object geometry. This work aims at measuring rate-distortion costs and gains for a wavelet-based shape-adaptive encoder similar to the shape-adaptive texture coder adopted inMPEG-4. The analysis of the rate-distortion curves obtained in several experiments provides insight about what performance gains and losses can be expected in various operative conditions and shows the potential of such an approach for image coding. Copyright (C) 2007 Marco Cagnazzo et al.
We present a new multiple-description coding (MDC) method for scalable video, designed for transmission over error-prone networks. We employ a redundant motion-compensated scheme derived from the Haar multiresolution ...
详细信息
We present a new multiple-description coding (MDC) method for scalable video, designed for transmission over error-prone networks. We employ a redundant motion-compensated scheme derived from the Haar multiresolution analysis, in order to build temporally correlated descriptions in a t + 2D video coder. Our scheme presents a redundancy which decreases with the resolution level. This is achieved by additionally subsampling some of the wavelet temporal subbanbds. We present an equivalent four-band lifting implementation leading to simple central and side decoders as well as a packet-based reconstruction strategy in order to cope with random packet losses. Copyright (C) 2007 Christophe Tillier et al.
Object detection is a fundamental task in computervision. Deformable part based model has achieved great success in the past several years, demonstrating very promising performance. Many papers emerge on part based m...
详细信息
ISBN:
(纸本)9781457701221
Object detection is a fundamental task in computervision. Deformable part based model has achieved great success in the past several years, demonstrating very promising performance. Many papers emerge on part based model such as structure learning, learning more discriminative features. To help researchers better understand the existing visual features’ potential for part based object detection and promote the deep research into part based object representation, we propose an evaluation framework to compare various visual features’ performance for part based model. The evaluation is conducted on challenging PASCAL VOC2007 dataset which is widely recognized as a benchmark database. We adopt Average Precision (AP) score to measure each detector’s performance. Finally, the full evaluation results are present and discussed.
暂无评论