Recent work in monocular pedestrian detection is trying to improve the execution time while keeping the accuracy as high as possible.A popular and successful approach for monocular intensity pedestrian detection is ba...
Recent work in monocular pedestrian detection is trying to improve the execution time while keeping the accuracy as high as possible.A popular and successful approach for monocular intensity pedestrian detection is based on the approximation(instead of computation) of image features for multiple scales based on the features computed on set of predefined *** port this idea to the infrared *** contributions reside in the combination of four channel features,namely infrared,histogram of gradient orientations,normalized gradient magnitude and local binary patterns with the objective of detecting pedestrians for night vision applications dealing with far infrared *** scale feature computation is done by feature *** contribution is the study of different formulations for Local Binary patterns like uniform patterns and rotation invariant patterns and their effect on detection *** detection speed is also boosted by the aid of a fast morphological based region of interest *** vary the number of approximated scales per octave and study the impact on execution time and accuracy.A reasonable result hits a speed of 18 fps with a log average miss rate of 39%.
Most pedestrian detection approaches that achieve high accuracy and precision rate and that can be used for real-time applications are based on histograms of gradient orientations. Usually multiscale detection is atta...
详细信息
ISBN:
(纸本)9781479951192
Most pedestrian detection approaches that achieve high accuracy and precision rate and that can be used for real-time applications are based on histograms of gradient orientations. Usually multiscale detection is attained by resizing the image several times and by recomputing the image features or using multiple classifiers for different scales. In this paper we present a pedestrian detection approach that uses the same classifier for all pedestrian scales based on image features computed for a single scale. We go beyond the low level pixel-wise gradient orientation bins and use higher level visual words organized into Word Channels. Boosting is used to learn classification features from the integral Word Channels. The proposed approach is evaluated on multiple datasets and achieves outstanding results on the INRIA and Caltech-USA benchmarks. By using a GPU implementation we achieve a classification rate of over 10 million bounding boxes per second and a 16 FPS rate for multiscale detection in a 640×480 image.
In an automatic face recognition system, it still remains a challenge to improve the robustness to aging. In this paper, we present a novel approach to address age invariant face recognition, by formulating it as a gr...
详细信息
In an automatic face recognition system, it still remains a challenge to improve the robustness to aging. In this paper, we present a novel approach to address age invariant face recognition, by formulating it as a graph matching problem. In contrast to the majority of tasks in the literature that only make use of robust texture features, this method generates a graph from a set of fiducial landmarks of each face, which captures the texture clues that tend to be stable in a period as well as the common facial geometry configuration. The nodes of the graph denote the texture of a face area around a landmark, and the edges correspond to the geometry topology of the face. For each area, the age invariant texture information is extracted by a discriminative and compact feature encoded in the Local Gabor Binary pattern Histogram Sequence (LGBPHS) projected in an LDA subspace. An objective function is then designed to match graphs for the purpose of registration and identification. Experiments are carried out on the FG-NET Aging database, and the results achieved outperform the state of the art ones, which clearly demonstrate the effectiveness and robustness of the proposed method in face recognition across age variations.
In Digital Subtraction Angiography(DSA) image registration algorithm,the precision of the control points as well as their number and the distribution in image determine the accuracy of geometric correction and registr...
详细信息
In Digital Subtraction Angiography(DSA) image registration algorithm,the precision of the control points as well as their number and the distribution in image determine the accuracy of geometric correction and registrationControl points usually adopt the grid points;however,a more effective method is to extract control points adaptively according to the image featureIn this paper,a control point's selection algorithm of DSA images is proposed based on adaptive multi-Scale vascular enhancement,error diffusion and means shift algorithmsExperimental results show that the proposed algorithm can adaptively put the control points to blood vessels and other key image characteristics,and can optimize the number of control points according to practical needs,which will ensure the accuracy of DSA image registration.
This paper proposes a novel method for detecting the moving vehicles in dynamic urban traffic scenes using a stereo camera. Relying on the fact that a set of feature points on a rigid 3D scene object are staying in a ...
详细信息
This paper proposes a novel method for detecting the moving vehicles in dynamic urban traffic scenes using a stereo camera. Relying on the fact that a set of feature points on a rigid 3D scene object are staying in a rigid 3D configuration, we propose to compute the relative motion between the camera and a moving object with an algorithm that follows from the visual odometry based motion estimation methods. Subtracting the camera motion we obtain the absolute object motion. Additionally we create a compact representation of the scene using superpixels computed from intensity and depth information. A graph-like structure is built, having superpixels as nodes and indicating neighboring relationships between adjacent superpixels. Objects are segmented using a fast region growing algorithm that considers as seeds the features used to compute the object motion.
In this paper, we propose a novel bottom-up paradigm for detecting visual saliency. Regarding the boundary as potential background (boundary prior), we firstly transfer the input color image into a graph with addition...
详细信息
ISBN:
(纸本)9781479928941
In this paper, we propose a novel bottom-up paradigm for detecting visual saliency. Regarding the boundary as potential background (boundary prior), we firstly transfer the input color image into a graph with additional four virtual nodes. With a new type of edge called feature edge defined considering both color information and spatial distribution, geodesic saliency measure is used to obtain four saliency maps. Then a combination strategy of four maps is proposed, rendering a uniform saliency map to better suppress background and avoid over-suppression of salient object. Finally, we introduce a way of determining foci of attention based on maximal deviation from norm (MDN) to enhance the quality of saliency map. Experimental results on a benchmark dataset demonstrate the better performance of our proposed approach compared with several state-of-art methods.
Multi-class segmentation assigns a class label to each pixel in an image. It represents a significant task for the semantic understanding of images and has received plentiful attention over the last years. The current...
详细信息
ISBN:
(纸本)9781479936397
Multi-class segmentation assigns a class label to each pixel in an image. It represents a significant task for the semantic understanding of images and has received plentiful attention over the last years. The current state of art is dominated by conditional random field based approaches, defined over pixels or image segments. However, high accuracy segmentation comes at a high computational cost. The best performing methods can barely run at few frames per second and are far from real-time applications. Our goal is to bridge the gap between current state of the art segmentation approaches and real-time applications. In this paper we propose an efficient approach for individual pixel classification. Multiple local descriptors are computed densely and then quantized using visual codebooks. Joint boosting is used to classify each pixel based on the quantized local descriptors. We show that using careful design choices and GPU optimization we can achieve sate of the art segmentation results at over 50 FPS. We also propose a Conditional Random Field (CRF) model defined over superpixels that uses the proposed pixel classifier for the estimation of unary potentials. The CRF based multi-class segmentation can run at over 30 FPS. The proposed approach is validated on the MSRC21 and CamVid multi-class segmentation benchmarks, the former one consisting of urban traffic sequences.
The images captured in fog conditions have degraded contrast,that makes current imageprocessing applications sensitive and error *** propose in this paper an efficient single image enhancement algorithm suitable for ...
详细信息
The images captured in fog conditions have degraded contrast,that makes current imageprocessing applications sensitive and error *** propose in this paper an efficient single image enhancement algorithm suitable for daytime fog conditions and based on an original mathematical model,for computing the atmospheric veil,that takes into account the variation in fog density to the *** model is inspired by the functions that appear in partition of unity in the differential geometry *** observing images captured in fog conditions,usually the fog has a very low density in front of the camera and this density has a non-linear increase with the distance,such that objects are no longer visible at greater *** using our mathematical model we are able to obtain superior reconstructions of the original fog-free image,when comparing to traditional *** advantage of our method is the ability to adapt the model in accordance to the density of the fog.A quantitative and qualitative evaluation is performed on both synthetic and real camera *** evaluation proves that our mathematical model is more suitable for image enhancement in both homogeneous and heterogeneous fog *** algorithm is able to perform image enhancement in real time for both color and gray scale images.
This paper proposes a low energy-consuming cluster-based algorithm to protect data integrity and privacy named ILCCPDA,which can dynamically elect cluster head by LEACH clustering protocol and take the simple cluster ...
详细信息
ISBN:
(纸本)9781479941681
This paper proposes a low energy-consuming cluster-based algorithm to protect data integrity and privacy named ILCCPDA,which can dynamically elect cluster head by LEACH clustering protocol and take the simple cluster fusion approach to reduce the data transmission,thus reducing energy *** can detect data integrity by adding homomorphic message authentication code and take the random key distribution mechanism for data *** can solve the problem of the integrity,privacy and energy consumption in the wireless transmission of sensor data.
This paper addresses the problem of human activity recognition in still images. We propose a novel method that focuses on human-object interaction for feature representation of activities on Riemannian manifolds, and ...
详细信息
ISBN:
(纸本)9781450329255
This paper addresses the problem of human activity recognition in still images. We propose a novel method that focuses on human-object interaction for feature representation of activities on Riemannian manifolds, and exploits underlying Riemannian geometry for classification. The main contributions of the paper include: (a) represent human activity by appearance features from local patches centered at hands containing interacting objects, and by structural features formed from the detected human skeleton containing the head, torso axis and hands;(b) formulate SVM kernel function based on geodesics on Riemannian manifolds under the log-Euclidean metric;(c) apply multi-class SVM classifier on the manifold under the one-against-all strategy. Experiments were conducted on a dataset containing 17196 images in 12 classes of activities from 4 subjects. Test results, evaluations, and comparisons with state-of-the-art methods provide support to the effectiveness of the proposed scheme. Copyright 2014 ACM.
暂无评论