Robustness is an essential issue to computervision and patternrecognition in developing multimedia applications. In this work, we present a robust kernel approach that is highly robust against random noises and intr...
详细信息
In this paper, we focus on face recognition over image sets, where each set is represented by a linear subspace. Linear Discriminant Analysis (LDA) is adopted for discriminative learning. After investigating the relat...
详细信息
In this paper, we focus on face recognition over image sets, where each set is represented by a linear subspace. Linear Discriminant Analysis (LDA) is adopted for discriminative learning. After investigating the relation between regularization on Fisher Criterion and Maximum Margin Criterion, we present a unified framework for regularized LDA. With the framework, the ratio-form maximization of regularized Fisher LDA can be reduced to the difference-form optimization with an additional constraint. By incorporating the empirical loss as the regularization term, we introduce a generalized Square Loss based Regularized LDA (SLR-LDA) with suggestion on parameter setting. Our approach achieves superior performance to the state-of-the-art methods on face recognition. Its effectiveness is also evidently verified in general object and object category recognition experiments.
This paper addresses the challenging issue of target tracking and appearance learning in Forward Looking Infrared (FLIR) sequences. Tracking and appearance learning are formulated as a joint state estimation problem w...
详细信息
ISBN:
(纸本)9781424439942
This paper addresses the challenging issue of target tracking and appearance learning in Forward Looking Infrared (FLIR) sequences. Tracking and appearance learning are formulated as a joint state estimation problem with two parallel inference processes. Specifically, a new adaptive Kalman filter is proposed to learn histogram-based target appearances. A particle filter is used to estimate the target position and size, where the learned appearance plays an important role. Our appearance learning algorithm is compared against two existing methods and experiments on the AMCOM FLIR dataset validate its effectiveness.
In this paper, we explore ways to combine boundary information and region segmentation to estimate regions corresponding to foreground objects. Boundary information is used to generate an object likelihood image which...
详细信息
In this paper, we explore ways to combine boundary information and region segmentation to estimate regions corresponding to foreground objects. Boundary information is used to generate an object likelihood image which encodes the likelihood that each pixel belongs to a foreground object. This is done by combining evidence gathered from a large number of boundary fragments on training images by exploiting the relation between local boundary shape and relative location of the corresponding object region in the image. A region segmentation is used to generate a likely segmentation that is consistent with the boundary fragments out of a set of multiple segmentations. A mutual information criterion is used for selecting a segmentation from a set of multiple segmentations. Object likelihood and region segmentation are combined to yield the final proposed object region(s).
In this paper, we propose a multi-label image annotation framework by incorporating the content and context information of images. Specifically, images are annotated on regional scale. This annotation is independent o...
详细信息
In this paper, we propose a multi-label image annotation framework by incorporating the content and context information of images. Specifically, images are annotated on regional scale. This annotation is independent of the sizes of blocks. Confidences of content based block and image annotation are then obtained. On the other hand, spatial features by combining the block annotation confidence and the spatial context are proposed for main concepts, corresponding to the concepts been annotated, and the auxiliary concepts, corresponding to the concepts that have high co-occurrence with the main concepts in the images. This proposed spatial feature can incorporate the position of the concept and the spatial context between these concepts. Experiments on expanded Corel dataset categories demonstrate the effectiveness of the proposed method.
We propose an adaptive and effective multimodal peripheral-fovea sensor design for real-time targets tracking. This design is inspired by the biological vision systems for achieving real-time target detection and reco...
详细信息
ISBN:
(纸本)9781424439942
We propose an adaptive and effective multimodal peripheral-fovea sensor design for real-time targets tracking. This design is inspired by the biological vision systems for achieving real-time target detection and recognition with a hyperspectral/range fovea and panoramic peripheral view. A realistic scene simulation approach is used to evaluate our sensor design and the related data exploitation algorithms before a real sensor is made. The goal is to reduce development time and system cost while achieving optimal results through an iterative process that incorporates simulation, sensing, processing and evaluation. Important issues such as multimodal sensory component integration, region of interest extraction, target tracking, hyperspectral image analysis and target signature identification are discussed.
Summary form only given. Learning hierarchical representations of object structure in a bottom-up manner faces several difficult issues. First, we are dealing with a very large number of potential feature aggregations...
详细信息
Summary form only given. Learning hierarchical representations of object structure in a bottom-up manner faces several difficult issues. First, we are dealing with a very large number of potential feature aggregations. Furthermore, the set of features the algorithm learns at each layer directly influences the expressiveness of the compositional layers that work on top of them. However, we cannot ensure the usefulness of a particular local feature for object class representation based solely on the local statistics. This can only be done when more global, object-wise information is taken into account. We build on the hierarchical compositional approach (Fidler and Leonardis, 2007) that learns a hierarchy of contour compositions of increasing complexity and specificity. Each composition models spatial relations between its constituent parts.
This paper describes a system for automatically extracting meta-information on people from videos on the web. The system contains multiple modules which automatically track people, including both faces and bodies, and...
详细信息
ISBN:
(纸本)9781424439942
This paper describes a system for automatically extracting meta-information on people from videos on the web. The system contains multiple modules which automatically track people, including both faces and bodies, and clusters the people into distinct groups. We present new technology and significantly modify existing algorithms for body-detection, shot-detection and grouping, tracking, and track-clustering within our system. The system was designed to work effectivity on web content, and thus exhibits robust tracking and clustering behavior over a broad spectrum of professional and semi-professional video content. In order to quantify and evaluate our system we created a large ground-truth data-set of people within video. Finally, we provide actual video examples of our algorithm and find that the results are quite strong over a broad range of content.
There has been significant research into the development of visual feature detectors and descriptors that are robust to a number of image deformations. Some of these methods have emphasized the need to improve on comp...
详细信息
ISBN:
(纸本)9781424439942
There has been significant research into the development of visual feature detectors and descriptors that are robust to a number of image deformations. Some of these methods have emphasized the need to improve on computational speed and compact representations so that they can enable a range of real-time applications with reduced computational requirements. In this paper we present modified detectors and descriptors based on the recently introduced CenSurE [l], and show experimental results that aim to highlight the computational savings that can be made with limited reduction in performance. The developed methods are based on exploiting the concept of sparse sampling which may be of interest to a range of other existing approaches.
In this paper we study the application of hardware fingerprinting based on PRNU noise analysis of biometric fingerprint devices for sensor identification. For each fingerprint sensor, a noise reference pattern is gene...
详细信息
In this paper we study the application of hardware fingerprinting based on PRNU noise analysis of biometric fingerprint devices for sensor identification. For each fingerprint sensor, a noise reference pattern is generated and subsequently correlated with noise residuals extracted from test images. We experiment on three different databases including a total of 20 fingerprint sensors. Our results indicate that fingerprint sensor identification at unit level is attainable with promising prospects. Our analysis indicates that in many cases identification can be performed even when one only has access to a limited number of samples. For two of the three databases one can train on less than 8 images per device and establish sensor identification with little or no misclassification error. On the third database, high levels of identification performance can be achieved when training on similar amounts of images required for other types of sensor identification such as cameras or scanners.
暂无评论