Point matching is an important component of image registration. Recent years, Coherent Point Drift (CPD) method becomes a very popular point matching approach. CPD treats point matching as a probability estimation pro...
详细信息
Point matching is an important component of image registration. Recent years, Coherent Point Drift (CPD) method becomes a very popular point matching approach. CPD treats point matching as a probability estimation problem and speeds up the process of matching a lot. In this method, one set of points are thought to be sampled from a Gaussian Mixture Model (GMM), which is centered by the other set of points. However, CPD is sensitive to outliers and noises, especially when the noise ratio increased or the number of outliers gets much high. To deal with this problem, we introduce shape context into the step of searching for matching points and then improve the form of prior probabilities of GMM in this paper. The main idea of our method is that if the most points in a data set are likely to be matched to a particular centroid, this Gaussian component should be have a more influence to GMM. Therefore, we set prior probability of GMM with the similarity between GMM components and the data set. And the computation of similarity is based on shape context. The experiments on 2D and 3D images show that when noise ratio is low, our method performs as well as CPD does, but as the ratio increased, our method is more robust and satisfactory than CPD.
Aiming at adverse influence of the correlation between measurement and process noise for filtering precision, a new multiple model particle filtering algorithm with correlated measurement noise and process noise is pr...
详细信息
The particle filter offers a general numerical tool to approximate the posterior density function for the state in nonlinear and non-Gaussian filtering problems. While the particle filter is fairly easy to implement a...
详细信息
This paper proposes a three-layer model for full frame video *** is practical for real time full frame processing where the smoothness of intentional camera motion is *** undefined pixels in stabilized frame are fille...
详细信息
This paper proposes a three-layer model for full frame video *** is practical for real time full frame processing where the smoothness of intentional camera motion is *** undefined pixels in stabilized frame are filled by pixels in previous frames *** traditional methods that all neighboring frames are required to be stored and registered with current frame,the proposed algorithm only stores single updated mosaic image for video *** runs significantly faster than previous *** efficacy has been demonstrated by real experiments.
Part-based models have become the mainstream approach for visual object classification and detection. The key tools adopted by the most methods are interest point detectors and descriptors, shared codes for object par...
详细信息
ISBN:
(纸本)9781479952106
Part-based models have become the mainstream approach for visual object classification and detection. The key tools adopted by the most methods are interest point detectors and descriptors, shared codes for object parts (visual codebook) and discriminative learning using positive and negative class examples. Distinction of our method from the existing part-based methods for object detection is the use of sparse class-specific landmarks with semantic meaning. The landmarks are the additional distinguished information of object location in the proposed framework. Additionally, localising semantic and discriminative landmarks (object parts) is significant in other related applications of computer vision, such as facial expression recognition and pose/orientation estimation of objects. Therefore, we propose a model which deviates from the mainstream by the fact that the object parts' appearance and spatial variation, constellation, are explicitly modelled in a generative probabilistic manner. With using only positive examples our method can achieve object detection accuracy comparable to state-of-the-art discriminative method.
In the studying of fibers microstructure of brain white matter,many reconstruction methods have been proposed to interpret the diffusion-weighted signalThose methods can be categorized into modelbased and model-free m...
详细信息
In the studying of fibers microstructure of brain white matter,many reconstruction methods have been proposed to interpret the diffusion-weighted signalThose methods can be categorized into modelbased and model-free methodsIn this paper,the diffusion configuration of water molecules are discussed,and two questions are put forward to analyze the performance of the current algorithms about diffusion configuration.
A number of computer vision problems such as object detection, pose estimation, and face recognition utilise local parts to represent objects, which include the distinguished information of objects. In this work, we i...
详细信息
ISBN:
(纸本)9781479952106
A number of computer vision problems such as object detection, pose estimation, and face recognition utilise local parts to represent objects, which include the distinguished information of objects. In this work, we introduce a novel probabilistic framework which automatically learns class-specific object parts (landmarks) in generative-learning manner. Encouraged by the success in learning and detecting facial landmarks, we employ bio-inspired multi-resolution Gabor features in the proposed framework. Specifically, complex-valued Gabor filter responses are first transformed to landmark specific likelihoods using Gaussian Mixture Models (GMM), and then efficient response matrix shift operations provide detection over orientations and scales. We avoid the undesirable characteristic of generative learning, a large number of training instances, with the novel concept of randomised Gaussian mixture model. Extensive experiments with public benchmarking Caltech-101 and BioID datasets demonstrate the effectiveness of our proposed method for localising object landmarks.
To detect violence in a video, a common video description method is to apply local spatio-temporal description on the query video. Then, the low-level description is further summarized onto the high-level feature base...
详细信息
ISBN:
(纸本)9781479928941
To detect violence in a video, a common video description method is to apply local spatio-temporal description on the query video. Then, the low-level description is further summarized onto the high-level feature based on Bag-of-Words (BoW) model. However, traditional spatio-temporal descriptors are not discriminative enough. Moreover, BoW model roughly assigns each feature vector to only one visual word, therefore inevitably causing quantization error. To tackle the constrains, this paper employs Motion SIFT (MoSIFT) algorithm to extract the low-level description of a query video. To eliminate the feature noise, Kernel Density Estimation (KDE) is exploited for feature selection on the MoSIFT descriptor. In order to obtain the highly discriminative video feature, this paper adopts sparse coding scheme to further process the selected MoSIFTs. Encouraging experimental results are obtained based on two challenging datasets which record both crowded scenes and non-crowded scenes.
In this paper, we try to deal with the problem of shadow detection from static images and video sequences. In instead to considering individual regions separately, we use relative illumination conditions between segme...
详细信息
This paper explores text-independent writer identification by combining Bag of Features (BoF), contour-hinge and SIFT scales feature. The BoF method adopted differs from the common BoF approach for writer identificati...
详细信息
ISBN:
(纸本)9781479936519
This paper explores text-independent writer identification by combining Bag of Features (BoF), contour-hinge and SIFT scales feature. The BoF method adopted differs from the common BoF approach for writer identification in that it extracts SIFT descriptors and uses Locality-constrained Linear Coding to get feature vector of each document. The Locality-constrained Linear Coding (LLC) tries to reconstruct each feature through locality constraint and has much more discriminative power than the common used Vector Quantization (VQ). Contour-hinge feature can capture orientation and curvature of the ink trace. Modification is made to the original contour-hinge to improve the identification rate. Besides, we also use SIFT scale information and integrate these three kinds of features together. Experiments are conducted the challenging ICDAR2013 writer identification contest dataset and dataset for "ICFHR2012 Writer Identification Contest, Challenge 1: Latin Documents". The experiment results show that the proposed BoF approach outperforms the common ones that adopt VQ, and after the integration, our method achieves the best result on the entire ICDAR2013 and ICFHR2012 dataset under soft evaluation.
暂无评论