In this paper we present a new technique to improve the convergence and to reduce the ghosting artifacts based on constant statistics (CS) method. We propose to reduce ghosting artifacts and to speed up the convergenc...
详细信息
ISBN:
(纸本)9783642250842
In this paper we present a new technique to improve the convergence and to reduce the ghosting artifacts based on constant statistics (CS) method. We propose to reduce ghosting artifacts and to speed up the convergence by using enhanced constant statistics method with the motion threshold. The key advantage of the method is based in its capacity for estimate detectors parameters, and then compensate for fixed-pattern noise in a frame by frame basics. The ability of the method to compensate for nonuniformity and reducing ghosting artifacts is demonstrated by employing video sequences of simulated and several infrared video sequences obtained using two infrared cameras.
CBDA is an emerging field in computervision and patternrecognition. In recent technology camera are incorporated to several electronic equipments and are very interesting and thus playing a vital role by replacing s...
详细信息
ISBN:
(纸本)9780819489326
CBDA is an emerging field in computervision and patternrecognition. In recent technology camera are incorporated to several electronic equipments and are very interesting and thus playing a vital role by replacing scanner with hand held imaging devices like Digital Cameras, Mobile phones and gaming devices attached with these camera. The goal of the work is to remove graphics from the document which plays a vital role in recognition of characters from the mobile captured documents. In this paper we have proposed a novel method for separating or removal of graphics like logos, animations other than the text from the document and method to reduce noise and finally textual content skew is estimated and corrected using Hough Transform. The experimental results show the efficacy compared to the result of well known existing methods.
In the context of computervision, matching can be done with similarity measures. This paper presents the classification of these measures into five families. In addition, 18 measures based on robust statistics, previ...
详细信息
In the context of computervision, matching can be done with similarity measures. This paper presents the classification of these measures into five families. In addition, 18 measures based on robust statistics, previously proposed in order to deal with the problem of occlusions, are studied and compared to the state of the art. A new evaluation protocol and new analyses are proposed and the results highlight the most efficient measures, first, near occlusions, the smooth median powered deviation, and second, near discontinuities, a non-parametric transform-based measure, CENSUS. (C) 2011 Elsevier Ltd. All rights reserved.
There is an need for an autonomous system that can organise spherical objects and reposition these objects with six degrees of control. A mechatronic system is intended to partially fulfil the needs of this system by ...
详细信息
ISBN:
(纸本)9781601321916
There is an need for an autonomous system that can organise spherical objects and reposition these objects with six degrees of control. A mechatronic system is intended to partially fulfil the needs of this system by providing the means to position an object's three degrees of rotational freedom. In this paper we present the development of a vision motion tracking algorithm. This algorithm is then to be used for creation of motion models and a visual servo system. The vision algorithm is based around intra-frame blob tracking to determine the motion of a manipulated object. The common axis and magnitude of the motion is determined by tracking randomly placed dots on the object's surface. An intersection of planes approach is utilised to generate a best fit axis of rotation. The performance of the dot tracking and axis rotation algorithms are investigated with simulations by varying variables test against other methodologies.
The popular bag of words approach for action recognition is based on the classifying quantized local features density. This approach focuses excessively on the local features but discards all information about the int...
详细信息
Local space-time features and bag-of-feature (BOF) representation are often used for action recognition in previous approaches. For complicated human activities, however, the limitation of these approaches blows up be...
详细信息
ISBN:
(纸本)9781457701214;9781457701221
Local space-time features and bag-of-feature (BOF) representation are often used for action recognition in previous approaches. For complicated human activities, however, the limitation of these approaches blows up because of the local properties of features and the lack of context. This paper addresses the problem by exploiting the spatio-temporal context information between features. We first define a spatio-temporal context, which combines the scale invariant spatio-temporal neighberhood of local features with the spatio-temporal relationships between them. Then, we introduce a spatio-temporal context kernel (STCK), which not only takes into account the local properties of features but also considers their spatial and temporal context information. STCK has a promising generalization property and can be plugged into SVMs for activities recognition. The experimental results on challenging activity datasets show that, compared to context-free model, the spatio-temporal context kernel improves the recognition performance.
The authors propose to extract local texture features for image-based coin recognition in this study. A set of Gabor wavelets and local binary pattern (LBP) operator are employed to represent texture information. Conc...
详细信息
The authors propose to extract local texture features for image-based coin recognition in this study. A set of Gabor wavelets and local binary pattern (LBP) operator are employed to represent texture information. Concentric ring structure is used to divide the coin image into a number of small sections. Statistics of Gabor coefficients or LBP values within each section is then concatenated into a feature vector to represent the image. A circular shift operator is proposed to make Gabor features robust against rotation variance. Matching between two coin images is done via distance measurement. The nearest-neighbour classifier is used to classify a given test coin. The public MUSCLE database consisting of over 10 000 images is used to test our algorithms;results show that significant improvements over edge distance-based methods have been achieved. The authors have also analysed the performance of the system on recognising unregistered coins and the analysis suggests further improvement could be achieved if physical properties like diameter and thickness are included for feature representation.
In this paper, we present a method for pattern such as graphical symbol and shape recognition and retrieval. It is basically based on dynamic programming for matching the Radon features. The key characteristic of the ...
详细信息
ISBN:
(纸本)9783642236877
In this paper, we present a method for pattern such as graphical symbol and shape recognition and retrieval. It is basically based on dynamic programming for matching the Radon features. The key characteristic of the method is to use DTW algorithm to match corresponding pairs of histograms at every projecting angle. This allows to exploit the Radon property to include both boundary as internal structure of shapes, while avoiding compressing pattern representation into a single vector and thus miss information, thanks to the DTW. Experimental results show that the method is robust to distortion and degradation including affine transformations.
A procedure for adapting the isoline curvature formula to estimate an edge curvature in the images is presented. The curvature estimation is used in the framework of the scale-space algorithm, which makes it possible ...
详细信息
作者:
Wang, LiangWang, YizhouGao, WenPeking Univ
Natl Engn Lab Video Technol Beijing 100871 Peoples R China Peking Univ
Key Lab Machine Percept MoE Sch Elect Engn & Comp Sci Beijing 100871 Peoples R China Harbin Inst Technol
Sch Comp Sci & Technol Harbin 150006 Heilongjiang Peoples R China
We propose a layered-grammar model to represent actions. Using this model, an action is represented by a set of grammar rules. The bottom layer of an action instance's parse tree contains action primitives such as...
详细信息
We propose a layered-grammar model to represent actions. Using this model, an action is represented by a set of grammar rules. The bottom layer of an action instance's parse tree contains action primitives such as spatiotemporal (ST) interest points. At each layer above, we iteratively mine grammar rules and "super rules" that account for the high-order compositional feature structures. The grammar rules are categorized into three classes according to three different ST-relations of their action components, namely the strong relation, weak relation and stochastic relation. These ST-relations characterize different action styles (degree of stiffness), and they are pursued in terms of grammar rules for the purpose of action recognition. By adopting the Emerging pattern (EP) mining algorithm for relation pursuit, the learned production rules are statistically significant and discriminative. Using the learned rules, the parse tree of an action video is constructed by combining a bottom-up rule detection step and a top-down ambiguous rule pruning step. An action instance is recognized based on the discriminative configurations generated by the production rules of its parse tree. Experiments confirm that by incorporating the high-order feature statistics, the proposed method largely improves the recognition performance over the bag-of-words models.
暂无评论