For detecting difficult objects, like hands, we present an algorithm that uses tokens and a grammar. Tokens are found by employing a new scale space edge detector that finds scale invariant features at object boundari...
详细信息
ISBN:
(数字)9783540375982
ISBN:
(纸本)354037597X
For detecting difficult objects, like hands, we present an algorithm that uses tokens and a grammar. Tokens are found by employing a new scale space edge detector that finds scale invariant features at object boundaries. We begin by constructing the scale space. Then we find edges at each scale and flatten the scale space to one edge image. To detect a hand we define a hand pattern grammar using curve tokens for finger tips and wedges, and line tokens. We identify a hand pattern by parsing these tokens using a graph based algorithm. We show and discuss the results of this algorithm on a database of hand images.
In this paper, we present efficient and simple image segmentations based on the solution of two separate Eikonal equations, each originating from a different region. Distance functions from the interior and exterior r...
详细信息
ISBN:
(数字)9783540375982
ISBN:
(纸本)354037597X
In this paper, we present efficient and simple image segmentations based on the solution of two separate Eikonal equations, each originating from a different region. Distance functions from the interior and exterior regions are computed, and final segmentation labels are determined by a competition criterion between the distance functions. We also consider applying a diffusion partial differential equation (PDE) based method to propagate information in a manner inspired by the information propagation feature of the Eikonal equation. Experimental results are presented in a particular medical image segmentation application, and demonstrate the proposed methods:
In this paper we investigate two discriminative classification approaches for frame-based speaker identification and verification, namely Support Vector Machine (SVM) and Sparse Kernel Logistic Regression (SKLR). SVMs...
详细信息
ISBN:
(数字)9783540375982
ISBN:
(纸本)354037597X
In this paper we investigate two discriminative classification approaches for frame-based speaker identification and verification, namely Support Vector Machine (SVM) and Sparse Kernel Logistic Regression (SKLR). SVMs have already shown good results in regression and classification in several fields of pattern recognition as well as in continuous speech recognition. While the non-probabilistic output of the SVM has to be translated into conditional probabilities, the SKLR produces the probabilities directly. In speaker identification and verification experiments both discriminative classification methods outperform the standard Gaussian Mixture Model (GMM) system on the POLYCOST database.
Local patterns in the form of single clusters are of interest in various areas of data mining. However, since the intention of cluster analysis is a global partition of a data set into clusters, it is not suitable to ...
详细信息
ISBN:
(数字)9783540375982
ISBN:
(纸本)354037597X
Local patterns in the form of single clusters are of interest in various areas of data mining. However, since the intention of cluster analysis is a global partition of a data set into clusters, it is not suitable to identify single clusters in a large data set where the majority of the data can not be assigned to meaningful clusters. This paper presents a new objective function-based approach to identify a single good cluster in a data set making use of techniques known from prototype-based, noise and fuzzy clustering. The proposed method can either be applied in order to identify single clusters or to carry out a standard cluster analysis by finding clusters step by step and determining the number of clusters automatically in this way.
Starting with a novel audio analysis and editing paradigm, a set of new and adaptive audio analysis and editing algorithms in the spectrogram are developed and integrated into a smart visual audio editing tool in a &q...
详细信息
In this paper, we reconstruct the corresponding 3D face model using only one 2D image. 3D feature points are obtained by optimally approximating 2D feature points set with defined similarity. Then, a shape model is re...
详细信息
ISBN:
(数字)9783540375982
ISBN:
(纸本)354037597X
In this paper, we reconstruct the corresponding 3D face model using only one 2D image. 3D feature points are obtained by optimally approximating 2D feature points set with defined similarity. Then, a shape model is reconstructed by the warp function algorithm. Finally, a realistic face model is created through texture mapping with registration method such as affine transform. Results show that the models we reconstruct are comparatively realistic, and they can be used for face recognition or computer animation. The computation speed is also satisfying.
Video segmentation is an important phase in video based traffic surveillance applications. The basic task of traffic video segmentation is to classify pixels in the current frame to road background or moving vehicles,...
详细信息
ISBN:
(数字)9783540375982
ISBN:
(纸本)354037597X
Video segmentation is an important phase in video based traffic surveillance applications. The basic task of traffic video segmentation is to classify pixels in the current frame to road background or moving vehicles, and casting shadows should be taken into account if exists. In this paper, a modified online EM procedure is proposed to construct Adaptive-K Gaussian Mixture Model (AKGMM) in which the dimension of the parameter space at each pixel can adaptively reflects the complexity of pattern at the pixel. A heuristic background components selection rule is developed to make pixel classification decision based on the proposed model. Our approach is demonstrated to be more adaptive, accurate and robust than some existing similar pixel modeling approaches through experimental results.
The Modified Quadratic Discriminant Function was first proposed by Kimura et al to improve the performance of Quadratic Discriminant Function, which can be seen as a dot-product method by eigen-decompostion of the cov...
详细信息
ISBN:
(数字)9783540375982
ISBN:
(纸本)354037597X
The Modified Quadratic Discriminant Function was first proposed by Kimura et al to improve the performance of Quadratic Discriminant Function, which can be seen as a dot-product method by eigen-decompostion of the covariance matrix of each class. Therefore, it is possible to expand MQDF to high dimension space by kernel trick. This paper presents a new kernel-based method to pattern recognition, Kernel Modified Quadratic Discriminant Function (KMQDF), based on MQDF and kernel method. The proposed KMQDF is applied in facial expression recognition. JAFFE face database and the AR face database are used to test this algorithm. Experimental results show that the proposed KMQDF with appropriated parameters can outperform 1-NN, QDF, MQDF classifier.
Monitoring abnormal patterns in data streams is an important research area for many applications. In this paper we present a new approach MAPS(Monitoring Abnormal patterns over data Streams) to model and identify the ...
详细信息
ISBN:
(数字)9783540375982
ISBN:
(纸本)354037597X
Monitoring abnormal patterns in data streams is an important research area for many applications. In this paper we present a new approach MAPS(Monitoring Abnormal patterns over data Streams) to model and identify the abnormal patterns over the massive data streams. Compared with other data streams, ICU streaming data have their own features: pseudo-periodicity and polymorphism. MAPS first extracts patterns from the online arriving data streams and then normalizes them according to their pseudo-periodic semantics. Abnormal patterns will be detected if they are satisfied the predicates defined in the clinician-specifying normal patterns. At last, a real application demonstrates that MAPS is efficient and effective in several important aspects.
The highest fidelity representations of realistic real-world materials currently used comprise Bidirectional Texture Functions (BTF). The BTF is a six dimensional function depending on view and illumination directions...
详细信息
ISBN:
(数字)9783540375982
ISBN:
(纸本)354037597X
The highest fidelity representations of realistic real-world materials currently used comprise Bidirectional Texture Functions (BTF). The BTF is a six dimensional function depending on view and illumination directions as well as on planar texture coordinates. The huge size of such measurements, typically in the form of thousands of images covering all possible combinations of illumination and viewing angles, has prohibited their practical exploitation and obviously some compression and modelling method of these enormous BTF data spaces is inevitable. The proposed approach combines BTF spatial clustering with cluster index modelling by means of an efficient Markov random field model. This method allows to generate seamless cluster index of arbitrary size to cover large virtual 3D objects surfaces. The method represents original BTF data using a set of local spatially dependent Bidirectional Reflectance Distribution Function (BRDF) values which are combined according to synthesised cluster index and illumination / viewing directions. BTF data compression using this method is about 1 : 100 and their synthesis is very fast.
暂无评论