In this paper, we propose a method based on the SVM algorithm to recognize dynamic hand gestures. The information of motion trajectory is captured by a leap motion in three-dimension space. A new methodology of featur...
详细信息
In this paper, we propose a method based on the SVM algorithm to recognize dynamic hand gestures. The information of motion trajectory is captured by a leap motion in three-dimension space. A new methodology of feature extracting is proposed to guarantee the length of samples being the same. The elements of feature vectors are ranged according to two different criteria: one is the amplitude of the variation of orientation angles, and the other criterion is the order of the appearance of features. Experimental results show that this method can classify the dynamic hand gestures effectively.
Real-time performance can be greatly improved, if the early recognition is implemented. In this paper, a dynamic hand gesture early recognitionsystem is proposed. The system can recognize the gesture before it is com...
详细信息
ISBN:
(纸本)9781479973989
Real-time performance can be greatly improved, if the early recognition is implemented. In this paper, a dynamic hand gesture early recognitionsystem is proposed. The system can recognize the gesture before it is completed. Our method is based on the Hidden Semi-Markov Models. Three-dimensional information of the gesture trajectory collected by leapmotion is the main data we used. Experiments on the dataset which we established demonstrate the effectiveness of our method.
Recently, constructing a good graph to represent data structures is widely used in machine learning based applications. Some existing trackers have adopted graph construction based classifiers for tracking. However, t...
详细信息
Recently, constructing a good graph to represent data structures is widely used in machine learning based applications. Some existing trackers have adopted graph construction based classifiers for tracking. However, their graph structures are not effective to characterize the inter-class separability and multi-model sample distribution, both of which are very important to successful tracking. In this paper, we propose to use a new graph structure to improve tracking performance without the assistance of learning object subspace generatively as previous work did. Meanwhile, considering the test samples deviate from the distribution of the training samples in tracking applications, we formulate the discriminative learning process, to avoid over fitting, in a semi-supervised fashion as L1-graph based regularizer. In addition, a non-linear variant is extended to adapt to multi-modal sample distribution. Experimental results demonstrate the superior properties of the proposed tracker.
Real-time 3D sensing plays a critical role in robotic navigation, video surveillance and human-computer interaction, etc. When computing 3D structures of dynamic scenes from stereo sequences, spatiotemporal stereo and...
详细信息
Reference-based image classification approach introduces a reference-set for both image representation and dictionary learning. It significantly reduces the dimensionality of represented images and shows outstanding p...
详细信息
ISBN:
(纸本)9781479923427
Reference-based image classification approach introduces a reference-set for both image representation and dictionary learning. It significantly reduces the dimensionality of represented images and shows outstanding performance even with randomly selected reference images and simple distance measure. In this paper, we improve upon existing work with two major contributions. First, we show that a more representative reference-set contributes to better classification accuracy. To this end, we carefully adapt the K-means clustering algorithm in the feature space to select a distinguished reference-set. Second, in the image classification process, we propose to represent each image by measuring its betweenness centrality in a social network composed of the representative reference-set in each class, leading to a more coherent distance measure that considers the overall connectivity between the probe image and the reference-set. Extensive experiment results demonstrate that our proposed scheme achieves better performance than existing methods.
In this paper, we present a novel audio fingerprinting method based on N-grams, which can quickly identify a segment of audio even when the audio signals are seriously distorted. We make use of N peaks in spectrum to ...
详细信息
In this paper, we present a novel audio fingerprinting method based on N-grams, which can quickly identify a segment of audio even when the audio signals are seriously distorted. We make use of N peaks in spectrum to form the audio fingerprint, which accelerates the retrieval speed greatly. We take advantage of the initial robust peaks to calculate the similarity between candidates and the input audio, which improves the retrieval accuracy significantly. The effectiveness of the N-gram method was evaluated on a music database of 10,000 songs. Experimental results show that the proposed approach outperforms two state-of-the-art algorithms (Shazam and Philips Robust Hash) in both effectiveness (in terms of retrieval accuracy) and efficiency (in terms of average retrieval time).
In this paper, a novel sparse feature representation method for object tracking is proposed. The method is on the observation that a tracked object can be dynamically and compactly represented by a few features (spars...
详细信息
In this paper we propose a methodology of multilayer filters based on tempo variation for realizing a query by humming (QBH) system. Firstly the original query clip is used to search for the candidate songs. If the re...
详细信息
ISBN:
(纸本)9781467322164
In this paper we propose a methodology of multilayer filters based on tempo variation for realizing a query by humming (QBH) system. Firstly the original query clip is used to search for the candidate songs. If the results are unreliable, the clip is linearly scaled twice for more candidates. If the results are still unreliable, the clip is scaled more times for retrieval. To sort all the candidates, a new matching algorithm called key transposition recursive alignment (KTRA) is presented, which improves the retrieval accuracy. Experimental results on the 2010 MIREX QBH query corpus show that the proposed method can achieve a relative improvement of 20.9% as well as an acceleration factor of 2.09 simultaneously compared to a state-of-the-art method.
Micro-blog is a new information sharing tool which is more and more popular these days. In very short time there are a large number of messages posted and it is difficult for people to grasp main topics that most peop...
详细信息
Micro-blog is a new information sharing tool which is more and more popular these days. In very short time there are a large number of messages posted and it is difficult for people to grasp main topics that most people are talking about. In this paper, we introduce a new text modeling method, which is called WAF, and propose a feature selection method for topic clustering based on it. In the experiments this method shows promising results, and the subsequent clustering based on it works effectively as well.
This paper proposes a spoken lyric search system, which can help users to find the wanted music based on their input spoken lyric phrases. Using spoken lyric phrases to query a music retrieval system is convenient for...
详细信息
This paper proposes a spoken lyric search system, which can help users to find the wanted music based on their input spoken lyric phrases. Using spoken lyric phrases to query a music retrieval system is convenient for users but challenging for developers. In our proposed system, the spoken query will be converted into text using large vocabulary continuous speech recognition (LVCSR) technique at first, and then a list of candidate lyrics and their corresponding songs will be presented by the text retrieval techniques. To guarantee the performance of the system, we propose a novel approach to improving the current technology. Specifically, a string matching method, which is based on phoneme confusion matrix, is applied to search the most similar lyrics in acoustic respect for the multiple sentence hypotheses of the input voice;and a two-level search strategy is adopted to shorten the time consuming. Experimental results show that the proposed system achieved top-1 accuracy as high as 93.98% when the search is performed within 2,000 songs, and 83.20%, a moderately satisfying accuracy, was obtained when the targeted number of songs increased to 10,000.
暂无评论