Speaker age recognition is an essential technique in automation speech recognition based on the speech wavform parameters in speaker's ***,there are several challenges in speaker age recognition,such as innate dif...
详细信息
Speaker age recognition is an essential technique in automation speech recognition based on the speech wavform parameters in speaker's ***,there are several challenges in speaker age recognition,such as innate differences in speaker's voice,subjective classification fuzzy,*** issue of speaker age based on isolated words is proposed in this paper,including support vector machine(SVM),picking up Mel frequency cepstrum coefficient(MFCC) characteristics of isolated words to distinguish the speaker's *** voicebox of this paper includes 4507 isolated word *** results show that the recognition rate based on isolated word speech can reach 72.93%.Through the experiment towards SVM classifier,we could find that the performance is improved without normalization for MFCC.
Multimedia data is usually represented with different low-level features, and different types of multimedia data, namely multimodal data, often coexist in many data sources. It is interesting and challenging to learn ...
详细信息
Most existing language modeling approaches are based on the term independence hypothesis. To go beyond this assumption, two main directions were investigated. The first one considers the use of the proximity features ...
详细信息
The efficiency and performance of the Twin Support Vector Machines(TWSVM) are better than the traditional support vector machines when it deals with the problems. However, it also has the problem of selecting kernel f...
详细信息
The efficiency and performance of the Twin Support Vector Machines(TWSVM) are better than the traditional support vector machines when it deals with the problems. However, it also has the problem of selecting kernel functions. Generally, TWSVM selects the Gaussian radial basis kernel function. Although it has a strong learning ability, its generalization ability is relatively weak. In a certain extent, this will limit the performance of TWSVM. In order to solve the problem of selecting kernel functions in TWSVM, we propose the twin support vector machines based on the mixed kernel function(MK-TWSVM) in this paper. To make full use of the learning ability of local kernel functions and the excellent generalization ability of global kernel functions, MK-TWSVM selects a global kernel function and a local kernel function to construct a mixed kernel function which has the better performance. The experimental results indicate that the mixed kernel function makes TWSVM have the good learning ability and generalization ability. So it improves the performance of TWSVM.
There is an important relationship between the stability of protein complex and hot region. Research has shown that in protein-protein interaction (PPI), residues are denser around the hot region. Therefore, this pape...
详细信息
ISBN:
(纸本)9781479956708
There is an important relationship between the stability of protein complex and hot region. Research has shown that in protein-protein interaction (PPI), residues are denser around the hot region. Therefore, this paper proposed an algorithm based on Gi statistics, regional division rule and regional amplification principle to form residue dense region (RDR);Then, according to the results of cascade classifier composed of Naive Bayes and Back-Propagation (BP) neural network classifier, non-hotspot residues in RDRs were removed;At length, we used binding free energy change value calculated from Robetta Server to modify predicted hot regions. The experimental results showd that the proposed method can effectively improve the prediction accuracy on hot regions.
According to the problem that conventional laminated paper counting algorithms have some unavoidable shortcomings such as high dependence on the quality of laminated paper and noise sensitivity, we propose a laminated...
详细信息
ISBN:
(纸本)9781479953004
According to the problem that conventional laminated paper counting algorithms have some unavoidable shortcomings such as high dependence on the quality of laminated paper and noise sensitivity, we propose a laminated paper counting algorithm based on compressive sensing (CS) and Hough transform (HT). In the proposed algorithm, the over-complete dictionary which is created by dispersing the Hough transform space of straight lines acts as the sparse matrix. Making use of high degree of sparse nature of the laminated paper image, we can obtain the accurate result through using CS theory. Experimental results on simulation images and laminated paper images have shown that our proposed algorithm can effectively restrain noise of the laminated paper image, and will get accurate experimental results with fewer CS measurements.
We present an online learned framework for multiple target tracking in a crowded scene. The tracking problem is formulated as a detection-based progressive association task. Firstly, reliable tracklets are generated b...
详细信息
ISBN:
(纸本)9781479957521
We present an online learned framework for multiple target tracking in a crowded scene. The tracking problem is formulated as a detection-based progressive association task. Firstly, reliable tracklets are generated by low level constraints among detection responses. Then longer tracklets associations are generated based on online learned Hough forest framework which effectively combines motion and appearance information for discrimination between two tracklets. In online learning scene, the association is formulated as a MAP problem and training examples are collected based on spatial-temporal constraints. In order to alleviate the drifting problem of online learning, Hungarian algorithm is employed to modify associated errors and update the training set. The experimental results show the effectiveness of our approach.
Network coding (NC) is one of the promising techniques to improve network throughput towards the Shannon limit. NC has been proven to be able to achieve the max-flow min-cut bound for single source multicast networks....
详细信息
ISBN:
(纸本)9781479949212
Network coding (NC) is one of the promising techniques to improve network throughput towards the Shannon limit. NC has been proven to be able to achieve the max-flow min-cut bound for single source multicast networks. For multi-information-source multicast networks, however, the question of how to use NC is still to be answered. In this paper we will focus on the simple case of multi-information-source independent network encoding. The region of admissible rate set of this case will be provided. With this rate region obtained, more advanced network encoding algorithms for multi-information source multicast networks can be further developed.
Human action recognition technology has been applied to intelligent security surveillance, content-based image and video retrieval and natural user interface. How to make use of the new type of data, 3D skeleton joint...
Human action recognition technology has been applied to intelligent security surveillance, content-based image and video retrieval and natural user interface. How to make use of the new type of data, 3D skeleton joint position extracted by 3D depth camera, has been a highly active research topic. A posture representation model is proposed, which is invariant to limb length, length ratio between body parts and body orientation. This model contains polar angle and azimuthal angle of each limb in the spherical coordinate system which is established by the features of body joints. Hidden Markov Model(HMM) is exploited for recognition. Skeleton sequences of different body orientation are collected as experimental data. Experimental results demonstrate the effectiveness of our approach.
OvTDM is a new transmission scheme that can obtain high spectral efficiency with very low threshold signal noise ratio (SNR) by using the inter symbol interference (ISI). Fano algorithm (FA) is researched in this pape...
详细信息
ISBN:
(纸本)9781510804166
OvTDM is a new transmission scheme that can obtain high spectral efficiency with very low threshold signal noise ratio (SNR) by using the inter symbol interference (ISI). Fano algorithm (FA) is researched in this paper to implement fast ML decoding at high spectral efficiency for OvTDM. Numerical results show that the complexity of OvTDM with FA has no relationship with the number of state which is lower than 15% of that of Viterbi algorithm and the bit SNR loss is under 2 dB. Moreover, OvTDM with FA can even outperform Shannon limit when spectral is above 6.7 bits/s/Hz.
暂无评论