In this paper, we focus on quantization-indexmodulation (QIM) steganography in low-bit-rate speech codec and contribute to improve its steganalysis resistance. A novel QIM steganography is proposed based on the replac...
详细信息
In this paper, we focus on quantization-indexmodulation (QIM) steganography in low-bit-rate speech codec and contribute to improve its steganalysis resistance. A novel QIM steganography is proposed based on the replacement of quantization index set in linear predictive coding (LPC). In this method, each quantization index set is seen as a point in quantization index space. Steganography is conducted in such space. Comparing with other methods, our algorithm significantly improves the embedding efficiency. One quantization index needs to be changed at most when three binary bits are hidden. The number of alterations introduced by the proposed approach is much lower than that of the current methods with the same embedding rate. Due to the fewer cover changes, the proposed steganography is less detectable. Moreover, a division strategy based on the genetic algorithm is proposed to reduce the additional distortion introduced by replacements. In our experiment, ITU-T G.723.1 is selected as the codec, and the experimental results show that the proposed approach outperforms the state-of-the-art LPC-based approach in low-bit-rate speech codec with respect to both stegano-graphic capacity and steganalysis resistance.
Speaker verification is of great importance, especially in the field of forensics and security. This paper aims at implementing such a system at the hardware level. This system extracts features from the fresh voice s...
详细信息
Speaker verification is of great importance, especially in the field of forensics and security. This paper aims at implementing such a system at the hardware level. This system extracts features from the fresh voice samples and verifies the speaker by comparing those with the ones being stored in the database. The features used here are the linear predictive coding (LPC) Coefficients which are obtained using the Levinson - Durbin (LD) algorithm. This paper proposes to implement Vector Quantization (VQ) to obtain the representative LPC vectors. A simple speaker verification system for a single person is efficiently implemented on FPGA.
The goal of this work is to present an audio signal classification system based on linear predictive coding and Random Forests. We consider the problem of multiclass classification with imbalanced datasets. The signal...
详细信息
ISBN:
(纸本)9781509064977
The goal of this work is to present an audio signal classification system based on linear predictive coding and Random Forests. We consider the problem of multiclass classification with imbalanced datasets. The signals under classification belong to the class of sounds from wildlife intruder detection applications: birds, gunshots, chainsaws, human voice and tractors. The proposed system achieves an overall correct classification rate of 99.25% There is no probability of false alarms in the case of birds or human voices. For the other three classes the probability is low, around 0.3% The false omission rate is also low: around 0.2% for birds and tractors, a little bit higher for chainsaws (0.4%), lower for gunshots (0.14%) and zero for human voices.
We are developing a method of time-frequency analysis on the basis of linear predictive coding (LPC). In contrast to standard LPC, which assumes an infinite number of samples, our method obtains instantaneous frequenc...
详细信息
ISBN:
(纸本)9781538646625
We are developing a method of time-frequency analysis on the basis of linear predictive coding (LPC). In contrast to standard LPC, which assumes an infinite number of samples, our method obtains instantaneous frequencies, instantaneous amplitude vary rates, and instantaneous amplitudes from a small number of samples. The time width for single local analysis can be much narrower than the period of oscillation of a given time series, enabling high resolution time-frequency analysis. We outline our method and use it to analyze electro-magnetic noise in comparison with short time Fourier transform to show its resolution.
In this work we compare different classification algorithms applied on different number of features (linear predictive coding coefficients) in order to detect audio signals from wildlife areas. The final goal is to fi...
详细信息
ISBN:
(纸本)9781538606742
In this work we compare different classification algorithms applied on different number of features (linear predictive coding coefficients) in order to detect audio signals from wildlife areas. The final goal is to find the appropriate number of linear predictive coding coefficients to provide the desired accuracy for a certain framework. The experimental results prove that the best classifier is Logistic Model Trees regardless the number of features, having a constant classification accuracy greater than 95%. In the case of a reduced number of features, both Random Forest and Lazy IBk have good results;the classification accuracy is greater than 98%.
Doppler radar is a cost-effective tool for moving target tracking, which can support a large range of civilian and military applications. A modified linear predictive coding (LPC) approach is proposed to increase the ...
详细信息
Doppler radar is a cost-effective tool for moving target tracking, which can support a large range of civilian and military applications. A modified linear predictive coding (LPC) approach is proposed to increase the target localization accuracy of the Doppler radar. Based on the time-frequency analysis of the received echo, the proposed approach first real-time estimates the noise statistical parameters and constructs an adaptive filter to intelligently suppress the noise interference. Then, a linearpredictive model is applied to extend the available data, which can help improve the resolution of the target localization result. Compared with the traditional LPC method, which empirically decides the extension data length, the proposed approach develops an error array to evaluate the prediction accuracy and thus, adjust the optimum extension data length intelligently. Finally, the prediction error array is superimposed with the predictor output to correct the prediction error. A series of experiments are conducted to illustrate the validity and performance of the proposed techniques. (C) 2016 Society of Photo-Optical Instrumentation Engineers (SPIE)
In this work we compare different classification algorithms applied on different number of features (linear predictive coding coefficients) in order to detect audio signals from wildlife areas. The final goal is to fi...
详细信息
In this work we compare different classification algorithms applied on different number of features (linear predictive coding coefficients) in order to detect audio signals from wildlife areas. The final goal is to find the appropriate number of linear predictive coding coefficients to provide the desired accuracy for a certain framework. The experimental results prove that the best classifier is Logistic Model Trees regardless the number of features, having a constant classification accuracy greater than 95%. In the case of a reduced number of features, both Random Forest and Lazy IBk have good results; the classification accuracy is greater than 98%.
Speech Therapy has become an efficient tool to bring back proper speech for patients suffering from various speech disorders. Patients are more benefitted when the speech therapy session is interactive where there is ...
详细信息
ISBN:
(纸本)9781467395458
Speech Therapy has become an efficient tool to bring back proper speech for patients suffering from various speech disorders. Patients are more benefitted when the speech therapy session is interactive where there is a visible change in the environment. Hence, the proposed system aims at manipulating devices when the user input is correct and also indicates if the user input is incorrect. Speech recognition has been done using the concept if linear predictive coding and Arduino Uno board is used for hardware interface.
In this paper, we propose a Packet Loss Concealment (PLC) method. Our method is based on Pitch Waveform Replication (PWR) and linear predictive coding (LPC). The estimated packet using LPC is better than that using PW...
详细信息
ISBN:
(纸本)9781479952304
In this paper, we propose a Packet Loss Concealment (PLC) method. Our method is based on Pitch Waveform Replication (PWR) and linear predictive coding (LPC). The estimated packet using LPC is better than that using PWR at a near boundary of the lost packet and the received one. On the other hand, the estimated packet using PWR is better than that using LPC at a distant boundary. Therefore, we combine the two estimated packets by considering the merits and demerits of PWR and LPC. Experimental results show that the proposed method provides better Perceptual Evaluation of Speech Quality (PESQ) scores than the conventional methods. Especially PESQ scores of the proposed method are remarkably excellent in the case of male voice.
Formants are able to define basic properties of speech efficiently by using very limited parameter sets;thus they have found important usage area at many applications of speech processing like coding, recognition, syn...
详细信息
ISBN:
(纸本)9781479948741
Formants are able to define basic properties of speech efficiently by using very limited parameter sets;thus they have found important usage area at many applications of speech processing like coding, recognition, synthesis and enhancement. Estimation of formants is harder than simply tracking the peaks of the spectrum;as the output of the vocal tract's spectral peaks are dependent on the shape of vocal tract, excitation and periodicity in a complex way. Because of this reason, a lot of past work was done on formant estimation and their positive and negative properties have been recognized. In this article we analyzed some of the popular formant estimation method's performances and compared them. Among these three compared methods, it's seen that the particle filtering based formant estimation method gives the most successful performance. Furthermore, it's recognized that linear predictive coding method has estimation difficulties with signals with low sampling frequencies and cepstrum method causes excess formants at peak picking.
暂无评论