Speech compression is one area of digital signal processing that focuses on reducing the bit rate of the speech signal for transmission or storage without significant loss of quality. In recent years a new technique c...
详细信息
Speech compression is one area of digital signal processing that focuses on reducing the bit rate of the speech signal for transmission or storage without significant loss of quality. In recent years a new technique called wavelet transform has been proposed for signal analysis. It has been successfully used in image compression application. So far, less attention has been paid to the research in the speech compression using wavelet. This paper attempts to evaluate the wavelet compression technique on speech signals. Different wavelet filters were used to select the best filter suitable for speech signal in providing low bit rate and low computation complexity. Our implementation was evaluated based on PSNR, SNR, NRMSE and compression ratio tested on 8 kHz 8-bit speech signals. This algorithm was also compared to the following speech compression schemes: linear predictive coding (LPC) which reduces the transmitted data by factor of more than twelve, and global system mobile (GSM) which reduces the transmitted data by factor of five As a result from this study, wavelet speech compression gives higher SNR and better speech quality than the other techniques.
Various speech enhancement schemes are analyzed in terms of the conflicting real time requirements of computational delay, robustness and accuracy. A spectral subtraction scheme is found to be implementable in real ti...
详细信息
Various speech enhancement schemes are analyzed in terms of the conflicting real time requirements of computational delay, robustness and accuracy. A spectral subtraction scheme is found to be implementable in real time using the available digital processing board. The tasks such as the computation of the spectral contents of the speech plus noise and the noise, smoothing the estimates, and speech enhancement filtering are implemented in the frequency domain using FFT in view of the computational speed and robustness. A variant of a spectral subtraction scheme is implemented in real time on a DSP board and its performance is evaluated.
The paper presents an artificial vision system able to analyze the image of a car given by a camera, to locate the registration plate and to recognize its registration number. It describes practical problems encounter...
详细信息
The paper presents an artificial vision system able to analyze the image of a car given by a camera, to locate the registration plate and to recognize its registration number. It describes practical problems encountered in implementing this application and the proposed solutions. The system has been designed using a modular approach. Sub-modules can be upgraded and or substituted independently, thus making the system potentially suitable in a large variety of vision applications. Performances of the system, have been evaluated in real situation.
In this work, a new method for estimating the time-varying AR model of speech is presented. Here, the time-varying parameters are modeled as stationary processes. Both the time-varying parameters and their correspondi...
详细信息
In this work, a new method for estimating the time-varying AR model of speech is presented. Here, the time-varying parameters are modeled as stationary processes. Both the time-varying parameters and their corresponding stationary process are modeled through a common Gauss-Markov model whose state-vector can be estimated through the extended Kalman Filter (EKF) algorithm. The proposed algorithm is different from the earlier methods which use the EKF algorithm. Simulation studies are carried out for both voiced and unvoiced speech. It is shown that the proposed method has less mean-square prediction error than that obtained through the LPC method.
Here we consider the problem of providing near optimal performance for a large set of possible models. We adopt the LQR framework in the single-input single-output (SISO) setting, and prove that given a compact set of...
详细信息
Here we consider the problem of providing near optimal performance for a large set of possible models. We adopt the LQR framework in the single-input single-output (SISO) setting, and prove that given a compact set of controllable and observable plant models of a fixed order, we can construct a single linear periodic controller (LPC) which provides near optimal LQR performance.
Basically all conventional digital signal processing techniques can be warped by introducing a simple modification to the system. In this paper, the focus is in warped linear predictive coding techniques with applicat...
详细信息
Basically all conventional digital signal processing techniques can be warped by introducing a simple modification to the system. In this paper, the focus is in warped linear predictive coding techniques with application to speech and audio coding. The performance of warped LPC is compared with a conventional LPC in listening tests and in terms of technical measures. This is done at various sampling rates as a function of the order of the LPC model.
In this paper, a 6.7-kbps vector sum excited linear prediction (VSELP) coder with less computational complexity is presented. A very efficient VSELP codebook with nine basis vectors and a heuristic K-selection method ...
详细信息
In this paper, a 6.7-kbps vector sum excited linear prediction (VSELP) coder with less computational complexity is presented. A very efficient VSELP codebook with nine basis vectors and a heuristic K-selection method (to reduce the search space and complexity) is constructed to obtain the stochastic codebook vector. The nine basis vectors are obtained by optimizing a set of randomly generated basis vectors. During the optimization process, we have trained the basis vectors to give the system apriori knowledge of the characteristics of the input. The coder is implemented on a TMS320C541 digital signal processor. The performance is evaluated by testing the 6.7-kbps VSELP coder with different test speech data taken from different speakers. The quality of the coder is estimated by comparing the performance of the 6.7-kbps VSELP coder with an 8-kbps VSELP speech coder based on the IS-54 standards. (C) 2002 Elsevier Science B.V. All rights reserved.
The reliable communication of FS CELP 10 16 encoded speech over very noisy channels is investigated. Using second-order Markov chains it is shown that over one-quarter of the CELP bits in every frame of speech are red...
详细信息
The reliable communication of FS CELP 10 16 encoded speech over very noisy channels is investigated. Using second-order Markov chains it is shown that over one-quarter of the CELP bits in every frame of speech are redundant. An unequal error protection coding scheme, which exploits this residual redundancy, is proposed for sending the CELP parameters over Gaussian and Rayleigh fading channels. Simulations indicate substantial coding gains over conventional systems.
An efficient method to implement the perceptual posterfilter for the suppression of coding noise in CELP-coded speech is proposed. The method is based on approximating the response of an all-pole filter to the respons...
详细信息
An efficient method to implement the perceptual posterfilter for the suppression of coding noise in CELP-coded speech is proposed. The method is based on approximating the response of an all-pole filter to the response of the pole-zero form postfilter via cepstrum processing. This all-pole postfilter can then be implemented more efficiently than the pole-zero postfilter with less computation and filter memory.
暂无评论