Memory quantization is studied in detail. We propose a framework, power series vector quantization (PSVQ), for analysis and development of memory quantizers. Furthermore, we present an optimization algorithm for the f...
详细信息
Memory quantization is studied in detail. We propose a framework, power series vector quantization (PSVQ), for analysis and development of memory quantizers. Furthermore, we present an optimization algorithm for the framework, and new memory quantization methods are derived from the framework. The new methods are applied to spectrum quantization, and are shown to outperform previous methods.
This paper proposes an adaptive multi-stage Levinson-Durbin algorithm, which is more numerically robust than the conventional Levinson-Durbin algorithm for input signals with high spectral dynamics such as speech or a...
详细信息
This paper proposes an adaptive multi-stage Levinson-Durbin algorithm, which is more numerically robust than the conventional Levinson-Durbin algorithm for input signals with high spectral dynamics such as speech or audio signals. At the same time, the proposed algorithm preserves the computational efficiency of the Levinson-Durbin algorithm. It can be therefore suitable to be used in practical linear prediction coding systems as a replacement of the Levinson-Durbin algorithm for better coding performance.
In this paper we investigate the use of fricatives and stops modelling and synthesis techniques with a spectral envelope reconstruction combined with noise reduction postfilter (SERNR) in mixed voiced-unvoiced multiba...
详细信息
In this paper we investigate the use of fricatives and stops modelling and synthesis techniques with a spectral envelope reconstruction combined with noise reduction postfilter (SERNR) in mixed voiced-unvoiced multiband excitation coders. We perform a comparative analysis amongst a noise excitation approach operating at 1.75 kb/s and a fricatives and stops excitation technique operating at 0.4 kb/s. A novel SERNR postfiltering technique that significantly enhances the decoded speech is proposed and compared with the well-known adaptive spectral enhancement (ASE) filter.
This paper presents a new analysis-by-synthesis (AbS) technique for joint optimization of the excitation and model parameters based on minimizing the closed loop synthesis error instead of the linear prediction error....
详细信息
This paper presents a new analysis-by-synthesis (AbS) technique for joint optimization of the excitation and model parameters based on minimizing the closed loop synthesis error instead of the linear prediction error. By minimizing the synthesis error, the analysis and synthesis stages become more compatible. Using a gradient search in the root domain, model parameters for a given excitation are optimized to minimize the error between the original and the synthesized speech. Since the optimization starts from the LPC solution, the synthesis error is guaranteed to be lower than that obtained using the LPC coefficients. For the ITU G.729 speech codec there is about 1dB of improvement in the segmental SNR for male and female speakers over 4 to 6 second long sentences. By adding an extra optimization step, the technique can be incorporated into any parametric coder such as LPC, multi-pulse LPC and CLEP-type speech coders in a bit stream compatible manner.
It is necessary to provide priority-based data transfer on HFC networks. In this paper, we propose a scheme that can support priority-based bandwidth contention. The scheme is easy to realize. In addition, the overhea...
详细信息
ISBN:
(纸本)0780374908
It is necessary to provide priority-based data transfer on HFC networks. In this paper, we propose a scheme that can support priority-based bandwidth contention. The scheme is easy to realize. In addition, the overhead in the headend is extremely low. We use simulation to verify that the scheme we proposed can indeed make high-priority bandwidth requests have greater possibility to succeed than low-priority bandwidth requests.
An new approach for computing line spectrum pair (LSP) parameters is proposed. The LSP parameters are proved to be the roots of two 5-degree algebraic equations. The root-solving procedure of such an equation is divid...
详细信息
ISBN:
(纸本)0780374886
An new approach for computing line spectrum pair (LSP) parameters is proposed. The LSP parameters are proved to be the roots of two 5-degree algebraic equations. The root-solving procedure of such an equation is divided into three sections: 1) compute the inflexions and split (-2,2) into 5 subintervals; 2) search three roots in the shortest three subintervals respectively; 3) calculate the other two roots by formulas.
The paper investigates the use of neural networks in recognizing the phonation of the speech sounds. The proposed method classifies the Malay plosive sounds of adults and children based on phonation in a speaker-indep...
详细信息
The paper investigates the use of neural networks in recognizing the phonation of the speech sounds. The proposed method classifies the Malay plosive sounds of adults and children based on phonation in a speaker-independent manner. The proposed method achieves encouraging result with an average accuracy of 98%.
The present paper describes the implementation and experimental results of the DAQP02 system on a chip (SoC) for petroleum pipeline inspection. This integrated circuit is able to read and process information about phy...
详细信息
The present paper describes the implementation and experimental results of the DAQP02 system on a chip (SoC) for petroleum pipeline inspection. This integrated circuit is able to read and process information about physical phenomena inside pipelines. The DAQP02 has two multiplexed analog inputs, one 8-bit A/D converter, an 8-bit data bus and a 22-bit address bus. Due to its small dimensions and low power consumption features, it is an efficient in-line inspection tool. The integrated circuit was manufactured in CMOS 0.8 /spl mu/m double poly, double metal, n-well technology.
A new method for recognizing the start and the end of each word in a Chinese continuous sentence is discussed. We define a new recognition characteristic called periodic gradual change (PGC). A continuous speech sente...
详细信息
ISBN:
(纸本)0780374886
A new method for recognizing the start and the end of each word in a Chinese continuous sentence is discussed. We define a new recognition characteristic called periodic gradual change (PGC). A continuous speech sentence can be separated into many single words by a combination of the new method of PGC and other characteristics such as zero crossing rate (ZCR), instantaneous swing (E characteristic) and linear predictive coding (LPC) parameter. The recognition rate is improved for continuous speech segmentation by the new method.
Vocoders compress speech by estimating model parameters at a given transmission rate over an analysis window, assuming that speech is stationary within this window. In this paper, the limits of this assumption are exp...
详细信息
Vocoders compress speech by estimating model parameters at a given transmission rate over an analysis window, assuming that speech is stationary within this window. In this paper, the limits of this assumption are explored with regard to the spectral envelope parameters in the form of line spectral frequency (LSF) parameters. It is shown that all LSF parameters have considerable variations over time, regardless of LSF vector extraction and transmission rates. LSF track variations are investigated through oversampling and are shown to contain high frequency variations above the frequency corresponding to the LSF vector transmission rate. An anti-aliasing filter with cut-off frequency adequate for the chosen LSF vector transmission rate is proposed to alleviate possible spectral overlapping of the LSF parameter spectra. It is confirmed, through experiments, that the proposed method offers an advantage over the classic LSF extraction method with respect to quantisation shown by bit savings of typically 10 to 15%.
暂无评论