We explore the performance of two dimensional (2-D) prediction based LSF quantization method for both wide-band and telephone-band (narrow-band) speech. The 2-D prediction based method exploits both the inter-frame an...
详细信息
We explore the performance of two dimensional (2-D) prediction based LSF quantization method for both wide-band and telephone-band (narrow-band) speech. The 2-D prediction based method exploits both the inter-frame and intra-frame correlations of LSF parameters. We show that a 4th order 2-D predictor provides optimum prediction gain as well as improved quantization performance at various choices of frame shift for both wide-band and telephone-band speech. Existing one dimensional (1-D) predictive method, exploiting only inter-frame correlation, results in poor performance at larger frame shifts; whereas proposed 2-D predictor provides lower spectral distortion as well as lower number of outliers compared to existing memory-based and memory-less methods.
linear Prediction with Low-frequency Emphasis (LPLE), an all-pole modeling technique which emphasizes the lower frequency range of the input signal, was described by Alku and Backstrom [1]. The method is based on firs...
详细信息
ISBN:
(纸本)9781424410286
linear Prediction with Low-frequency Emphasis (LPLE), an all-pole modeling technique which emphasizes the lower frequency range of the input signal, was described by Alku and Backstrom [1]. The method is based on first interpreting conventional linearpredictive (LP) analyses of successive prediction orders with parallel structures using the concept of symmetric linear prediction. The experiments presented in this work are aimed to show that the LPLE method is well-suited for those speech processing and enhancement applications, where low-order all-pole models with improved modeling of the lowest formants are needed. This is done by replacing the LP block with a LPLE block. Distortion measures using Itakura distance and noise cancellation using a Wiener "type" filter were explored, utilizing the LPLE coefficients instead of LP coefficients.
This paper presents a text independent speaker identification system using multi-band features with artificial neural network. linearpredictive cepstrum coefficients (LPCCs) computed from sub-band signals with higher...
详细信息
ISBN:
(纸本)9781424415502
This paper presents a text independent speaker identification system using multi-band features with artificial neural network. linearpredictive cepstrum coefficients (LPCCs) computed from sub-band signals with higher order statistics (HOS) are employed as the main features to represent the speaker characteristics. The multi-band representation of the speech signal is implemented by empirical mode decomposition (EMD). Dominant feature vectors are derived by applying principal component analysis (PCA) on LPCC space computed from the speech signal. The experimental results show that the proposed system improves the speaker identification performance. The efficiency is also compared for different features with noisy speech signals.
We propose an extension of ADPCM that includes adaptive pre- and post-filtering to achieve spectral shaping of the coding noise. The advantage of this coding scheme is that it allows a realization without algorithmic ...
详细信息
ISBN:
(纸本)9781424407286
We propose an extension of ADPCM that includes adaptive pre- and post-filtering to achieve spectral shaping of the coding noise. The advantage of this coding scheme is that it allows a realization without algorithmic delay by making the filters backwards-adaptive. The measurements we present indicate that the addition of adaptive pre- and post-filtering to ADPCM results in a significant improvement in perceived audio quality. We therefore believe that the proposed system is a viable way to near-transparent lossy audio coding without algorithmic delay.
Data compression techniques have extensive applications in power-con strained digital communication systems, such as in the rapidly-developing domain of wireless sensor network applications. This paper explores energy...
详细信息
ISBN:
(纸本)1424407281
Data compression techniques have extensive applications in power-con strained digital communication systems, such as in the rapidly-developing domain of wireless sensor network applications. This paper explores energy consumption tradeoffs associated with data compression, particularly in the context of lossless compression for acoustic signals. Such signal processing is relevant in a variety of sensor network applications, including surveillance and monitoring. Applying data compression in a sensor node generally reduces the energy consumption of the transceiver at the expense of additional energy expended in the embedded processor due to the computational cost of compression. This paper introduces a methodology for comparing data compression algorithms in sensor networks based on the figure of merit DIE, where D is the amount of data (before compression) that can be transmitted under a given energy budget E for computation and communication. We develop experiments to evaluate, using this figure of merit, different variants of linear predictive coding. We also demonstrate how different models of computation applied to the embedded software design lead to different degrees of processing efficiency, and thereby have significant effect on the targeted figure of merit.
This study combined behavioral performance, eventrelated potential (ERP) and explored the differences between positive expression and negative expression in face domain. A "study-test" pattern was adopted. A...
详细信息
The paper presents a high quality harmonic excitation linearpredictive (HE-LPC) speech coder operating at 2 kb/s based on a harmonic excitation model with two bands. The system incorporates novel features such as: co...
详细信息
ISBN:
(纸本)0780386787
The paper presents a high quality harmonic excitation linearpredictive (HE-LPC) speech coder operating at 2 kb/s based on a harmonic excitation model with two bands. The system incorporates novel features such as: combined pitch detection; residual harmonic matching voicing determination; extraction and interpolation of residual harmonic magnitudes. Subjective listening tests indicate that this coder has the same quality as that of the Federal Standard MELP (mixed excitation linear prediction) coder at 2.4 kb/s, whether the training database is from Chinese or English.
We propose a new weighting function which is computationally simple and an approximation to the theoretically derived optimum weighting function shown in the literature. The proposed weighting function is perceptually...
详细信息
We propose a new weighting function which is computationally simple and an approximation to the theoretically derived optimum weighting function shown in the literature. The proposed weighting function is perceptually motivated and provides improved vector quantization performance compared to several weighting functions proposed so far, for line spectrum frequency (LSF) parameter quantization of both clean and noisy speech data.
暂无评论