检索结果-内蒙古大学图书馆

Multichannel linear prediction method compliant with the MPEG-4 ALS

IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES 2008年第3期E91A卷 756-762页

作者： Kamamoto, Yutaka Harada, Noboru Moriya, Takehiro NTT Corp NTT Commun Sci Labs Atsugi Kanagawa 2430198 Japan

A new linear prediction analysis method for multichannel signals was devised. with the goal of enhancing the compression performance of the MPEG-4 Audio Lossless coding (ALS) compliant encoder and decoder. The multichannel coding tool for this standard carries out an adaptively weighted subtraction of the residual signals of the coding channel from those of the reference channel, both of which are produced by independent linear prediction. Our linear prediction method tries to directly minimize the amplitude of the predicted residual signal after subtraction of the signals of the coding channel, and the method has been implemented in the MPEG-4 ALS codec software. The results of a comprehensive evaluation show that this method reduces the size of a compressed file. The maximum improvement of the compression ratio is 14.6% which is achieved at the cost of a small increase in computational complexity at the encoder and without increase in decoding time. This is a practical method because the compressed bitstream remains compliant with the MPEG-4 ALS standard.

关键词： linear predictive coding multichannel signal lossless compression and MPEG-4 ALS

来源：评论

学校读者我要写书评

暂无评论

Digital filter interpolation of decoded LSFs for distributed continuous speech recognition

引用

ELECTRONICS LETTERS 2008年第17期44卷 1039-U50页

作者： de Alencar, V. F. S. Alcaim, A. Catholic Univ CETUC PUC RIO Ctr Telecommun Studies BR-22453 Rio De Janeiro Brazil

A digital filter interpolation of decoded line spectral frequencies (LSFs) that significantly outperforms linear interpolation for large vocabulary distributed continuous speech recognition systems is presented. Experiments were conducted using linear predictive coding (LPC) and LSF-derived speech recognition features, CDHMM acoustic models, triphone units and trigram language models for Brazilian Portuguese.

关键词： acoustic models digital filters linear predictive coding linear interpolation trigram language models triphone units vocabulary distributed continuous speech recognition decoded line spectral frequencies speech recognition features decoding digital filter interpolation speech recognition

来源：评论

学校读者我要写书评

暂无评论

Could formant frequencies of snore signals be an alternative means for the diagnosis of obstructive sleep apnea?

引用

SLEEP MEDICINE 2008年第8期9卷 894-898页

作者： Ng, Andrew Keong Koh, Tong San Baey, Eugene Lee, Teck Hock Abeyratne, Udantha Ranjith Puvanendran, Kathiravelu Nanyang Technol Univ Sch Elect & Elect Engn Singapore 639798 Singapore Respiron Inc Singapore 919191 Singapore Univ Queensland Sch Informat Technol & Elect Engn Brisbane Qld Australia Singapore Gen Hosp Sleep Disorders Unit Singapore 169608 Singapore

Objective: To study the feasibility of using acoustic signatures in snore signals for the diagnosis of obstructive sleep apnea (OSA). Methods: Snoring sounds of 30 apneic snorers (24 males;6 females: apnea-hypopnea index, AHI = 46.9 +/- 25.7 events/h) and 10 benign snorers (6 males;4 females;AHI = 4.6 +/- 3.4 events/h) were captured in a sleep laboratory. The recorded snore signals were preprocessed to remove noise, and subsequently, modeled using a linear predictive coding (LPC) technique. Formant frequencies (F1, F2, and F3) were extracted from the LPC spectrum for analysis. The accuracy of this approach was assessed using receiver operating characteristic curves and notched box plots. The relationship between AHI and F1 was further explored via regression analysis. Results: Quantitative differences in formant frequencies between apneic and benign snores are found in same- or both-gender snorers. Apneic snores exhibit higher formant frequencies than benign snores, especially F1, which can be related to the pathology of OSA. This study yields a sensitivity of 88%, a specificity of 82%, and a threshold value of F1 = 470 Hz that best differentiate apneic snorers from benign snorers (both gender combined). Conclusion: Acoustic signatures in snore signals carry information for OSA diagnosis, and snore-based analysis might potentially be a non-invasive and inexpensive diagnostic approach for mass screening of OSA. (c) 2007 Elsevier B.V. All rights reserved.

关键词： Obstructive sleep apnea Polysomnography Snoring Snore signals Acoustic analysis Formant frequencies linear predictive coding

来源：评论

学校读者我要写书评

暂无评论

Improved Frame Loss Recovery Using Closed-Loop Estimation of Very Low Bit Rate Side Information

Improved Frame Loss Recovery Using Closed-Loop Estimation of...

引用

9th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2008)

作者： Gournay, Philippe Univ Sherbrooke Speech & Audio Res Grp Sherbrooke PQ J1K 2R1 Canada

ISBN: (纸本)9781615673780

In CELP coders, the past excitation signal used to build the adaptive codebook is known to be the main source of error propagation when a frame is lost. This paper presents a novel resynchronization technique using very low bit rate side information to correct the past excitation signal after a frame erasure, the novelty being that the correction is computed in a closed loop fashion, based on the actual error introduced by the concealment. Subjective test results show that this approach is a promising area for future research on frame loss recovery.

关键词： speech codecs linear predictive coding error correction robustness

来源：评论

学校读者我要写书评

暂无评论

The perceptual quality of MELP speech over error tolerant IP networks

The perceptual quality of MELP speech over error tolerant IP...

引用

33rd IEEE International Conference on Acoustics, Speech and Signal Processing

作者： Gavula, Ben Scheets, George Teague, Keith Weber, Justin Oklahoma State Univ Sch Elect & Comp Engn Stillwater OK 74078 USA

ISBN: (纸本)9781424414833

Modifications to IP based packet network protocols are examined that would make the network tolerant of bit errors in packet payloads or headers. These modifications are tested with communication quality MELP voice traffic. As measured by a PESQ score, improvements in the perceptual quality of the speech are noted that are maximized when error checking is disabled for the entire packet.

关键词： vocoders transport protocols internet error analysis linear predictive coding

来源：评论

学校读者我要写书评

暂无评论

Speech enhancement based on double RBF networks

Speech enhancement based on double RBF networks

引用

1st International Congress on Image and Signal Processing

作者： Guo, Jichang Guo, Libin Tianjin Univ Sch Elect Informat Engn Tianjin 300072 Peoples R China

ISBN: (纸本)9780769531199

In this paper, a non-linear spectral estimation for noise reduction is present which is approximated and implemented by double Radial Basis Function (RBF) networks. The simulation results indicate that the method can greatly improve the quality and the intelligibility of speech, and have other advantages such as the widely applicable Signal-to-Noise Ratio (SNR) range, less computation load Particularly the method may maintain the preferable accurate of signal in speech waveform, and the quality of speech signals have been improved obviously.

关键词： Speech enhancement Radial basis function networks linear predictive coding Neural networks Signal processing Noise reduction Signal to noise ratio Flowcharts Frequency Electronic mail

来源：评论

学校读者我要写书评

暂无评论

Delay-free lossy audio coding using shelving pre- and post-filters

Delay-free lossy audio coding using shelving pre- and post-f...

引用

33rd IEEE International Conference on Acoustics, Speech and Signal Processing

作者： Holters, Martin Zoelzer, Udo Helmut Schmidt Univ Hamburg Germany

ISBN: (纸本)9781424414833

A delay-free audio coding scheme based on ADPCM with adaptive pre- and post-filtering is presented. The pre-/post-filters are realized as a cascade of shelving filters, designed to match the characteristics of human perception. The pre- and post-filters are adapted by dynamic compression of the respective sub-bands. The adaption is backward-adaptive, i.e. is fed by the reconstructed signal, which eliminates the need to transmit the filter coefficients and allows delay-free operation. This pre- and post-filtering significantly improves the audio quality compared to a plain ADPCM codec, as underlined by objective measurements. Since the base ADPCM used is also delay-free, the resulting coding system works without any algorithmic delay.

关键词： audio coding linear predictive coding adaptive filters

来源：评论

学校读者我要写书评

暂无评论

Voiced/Unvoiced Classification Recovery in the Speech Decoder Based on GMM

Voiced/Unvoiced Classification Recovery in the Speech Decode...

引用

9th International Conference on Signal Processing

作者： Wei Xuan Dang Xiaoyan Cui Huijuan Tang Kun Tsinghua Univ Dept Elect Engn State Key Lab Microwave & Digital Commun Beijing 100084 Peoples R China

ISBN: (纸本)9781424421787

Voiced/Unvoiced (V/U) classification is an important parameter in low bit-rate speech coding algorithms. An algorithm that recovers the V/U classification from the linear prediction coding (LPC) coefficients and the gain in the speech decoder is proposed. Two Gaussian mixture models (GMM) are employed to model the joint probability of these parameters and to perform the V/U estimation. Experiments show the performance improvements of the proposed algorithm over the V/U classifier used in mixed excitation LPC vocoder (MELP). The proposed algorithm operates only at the receiving end and saves all the bits originally used for V/U quantization.

关键词： Decoding linear predictive coding Speech coding Databases Pattern recognition Quantization Speech synthesis Cepstral analysis Probability distribution Laboratories

来源：评论

学校读者我要写书评

暂无评论

Variable dimension matrix quantization of LSP parameters for very low bit rate vocoder below 300bps

Variable dimension matrix quantization of LSP parameters for...

引用

9th International Conference on Signal Processing

作者： Min Gang Yang Ji-bin Chen Yan-pu Zhang Xiong-wei Institute of communications engineering People''s Liberation Army University of Science and Technology China Xian Communication Institute China

ISBN: (纸本)9781424421787

This paper examines the efficient quantization of LSP parameters for very low bit rate vocoder below 300bps, a new quantization scheme called variable dimension matrix quantization (VDMQ) is presented In the VDMQ scheme, the extracted LSP parameters matrix with variable dimension is quantized directly without dimension conversion. Based on the distance measure definition between low LSP matrices with different dimension, the optimal codeword is deduced Theoretical analysis and experiment results show that the VDMQ scheme performs better than the segment quantization and matrix quantization scheme at very low bit rate. Also, the codebook storage is almost reduced by 90%. The VDMQ scheme provides a new effective approach for efficient LSP parameters quantization at very low bit rate.

关键词： Bit rate Vocoders Speech Vector quantization linear predictive coding Programmable logic arrays Performance analysis Wireless communication Artificial satellites Underwater communication

来源：评论

学校读者我要写书评

暂无评论

New Feature Extraction Methods Using DWT and LPC for Isolated Word Recognition

New Feature Extraction Methods Using DWT and LPC for Isolate...

引用

IEEE Region 10 Conference (TENCON 2008)

作者： Nehe, N. S. Holambe, R. S. SGGS Inst Engn & Technol Nanded MS India

ISBN: (纸本)9781424424085

In this paper a new feature extraction methods, which utilize reduced order linear predictive coding (LPC) coefficients for speech recognition, have been proposed The coefficients have been derived from the speech frames decomposed using Discrete Wavelet Transform (DWT). In the literature it is assumed that the speech frame of size 10 msec to 30 msec is stationary, however, in practice different parts of the speech signal may convey different amount of information (hence may not be perfectly stationary). LPC coefficients derived from subband decomposition of speech frame provide better representation than modeling the frame directly. Experimentally it has been shown that, the proposed approaches provide effective (better recognition rate) and efficient (reduced feature vector dimension) features. The speech recognition system using the continuous Hidden Markov Model (HMM) has been implemented. The proposed algorithms are evaluated using NIST TI-46 isolated-word database.

关键词： Feature Extraction linear predictive coding Discrete Wavelet Transform

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：