检索结果-内蒙古大学图书馆

LPC DISTANCE MEASURES AND STATISTICAL TESTS WITH PARTICULAR REFERENCE TO THE LIKELIHOOD RATIO

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING 1982年第2期30卷 304-315页

作者： DESOUZA, P THOMSON, PJ UK Scientific Centre IBM United Kingdom Laboratories Limited Winchester UK University of Otago Dunedin New Zealand Department of Mathematics Victoria University of Wellington Wellington New Zealand Department of Mathematics and Statistics Massey University Palmerston North New Zealand

Several LPC distance measures and statistical tests have been proposed for use in speech processing, the most popular of which is Itakura's log likelihood ratio statistic, and some simple variants thereof. In this paper it is shown that these statistics share some undesirable properties. It is argued that there are more tractable and more sensitive measures available including other relevant likelihood ratio statistics. It is also shown that when Itakura's measure is used to compare two estimated LPC vectors, it is not the log likelihood ratio at all, and the true likelihood ratio for these conditions is derived.

关键词： linear predictive coding Particle measurements Testing Statistics Speech processing predictive models Mathematics Random variables Equations Statistical analysis

来源：评论

学校读者我要写书评

暂无评论

Real-time robust formant estimation system using a phase equalization-based autoregressive exogenous model

引用

ACOUSTICAL SCIENCE AND TECHNOLOGY 2015年第6期36卷 478-488页

作者： Oohashi, Hiroki Hiroya, Sadao Mochida, Takemi Nippon Telegraph & Tel Corp Human Informat Sci Lab NTT Commun Sci Labs 3-1 Morinosato Atsugi Kanagawa 2430198 Japan

This paper presents a real-time robust formant tracking system for speech using a real-time phase equalization-based autoregressive exogenous model (PEAR) with electroglottography (EGG). Although linear predictive coding (LPC) analysis is a popular method for estimating formant frequencies, it is known that the estimation accuracy for speech with high fundamental frequency F-0 would be degraded since the harmonic structure of the glottal source spectrum deviates more from the Gaussian noise assumption in LPC as its F-0 increases. In contrast, PEAR, which employs phase equalization and LPC with an impulse train as the glottal source signals, estimates formant frequencies robustly even for speech with high F-0. However, PEAR requires higher computational complexity than LPC. In this study, to reduce this computational complexity, a novel formulation of PEAR was derived, which enabled us to implement PEAR for a real-time robust formant tracking system. In addition, since PEAR requires timings of glottal closures, a stable detection method using EGG was devised. We developed the real-time system on a digital signal processor and showed that, for both the synthesized and natural vowels, the proposed method can estimate formant frequencies more robustly than LPC against a wider range of F-0.

关键词： Formant estimation Online linear predictive coding Phase equalization

来源：评论

学校读者我要写书评

暂无评论

AN ALGORITHM FOR LPC SYNTHESIS GAIN MATCHING

引用

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING 1983年第6期31卷 1566-1569页

作者： VANHOVE, L Laboratory of Electronics and Metrology University of Ghent Ghent Belgium

A simple method of synthesis gain matching in a linear prediction (LP) vocoder makes use of the LP analysis residual energy. However, poor gain matching is to be expected when a low-frequency formant is in resonance with the voiced excitation impulses. Such large gain errors increase the probability of synthesis filter overflow. A simple improvement of this method is suggested, reducing these large errors substantially. The improvement makes use of the information provided by the derivative of the already synthesized signal. The method can be applied internally or externally to low-complexity real-time speech synthesizers.

关键词： linear predictive coding Speech synthesis Signal synthesis Filters Vocoders Error analysis Frequency Resonance Synthesizers Speech enhancement

来源：评论

学校读者我要写书评

暂无评论

SUBJECTIVE EVALUATION OF A 4.8 KBIT-S RESIDUAL-EXCITED linear PREDICTION CODER

引用

IEEE TRANSACTIONS ON COMMUNICATIONS 1981年第9期29卷 1389-1393页

作者： NAKATSUI, M STEVENSON, DC MERMELSTEIN, P UNIV QUEBEC INRS TELECOMMUNVERDUNQUEBECCANADA BELL NO RES VERDUNQUEBECCANADA

A 4.8 kbit/s residual-excited linear prediction coder (RELP) with two subband coded basebands was systematically evaluated in terms of intelligibility and overall quality. Intelligibility degradation due to RELP-coding is found to be 6 percent without transmission errors, an additional 6.4 percent with 1 percent bit errors, and 9.8 percent in 10 dB SNR acoustic background noise. Quality of the RELP coded speech is midway between those of 3 and 4 bit log-PCM and is significantly higher than that of the pitch-excited linear prediction coder.

关键词： Baseband Background noise linear predictive coding Low pass filters Decoding Speech analysis Frequency Degradation Data communication Communications Society

来源：评论

学校读者我要写书评

暂无评论

REGULAR FORM OF DURBINS RECURSION FOR PROGRAMMABLE SIGNAL PROCESSORS

引用

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING 1987年第11期35卷 1628-1629页

作者： ACKENHUSEN, JG Speech Processing Department AT and T Bell Laboratories Inc. Murray Hill NJ USA

A new form of Durbin's recursion is described that renders all addressing to be sequential within one iteration of the recursion. Using this technique, Durbin's recursion may be cast into a single repetitively called subroutine with sufficiently simple address arithmetic for single-chip programmable digital signal processors. In a specific implementation, use of this technique reduces program memory by a factor of five while increasing execution time of Durbin's recursion by 50 percent (an increase of 8 to 12 percent of real time), allowing Durbin's recursion to be combined with autocorrelation analysis in a single DSP chip.

关键词： Signal processing Digital signal processors Algorithms linear predictive coding Autocorrelation Digital signal processing chips Vectors Speech processing Digital arithmetic Registers

来源：评论

学校读者我要写书评

暂无评论

Sultan of sound

引用

IEEE SPECTRUM 2005年第5期42卷 44-48页

作者： Perry, TS National Academy of Sciences National Academy of Engineering American Association for the Advancement of Science United States

This paper describes the pioneering research in the field of speech technology by James L. Flanagan, 2005 IEEE Medal of Honor awardee. Flanagan's work with speech coding heralded a series of advances over the years, including a currently favored technique, linear predictive coding. After graduating from Mississippi State as an electrical engineering major, Flanagan accepted a graduate assistantship in MIT's acoustics lab, which led to his seminal research in voice coding. Flanagan then worked at Bell Telephone Laboratories where he would spend the next 33 years. He climbed steadily up the ranks at Bell Labs, eventually becoming director of the Information Principles Research Laboratory. Among the projects that Flanagan was deeply involved in were the development of automatic speech recognition systems, voice mail, artificial larynx, and packet-switched voice technology.

关键词： Laboratories Medals Speech coding linear predictive coding Electrical engineering Acoustics Telephony Automatic speech recognition Voice mail Larynx

来源：评论

学校读者我要写书评

暂无评论

A HYBRID HADAMARD LPC SCHEME FOR PICTURE coding

引用

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING 1987年第3期35卷 391-394页

作者： CHAKRABORTI, NB MISRA, R Department of Electronics and Electrical Communication Engineering Indian Institute of Technology Kharagpur India

A transform-LPC hybrid system for real-time coding of picture data has been presented. The LPC has been made adaptive by using a correlation cancellation loop. Three different schemes for coding and reconstructing the transform components have been presented and their relative performances have been compared. The MSE and SNR are seen to be comparable to previous work on transform-DPCM hybrid schemes.

关键词： Error correction Error correction codes linear predictive coding Vectors Image reconstruction Data models Signal processing algorithms Real time systems Discrete cosine transforms Prediction theory

来源：评论

学校读者我要写书评

暂无评论

A statistical pattern recognition approach to robust recursive identification of nonstationary AR model of speech production system

引用

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING 1996年第6期4卷 456-460页

作者： Markovic, MZ Kovacevic, BD Milosavljevic, MM Institute of Applied Mathematics and Electronics Belgrade Serbia Faculty of Electrical Engineering University of Belgrade Belgrade Serbia

We propose a new robust recursive procedure based an WRLS algorithm with VFF and frame-based quadratic classifier for identification of nonstationary AR model of speech. Two versions of the frame-based quadratic classifier design procedure are elaborated upon. Experimental results are obtained in analyzing speech signal on voiced and mixed excitation frames.

关键词： Pattern recognition Robustness linear predictive coding Production systems Speech enhancement Signal analysis Speech analysis Speech coding Least squares methods Gaussian noise

来源：评论

学校读者我要写书评

暂无评论

ON THE IDENTIFICATION OF AR SYSTEMS EXCITED BY PERIODIC SIGNALS OF UNKNOWN PHASE

引用

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING 1984年第3期32卷 638-641页

作者： DELLER, JR Speech Processing Laboratory Department of Electrical Engineering Illinois Institute of Technology Chicago IL USA

Results pertaining to the short and long term covariance identification of AR models driven by periodic sequences of possibly unknown phase are presented. The work is motivated by problems relating to LPC analysis of voiced speech, but results are formulated in general. Short term and asymptotic effects of such inputs on the invertibility of the covariance matrix are considered. Short term criteria with respect to the input for exact solution are established and an asymptotic bound for the case of inexact solution is developed.

关键词： Signal processing Speech analysis Covariance matrix linear predictive coding Speech processing Stochastic processes State-space methods Random variables Eigenvalues and eigenfunctions Controllability

来源：评论

学校读者我要写书评

暂无评论

SINGLE CHIP LPC SPEECH SYNTHESIZER AND COMPANION 131K BIT ROM

引用

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS 1979年第2期25卷 193-197页

作者： BRANTINGHAM, L Consumer Products Group Texas Instruments Inc. Lubbock TX USA

A recently : developed two integrated circuit speech synthesis system represents a significant advance in large scale integration in both random logic and data storage functions.

关键词： linear predictive coding Speech synthesis Synthesizers Read only memory Logic circuits Digital filters Speech coding Humans Frequency Integrated circuit synthesis

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：