检索结果-内蒙古大学图书馆

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： R. Fontana M. Fox Department of Electrical Engineering Carnegie Mellon University Pittsburgh PA USA

A composite source is an indexed family of random processes (subsources) together with a switch which chooses from among these processes in a stochastic fashion. Such a source has often been proposed as a model for speech and other processes having piece-wise, or quasi, stationary behavior. Until recently, however, very little has been known about such models from either a theoretical or a practical perspective. In this paper, we consider a speaker/isolated word recognition system derived from a composite source model for speech production. In particular, estimates of the underlying subsources are obtained using a modified data compression algorithm. Switch sequences are then derived from these estimates for each utterance. Finally, switch sequences are compared in the time domain (using Levenshtein's metric) and from a statistical point of view (via variation distance). Both modes of comparison are seen to be highly correlated and produce a recognition procedure with very encouraging results.

关键词： Switches Stochastic processes Speech recognition Data compression Random processes Speech processing Speech analysis linear predictive coding Performance analysis Production systems

来源：评论

学校读者我要写书评

暂无评论

QD-an algorithm for non-linear inverse filtering

QD-an algorithm for non-linear inverse filtering

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： P. Hedelin G. Hult Department of Information Theory Chalmers University of Technology Gothenburg Sweden

Statistical methods for parameter estimation are often based on a Gaussian assumption. In this paper we describe a signal model based on an assumption of uniform distributions. This signal model leads to a simple, non-linear algorithm, called QD, for parameter estimation. The QD algorithm can be applied to adaptive, inverse filtering of speech signals. The technique used is to sequentially estimate the reflection coefficients of a lattice structure. Some of the results of one such application are discussed and compared to those obtained with other methods.

关键词： Filtering algorithms Parameter estimation linear systems Lattices linear predictive coding Maximum likelihood estimation Statistical distributions Gaussian distribution Filters Information theory

来源：评论

学校读者我要写书评

暂无评论

HIGH-QUALITY PARCOR SPEECH SYNTHESIZER

引用

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS 1980年第3期26卷 353-359页

作者： SAMPEI, T ASADA, A NAKATA, K HITACHI LTD CENT RES LAB KOKUBUNJI TOKYO 185 JAPAN

A high quality speech synthesizer system which consists of 3 LSI chips, a speech synthesizer, a 128k bit ROM and a general purpose microprocessor has been developed. This system is based on the recently developed Partial Autocorrelation (PARCOR) voice compression technique. This system can generate high quality speech from a data rate of less than 2400 bits per second. Several new techniques are applied for this system to improve the quality of generated speech especially of the female voice. This system has many advantageous features such as speech speed control and external pitch excitation.

关键词： Speech synthesis Synthesizers linear predictive coding Large scale integration Autocorrelation Speech analysis Laboratories Costs Application software Digital filters

来源：评论

学校读者我要写书评

暂无评论

ALGORITHM FOR VECTOR QUANTIZER DESIGN

引用

IEEE TRANSACTIONS ON COMMUNICATIONS 1980年第1期28卷 84-95页

作者： LINDE, Y BUZO, A GRAY, RM CODEX CORP MANSFIELD MA 02048 USA SIGNAL TECHNOL INC SANTA BARBARA CA USA

An efficient and intuitive algorithm is presented for the design of vector quantizers based either on a known probabilistic model or on a long training sequence of data. The basic properties of the algorithm are discussed and demonstrated by examples. Quite general distortion measures and long blocklengths are allowed, as exemplified by the design of parameter vector quantizers of ten-dimensional vectors arising in linear predictive Coded (LPC) speech compression with a complicated distortion measure arising in LPC analysis that does not depend only on the error vector.

关键词： Algorithm design and analysis Distortion measurement Vectors Clustering algorithms Speech analysis linear predictive coding Senior members Sufficient conditions Quantization Data communication

来源：评论

学校读者我要写书评

暂无评论

SPEECH coding BASED UPON VECTOR QUANTIZATION

引用

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING 1980年第5期28卷 562-574页

作者： BUZO, A GRAY, AH GRAY, RM MARKEL, JD STANFORD UNIV INFORMAT SYST LABSTANFORDCA 94305 SIGNAL TECHNOL INC SANTA BARBARACA 93101

With rare exception, all presently available narrow-band speech coding systems implement scalar quantization (independent quantization) of the transmission parameters (such as reflection coefficients or transformed reflection coefficients in LPC systems). This paper presents a new approach called vector quantization. For very low data rates, realistic experiments have shown that vector quantization can achieve a given level of average distortion with 15 to 20 fewer bits/frame than that required for the optimized scalar quantizing approaches presently in use. The vector quantizing approach is shown to be a mathematically and computationally tractable method which builds upon knowledge obtained in linear prediction analysis studies. This paper introduces the theory in a nonrigorous form, along with practical results to date and an extensive list of research topics for this new area of speech coding.

关键词： Speech coding Vector quantization Distortion measurement Reflection linear predictive coding Speech processing Laboratories Narrowband Pervasive computing

来源：评论

学校读者我要写书评

暂无评论

THE BURG ALGORITHM FOR LPC SPEECH ANALYSIS-SYNTHESIS

引用

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING 1980年第6期28卷 609-615页

作者： GRAY, AH WONG, DY Signal Technology Inc. Santa Barbara CA USA

The performance of the Burg method for speech analysis is compared to the autocorrelation and covariance methods. The criterion of goodness is the accuracy of the spectral approximation, filter stability, windowing requirements, data frame length, and spectral resolution. A mathematical comparison is presented for the simple first-order signal. Spectral comparisons are presented for a second-order speech-like signal. Real speech synthesis using the analysis results of the autocorrelation and Burg methods are subjectively compared. The results do not find any justification for preferring the computationally more complex Burg method.

关键词： linear predictive coding Speech analysis Speech synthesis Autocorrelation Entropy Spectral analysis Signal processing algorithms Stability Reflection Yield estimation

来源：评论

学校读者我要写书评

暂无评论

SPECTRAL MISMATCH DUE TO PRE-EMPHASIS IN LPC ANALYSIS-SYNTHESIS

引用

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING 1980年第2期28卷 263-264页

作者： WONG, DY HSIAO, CC MARKEL, JD UNIV CALIF BERKELEY DEPT ELECT ENGNBERKELEYCA 94720

The standard first-order preemphasis approach used in linear prediction analysis results in an undesirable low-frequency boost in the synthesis spectrum. A solution is obtained by mismatching the preemphasis and poste... 详细信息

关键词： linear predictive coding Speech synthesis Speech analysis Matched filters Filtering Dynamic range Speech enhancement Frequency synthesizers Power harmonic filters Signal synthesis

来源：评论

学校读者我要写书评

暂无评论

KALMAN BACKWARD ADAPTIVE PREDICTOR COEFFICIENT IDENTIFICATION IN ADPCM WITH PCQ

引用

IEEE TRANSACTIONS ON COMMUNICATIONS 1980年第3期28卷 361-371页

作者： GIBSON, JD BERGLUND, VP SAUTER, LC TRW DEF & SPACE SYST GRP SATELLITE COMMUN SECTREDONDO BEACHCA 90278

Kalman backward adaptive predictor coefficient identification is combined with a modified pitch-compensating quantizer (MPCQ) to produce a high-performance adaptive differential pulse code modulation (ADPCM) system for operation at data rates of 12-16 kbits/s. The Kalman/MPCQ system is compared to an ADPCM system using a Kalman algorithm and robust Jayant qnantization and to a system with a fixed-tap predictor and MPCQ. The performance indicators are signal-to-quantization noise ratio (SNR), sound spectrogram analyses, and formal subjective listening tests. The SNR comparisons indicate that the Kalman/ MPCQ system has the highest SNR, followed by the fixed-tap/MPCQ system, and then the Kalman/robust Jayant system. Subjective listening test results show that the Kalman/MPCQ system is preferred over the fixed-tap/MPCQ system 100 percent of the time and over the Kalman/ robust Jayant system 80 percent of the time. Kalman adaptation thus provides an important perceptual effect not evident in the SNR's. The previously catastrophic effects of transmission errors on backward adaptive prediction are eliminated by simple ADPCM system modifications that do not affect the SNR or subjective quality of the output in the absence of errors for the five sentences studied. The problem of tandeming with a linear predictive coder (LPC) is investigated by using LPC processed speech as input to the three ADPCM systems and by using the output of the three ADPCM systems as input to an LPC analysis algorithm. For the LPC to ADPCM connection, the two systems with the MPCQ produce good quality output speech, while the system with robust Jayant quantization exhibits a fading phenomenon. For the ADPCM into LPC analysis, all three systems produce speech of approximately the same quality, with the fixedtap system being slightly, noisier. Using a distance measure proposed by Itakura, the predictor coefficients computed from the three ADPCM system outputs are compared with the predictor coefficien

关键词： Kalman filters linear predictive coding Signal to noise ratio Speech analysis Pulse modulation Noise robustness Acoustic noise Modulation coding Spectrogram Signal analysis

来源：评论

学校读者我要写书评

暂无评论

AN INTERPRETATION OF THE LOG LIKELIHOOD RATIO AS A MEASURE OF WAVEFORM CODER PERFORMANCE

引用

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING 1980年第3期28卷 318-323页

作者： CROCHIERE, RE TRIBOLET, JM RABINER, LR Univ Tecn Lisboa INST SUPER TECN LISBON 1 PORTUGAL

The log likelihood measure has been widely used in speech research for comparing speech signals. Recently, it has been proposed as a measure for assessing the quality of coded speech. In this paper we present an interpretation of the log likelihood ratio measure within the theoretical framework of a waveform coder distortion model. We then discuss the implications of this interpretation and show how it can be applied to the formulation of better objective measures of waveform coder performance.

关键词： Distortion measurement Acoustic distortion linear predictive coding Acoustic measurements Filters Speech analysis Length measurement Design optimization Speech processing Noise measurement

来源：评论

学校读者我要写书评

暂无评论

WINDOWLESS TECHNIQUES FOR LPC ANALYSIS

引用

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING 1980年第4期28卷 421-427页

作者： BARNWELL, TP School of Electrical Engineering Georgia Institute of Technology Atlanta GA USA

The purpose of this work was to study, experimentally, two windowless LPC analysis algorithms for use in speech digitization. The two algorithms are a circular autocorrelation technique which utilizes the pseudoperiodic nature of voiced speed, and a reflection coefficient estimation technique suggestion by J. P. Burg. Both techniques showed considerable promise in the experimental results.

关键词： linear predictive coding Speech analysis Signal processing algorithms Algorithm design and analysis Signal analysis Vocoders Feature extraction Digital filters Autocorrelation Transfer functions

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：