检索结果-内蒙古大学图书馆

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： M. Skoglund J. Skoglund Department of Signals Sensors and Systems Royal Institute of Technology Stockholm Sweden Department of Information Theory Chalmers University of Technology Goteborg Sweden

This paper presents an approach to speech vector quantization of sources exhibiting intervector dependency. We present the optimal decoder based on a collection of received indices. We also present the optimal encoder for such decoding. The optimal decoder can be implemented as a table look-up decoder, however the size of the decoder codebook grows very fast with the size of the collection of utilized indices. This leads us to introduce a method for storing an approximation to the set of optimal decoder vectors, based on linear mapping of a block code vector quantization. In this approach a heavily reduced set of parameters is employed to represent the codebook. Furthermore, we illustrate that the proposed scheme has an interpretation as nonlinear predictive quantization. Numerical results indicate high gain over memoryless coding and memory quantization based on linear predictive coding. The results also show that the sub-optimal approach performs close to the optimal.

关键词： Vector quantization Decoding Signal processing Encoding Block codes Zinc Sensor systems linear predictive coding Modems Speech

来源：评论

学校读者我要写书评

暂无评论

A new discrete all-pole modeling for the multiband excitation coder

A new discrete all-pole modeling for the multiband excitatio...

引用

Mediterranean Electrotechnical Conference (MELECON)

作者： S. Torres-Guijarro F.J. Casajus-Quiros Departmento. Informática UC3M Spain

A new model for the spectral samples obtained in the multiband excitation speech coder (MBE) is introduced. Objective and subjective tests show that it compares favorably with the classical linear prediction (LP) model, specially for high pitched speakers. Strategies for efficiently quantizing the model parameters, suitable for low bit rate implementations of the MBE coder, are also addressed.

关键词： predictive models Autocorrelation Speech Bit rate Vectors Frequency estimation Telecommunication standards Testing linear predictive coding Harmonic analysis

来源：评论

学校读者我要写书评

暂无评论

Speech compression based on exact modeling and structured total least norm optimization

Speech compression based on exact modeling and structured to...

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： P. Lemmerling I. Dologlou S. Van Huffel Department of Electrical Engineering Katholieke Universiteit Leuven Leuven Belgium

We present a new speech coding algorithm, based on an all-pole model of the vocal tract. Whereas current autoregressive (AR) based modeling techniques (e.g. CELP, LPC-10) minimize a prediction error, which is considered to be the input to the all-pole model, our approach determines the closest (in L/sub 2/ norm) signal, which exactly satisfies an all-pole model. Each frame is then encoded by storing the parameters of the complex damped exponentials deduced from the all-pole model and its initial conditions. Decoding is performed by adding the complex damped exponentials based on the transmitted parameters. The new algorithm is demonstrated on a speech signal. The quality is compared with that of a standard coding algorithm at comparable compression ratios, by using the segmental signal-to-noise ratio (SNR).

关键词： Mathematical model predictive models linear predictive coding Speech coding Vocoders White noise Vectors Speech synthesis Integrated circuit modeling Laboratories

来源：评论

学校读者我要写书评

暂无评论

Spectral stability based event localizing temporal decomposition

Spectral stability based event localizing temporal decomposi...

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： A.C.R. Nandasena M. Akagi Graduate School of Information Science Japan Advanced Institute of Science and Technology Japan

A new approach to temporal decomposition (TD) of speech, called "spectral stability based event localizing temporal decomposition", abbreviated S/sup 2/ BEL-TD, is presented. The original method of TD proposed by Atal (1983) is known to have the drawbacks of high computational cost, and the instability of the number and locations of events. In S/sup 2/ BEL-TD, the event localization is performed based on a maximum spectral stability criterion. This overcomes the instability problem of events of the Atal's method. Also, S/sup 2/ BEL-TD avoids the use of the computationally costly singular value decomposition routine used in the Atal's method, thus resulting in a computationally simpler algorithm of TD. Simulation results show that an average spectral distortion of about 1.5 dB can be achieved with LSF as the spectral parameter. Also, we have shown that the temporal pattern of the speech excitation parameters can also be well described using the S/sup 2/ BEL-TD technique.

关键词： Speech Singular value decomposition Information science Costs Computational modeling linear predictive coding Solids Stability criteria

来源：评论

学校读者我要写书评

暂无评论

How steady are vowel steady-states?

引用

CLINICAL LINGUISTICS & PHONETICS 1998年第5期12卷 405-415页

作者： Blomgren, M Robb, M Univ Connecticut Dept Commun Sci Storrs CT 06269 USA

The duration of vowel steady-states (VSS) was examined acoustically in the speech production of 40 normal young adults. VSS was assessed according to formant frequency changes in sustained /i/ productions and consonant + /i/ + /d/(/Cid/) productions. The duration of the VSS was measured for the first and second formants (F1 and F2) by incorporating a fixed rate-of-change criterion. Results indicated no significant differences in VSS duration according to gender or vowel context. VSS duration based on F1 was significantly longer than F2 VSS duration. The duration of VSS was also found to be correlated to the overall vowel duration in /Cid/ contexts. Discussion focuses on the analysis and application of VSS in acoustic studies of normal and disordered speech production.

关键词： acoustics vowel steady-state formant frequency linear predictive coding

来源：评论

学校读者我要写书评

暂无评论

An embedded scheme for regular pulse excited (RPE) linear predictive coding

An embedded scheme for regular pulse excited (RPE) linear pr...

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Shude Zhang G. Lockhart Department of Electronic and Electrical Engineering University of Leeds Leeds UK

The feasibility and performance of an embedded RPE (ERPE) scheme based on multistage coding is investigated. The coding efficiency of second and subsequent stages depends on the spectral envelope difference between the original speech and the error signal at each stage whereas re-use of LPC parameters derived from the original speech depends on the corresponding LPC spectral difference. Suitable measures of spectral difference are defined and simulation shows that both decrease with the perceptual weighting factor. The ERPE system requires little extra coding complexity and can be simplified further by using a partial phase adaptation procedure with marginal loss of SNR performance. The simulated ERPE system shows graceful reduction of reconstructed speech quality for bit rates from 14.8 to 6.4 kb/s in 4.2 kb/s steps.

关键词： linear predictive coding Bit rate Speech coding Decoding Delta modulation Filters predictive coding Performance loss Degradation Communication system control

来源：评论

学校读者我要写书评

暂无评论

LSP analysis and processing for speech coders

引用

ELECTRONICS LETTERS 1997年第9期33卷 743-744页

作者： McLoughlin, IV Chance, RJ School of Electronic and Electrical Engineering The University of Birmingham Birmingham United Kingdom

linear prediction parameters within CELP coders are commonly represented by line spectral pairs (LSP), giving stable filters and efficient coding. However, LSP manipulation can also alter the frequencies of the represented signals. The authors use computationally efficient LSP manipulation to enhance the intelligibility of speech degraded by acoustic interference.

关键词： speech coding linear predictive coding

来源：评论

学校读者我要写书评

暂无评论

On-line signature verification using LPC cepstrum and neural networks

引用

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS 1997年第1期27卷 148-153页

作者： Wu, QZ Jou, IC Lee, SY TELECOMMUN LABS CHUNGLITAIWAN

In this paper, an on-line signature verification scheme based on linear Prediction coding (LPC) cepstrum and neural networks is proposed. Cepstral coefficients derived from linear predictor coefficients of the writing trajectories are calculated as the features of the signatures. These coefficients are used as inputs to the neural networks. A number of single-output multilayer perceptrons (MLP's), as many as the number of words in the signature, are equipped for each registered person to verify the input signature. If the summation of output values of all MLP's is larger than verification threshold, the input signature is regarded as a genuine signature;otherwise, the input signature is a forgery. Simulations show that this scheme can detect the genuineness of the input signatures from our test database with an error rate as low as 4%.

关键词： Handwriting recognition linear predictive coding Cepstrum Neural networks Cepstral analysis Writing Trajectory Multilayer perceptrons Forgery Testing

来源：评论

学校读者我要写书评

暂无评论

Noncausal all-pole modeling of voiced speech

引用

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING 1997年第1期5卷 1-10页

作者： Gardner, WR Rao, BD QUALCOMM INC SAN DIEGO CA USA

This paper introduces noncausal all-pole models that are capable of efficiently capturing both the magnitude and phase information of voiced speech, It is shown that noncausal all-pole filter models are better able to match both magnitude and phase information and are particularly appropriate for voiced speech due to the nature of the glottal excitation. By modeling speech in the frequency domain, the standard difficulties that occur when using noncausal all-pole filters are avoided. Several algorithms for determining the model parameters based on frequency-domain information and the masking effects of the ear are described. Our work suggests that high-quality voiced speech can be produced using a 14th-order noncausal all-pole model.

关键词： predictive models Matched filters Speech coding Transfer functions Information filtering Information filters Frequency domain analysis Ear linear predictive coding Vocoders

来源：评论

学校读者我要写书评

暂无评论

Low-delay subband CELP coding for wideband speech

引用

IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING 1997年第5期144卷 313-316页

作者： Tian, WS Wong, WC Tsao, C Natl Univ Singapore Dept Elect Engn Singapore 119260 Singapore

Low-delay techniques are proposed for coding 7 kHz speech using subband code-excited linear predictive coding (CELP). The use of separate and joint index codebooks is compared. Specifically, the joint-index-subband CELP (JISBC) algorithm is found to provide good quality with processing delay in the range 2.375-3.375 ms at corresponding bit rates of 16-8 k bit/s.

关键词： coding linear predictive coding speech transmission

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：