检索结果-内蒙古大学图书馆

International Journal of speech Technology 1999年第4期2卷 289-303页

作者： Edler, Bernd University of Hannover Laboratorium für Informationstechnologie Schneiderberg 32 D-30167 Hannover Germany

While previous MPEG Audio standards mainly were focused on the representation of audio signals close to or equal to CD quality, the new MPEG-4 Audio standard extends the range of applicability towards significantly lower bit rates. Furthermore it offers extended functionalities for the representation of natural and even synthetic audio signals in an object oriented fashion. This paper gives a brief overview on the complete audio part of the MPEG-4 standard and more detailed information on its parts related to speech coding.

关键词： speech coding

来源：评论

学校读者我要写书评

暂无评论

speech coding Methods, Standards, and Applications

引用

IEEE Circuits and Systems Magazine 2005年第4期5卷 30-49页

作者： Gibson, Jerry D. Department of Media Arts and Technology and Electrical and Computer Engineering University of California Santa Barbara

Voice is the preferred method of human communication. Although there have been times when it seemed that the voice communications problem was solved, such as when the PSTN was our primary network or later when digital cellular networks reached maturity, such is not the case today. This paper addresses the challenges and opportunities starting from the basic issues in speech coder design, developing the important speech coding techniques and standards, discussing current and future applications, outlining techniques for evaluating speech coder performance, and identifying research directions. The most prominent speech coding standards are presented and their properties, such as performance, complexity, and coding delay, analyzed. Particular networks and applications for each standard are included. Further, reflecting upon the issues and developments high-lighted in this paper, it becomes evident that there is a diverse set of challenges and opportunities for research and innovation in speech coding and voice communications. © 2005 IEEE.

关键词： speech coding

来源：评论

学校读者我要写书评

暂无评论

speech coding using Best Tree Encoding (BTE) technique based on LPC and trigonometric features

引用

INTERNATIONAL JOURNAL OF speech TECHNOLOGY 2016年第1期19卷 33-39页

作者： Abbass, Mohamed Y. Gody, Amr M. Shehata, Safy A. Baraket, Tamer M. Haggag, Said S. Atom Energy Author Nucl Res Ctr Dept Engn Cairo Egypt Fayoum Univ Dept Elect Engn Fac Engn Al Fayyum Egypt Atom Energy Author Nucl Res Ctr Cairo Egypt

Over the past several years there has been considerable attention focused on coding and enhancement of speech signals. This interest is progressed towards the development of new techniques capable of producing good quality speech at the output. speech coding is a process of converting human speech into efficient encoded representations that can be decoded to produce a close approximation of the original signal. This paper deals with the problem of speech coding. It proposes novel approach called Best Tree Encoding (BTE) to encode the wavelet packet Best Tree Structure into a vector of four elements. This research is introducing BTE for solving another problem for speech compression and syntheses. Tree node data coefficients are encoded using LPC Filters and trigonometric features. The encoded vector consists of 4 elements from BTE analysis as well as LPC and trigonometric vector for each leaf node. The quality of the reproduced speech is evaluated for both understanding and quality. The quality of speech signal is measured on the basis of signal to noise ratio, log likelihood ratio, and spectral distortion.

关键词： speech coding BTE WPD

来源：评论

学校读者我要写书评

暂无评论

speech coding BASED UPON VECTOR QUANTIZATION

引用

IEEE TRANSACTIONS ON ACOUSTICS speech AND SIGNAL PROCESSING 1980年第5期28卷 562-574页

作者： BUZO, A GRAY, AH GRAY, RM MARKEL, JD STANFORD UNIV INFORMAT SYST LABSTANFORDCA 94305 SIGNAL TECHNOL INC SANTA BARBARACA 93101

With rare exception, all presently available narrow-band speech coding systems implement scalar quantization (independent quantization) of the transmission parameters (such as reflection coefficients or transformed reflection coefficients in LPC systems). This paper presents a new approach called vector quantization. For very low data rates, realistic experiments have shown that vector quantization can achieve a given level of average distortion with 15 to 20 fewer bits/frame than that required for the optimized scalar quantizing approaches presently in use. The vector quantizing approach is shown to be a mathematically and computationally tractable method which builds upon knowledge obtained in linear prediction analysis studies. This paper introduces the theory in a nonrigorous form, along with practical results to date and an extensive list of research topics for this new area of speech coding.

关键词： speech coding Vector quantization Distortion measurement Reflection Linear predictive coding speech processing Laboratories Narrowband Pervasive computing

来源：评论

学校读者我要写书评

暂无评论

speech coding with wavelets

引用

IEEE Potentials 1998年第2期17卷 38-41页

作者： Litwin, L.R. Purdue Univ

speech coding is currently an active topic for research in the areas of Very Large-Scale Integrated (VLSI) circuit technology and Digital Signal Processing (DSP). Various techniques are being developed to transmit high quality speech at a low bit rate. The 1/f nature of the speech residual in RELP allows the wavelet transform to efficiently code the residual for transmission. The received speech's bandwidth can be doubled by transmitting only one extra variance value per analysis frame. A prototype 1/f speech coder that employs these techniques has been developed at Drexel University (PA). Various technical issues are being resolved. Also, work is being done to adapt these techniques to music coding as well.

关键词： speech coding

来源：评论

学校读者我要写书评

暂无评论

speech coding: Applications, challenges and new directions

Speech coding: Applications, challenges and new directions

引用

4th International Symposium on Signal Processing and Its Applications (ISSPA 96)

作者： Kroon, P Lucent Technologies Murray Hill United States

ISBN: (纸本)1864352094

Inexpensive VLSI and an increasing demand for bandwidth efficiency have led to a large increase of the number of applications for speech coding. In the past 5 years, many standards have been defined for use in network and cellular communications systems. This widespread deployment of speech coding technology has created new challenges such as coding of speech with background noise, robust performance over noisy or fading channel, and performance for multiple encodings. In addition, many applications impose constraints on communication delay, cost and power consumption of the implementation. This paper will first review the underlying speech coding principles used in most international and regional standards. This is followed by a closer look at the trade-offs that tan be made during the design of a speech coder. The last part of the paper will focus on the technical challenges and new directions of speech coding research for the next five years.

关键词： speech coding

来源：评论

学校读者我要写书评

暂无评论

FOURIER-TRANSFORM VECTOR QUANTIZATION FOR speech coding

引用

IEEE TRANSACTIONS ON COMMUNICATIONS 1987年第10期35卷 1059-1068页

作者： CHANG, PC GRAY, RM MAY, J IBM CORP THOMAS J WATSON RES CTR YORKTOWN HTS NY 10598 USA STANFORD UNIV DEPT ELECT ENGN INFORMAT SYST LAB STANFORD CA 94305 USA ESL INC SUNNYVALE CA 94086 USA

Design algorithms and simulation results are presented for vector quantizers for Fourier transformed data. Transforming the data prior to quantization has two potential advantages. First, each sample in the transform domain depends on many samples in the original domain. Thus, even scalar quantization in the transform domain is a form of vector quantization or block source coding in the original waveform domain and the basic coding theorems of information theory show that such block codes can provide better performance than scalar codes, even for memoryless sources. Second, vector quantization of Fourier transformed speech waveforms provides distinctly better subjective quality than ordinary vector quantization of the waveform using codes of comparable complexity. While the system is, of course, more complicated due to the need to take Fourier transforms, its envisioned application is as a coder for the output of FFT chips currently available or under development. The proposed implementation of a Fourier transform vector quantizer (FTVQ) uses a product code structure, providing different codes for different coefficient vectors corresponding to different frequency bands. This is a form of subband coding and yields a simple means of optimizing bit allocations among the subcodes. Two coding structures with corresponding distortion measures are considered: those that quantize vectors of pairs of real and imaginary coefficients and those that quantize separate vectors of magnitude and phase coefficients. Both structures yield good performance for the given complexity in comparison to waveform vector quantizers. For speech coding, a magnitude-phase FTVQ yields better subjective quality than a real-imaginary FTVQ when the rate allocation is properly chosen.

关键词： Fourier transforms Vector quantization speech coding Algorithm design and analysis Source coding Information theory Block codes Product codes Frequency Bit rate

来源：评论

学校读者我要写书评

暂无评论

On a code-excited nonlinear predictive speech coding (CENLP) by means of recurrent neural networks

引用

IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES 1998年第8期E81A卷 1628-1634页

作者： Ma, N Nishi, T Wei, G Kyushu Univ Dept Comp Sci & Commun Engn Fukuoka 8128581 Japan SCUT Dept Elect & Commun Engn Guang Zhou 510641 Peoples R China

To improve speech coding qualify, in particular, the long-term dependency prediction characteristics, we propose a new nonlinear predictor, i.e., a fully connected recurrent neural network (FCRNN) where the hidden units have Feedbacks not only from themselves but also from the output unit. The comparison of the capabilities of the FCRNN with conventional predictors shows that the former has Less prediction error than the latter. We apply this FCRNN instead of the previously Fro posed recurrent neural networks in the code-excited predictive speech coding system (i.e., CELP) and shows that our system (FCRNN) requires less bit rate/frame and improves the performance for speech coding.

关键词： nonlinear prediction fully connected recurrent neural networks vector quantization speech coding

来源：评论

学校读者我要写书评

暂无评论

speech coding AT 4 AND 8 KB/S BASED ON ITERATIVE SEQUENTIAL CELP OPTIMIZATION

SPEECH CODING AT 4 AND 8 KB/S BASED ON ITERATIVE SEQUENTIAL ...

引用

IEEE GLOBAL TELECOMMUNICATIONS CONF ( GLOBECOM 91 )

作者： LEBLANC, WP CUPERMAN, V Syst & Comput Eng Dept Carleton Univ Ottawa Ont Canada

ISBN: (纸本)0879426977

The authors introduce sequential optimization of the parameters of a 4- and 8-kb/s codebook excited linear predictive (CELP) coder. The short-term filter, the long-term adaptive codebook, and the excitation codebook parameters were sequentially optimized to minimize the resulting weighted mean square error. The sequential optimization procedure considers at each subframe the most probable choice of excitation parameters for the next subframe. A simulated annealing algorithm was used to optimize the short-term filter to minimize weighted mean-squared error. The closed-loop algorithm is iterative in nature, and iterations are conducted over the adaptive codebook, the codebook excitation and the short-term filter in an attempt to jointly optimize the corresponding parameters.

关键词： speech coding

来源：评论

学校读者我要写书评

暂无评论

speech coding WITH TRANSFORM DOMAIN PREDICTION

SPEECH CODING WITH TRANSFORM DOMAIN PREDICTION

引用

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

作者： Villemoes, Lars Klejsa, Janusz Hedelin, Per Dolby Sweden AB Stockholm Sweden

We show how model based prediction can be employed in the construction of a speech codec which operates entirely in the frequency domain of a Modified Discrete Cosine Transform (MDCT). The codec tools described in thi... 详细信息

ISBN: (纸本)9781538616321

关键词： speech coding long term prediction MDCT audio coding

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：