检索结果-内蒙古大学图书馆

Current Objectives in 4-kb/s Wireline-Quality speech coding Standardization

IEEE SIGNAL PROCESSING LETTERS 1994年第11期1卷 157-159页

作者： Dimolitsas, Spiros Ravishankar, Channasandra Schroeder, Gerhard Comsat Labs Clarksburg MD 20871 USA Deutsch Bundespost Telekom Forsch & Technol Zentrum Darmstadt Germany

Recently, the standardization of high-quality speech coding has intensified. In parallel, a number of novel applications are placing new demands on transmission efficiency and quality. In response to such challenges, standardization bodies have begun the definition of requirements for the next generation of very low-rate speech coding. Taking a lead in these activities, ANSI committee T1A1 and the ITU-T initiated the definition of the performance and characteristics of a wireline-quality 4-kb/s speech coding algorithm for network applications. In this letter, this emerging set of requirements is presented.

关键词： speech coding Standardization Telephony Signal processing algorithms Next generation networking Telegraphy Code standards Telecommunication standards Prediction algorithms Standards development

来源：评论

学校读者我要写书评

暂无评论

Alias-and-Separate: Wideband speech coding Using Sub-Nyquist Sampling and speech Separation

引用

IEEE SIGNAL PROCESSING LETTERS 2022年 29卷 2003-2007页

作者： Hwang, Soojoong Lee, Eunkyun Jang, Inseon Shin, Jong Won Gwangju Inst Sci & Technol Sch Elect Engn & Comp Sci Gwangju 61005 South Korea Elect & Telecommun Res Inst Daejeon 34129 South Korea

Decimation of a discrete-time signal below the Nyquist rate without applying an appropriate lowpass filter results in a distortion called aliasing. If wideband speech sampled at 16 kHz is decimated by 2 to result in a signal sampled at 8 kHz with aliasing, the decimated signal would be the summation of two speech-like signals, which are the narrowband speech covering 0-4 kHz and the spectrally flipped aliasing component coming from 8-4 kHz. Recently, the performance of speech separation has been remarkably improved with deep learning-based approaches, implying that the narrowband and aliasing components may be able to be separated. In this letter, we propose a novel method for low-rate wideband speech coding utilizing a standard narrowband codec. Instead of coding wideband speech using a wideband codec with a limited bitrate, we propose to decimate the input wideband speech incurring aliasing, and then encode it with a narrowband codec by allocating all the allowed bitrate to 0-4 kHz. After decoding the encoded bitstream, we apply a speech separation technique to obtain the narrowband and aliasing signals, which are then used to reconstruct the wideband speech by expansion, low/highpass filtering, and summation. Experimental results showed that the proposed method could achieve subjective quality comparable to the speeches coded by wideband codecs at higher bitrates in a subjective MUSHRA test.

关键词： speech coding Bit rate Wideband Encoding Narrowband Decoding speech enhancement Frequency aliasing speech codec audio codec speech separation coded signal enhancement

来源：评论

学校读者我要写书评

暂无评论

VARIABLE FRAME RATE TRANSMISSION - A REVIEW OF METHODOLOGY AND APPLICATION TO NARROW-BAND LPC speech coding

引用

IEEE TRANSACTIONS ON COMMUNICATIONS 1982年第4期30卷 674-686页

作者： VISWANATHAN, VR MAKHOUL, J SCHWARTZ, RM HUGGINS, AWF Bolt Beranek and Newman 슠Inc. Cambridge MA USA

We review the variable frame rate (VFR) transmission methodology that we developed, implemented, and tested during the period 1973-1978 for efficiently transmitting LPC vocoder parameters extracted from the input speech at a fixed frame rate. In the VFR method, parameters are transmitted only when their values have changed sufficiently over the interval since their preceding transmission. We explored two distinct approaches to automatic implementation of the VFR method. The first approach bases the transmission decisions on comparisons of the parameter values of the present frame and the last transmitted frame. The second approach, which is based on a functional perceptual model of speech, compares the parameter values of all the frames that lie in the interval between the present frame and the last transmitted frame against a linear model of parameter variation over that interval. The application of VFR transmission to the design of narrow-band LPC speech coders with average bit rates of 2000-2400 bits/s is also considered. The transmission decisions are made separately for the three sets of LPC parameters, pitch, gain, and spectral parameters, using separate VFR schemes. A formal subjective spccch quality test of six selected LPC coders is described, and the results are presented and analyzed in detail. It is shown that a 2075 bit/s VFR coder produces speech quality equal to or better than that of a 5700 bit/s fixed frame rate coder.

关键词： Narrowband Linear predictive coding speech coding Bit rate Testing Vocoders speech analysis Interpolation Steady-state Acoustics

来源：评论

学校读者我要写书评

暂无评论

Low bit-rate speech coding by perceptually optimized noise excitation modulation

引用

SIGNAL PROCESSING 1997年第1期56卷 77-89页

作者： Tsoukalas, D Mourjopoulos, J Kokkinakis, G UNIV PATRAS WIRE COMMUN LAB PATRAS 26500 GREECE

A novel low bit-rate high-quality speech coding technique is presented based on a perceptually optimized signal reconstruction method. According to this parametric speech model, the signal's spectral envelope is reconstructed from non-linear spectral filtering of an excitation signal, which is a combination of a random broadband noise signal with a number of discrete spectral pulses extracted from the original speech using a perceptual model. This general coding platform allows variable bit-rate implementations, starting from 1.9 kbit/s, at which sufficient intelligibility (more than 92%) was measured, while at higher bit-rates (2.8 kbit/s) intelligibility scores were better than 94% with sufficient naturalness in the coded speech. In all cases, the complexity of the proposed system is very low. (C) 1997 Elsevier Science B.V.

关键词： speech speech coding parametric representation

来源：评论

学校读者我要写书评

暂无评论

A PIPELINED ADAPTIVE DIFFERENTIAL VECTOR QUANTIZER FOR LOW-POWER speech coding APPLICATIONS

引用

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS 1993年第5期40卷 347-349页

作者： SHANBHAG, NR PARHI, KK Department of Electrical Engineering University of Minnesota Minneapolis MN USA

A fine-grain pipelined adaptive differential vector quantizer architecture is proposed for low-power speech coding applications. The pipelined architecture is developed by employing the relaxed look-ahead technique. The hardware overhead due to pipelining is only the pipelining latches. Simulations with speech sampled at 8 Khz show that, for a vector dimension of 8, the degradation in the signal-to-noise ratio (SNR) due to pipelining is negligible. Furthermore, this degradation is independent of the level of pipelining. Thus the proposed architecture is attractive from an integrated circuit implementation point of view.

关键词： speech coding Finite impulse response filter Signal processing algorithms Digital filters Circuits Pipeline processing Intersymbol interference Nonlinear filters Algorithm design and analysis Simulated annealing

来源：评论

学校读者我要写书评

暂无评论

speech coding technology in CDMA mobile system and its implementation

Speech coding technology in CDMA mobile system and its imple...

引用

APOC 2003: Asia-Pacific Optical and Wireless Communications - Mobile Service and Application

作者： Liu, Lan Wu, Wei College of Information Engineering Wuhan University of Technology Wuhan China School of Science Wuhan University of Technology Wuhan China

The fundamental of QCELP speech coding technology is introduced. According to the features of TMS320C54X family DSP of TI Inc., the implementation approach of QCELP speech coding with fixed-point DSP (digital signal p... 详细信息

关键词： speech coding

来源：评论

学校读者我要写书评

暂无评论

Very low bit rate speech coding using a diphone-based recognition and synthesis approach

引用

ELECTRONICS LETTERS 1998年第9期34卷 859-860页

作者： Felici, M Borgatti, M Guerrieri, R Univ Bologna Dept Elect I-40136 Bologna Italy

High compression rates of speech signals may be achieved by coding schemes based on relevant linguistic segments. A system is described that relies on a diphone recogniser as the coder and on a speech synthesiser reproducing speech starting from a diphone codebook as the decoder. The spoken message is encoded in textual (phoneme labels) plus prosody representation. This speech coding technique may be used for voice mail or phone communication over low bit rate channels.

关键词： low bit rate channel data compression prosody representation textual representation phone communication speech coding Codes phoneme label linguistic segment speech and audio signal processing speech recognition speech synthesis compression rate voice mail diphone recognition

来源：评论

学校读者我要写书评

暂无评论

CELP BASED MIXED-SOURCE MODEL FOR VERY LOW BIT-RATE speech coding

引用

ELECTRONICS LETTERS 1993年第2期29卷 156-157页

作者： KWON, CH UN, CK Korea Adv. Inst. of Sci. & Technol. Daejeon South Korea

A CELP based mixed-source model is described. It uses a mixed excitation which combines a lowpass-filtered adaptive source and a highpass-filtered stochastic source. In addition, one more stochastic source is newly employed for more natural sounding speech. In informal listening tests, the proposed model at 3 kbit/s shows very good performance both in speech quality and intelligibility.

关键词： speech coding MODELING

来源：评论

学校读者我要写书评

暂无评论

LOW-DELAY HYBRID VECTOR EXCITATION LINEAR PREDICTIVE speech coding

引用

ELECTRONICS LETTERS 1993年第25期29卷 2164-2165页

作者： CHEN, H WONG, WC KO, CC Dept. of Electr. Eng. Nat. Univ. of Singapore Singapore

A hybrid approach in determining the excitation vector in a low-delay code excited linear predictive coder is proposed. By a judicious division of the composite excitation vector into long-term and short-term componen... 详细信息

关键词： LINEAR PREDICTIVE coding speech coding

来源：评论

学校读者我要写书评

暂无评论

Very low rate speech coding using temporal decomposition

引用

ELECTRONICS LETTERS 1999年第6期35卷 456-457页

作者： Ghaemmaghami, S Sridharan, S Queensland Univ Technol Sch Elect Elect & Syst Engn Speech Res Lab Brisbane Qld 4001 Australia Sharif Univ Technol Elect Res Ctr Tehran Iran

A method for encoding the spectral characteristics of speech, at rates below 180 bit/s, using hierarchical temporal decomposition (HTD) is proposed. A set of the log-area-ratio (LAR) parameters, extracted from a given block of speech, are approximated through Gaussian interpolation between the most-steady frames detected by the HTD. This results in a smaller set of parameters which are encoded using vector quantisation. It is shown that the same spectral distortion is obtained with the new coder at a rate of 180 bit/s as that using a scalar quantisation, TD-based coder, at 600 bit/s.

关键词： interpolation log-area-ratio parameters vector quantisation Gaussian interpolation speech coding Codes spectral distortion hierarchical temporal decomposition speech and audio coding Interpolation and function approximation (numerical analysis) most-steady frames spectral characteristics 0 to 180 bit/s Gaussian processes

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：