检索结果-内蒙古大学图书馆

COMPARISON OF 1-BIT ADAPTIVE QUANTIZERS FOR speech coding

ELECTRONICS LETTERS 1989年第9期25卷 586-588页

作者： HALL, SC BRADLOW, HS UNIV WOLLONGONG DEPT ELECT & COMP ENGNWOLLONGONGNSW 2500AUSTRALIA

A 1-bit version of the recently proposed generalised hybrid adaptive quantiser (GHAQ) is compared with two other 1-bitadaptive quantisers, in terms of the signal-to-noise ratio obtained when coding speech in delta modulators. The GHAQ is found to have a clear advantage, particularly when the speech is acquired by a telephone microphone. It is also shown that the optimum coefficients of the predictor can be influenced substantially by the design of the adaptive quantiser, and a technique for finding suitable coefficients in this context is described.

关键词： optimum coefficients speech analysis and processing generalised hybrid adaptive quantiser adaptive systems signal-to-noise ratio encoding telephone microphone delta modulation speech coding Signal processing and detection speech and audio signal processing delta modulators analogue-digital conversion predictor Modulation and coding methods 1-bit adaptive quantisers filtering and prediction theory

来源：评论

学校读者我要写书评

暂无评论

LPC analysis incorporating spectral interpolation for speech coding

引用

ELECTRONICS LETTERS 1999年第3期35卷 200-201页

作者： Lee, MS Kim, HK Lee, HS Korea Adv Inst Sci & Technol Dongdaemun Gu Seoul 130012 South Korea SK Telekom Cent R&D Ctr Yusong Gu Taejon 305348 South Korea

A linear predictive coding (LPC) analysis scheme which is applicable to speech coding is proposed. The analysis method, called interpolative LPC (ILPC) analysis, estimates the spectral envelope by incorporating the interpolation characteristics into the LPC analysis. The ILPC analysis reduces average spectral distortion and the percentage of outlier frames, compared with the conventional LPC analysis followed by linear interpolation.

关键词： LPC analysis speech processing techniques interpolation linear predictive coding speech coding spectral envelope estimation speech and audio coding Interpolation and function approximation (numerical analysis) Signal processing theory spectral interpolation interpolative LPC

来源：评论

学校读者我要写书评

暂无评论

Development of custom vector accelerator for high-performance speech coding

引用

ELECTRONICS LETTERS 2004年第24期40卷 1559-1561页

作者： Chouliaras, V Nunez, JL Koutsomyti, K Parr, SR Mulvaney, DJ Datta, S Loughborough Univ Technol Dept Elect & Elect Engn Loughborough LE11 3TU Leics England

The addition of custom vector instructions to the G.729A speech coding algorithm is shown to reduce significantly its computational complexity. The identified vector extensions are implemented in the form of a configurable vector accelerator, tightly coupled to a 32 bit Sparc V8-compliant reduced instruction set (RISC) processor. Architectural simulation demonstrates that a reduction in complexity of up 60%, for a vector length of sixteen 16 bit elements, is achievable in current VLSI technology.

关键词： Computer architecture Microprocessor chips high performance speech coding Microprocessors and microcomputers speech coding architectural simulation code standards custom vector accelerator Sparc V8-compliant reduced instruction set processor VLSI technology G.729A speech coding algorithm microprocessor chips speech processing techniques VLSI reduced instruction set computing RISC processor Computational complexity speech and audio coding computational complexity

来源：评论

学校读者我要写书评

暂无评论

Fast vector-sum codebook search method for low bit rate speech coding

引用

ELECTRONICS LETTERS 1997年第6期33卷 451-452页

作者： Choi, YS Park, SW Youn, DH Laboratory of Chromatography DEPg.Fac.Quimica Universidad Nacional Autonoma de Mexico Circuito interior Cd Universitaria/CP 04510 Mexico D.F.Mexico

A fast vector-sum codebook search method for low bit rate speech coding is presented. In this method, the codebook search is simplified by designing a vector-sum codebook that consists of orthonormal regular pulse basis vectors. A further simplification is achieved by adopting backward filtering. The method proposed has significantly reduced computational complexity, compared with the conventional VSELP, without producing any additional degradation in the quality of the synthesised speech.

关键词： speech coding

来源：评论

学校读者我要写书评

暂无评论

VECTOR-ADAPTIVE VECTOR QUANTIZATION WITH APPLICATION TO speech coding

引用

IEEE TRANSACTIONS ON COMMUNICATIONS 1991年第6期39卷 958-962页

作者： KOU, WD MARK, JW UNIV WATERLOO DEPT ELECT & COMP ENGN WATERLOO N2L 3G1 ONTARIO CANADA UNIV WATERLOO DEPT ELECT ENGN WATERLOO N2L 3G1 ONTARIO CANADA

A vector-adaptive vector quantization (VAVQ) scheme, which may be viewed as a generalization of gain-adaptive vector quantization, is described. The proposed scheme adjusts each component of the encoding signal vector according to a statistical estimate of the signal characteristics. The VAVQ scheme can cope with large input dynamic ranges, can be used in either the time domain or the transform domain, and exhibits approximately 4 dB improvement in segmental SNR over the fixed VQ.

关键词： Vector quantization speech coding Encoding Decoding Algorithm design and analysis speech processing Dynamic range Shape Probability density function Block codes

来源：评论

学校读者我要写书评

暂无评论

Current Objectives in 4-kb/s Wireline-Quality speech coding Standardization

引用

IEEE SIGNAL PROCESSING LETTERS 1994年第11期1卷 157-159页

作者： Dimolitsas, Spiros Ravishankar, Channasandra Schroeder, Gerhard Comsat Labs Clarksburg MD 20871 USA Deutsch Bundespost Telekom Forsch & Technol Zentrum Darmstadt Germany

Recently, the standardization of high-quality speech coding has intensified. In parallel, a number of novel applications are placing new demands on transmission efficiency and quality. In response to such challenges, standardization bodies have begun the definition of requirements for the next generation of very low-rate speech coding. Taking a lead in these activities, ANSI committee T1A1 and the ITU-T initiated the definition of the performance and characteristics of a wireline-quality 4-kb/s speech coding algorithm for network applications. In this letter, this emerging set of requirements is presented.

关键词： speech coding Standardization Telephony Signal processing algorithms Next generation networking Telegraphy Code standards Telecommunication standards Prediction algorithms Standards development

来源：评论

学校读者我要写书评

暂无评论

Alias-and-Separate: Wideband speech coding Using Sub-Nyquist Sampling and speech Separation

引用

IEEE SIGNAL PROCESSING LETTERS 2022年 29卷 2003-2007页

作者： Hwang, Soojoong Lee, Eunkyun Jang, Inseon Shin, Jong Won Gwangju Inst Sci & Technol Sch Elect Engn & Comp Sci Gwangju 61005 South Korea Elect & Telecommun Res Inst Daejeon 34129 South Korea

Decimation of a discrete-time signal below the Nyquist rate without applying an appropriate lowpass filter results in a distortion called aliasing. If wideband speech sampled at 16 kHz is decimated by 2 to result in a signal sampled at 8 kHz with aliasing, the decimated signal would be the summation of two speech-like signals, which are the narrowband speech covering 0-4 kHz and the spectrally flipped aliasing component coming from 8-4 kHz. Recently, the performance of speech separation has been remarkably improved with deep learning-based approaches, implying that the narrowband and aliasing components may be able to be separated. In this letter, we propose a novel method for low-rate wideband speech coding utilizing a standard narrowband codec. Instead of coding wideband speech using a wideband codec with a limited bitrate, we propose to decimate the input wideband speech incurring aliasing, and then encode it with a narrowband codec by allocating all the allowed bitrate to 0-4 kHz. After decoding the encoded bitstream, we apply a speech separation technique to obtain the narrowband and aliasing signals, which are then used to reconstruct the wideband speech by expansion, low/highpass filtering, and summation. Experimental results showed that the proposed method could achieve subjective quality comparable to the speeches coded by wideband codecs at higher bitrates in a subjective MUSHRA test.

关键词： speech coding Bit rate Wideband Encoding Narrowband Decoding speech enhancement Frequency aliasing speech codec audio codec speech separation coded signal enhancement

来源：评论

学校读者我要写书评

暂无评论

VARIABLE FRAME RATE TRANSMISSION - A REVIEW OF METHODOLOGY AND APPLICATION TO NARROW-BAND LPC speech coding

引用

IEEE TRANSACTIONS ON COMMUNICATIONS 1982年第4期30卷 674-686页

作者： VISWANATHAN, VR MAKHOUL, J SCHWARTZ, RM HUGGINS, AWF Bolt Beranek and Newman 슠Inc. Cambridge MA USA

We review the variable frame rate (VFR) transmission methodology that we developed, implemented, and tested during the period 1973-1978 for efficiently transmitting LPC vocoder parameters extracted from the input speech at a fixed frame rate. In the VFR method, parameters are transmitted only when their values have changed sufficiently over the interval since their preceding transmission. We explored two distinct approaches to automatic implementation of the VFR method. The first approach bases the transmission decisions on comparisons of the parameter values of the present frame and the last transmitted frame. The second approach, which is based on a functional perceptual model of speech, compares the parameter values of all the frames that lie in the interval between the present frame and the last transmitted frame against a linear model of parameter variation over that interval. The application of VFR transmission to the design of narrow-band LPC speech coders with average bit rates of 2000-2400 bits/s is also considered. The transmission decisions are made separately for the three sets of LPC parameters, pitch, gain, and spectral parameters, using separate VFR schemes. A formal subjective spccch quality test of six selected LPC coders is described, and the results are presented and analyzed in detail. It is shown that a 2075 bit/s VFR coder produces speech quality equal to or better than that of a 5700 bit/s fixed frame rate coder.

关键词： Narrowband Linear predictive coding speech coding Bit rate Testing Vocoders speech analysis Interpolation Steady-state Acoustics

来源：评论

学校读者我要写书评

暂无评论

Low bit-rate speech coding by perceptually optimized noise excitation modulation

引用

SIGNAL PROCESSING 1997年第1期56卷 77-89页

作者： Tsoukalas, D Mourjopoulos, J Kokkinakis, G UNIV PATRAS WIRE COMMUN LAB PATRAS 26500 GREECE

A novel low bit-rate high-quality speech coding technique is presented based on a perceptually optimized signal reconstruction method. According to this parametric speech model, the signal's spectral envelope is reconstructed from non-linear spectral filtering of an excitation signal, which is a combination of a random broadband noise signal with a number of discrete spectral pulses extracted from the original speech using a perceptual model. This general coding platform allows variable bit-rate implementations, starting from 1.9 kbit/s, at which sufficient intelligibility (more than 92%) was measured, while at higher bit-rates (2.8 kbit/s) intelligibility scores were better than 94% with sufficient naturalness in the coded speech. In all cases, the complexity of the proposed system is very low. (C) 1997 Elsevier Science B.V.

关键词： speech speech coding parametric representation

来源：评论

学校读者我要写书评

暂无评论

Very low bit rate speech coding using a diphone-based recognition and synthesis approach

引用

ELECTRONICS LETTERS 1998年第9期34卷 859-860页

作者： Felici, M Borgatti, M Guerrieri, R Univ Bologna Dept Elect I-40136 Bologna Italy

High compression rates of speech signals may be achieved by coding schemes based on relevant linguistic segments. A system is described that relies on a diphone recogniser as the coder and on a speech synthesiser reproducing speech starting from a diphone codebook as the decoder. The spoken message is encoded in textual (phoneme labels) plus prosody representation. This speech coding technique may be used for voice mail or phone communication over low bit rate channels.

关键词： low bit rate channel data compression prosody representation textual representation phone communication speech coding Codes phoneme label linguistic segment speech and audio signal processing speech recognition speech synthesis compression rate voice mail diphone recognition

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：