检索结果-内蒙古大学图书馆

A voicing-driven packet loss recovery algorithm for analysis-by-synthesis predictive speech coders over Internet

IEEE TRANSACTIONS ON MULTIMEDIA 2001年第1期3卷 98-107页

作者： Wang, JF Wang, JC Yang, JF Wang, JJ Natl Cheng Kung Univ Dept Elect Engn Tainan 70101 Taiwan

In this paper, a novel voicing-driven adaptive packet loss recovery algorithm is proposed to lessen the possible voice degradation and error propagation for analysis-by-synthesis speech coders in Internet applications. After voicing classification, we adaptively adopt random noise generation, multiresolution excitation generation, or pulse tracking procedure to recover the lost packets. By applying the algorithm to the G.723.1 coder, simulation results show that the proposed algorithm is superior to the recovery algorithm embedded in the G.723.1 standard through the subjective evaluation.

关键词： analysis-by-synthesis CELP packet loss recovery speech coder voice over IP

来源：评论

学校读者我要写书评

暂无评论

speech CODING FOR TELECOMMUNICATIONS

引用

ELECTRONICS & COMMUNICATION ENGINEERING JOURNAL 1992年第5期4卷 273-283页

作者： BOYD, I BT Laboratories BT plc Ipswich UK

The function of a speech coding algorithm is to convert an analogue speech signal into a digital form for efficient transmission over a digital path, or efficient storage on a digital storage medium, and to perform the complementary function of converting a received digital signal back to analogue form. The article reviews those speech coding techniques which are already being extensively used in telecommunications applications. As well as explaining the basic principles employed by these speech coding algorithms to achieve efficient digital encoding, examples of telecommunications services which use these algorithms are presented.

关键词： speech coder digital signal digital storage medium speech coding algorithm digital encoding telecommunications services Codes speech coding speech and audio signal processing telecommunication services analogue speech signal digital transmission Telecommunication applications telecommunications applications

来源：评论

学校读者我要写书评

暂无评论

VoIP: State of art for global connectivity-A critical review

引用

JOURNAL OF NETWORK AND COMPUTER APPLICATIONS 2014年第1期37卷 365-379页

作者： Singh, Harjit Pal Singh, Sarabjeet Singh, J. Khan, S. A. Dr BR Ambedkar Natl Inst Technol Dept Phys Jalandhar India MD Colllge Dept Comp Sci Sri Ganganagar India SSIET Dept Elect Technol Amritsar Punjab India

The Internet has revolutionized the telecommunication systems by supporting new applications and services. Voice over Internet Protocol (VoIP) is one of the most prominent telecommunication services based on the Internet Protocol (IP). The signal quality of the VoIP system depends on several factors such as networking conditions, coding processes, speech content and error correction schemes. The work in the present paper reviewed these issues, used for providing toll-quality communication service to the users over VoIP system. From the very beginning of transferring the voice data over packet switched networks, the journey of the packet based communications to modern VoIP and advancements to improve the service of the VoIP system has been summarized in this work. (C) 2013 Elsevier Ltd. All rights reserved.

关键词： VoIP IP telephony speech coder Voice quality Digital signal processing (DSP)

来源：评论

学校读者我要写书评

暂无评论

speech recognition using quantized LSP parameters and their transformations in digital communication

引用

speech COMMUNICATION 2000年第4期30卷 223-233页

作者： Choi, SH Kim, HK Lee, HS Korea Adv Inst Sci & Technol Dept Elect Engn Yusong Gu Taejon 305701 South Korea Samsung Adv Inst Technol Human & Comp Interact Lab Yongin 449712 Kyungki Do South Korea AT&T Labs Res Florham Pk NJ 07932 USA SK Telecom Cent Res Lab Pundang Gu Songnam 463020 Kyungki Do South Korea

In digital communication networks, speech recognition systems conventionally first reconstruct speech and then extract feature parameters. In this paper, we consider a useful approach of incorporating speech coding parameters into the speech recognizer. Most speech coders employed in digital communication networks use line spectrum pairs (LSPs) as spectral parameters. We introduce two ways to improve the recognition performance of the LSP-based speech recognizer. One is to devise weighted distance measures of LSPs and the other is to transform LSPs into a new feature set, named pseudo-cepstrum (PCEP). The speaker-independent connected-digit recognition experiments based on the discrete hidden Markov model showed that the weighted distance measures provide better recognition accuracy than unweighted ones do. Additionally, a mel-scale PCEP gives an even better performance than the weighted distance measures do. To clarify the performance improvement of the proposed methods, a significance test is introduced. As a result, the proposed methods achieved higher performances in recognition accuracy, compared with the conventional methods employing mel-frequency cepstral coefficients. (C) 2000 Elsevier Science B.V. All rights reserved.

关键词： speech recognition speech coder digital communication line spectrum pairs pseudo-cepstrum

来源：评论

学校读者我要写书评

暂无评论

Improved optimisation of excitation sequences in speech and audio coders

引用

ELECTRONICS LETTERS 2004年第8期40卷 515-517页

作者： Riera-Palou, F den Brinker, AC Gerrits, AJ Sluijter, RJ Philips Res Labs NL-5656 AA Eindhoven Netherlands

A new optimisation criterion for computing excitation sequences, typically used in analysis by synthesis linear prediction coders, is presented. This criterion not only relies on the error over the processed frame but also includes the excitation effects induced in future frames. This technique is effective in removing ticks that are otherwise audible.

关键词： optimisation audio coder speech coder audio coding linear predictive coding Optimisation techniques speech coding vocoders excitation sequence speech and audio coding synthesis linear prediction coder excitation effects speech recognition and synthesis speech synthesis

来源：评论

学校读者我要写书评

暂无评论

EVRC-wideband: The new 3GPP2 wideband vocoder standard

EVRC-wideband: The new 3GPP2 wideband vocoder standard

引用

32nd IEEE International Conference on Acoustics, speech and Signal Processing

作者： Krishnan, Venkatesh Rajendran, Vivek Kandhadai, Ananthapadmanabhan Manjunath, Sharath QUALCOMM Inc 5775 Morehouse Dr San Diego CA 92121 USA

ISBN: (纸本)1424407281

In this paper, the latest wideband vocoder standard adopted by the cdma2000 standardization body, 3GPP2, is described. Christened Enhanced Variable Rate Codec- Wideband (EVRC-WB), the proposed codec encodes wideband speech (16 KHz sampling frequency) at a maximum bit-rate of 8.55 kbit/s. EVRC-WB is based on a split band coding paradigm in which two different coding models are used for the signal components in the low frequency (LF) (0-4 KHz) and the high frequency (HF) (3.5-7 KHz) bands. The coding model used for the former is based on the EVRC-B narrowband (0-4 KHz) codec, modified to encode the LF band signal at a maximum bitrate of 7.75 kbit/s. The HF band coding model is a LPC based coding scheme where the excitation is derived from the coded LF band excitation using non-linear processing. Mean opinion scores from 3GPP2 characterization tests are provided to demonstrate that the EVRC-WB codee (8.55 kbit/s, max.) performs statistically significantly better than the Adaptive Multirate Wideband (12.65 kbit/s, max.).

关键词： wideband speech coder non-linear processing

来源：评论

学校读者我要写书评

暂无评论

Information Hiding in Line spectrum pair feature of non-voice part of speech signal

Information Hiding in Line spectrum pair feature of non-voic...

引用

International Conference On Smart Technologies For Smart Nation (SmartTechCon)

作者： Tahilramani, Nikunj Bhatt, Ninad Uka Tarsadia Univ ECC Dept Silver Oak Coll Engn & Tech Ahmadabad Gujarat India EC Dept Ahmadabad Gujarat India CK Pithawala Coll Engn ECC Dept Surat Gujarat India

ISBN: (纸本)9781538605691

Transmission of data in Voice over Internet Protocol (VoIP) must be made secure and robust such that data should not be easily attacked by intruders. Main objective of the proposed system is to hide the secret information in the silence part of speech signal for secure communication. Voice Activity Detection (VAD) Algorithm of ITU-T G.729B speech coder is performed to detect silence part of speech signal which is followed by Steganography for embedding and extraction of secret information. In order to evaluate the performance parameters for data hiding capacity in speech signal and for speech quality, the parameters like Perceptual Evaluation of speech Quality (PESQ), Absolute error (ABS), Root Mean Square Error (RAISE), Mean Square Error (MSE), Mean Optimum Score (MOS) are explored. Robustness for proposed hiding scheme is performed by introducing compression attack and resampling of speech signal.

关键词： Voice Activity Detection speech coder Steganography PESQ ABS RMSE MSE MOS

来源：评论

学校读者我要写书评

暂无评论

Next Generation of Mixed Excited Linear Prediction speech Quality

Next Generation of Mixed Excited Linear Prediction Speech Qu...

引用

International Conference on Green Computing Communication and Electrical Engineering (ICGCCEE)

作者： Miyani, Haresh Mehta, Aalay Nai, Pratik Patel, Harshad Bhagwan Mahavir Coll Engn & Technol Dept Elect Engn Surat 395017 India

ISBN: (纸本)9781479949816

Nowadays the number of mobile subscribers is increasing all over the world, so the system for the communication has to be improved. Mixed Excited Linear Prediction (MELP) algorithm is developed for reducing the bandwidth of the signal as well as transmit more data on a single channel. This results in increase in channel capacity. MELP is basically a speech coding method, relying on a speech Encoder and speech Decoder. The MELP speech coder reduces the redundancy of the signal and compresses it, which is represented by the MELP code. speech Decoder includes a Linear Predictive Coding (LPC) filter providing a synthesized speech at its output side in response to voice and unvoiced. MELP also reduces jitter voice. The bit rate of MELP is reducing the reserves of the code book and calculation complexity. This paper describes the bit rates of MELP coder can be reduced to as low as 2.4kbps without apparent damage to the synthetic speech quality.

关键词： MELP Channel capacity speech coder Bandwidth Synthetic speech

来源：评论

学校读者我要写书评

暂无评论

Gender-dependent and speaker-dependent speech enhancement

Gender-dependent and speaker-dependent speech enhancement

引用

IEEE International Conference on Acoustics, speech, and Signal Processing

作者： Potamitis, I Fakotakis, N Kokkinakis, G Univ Patras Dept Elect & Comp Engn Wire Commun Lab GR-26110 Patras Greece

ISBN: (纸本)0780374029

Our work introduces a speech enhancement technique that can explicitly incorporate prior information about the gender or speaker time-frequency characteristics in its formalism. We approximate the multimodal, clean speech linear spectrum magnitude with a mixture of Gaussians pdfs using the Expectation-Maximization algorithm (EM). Subsequently, we apply the Bayesian inference framework to the degraded spectral coefficients and by employing Minimum Mean Square Error Estimation (MMSE) we derive a closed form solution for the spectral magnitude estimation task adapted to the spectral characteristics and noise variance of each band. We suggest that 2-3 minutes of phonetically balanced non-degraded gender or speaker dependent speech is adequate to tune our algorithm. We demonstrate the benefit of using an enhancement technique tailored to a specific gender or speaker and propose its use in cases where message ambiguity is of critical importance. We evaluate of our algorithm using Lynx helicopter and White Gaussian noise on the task of improving the quality of speech and in combination with a speech coder and demonstrate its robustness at very low SNRs. Implementation code is available at: http://***/potamitis/***.

关键词： speech enhancement least mean squares methods white noise Gaussian noise acoustic noise iterative methods Bayes methods gender-dependent speech enhancement speaker-dependent speech enhancement prior information speaker time-frequency characteristics multimodal clean speech linear spectrum magnitude Gaussian pdfs expectation-maximization algorithm Bayesian inference framework degraded spectral coefficients minimum mean square error estimation MMSE closed form solution spectral magnitude estimation task phonetically balanced nondegraded speech Lynx helicopter noise white Gaussian noise quality of speech speech coder

来源：评论

学校读者我要写书评

暂无评论

CELP coder Modification for the Voice Conversion

CELP Coder Modification for the Voice Conversion

引用

SCIEI 2014 Milano Conference

作者： Abdelkader Guerid Amrane Houacine LCPTS Faculty of Electronics and InformaticsUSTHB

Voice Conversion(VC) consists in modifying a source voice to a target speaker voice. In our approach, we modified only the Code excited linear Predictive(CELP) coder by introducing a pre-processing before the coder for the voice conversion. The decoder part of CELP was not modified. This allows maintaining the transmission rate. Our approach for conversion consists in separating the voiced and unvoiced frames, and thus two different conversion functions are associated. The Spectral Frequency Parameters LSF parameters are adopted to represent the vocal tract and Gaussian Mixture Models(GMM) are used to calculate the conversion functions. The pitch for the voiced frames is transformed by linear conversion. The model was tested for conversions between male and female voices.

关键词： Voice conversion Gaussian Mixture Model(GMM) CELP LPC speech coder

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：