检索结果-内蒙古大学图书馆

Phase-spread voicing analysis in parametric speech coders

ELECTRONICS LETTERS 2006年第11期42卷 665-666页

作者： Edwards, R. Sturt, C. Villette, S. Kondoz, A. Univ Surrey Ctr Commun Syst Res Guildford GU2 7XH Surrey England

Background: Traditional time synchronous (TS) parametric speech coders [1-3] Cannot Currently produce speech of toll quality owing to the inaccurate modelling of perceptually important speech transitions and lack of accurate speech parameter analysis. To improve their performance pitch synchronous (PS) parametric speech coders such as the PS SB-LPC [4] and 1-MELP [5] have been developed which operate off a pitch cycle waveform (PCW) basis. The differences between TS and PS coder types are demonstrated in Fig. I when applied to the voicing classification Of input speech. In Fig. I the TS coder may be incorrect in its classification (Fig. 1d) of the voicing content of the speech signal at points 3 and 4. At point 3 the segment is too short compared to window length. at point 4 the TS method does not provide the necessary transition time accuracy. Because of its smaller analysis window the PS method (Fig. 1c) should be able to capture the finer detail and with an appropriate voicing classification scheme be able to classify the first and third speech segments as voiced. Many of the standard metrics used for analysis in TS coders are based off periodicity such as autocorrelation and AMDF: as these techniques require several cycles of speech they cannot be used with great accuracy for voicing analysis of single cycles. Non-periodic metrics such as peakiness and zero crossing can be applied to single cycles. Peakiness is a measure of how the energy of a signal is spread. We decided to employ peakiness since it was found that its phase behaviour changes predictively when the signal is hand limited. Also. as we utilise phase rather than Periodicity. the technique is more robust to the effects of incorrect pitch detection and irregular pitch variation.

关键词： speech coding parametric speech coders phase-spread voicing analysis pitchcycles speech signal voicing content

来源：评论

学校读者我要写书评

暂无评论

Joint optimization of model and excitation in parametric speech coders

Joint optimization of model and excitation in parametric spe...

引用

IEEE International Conference on Acoustics, speech, and Signal Processing

作者： Lashkari, K Miki, T DoCoMo USA Labs Inc San Jose CA 95110 USA

ISBN: (纸本)0780374029

This paper presents a new Analysis-by-Synthesis (AbS) technique for joint optimization of the excitation and model parameters based on minimizing the closed loop synthesis error instead of the linear prediction error. By minimizing the synthesis error, the analysis and synthesis stages become more compatible. Using a gradient search in the root domain, model parameters for a given excitation are optimized to minimize the error between the original and the synthesized speech. Since the optimization starts from the LPC solution, the synthesis error is guaranteed to be lower than that obtained using the LPC coefficients. For multipulse LPC, there is a 0.5-1 dB improvement in the segmental SNR for male and female speakers over 4 to 6 second long sentences. Listening tests and objective MOS scores confirm the improved speech quality. By adding an extra optimization step, the technique can be incorporated into the LPC, multi-pulse LPC and CELP-type speech coders.

关键词： speech coding vocoders linear predictive coding optimisation speech enhancement speech synthesis minimisation gradient methods search problems joint optimization parametric speech coders analysis-by-synthesis technique excitation parameters model parameters closed loop synthesis error minimization gradient search multipulse LPC segmental SNR listening tests objective MOS scores improved speech quality CELP-type speech coders

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：