咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >Recognizing GSM digital speech 收藏

Recognizing GSM digital speech

作     者:Gallardo-Antolín, A Peláez-Moreno, C Díaz-de-María, F 

作者机构:Univ Carlos III Madrid Signal Theory & Commun Dept Leganes 28911 Madrid Spain 

出 版 物:《IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING》 (IEEE Trans Speech Audio Process)

年 卷 期:2005年第13卷第6期

页      面:1186-1205页

核心收录:

基  金:Spanish Government  (TIC2002-02025) 

主  题:coding distortion Global System for Mobile (GSM) networks speech coding speech recognition tandeming transmission errors wireless networks 

摘      要:The Global System for Mobile (GSM) environment encompasses three main problems for automatic speech cognition (ASR) systems: noisy scenarios, source coding distortion, and transmission errors. The first one has already received much attention;however, source coding distortion and transmission errors must be explicitly addressed. In this paper, we propose an alternative front-end for speech recognition over GSM networks. This front-end is specially conceived to be effective against source coding distortion and transmission errors. Specifically, we suggest extracting the recognition feature vectors directly from the encoded speech (i.e., the bitstream) instead of decoding it and subsequently extracting the feature vectors. This approach offers two significant advantages. First, the recognition system is only affected by the quantization distortion of the spectral envelope. Thus, we are avoiding the influence of other sources of distortion as a result of the encoding-decoding process. Second, when transmission errors occur, our front-end becomes more effective since it is not affected by errors in bits allocated to the excitation signal. We have considered the half and the full-rate standard codecs and compared the proposed front-end with the conventional approach in two ASR tasks, namely, speaker-independent isolated digit recognition and speaker-independent continuous speech recognition. In general, our approach outperforms the conventional procedure, for a variety of simulated channel conditions. Furthermore, the disparity increases as the network conditions worsen.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分