咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >BLIND SIGNAL DEREVERBERATION F... 收藏
arXiv

BLIND SIGNAL DEREVERBERATION FOR MACHINE SPEECH RECOGNITION

作     者:Sadhu, Samik Hermansky, Hynek 

作者机构:Center for Language and Speech Processing Johns Hopkins University United States Human Language Technology Center of Excellence Johns Hopkins University United States 

出 版 物:《arXiv》 (arXiv)

年 卷 期:2022年

核心收录:

主  题:Reverberation 

摘      要:We present a method to remove unknown convolutive noise introduced to speech by reverberations of recording environments, utilizing some amount of training speech data from the reverberant environment, and any available non-reverberant speech data. Using Fourier transform computed over long temporal windows, which ideally cover the entire room impulse response, we convert room induced convolution to additions in the log spectral domain. Next, we compute a spectral normalization vector from statistics gathered over reverberated as well as over clean speech in the log spectral domain. During operation, this normalization vectors are used to alleviate reverberations from complex speech spectra recorded under the same reverberant conditions. Such dereverberated complex speech spectra are used to compute complex FDLP-spectrograms for use in automatic speech recognition. © 2022, CC BY.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分