文献详情 >Multi-microphone Complex Spect... 收藏

Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation

作者：Wang, Zhong-Qiu Wang, Peidong Wang, DeLiang

作者机构：Ohio State Univ Dept Comp Sci & Engn Columbus OH 43210 USA Mitsubishi Elect Res Labs Cambridge MA 02139 USA Ohio State Univ Ctr Cognit & Brain Sci Columbus OH 43210 USA

出版物：《IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING》 (IEEE ACM Trans. Audio Speech Lang. Process.)

年卷期：2021年第29卷

页面：2001-2014页

核心收录：

学科分类：0808[工学-电气工程] 08[工学] 0702[理学-物理学]

基　　金：NIDCD [R01 DC012048] NSF [ECCS-1808932] Ohio Supercomputer Center

主　　题：Geometry Array signal processing Speech processing Microphone arrays Covariance matrices Deep learning Training Complex spectral mapping speaker separation microphone array processing deep learning

摘要：We propose multi-microphone complex spectral mapping, a simple way of applying deep learning for time-varying non-linear beamforming, for speaker separation in reverberant conditions. We aim at both speaker separation and dereverberation. Our study first investigates offline utterance-wise speaker separation and then extends to block-online continuous speech separation (CSS). Assuming a fixed array geometry between training and testing, we train deep neural networks (DNN) to predict the real and imaginary (RI) components of target speech at a reference microphone from the RI components of multiple microphones. We then integrate multi-microphone complex spectral mapping with minimum variance distortionless response (MVDR) beamforming and post-filtering to further improve separation, and combine it with frame-level speaker counting for block-online CSS. Although our system is trained on simulated room impulse responses (RIR) based on a fixed number of microphones arranged in a given geometry, it generalizes well to a real array with the same geometry. State-of-the-art separation performance is obtained on the simulated two-talker SMS-WSJ corpus and the real-recorded LibriCSS dataset.

本地馆藏 | 借阅须知 | 我要预约

已订购，未入库

sda

目录详情 | 试阅读 |

读者评论与其他读者分享你的观点

学校读者

用户名:未登录

我的评分

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation

读者评论与其他读者分享你的观点

请选择收藏分类：

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation

读者评论 与其他读者分享你的观点

请选择收藏分类： 新增自定义分类 确定 取消

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

读者评论与其他读者分享你的观点

请选择收藏分类：