检索结果-内蒙古大学图书馆

Speaker front-back disambiguity using multi-channel speech signals

ELECTRONICS LETTERS 2022年第25期58卷 1012-1015页

作者： Qian, Xinyuan Yang, Jichen Brutti, Alessio Univ Sci & Technol Beijing Sch Comp & Commun Engn Beijing Peoples R China Guangdong Polytech Normal Univ Sch Cyber Secur Guangzhou Peoples R China Ctr Informat Technol Fdn Bruno Kessler Povo Italy

This paper tackles the front-back disambiguity problem in speaker localization when the audio signals are captured by a symmetric microphone array. To this end, a deep neural network is proposed with an attention-based mechanism designed to assign different weights to features obtained from individual microphones. For support, a real dataset with synchronized multichannel audio signals captured by a large linear microphone array is introduced, along with manual annotations. The experimental results demonstrate the effectiveness of the proposed method over the other approaches. In particular, more than 50% reduction in Equal Error Rate (EER) is achieved when comparing with the single-channel case. The designed multi-channel self-attention mechanism also brings further improvements. The dataset and source code will be released.

关键词： acoustic signal processing individual microphones synchronized multichannel audio signals disambiguity problem linear microphone array audio signal processing microphone arrays speaker recognition speech processing deep learning (artificial intelligence) single-channel case symmetric microphone array feature extraction attention-based mechanism Equal Error Rate designed multichannel self-attention mechanism speaker front-back disambiguity microphones deep neural network multichannel speech signals manual annotations speaker localization

来源：评论

学校读者我要写书评

暂无评论

SPATIAL-FEATURE-BASED ACOUSTIC SCENE ANALYSIS USING DISTRIBUTED microphone array 23

SPATIAL-FEATURE-BASED ACOUSTIC SCENE ANALYSIS USING DISTRIBU...

引用

23rd European Signal Processing Conference (EUSIPCO)

作者： Imoto, Keisuke Ono, Nobutaka SOKENDAI Hayama Kanagawa Japan Natl Inst Informat Tokyo Japan

ISBN: (纸本)9780992862633

In this paper we propose a robust and efficient method to utilize the spatial information provided by a distributed microphone array for acoustic scene analysis. In our approach, similarly to the cepstrum, which is widely used as a spectral feature, the logarithm of the amplitude in multichannel observation is converted to a feature vector by a linear orthogonal transformation. Then, the spatial information of the acoustic scene is represented in the spatial feature space. This approach does not require the positions of the microphones and is not sensitive to the synchronization mismatch of channels, both of which make the method suitable for use with a distributed microphone array. Experimental results using real-life environmental sounds show the validity of our approach even when a smaller feature dimension than the original one is used.

关键词： Acoustic scene analysis distributed microphone array spatial cepstrum symmetric microphone array isotropic sound field

来源：评论

学校读者我要写书评

暂无评论

SPATIAL-FEATURE-BASED ACOUSTIC SCENE ANALYSIS USING DISTRIBUTED microphone array

SPATIAL-FEATURE-BASED ACOUSTIC SCENE ANALYSIS USING DISTRIBU...

引用

European Signal Processing Conference

作者： Keisuke Imoto Nobutaka Ono SOKENDAI (The Graduate University for Advanced Studies) National Institute of Informatics

ISBN: (纸本)9781479988518

关键词： Acoustic scene analysis Distributed microphone array Spatial cepstrum symmetric microphone array Isotropic sound field

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：