We propose to use enhanced EO-trajectories, which are extracted using shift-autocorrelation (shift-ACF), for multiple speaker detection in audio monitoring scenarios. After introducing spectral shift-ACF features, the...
详细信息
ISBN:
(纸本)9780992862633
We propose to use enhanced EO-trajectories, which are extracted using shift-autocorrelation (shift-ACF), for multiple speaker detection in audio monitoring scenarios. After introducing spectral shift-ACF features, their performance in multiple F0-extraction in the presence of different noise types is estimated for synthetic signal scenarios. Afterwards, a novel method for F0-supertrajectory extraction is proposed and evaluated for multiple speaker detection in the presence of background noises that typically occur in audio monitoring. It turns out that due to their improved sharpness in representing harmonic components, spectral shift-ACF features outperform classical features in many cases.
We propose to use enhanced F0-trajectories, which are extracted using shift-autocorrelation (shift-ACF), for multiple speaker detection in audio monitoring scenarios. After introducing spectral shift-ACF features, the...
详细信息
ISBN:
(纸本)9781479988518
We propose to use enhanced F0-trajectories, which are extracted using shift-autocorrelation (shift-ACF), for multiple speaker detection in audio monitoring scenarios. After introducing spectral shift-ACF features, their performance in multiple F0-extraction in the presence of different noise types is estimated for synthetic signal scenarios. Afterwards, a novel method for F0-supertrajectory extraction is proposed and evaluated for multiple speaker detection in the presence of background noises that typically occur in audio monitoring. It turns out that due to their improved sharpness in representing harmonic components, spectral shift-ACF features outperform classical features in many cases.
Overlapping speaker localization approaches generally require a binary detector which performs the source/noise classification of the location estimates. This is mainly due to the unknown time-varying number of source...
详细信息
ISBN:
(纸本)9781479968084
Overlapping speaker localization approaches generally require a binary detector which performs the source/noise classification of the location estimates. This is mainly due to the unknown time-varying number of sources, and to the presence of noise and reverberation. In this paper, we firstly introduce an online implementation of a previously developed offline multiplespeaker detector. This classifier is then extended to include new detection features. More precisely, the proposed approach uses the classified location estimates as labelled data to train new classification models for different potential features. The resulting models are then integrated into the online classifier to improve the classification performance. In particular, this paper investigates three different classification history-based models, namely, the location, the kurtosis and the probabilistic steered response power features. Experiments conducted on the AV16.3 corpus show the effectiveness of the proposed approach.
暂无评论