版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:Univ Udine Dept Math Comp Sci & Phys I-33100 Udine Italy
出 版 物:《IEEE SIGNAL PROCESSING LETTERS》 (IEEE Signal Process Lett)
年 卷 期:2022年第29卷
页 面:1828-1832页
核心收录:
主 题:Location awareness Reverberation Mathematical models Spatial resolution Signal processing algorithms Phased arrays Microphone arrays Acoustic source localization delay-and-sum beamformer geometrically sampled grid max-pooling microphone array phase transform steered response power
摘 要:The steered response power phase transform (SRP-PHAT) is a well-known algorithm for acoustic source localization using microphone arrays. It consists in the computation of the generalized cross-correlation (GCC) between each microphone pair, and in the coherent summation of the GCC values in the grid search space. Several improvements based on the volumetric grid have been proposed in order to achieve spatial resolution scalability and to reduce the computational cost by using a coarser grid. In general, the problem of the volumetric based methods is that the noise and the reverberation are projected into the search space since all GCC information is used to build the acoustic map. It is hence proposed a volumetric grid SRP-PHAT algorithm based on the geometrically sampled grid (GSG) that incorporates a max-pooling (MP) operation in the volume accumulation of the GCC values in order to improve the localization performance. The MP is the solution of a minimization-maximization problem that aims at minimizing the deleterious effect of noise and reverberation and at maximizing the accuracy of the GCC values related to the target sound source. Simulations and real-world experiments demonstrate the efficiency of the proposed SRP-GSG-MP algorithm in adverse conditions.