咨询与建议

限定检索结果

文献类型

  • 2 篇 期刊文献
  • 1 篇 会议

馆藏范围

  • 3 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 3 篇 工学
    • 3 篇 电气工程
    • 1 篇 控制科学与工程
    • 1 篇 计算机科学与技术...
  • 2 篇 理学
    • 2 篇 物理学
  • 1 篇 医学
    • 1 篇 基础医学(可授医学...

主题

  • 3 篇 active speaker d...
  • 2 篇 microphone array
  • 1 篇 hidden markov mo...
  • 1 篇 features extract...
  • 1 篇 transfer learnin...
  • 1 篇 self-supervised ...
  • 1 篇 acoustics
  • 1 篇 multichannel aud...
  • 1 篇 robots
  • 1 篇 visualization
  • 1 篇 language acquisi...
  • 1 篇 face
  • 1 篇 cognitive system...
  • 1 篇 detectors
  • 1 篇 multichannel
  • 1 篇 cognitive system...

机构

  • 1 篇 univ surrey ctr ...
  • 1 篇 ntnu norwegian u...
  • 1 篇 univ southern ca...
  • 1 篇 univ surrey ctr ...
  • 1 篇 kth royal inst t...

作者

  • 2 篇 jackson philip j...
  • 2 篇 berghi davide
  • 1 篇 salvi giampiero
  • 1 篇 beskow jonas
  • 1 篇 stefanov kalin

语言

  • 3 篇 英文
检索条件"主题词=active speaker detection and localization"
3 条 记 录,以下是1-10 订阅
排序:
Leveraging Visual Supervision for Array-Based active speaker detection and localization
收藏 引用
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 2024年 32卷 984-995页
作者: Berghi, Davide Jackson, Philip J. B. Univ Surrey Ctr Vis Speech & Signal Proc CVSSP Surrey GU2 7XH England
Conventional audio-visual approaches for active speaker detection (ASD) typically rely on visually pre-extracted face tracks and the corresponding single-channel audio to find the speaker in a video. Therefore, they t... 详细信息
来源: 评论
AUDIO INPUTS FOR active speaker detection and localization VIA MICROPHONE ARRAY
AUDIO INPUTS FOR ACTIVE SPEAKER DETECTION AND LOCALIZATION V...
收藏 引用
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
作者: Berghi, Davide Jackson, Philip J. B. Univ Surrey Ctr Vis Speech & Signal Proc Guildford Surrey England
This study considers the problem of detecting and locating an active talker's horizontal position from multichannel audio captured by a microphone array. We refer to this as active speaker detection and localizati... 详细信息
来源: 评论
Self-Supervised Vision-Based detection of the active speaker as Support for Socially Aware Language Acquisition
收藏 引用
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS 2020年 第2期12卷 250-259页
作者: Stefanov, Kalin Beskow, Jonas Salvi, Giampiero Univ Southern Calif Inst Creat Technol Los Angeles CA 90089 USA KTH Royal Inst Technol Dept Speech Mus & Hearing S-10044 Stockholm Sweden NTNU Norwegian Univ Sci & Technol Dept Elect Syst N-7491 Trondheim Norway
This paper presents a self-supervised method for visual detection of the active speaker in a multiperson spoken interaction scenario. active speaker detection is a fundamental prerequisite for any artificial cognitive... 详细信息
来源: 评论