咨询与建议

限定检索结果

文献类型

  • 29 篇 会议
  • 24 篇 期刊文献
  • 1 篇 学位论文

馆藏范围

  • 54 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 50 篇 工学
    • 47 篇 计算机科学与技术...
    • 20 篇 电气工程
    • 10 篇 软件工程
    • 8 篇 信息与通信工程
    • 4 篇 电子科学与技术(可...
    • 3 篇 控制科学与工程
  • 8 篇 理学
    • 7 篇 物理学
    • 1 篇 生物学
  • 6 篇 医学
    • 6 篇 临床医学
  • 3 篇 教育学
    • 3 篇 心理学(可授教育学...
  • 1 篇 管理学
    • 1 篇 管理科学与工程(可...

主题

  • 54 篇 audio-visual lea...
  • 5 篇 multi-modal lear...
  • 5 篇 visualization
  • 4 篇 task analysis
  • 4 篇 self-supervised ...
  • 4 篇 cross-modal retr...
  • 3 篇 multimodal learn...
  • 3 篇 representation l...
  • 3 篇 deep learning
  • 3 篇 event localizati...
  • 3 篇 sound source loc...
  • 3 篇 contrastive lear...
  • 3 篇 location awarene...
  • 3 篇 action recogniti...
  • 3 篇 feature extracti...
  • 2 篇 spiking neural n...
  • 2 篇 individual diffe...
  • 2 篇 audio-visual cor...
  • 2 篇 transformer
  • 2 篇 zero-shot learni...

机构

  • 3 篇 univ tubingen tu...
  • 2 篇 shanghai ai lab ...
  • 2 篇 univ surrey guil...
  • 2 篇 hefei univ techn...
  • 2 篇 beijing inst tec...
  • 1 篇 fudan univ sch c...
  • 1 篇 univ amsterdam
  • 1 篇 baidu inc people...
  • 1 篇 univ paris 05 un...
  • 1 篇 univ geneva fac ...
  • 1 篇 univ las palmas ...
  • 1 篇 univ michigan an...
  • 1 篇 chinese inst bra...
  • 1 篇 beijing univ pos...
  • 1 篇 univ elect sci &...
  • 1 篇 chinese acad sci...
  • 1 篇 czech tech univ ...
  • 1 篇 sichuan univ col...
  • 1 篇 int inst informa...
  • 1 篇 postech dept ele...

作者

  • 3 篇 koepke a. sophia
  • 3 篇 wang meng
  • 3 篇 mercea otniel-bo...
  • 3 篇 guo dan
  • 3 篇 zhou jinxing
  • 3 篇 akata zeynep
  • 2 篇 wang jing
  • 2 篇 liu miao
  • 2 篇 zeng donghuo
  • 2 篇 kim junsik
  • 2 篇 yin jianqin
  • 2 篇 hummel thomas
  • 2 篇 zhong yiran
  • 2 篇 ikeda kazushi
  • 2 篇 mei xinhao
  • 2 篇 kweon in so
  • 2 篇 xie xiang
  • 2 篇 tian yapeng
  • 2 篇 senocak arda
  • 2 篇 li wenrui

语言

  • 54 篇 英文
检索条件"主题词=Audio-Visual Learning"
54 条 记 录,以下是51-60 订阅
排序:
Integrating audio-visual Contexts with Refinement for Segmentation  33rd
Integrating Audio-Visual Contexts with Refinement for Segmen...
收藏 引用
33rd International Conference on Artificial Neural Networks and Machine learning (ICANN)
作者: Geng, Qingwei Gu, Xiaodong Fudan Univ Dept Elect Engn Shanghai 200438 Peoples R China
A more fine-grained video spatial localization task audio visual segmentation(AVS) has recently been proposed, which aims to generate the masks of the sounding objects that sound in the given videos. In this paper, we... 详细信息
来源: 评论
FOLEYGEN: visualLY-GUIDED audio GENERATION  34
FOLEYGEN: VISUALLY-GUIDED AUDIO GENERATION
收藏 引用
34th International Workshop on Machine learning for Signal Processing
作者: Mei, Xinhao Nagaraj, Varun Le Lant, Gael Ni, Zhaoheng Chang, Ernie Shi, Yangyang Chandrakumar, Vikas Meta Menlo Pk CA 94025 USA Univ Surrey Guildford Surrey England
Recent advancements in audio generation tasks, such as text-to-audio and text-to-music generation, have been spurred by the evolution of deep learning models and large-scale datasets. However, the task of video-to-aud... 详细信息
来源: 评论
T-VSL: Text-Guided visual Sound Source Localization in Mixtures
T-VSL: Text-Guided Visual Sound Source Localization in Mixtu...
收藏 引用
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
作者: Mahmud, Tanvir Tian, Yapeng Marculescu, Diana Univ Texas Austin Austin TX 78712 USA Univ Texas Dallas Dallas TX 75080 USA
visual sound source localization poses a significant challenge in identifying the semantic region of each sounding source within a video. Existing self-supervised and weakly supervised source localization methods stru... 详细信息
来源: 评论
Individual differences in the acquisition of non-linguistic audio-visual associations in 5 year olds
收藏 引用
DEVELOPMENTAL SCIENCE 2020年 第4期23卷 e12913页
作者: Altarelli, Irene Dehaene-Lambertz, Ghislaine Bavelier, Daphne Univ Paris Sud Univ Paris Saclay NeuroSpin CtrINSERM CEA DRF Inst JoliotCognit Neuroimaging Unit U992 Gif Sur Yvette France Univ Geneva Fac Psychol & Educ Sci Geneva Switzerland Univ Paris 05 Univ Paris Lab Psychol Child Dev & Educ LaPsyDE CNRS UMR 8240 Paris France
audio-visual associative learning - at least when linguistic stimuli are employed - is known to rely on core linguistic skills such as phonological awareness. Here we ask whether this would also be the case in a task ... 详细信息
来源: 评论