咨询与建议

限定检索结果

文献类型

  • 11 篇 会议
  • 10 篇 期刊文献

馆藏范围

  • 21 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 20 篇 工学
    • 17 篇 计算机科学与技术...
    • 11 篇 电气工程
    • 4 篇 软件工程
    • 3 篇 信息与通信工程
    • 1 篇 机械工程
    • 1 篇 动力工程及工程热...
    • 1 篇 控制科学与工程
    • 1 篇 化学工程与技术
  • 2 篇 理学
    • 2 篇 物理学
  • 2 篇 医学
    • 2 篇 临床医学
  • 1 篇 管理学
    • 1 篇 管理科学与工程(可...

主题

  • 21 篇 audio-visual seg...
  • 6 篇 visualization
  • 5 篇 transformers
  • 5 篇 feature extracti...
  • 4 篇 location awarene...
  • 4 篇 semantics
  • 3 篇 semantic segment...
  • 2 篇 task analysis
  • 2 篇 image segmentati...
  • 2 篇 multi-modal lear...
  • 2 篇 decoding
  • 2 篇 correlation
  • 2 篇 avsbench
  • 2 篇 computer vision
  • 2 篇 audio-visual lea...
  • 2 篇 attention mechan...
  • 1 篇 graphical neural...
  • 1 篇 representation l...
  • 1 篇 white noise
  • 1 篇 transformer

机构

  • 2 篇 shanghai ai lab ...
  • 2 篇 netease fuxi ai ...
  • 2 篇 hefei univ techn...
  • 2 篇 beihang univ peo...
  • 2 篇 australian natl ...
  • 2 篇 nvidia santa cla...
  • 1 篇 univ queensland ...
  • 1 篇 beijing inst tec...
  • 1 篇 univ queensland ...
  • 1 篇 matrixverse ai n...
  • 1 篇 csiro data61 mar...
  • 1 篇 csiro math & inf...
  • 1 篇 fudan univ sch c...
  • 1 篇 univ oxford oxfo...
  • 1 篇 dalian universit...
  • 1 篇 pathpartner tech...
  • 1 篇 zhejiang univ pe...
  • 1 篇 wuhan univ sci &...
  • 1 篇 alibaba grp alib...
  • 1 篇 csiro data61 can...

作者

  • 3 篇 yu xin
  • 3 篇 liu chen
  • 3 篇 wang dadong
  • 3 篇 li lincheng
  • 2 篇 li peike patrick
  • 2 篇 zhang hu
  • 2 篇 kong lingpeng
  • 2 篇 wang meng
  • 2 篇 zhong yiran
  • 2 篇 guo dan
  • 2 篇 birchfield stan
  • 2 篇 zhou jinxing
  • 2 篇 zhang jing
  • 2 篇 wang jianyuan
  • 2 篇 zhang jiayi
  • 2 篇 sun weixuan
  • 1 篇 gan zhenye
  • 1 篇 wang lijun
  • 1 篇 li jiahao
  • 1 篇 tan zhentao

语言

  • 21 篇 英文
检索条件"主题词=Audio-visual segmentation"
21 条 记 录,以下是1-10 订阅
排序:
audio-visual segmentation with Semantics
收藏 引用
INTERNATIONAL JOURNAL OF COMPUTER VISION 2025年 第4期133卷 1644-1664页
作者: Zhou, Jinxing Shen, Xuyang Wang, Jianyuan Zhang, Jiayi Sun, Weixuan Zhang, Jing Birchfield, Stan Guo, Dan Kong, Lingpeng Wang, Meng Zhong, Yiran Hefei Univ Technol Hefei Peoples R China Shanghai AI Lab Shanghai Peoples R China Univ Oxford Oxford England Beihang Univ Beijing Peoples R China Australian Natl Univ Canberra Australia Nvidia Santa Clara CA USA Univ Hong Kong Hong Kong Peoples R China
We propose a new problem called audio-visual segmentation (AVS), in which the goal is to output a pixel-level map of the object(s) that produce sound at the time of the image frame. To facilitate this research, we con... 详细信息
来源: 评论
Cross-Modal Cognitive Consensus Guided audio-visual segmentation
收藏 引用
IEEE TRANSACTIONS ON MULTIMEDIA 2025年 27卷 209-223页
作者: Shi, Zhaofeng Wu, Qingbo Meng, Fanman Xu, Linfeng Li, Hongliang Univ Elect Sci & Technol China Sch Informat & Commun Engn Chengdu 611731 Peoples R China
audio-visual segmentation (AVS) aims to extract the sounding object from a video frame, which is represented by a pixel-wise segmentation mask for application scenarios such as multi-modal video editing, augmented rea... 详细信息
来源: 评论
Transformer-Prompted Network: Efficient audio-visual segmentation via Transformer and Prompt Learning
收藏 引用
IEEE SIGNAL PROCESSING LETTERS 2025年 32卷 516-520页
作者: Wang, Yusen Qian, Xiaohong Zhou, Wujie Zhejiang Univ Sci & Technol Sch Informat & Elect Engn Hangzhou 310023 Peoples R China
audio-visual segmentation (AVS) is a challenging task that focuses on segmenting sound-producing objects within video frames by leveraging audio signals. Existing convolutional neural networks (CNNs) and Transformer-b... 详细信息
来源: 评论
Consistency-Queried Transformer for audio-visual segmentation
收藏 引用
IEEE TRANSACTIONS ON IMAGE PROCESSING 2025年 34卷 2616-2627页
作者: Lv, Ying Liu, Zhi Chang, Xiaojun Shanghai Univ Shanghai Inst Adv Commun & Data Sci Sch Commun & Informat Engn Joint Int Res Lab Specialty Fiber Opt & Adv Commun Shanghai 200444 Peoples R China Shanghai Univ Wenzhou Inst Wenzhou 325000 Peoples R China Univ Technol Sydney Australian Artificial Intelligence Inst Fac Engn & Informat Technol Sydney NSW 2007 Australia
audio-visual segmentation (AVS) aims to segment objects in audio-visual content. The effective interaction between audio and visual features has garnered significant attention from the multimodal domain. Despite signi... 详细信息
来源: 评论
Bootstrapping audio-visual Video segmentation by Strengthening audio Cues
收藏 引用
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 2025年 第3期35卷 2398-2409页
作者: Chen, Tianxiang Tan, Zhentao Gong, Tao Chu, Qi Wu, Yue Liu, Bin Yu, Nenghai Lu, Le Ye, Jieping Univ Sci & Technol China Sch Cyber Sci & Technol Hefei 230026 Peoples R China Chinese Acad Sci Key Lab Electromagnet Space Informat Hefei 230022 Peoples R China Alibaba Grp Alibaba Cloud Hangzhou 310024 Peoples R China Alibaba Grp DAMO Acad New York NY 10014 USA
How to effectively interact audio with vision has garnered considerable interest within the multi-modality research field. Recently, a novel audio-visual video segmentation (AVS) task has been proposed, aiming to segm... 详细信息
来源: 评论
audio-visual segmentation based on robust principal component analysis
收藏 引用
EXPERT SYSTEMS WITH APPLICATIONS 2024年 256卷
作者: Fang, Shun Zhu, Qile Wu, Qi Wu, Shiqian Xie, Shoulie Wuhan Univ Sci & Technol Sch Informat Sci & Engn Wuhan Peoples R China Wuhan Univ Sci & Technol Inst Robot & Intelligent Syst Wuhan Peoples R China Jiangxi Univ Finance & Econ Sch Software & Internet things Engn Nanchang Peoples R China Henan Acad Sci Inst Adv Displays & Imaging Zhengzhou Peoples R China RF & Opt Dept Inst Infocomm Res A STAR Signal Proc Singapore Singapore
audio-visual segmentation (AVS) aims to extract the sounding objects from a video. The current learning- based AVS methods are often supervised, which rely on specific task data annotations and expensive model trainin... 详细信息
来源: 评论
audio-visual segmentation by Exploring Cross-Modal Mutual Semantics  23
Audio-Visual Segmentation by Exploring Cross-Modal Mutual Se...
收藏 引用
31st ACM International Conference on Multimedia (MM)
作者: Liu, Chen Li, Peike Patrick Qi, Xingqun Zhang, Hu Li, Lincheng Wang, Dadong Yu, Xin Univ Queensland Brisbane Qld Australia Univ Technol Sydney Sydney NSW Australia Matrix Verse Sydney NSW Australia Netease Fuxi AI Lab Hangzhou Peoples R China CSIRO DATA61 Marsfield Australia
The audio-visual segmentation (AVS) task aims to segment sounding objects from a given video. Existing works mainly focus on fusing audio and visual features of a given video to achieve sounding object masks. However,... 详细信息
来源: 评论
audio-visual segmentation via Unlabeled Frame Exploitation
Audio-Visual Segmentation via Unlabeled Frame Exploitation
收藏 引用
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
作者: Liu, Jinxiang Liu, Yikun Zhang, Fei Ju, Chen Zhang, Ya Wang, Yanfeng Shanghai Jiao Tong Univ Cooperat Medianet Innovat Ctr Shanghai Peoples R China Shanghai AI Lab Shanghai Peoples R China
audio-visual segmentation (AVS) aims to segment the sounding objects in video frames. Although great progress has been witnessed, we experimentally reveal that current methods reach marginal performance gain within th... 详细信息
来源: 评论
audio-visual segmentation  17th
Audio-Visual Segmentation
收藏 引用
17th European Conference on Computer Vision (ECCV)
作者: Zhou, Jinxing Wang, Jianyuan Zhang, Jiayi Sun, Weixuan Zhang, Jing Birchfield, Stan Guo, Dan Kong, Lingpeng Wang, Meng Zhong, Yiran Hefei Univ Technol Hefei Peoples R China SenseTime Res Hangzhou Peoples R China Australian Natl Univ Canberra ACT Australia Beihang Univ Beijing Peoples R China NVIDIA Santa Clara CA USA Univ Hong Kong Pok Fu Lam Hong Kong Peoples R China Shanghai Artificial Intelligence Lab Shanghai Peoples R China
We propose to explore a new problem called audio-visual segmentation (AVS), in which the goal is to output a pixel-level map of the object(s) that produce sound at the time of the image frame. To facilitate this resea... 详细信息
来源: 评论
Enhance audio-visual segmentation with hierarchical encoder and audio guidance
收藏 引用
NEUROCOMPUTING 2024年 594卷
作者: Guo, Cunhan Huang, Heyan Zhou, Yanghao Univ Chinese Acad Sci Sch Emergency Management Sci & Engn 1 Yanqihu East Rd Beijing 101400 Peoples R China Beijing Inst Technol Southeast Acad informat Technol 1998 Licheng Middle Ave Putian 351100 Fujian Peoples R China Beijing Inst Technol Sch Comp Sci & Technol 5 Zhongguancun South St Beijing 101400 Peoples R China
As one of the pivotal technologies leading towards embodied intelligence, audio-visual segmentation is geared towards achieving precise segmentation of sounding objects, offering vast application prospects in scenario... 详细信息
来源: 评论