咨询与建议

限定检索结果

文献类型

  • 9 篇 会议
  • 4 篇 期刊文献

馆藏范围

  • 13 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 12 篇 工学
    • 8 篇 计算机科学与技术...
    • 3 篇 电气工程
    • 3 篇 软件工程
    • 1 篇 仪器科学与技术
    • 1 篇 信息与通信工程
    • 1 篇 控制科学与工程
    • 1 篇 建筑学
    • 1 篇 土木工程
    • 1 篇 生物医学工程(可授...
  • 4 篇 理学
    • 4 篇 物理学
  • 3 篇 医学
    • 2 篇 临床医学
    • 1 篇 基础医学(可授医学...

主题

  • 13 篇 audio spectrogra...
  • 2 篇 adapters
  • 2 篇 audio classifica...
  • 1 篇 encoder-decoder
  • 1 篇 respiratory soun...
  • 1 篇 fault diagnosis
  • 1 篇 construction noi...
  • 1 篇 self-attention
  • 1 篇 sequential infer...
  • 1 篇 expandable dual-...
  • 1 篇 soft mixture of ...
  • 1 篇 continual learni...
  • 1 篇 videomae
  • 1 篇 noise estimation
  • 1 篇 dynamic convolut...
  • 1 篇 auto-encoder
  • 1 篇 lora
  • 1 篇 noise control
  • 1 篇 gas-insulated sw...
  • 1 篇 dynamic relu

机构

  • 2 篇 fdn bruno kessle...
  • 2 篇 univ trento tren...
  • 1 篇 johannes kepler ...
  • 1 篇 seoul natl univ ...
  • 1 篇 nanyang technol ...
  • 1 篇 smartsound
  • 1 篇 inst construct &...
  • 1 篇 univ tartu tartu...
  • 1 篇 tech univ munich...
  • 1 篇 nanyang technol ...
  • 1 篇 worcester polyte...
  • 1 篇 doshisha univ de...
  • 1 篇 univ illinois be...
  • 1 篇 faculty of elect...
  • 1 篇 imperial coll lo...
  • 1 篇 johannes kepler ...
  • 1 篇 state grid shang...
  • 1 篇 beijing inst tec...
  • 1 篇 concordia univ m...
  • 1 篇 iiit delhi delhi

作者

  • 2 篇 brutti alessio
  • 2 篇 cappellazzo umbe...
  • 2 篇 falavigna daniel...
  • 1 篇 kot alex
  • 1 篇 gowda karthik
  • 1 篇 imoto keisuke
  • 1 篇 ohta takezo
  • 1 篇 islam bashima
  • 1 篇 kong adams
  • 1 篇 jang seongju
  • 1 篇 wang zhihua
  • 1 篇 koutini khaled
  • 1 篇 he qianhua
  • 1 篇 behera swarup ra...
  • 1 篇 hu bin
  • 1 篇 yun se-young
  • 1 篇 chi seokho
  • 1 篇 cho won-yang
  • 1 篇 shen bingquan
  • 1 篇 si yongjie

语言

  • 13 篇 英文
检索条件"主题词=Audio Spectrogram Transformer"
13 条 记 录,以下是1-10 订阅
排序:
A Sequential audio spectrogram transformer for Real-Time Sound Event Detection  32
A Sequential Audio Spectrogram Transformer for Real-Time Sou...
收藏 引用
32nd European Signal Processing Conference (EUSIPCO)
作者: Ohta, Takezo Bando, Yoshiaki Imoto, Keisuke Onishi, Masaki Univ Tsukuba Grad Sch Syst & Informat Engn Tsukuba Ibaraki Japan Natl Inst Adv Ind Sci & Technol Tokyo Japan Doshisha Univ Dept Informat Syst Design Kyoto Japan
In this paper, we propose an audio spectrogram transformer (AST) for sequential inference and evaluate its real-time performance. ASTs are pre-trained in a self-supervised manner, such as masked autoencoding, and the ... 详细信息
来源: 评论
FastAST: Accelerating audio spectrogram transformer via Token Merging and Cross-Model Knowledge Distillation  25
FastAST: Accelerating Audio Spectrogram Transformer via Toke...
收藏 引用
25th Interspeech Conference
作者: Behera, Swarup Ranjan Dhiman, Abhishek Gowda, Karthik Narayani, Aalekhya Satya Reliance Jio AICoE Hyderabad India
audio classification models, particularly the audio spectrogram transformer (AST), play a crucial role in efficient audio analysis. However, optimizing their efficiency without compromising accuracy remains a challeng... 详细信息
来源: 评论
Patch-Mix Contrastive Learning with audio spectrogram transformer on Respiratory Sound Classification  24
Patch-Mix Contrastive Learning with Audio Spectrogram Transf...
收藏 引用
Interspeech Conference
作者: Bae, Sangmin Kim, June-Woo Cho, Won-Yang Baek, Hyerim Son, Soyoun Lee, Byungjo Ha, Changwan Tae, Kyongpil Kim, Sungnyun Yun, Se-Young KAIST AI Daejeon South Korea Kyungpook Natl Univ Dept AI Daegu South Korea SmartSound Seoul South Korea Dongguk Univ Seoul South Korea MODULABS Seoul South Korea
Respiratory sound contains crucial information for the early diagnosis of fatal lung diseases. Since the COVID-19 pandemic, there has been a growing interest in contact-free medical care based on electronic stethoscop... 详细信息
来源: 评论
Adapter Incremental Continual Learning of Efficient audio spectrogram transformers  24
Adapter Incremental Continual Learning of Efficient Audio Sp...
收藏 引用
Interspeech Conference
作者: Selvaraj, Nithish Muthuchamy Guo, Xiaobao Kong, Adams Shen, Bingquan Kot, Alex Nanyang Technol Univ Rapid Rich Object Search ROSE Lab Singapore Singapore Nanyang Technol Univ Sch Comp Sci & Engn Singapore Singapore DSO Natl Labs Singapore Singapore
Efficient tuning of neural networks for continual learning with minimal computational resources remains a challenge. In this paper, we propose continual learning of audio classifiers with parameter and compute efficie... 详细信息
来源: 评论
PARAMETER-EFFICIENT TRANSFER LEARNING OF audio spectrogram transformerS  34
PARAMETER-EFFICIENT TRANSFER LEARNING OF AUDIO SPECTROGRAM T...
收藏 引用
34th International Workshop on Machine Learning for Signal Processing
作者: Cappellazzo, Umberto Falavigna, Daniele Brutti, Alessio Ravanelli, Mirco Univ Trento Trento Italy Fdn Bruno Kessler Trento Italy Concordia Univ Montreal PQ Canada
Parameter-efficient transfer learning (PETL) methods have emerged as a solid alternative to the standard full fine-tuning approach. They only train a few extra parameters for each downstream task, without sacrificing ... 详细信息
来源: 评论
Efficient Fine-tuning of audio spectrogram transformers via Soft Mixture of Adapters  25
Efficient Fine-tuning of Audio Spectrogram Transformers via ...
收藏 引用
25th Interspeech Conference
作者: Cappellazzo, Umberto Falavigna, Daniele Brutti, Alessio Univ Trento Trento Italy Fdn Bruno Kessler Trento Italy
Mixture of Experts (MoE) architectures have recently started burgeoning due to their ability to scale model's capacity while maintaining the computational cost affordable, leading to state-of-the-art results in nu... 详细信息
来源: 评论
Dynamic Convolutional Neural Networks as Efficient Pre-Trained audio Models
收藏 引用
IEEE-ACM TRANSACTIONS ON audio SPEECH AND LANGUAGE PROCESSING 2024年 32卷 2227-2241页
作者: Schmid, Florian Koutini, Khaled Widmer, Gerhard Johannes Kepler Univ Linz Inst Computat Percept CP JKU A-4040 Linz Austria Johannes Kepler Univ Linz LIT Artificial Intelligence Lab A-4040 Linz Austria
The introduction of large-scale audio datasets, such as audioSet, paved the way for transformers to conquer the audio domain and replace CNNs as the state-of-the-art neural network architecture for many tasks. audio S... 详细信息
来源: 评论
Sound Tagging in Infant-centric Home Soundscapes
Sound Tagging in Infant-centric Home Soundscapes
收藏 引用
9th IEEE/ACM International Conference on Connected Health - Applications, Systems and Engineering Technologies (CHASE)
作者: Khan, Mohammad Nur Hossain Li, Jialu McElwain, Nancy L. Hasegawa-Johnson, Mark Islam, Bashima Worcester Polytech Inst Dept Elect & Comp Engn Worcester MA 01609 USA Univ Illinois Dept Elect & Comp Engn Champaign IL USA Univ Illinois Dept Human Dev & Family Studies Champaign IL USA Univ Illinois Beckman Inst Adv Sci & Technol Champaign IL USA
Certain environmental noises have been associated with negative developmental outcomes for infants and young children. Though classifying or tagging sound events in a domestic environment is an active research area, p... 详细信息
来源: 评论
AVR: Synergizing Foundation Models for audio-Visual Humor Detection  25
AVR: Synergizing Foundation Models for Audio-Visual Humor De...
收藏 引用
25th Interspeech Conference
作者: Sharma, Sarthak Phukan, Orchid Chetia Singh, Drishti Buduru, Arun Balaji Sharma, Rajesh IIIT Delhi Delhi India Univ Tartu Tartu Estonia
In this work, we present, AVR application for audio-visual humor detection. While humor detection has traditionally centered around textual analysis, recent advancements have spotlighted multimodal approaches. However... 详细信息
来源: 评论
Fully Few-shot Class-incremental audio Classification Using Expandable Dual-embedding Extractor  25
Fully Few-shot Class-incremental Audio Classification Using ...
收藏 引用
25th Interspeech Conference
作者: Si, Yongjie Li, Yanxiong Li, Jialong Tan, Jiaxin He, Qianhua South China Univ Technol Sch Elect & Informat Engn Guangzhou Peoples R China
It's assumed that training data is sufficient in base session of few-shot class-incremental audio classification. However, it's difficult to collect abundant samples for model training in base session in some ... 详细信息
来源: 评论