咨询与建议

限定检索结果

文献类型

  • 18 篇 会议
  • 12 篇 期刊文献

馆藏范围

  • 30 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 28 篇 工学
    • 20 篇 计算机科学与技术...
    • 16 篇 电气工程
    • 8 篇 软件工程
    • 5 篇 信息与通信工程
    • 1 篇 机械工程
    • 1 篇 仪器科学与技术
    • 1 篇 控制科学与工程
    • 1 篇 网络空间安全
  • 12 篇 理学
    • 12 篇 物理学
  • 8 篇 医学
    • 8 篇 临床医学
  • 2 篇 文学
    • 1 篇 外国语言文学
  • 1 篇 管理学
    • 1 篇 管理科学与工程(可...

主题

  • 30 篇 attention-based ...
  • 9 篇 speech recogniti...
  • 5 篇 end-to-end
  • 3 篇 decoding
  • 3 篇 language model
  • 3 篇 training
  • 3 篇 connectionist te...
  • 2 篇 speaker adaptati...
  • 2 篇 monotonic chunkw...
  • 2 篇 deep learning
  • 2 篇 end-to-end speec...
  • 2 篇 computational mo...
  • 2 篇 beam search
  • 2 篇 predictive model...
  • 2 篇 recurrent neural...
  • 2 篇 phone synchronou...
  • 2 篇 transducers
  • 2 篇 multichannel end...
  • 2 篇 data models
  • 2 篇 e2e

机构

  • 3 篇 microsoft corp r...
  • 2 篇 sony grp corp
  • 1 篇 microsoft cloud ...
  • 1 篇 ping an technol ...
  • 1 篇 department of co...
  • 1 篇 department of in...
  • 1 篇 mitsubishi elect...
  • 1 篇 van lang univ sc...
  • 1 篇 shanghai jiao to...
  • 1 篇 kyoto univ grad ...
  • 1 篇 srm univ sch eng...
  • 1 篇 carnegie mellon ...
  • 1 篇 pre univ kota sa...
  • 1 篇 soongsil univ sc...
  • 1 篇 univ malaysia sa...
  • 1 篇 computer science...
  • 1 篇 computer science...
  • 1 篇 microsoft res as...
  • 1 篇 mitsubishi elect...
  • 1 篇 doshisha univ gr...

作者

  • 4 篇 gaur yashesh
  • 4 篇 watanabe shinji
  • 3 篇 zhao rui
  • 3 篇 kanda naoyuki
  • 3 篇 gong yifan
  • 3 篇 meng zhong
  • 3 篇 li jinyu
  • 2 篇 chen xie
  • 2 篇 kashiwagi yosuke
  • 2 篇 inaguma hirofumi
  • 2 篇 kawahara tatsuya
  • 2 篇 tsunoo emiru
  • 2 篇 sun eric
  • 2 篇 ochiai tsubasa
  • 2 篇 parthasarathy sa...
  • 2 篇 lu liang
  • 2 篇 woodland philip ...
  • 1 篇 hamzah a.alsayad...
  • 1 篇 zhao running
  • 1 篇 shigeru katagiri

语言

  • 30 篇 英文
检索条件"主题词=Attention-based encoder-decoder"
30 条 记 录,以下是1-10 订阅
排序:
attention-based encoder-decoder End-to-End Neural Diarization With Embedding Enhancer
收藏 引用
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 2024年 32卷 1636-1649页
作者: Chen, Zhengyang Han, Bing Wang, Shuai Qian, Yanmin Shanghai Jiao Tong Univ Dept Comp Sci & Engn Auditory Cognit & Computat Acoust Lab Shanghai 200240 Peoples R China Shanghai Jiao Tong Univ AI Inst MoE Key Lab Artificial Intelligence Shanghai 200240 Peoples R China Chinese Univ Hong Kong Shenzhen Res Inst Big Data Shenzhen 518172 Peoples R China
Deep neural network-based systems have significantly improved the performance of speaker diarization tasks. However, end-to-end neural diarization (EEND) systems often struggle to generalize to scenarios with an unsee... 详细信息
来源: 评论
HYBRID attention-based encoder-decoder MODEL FOR EFFICIENT LANGUAGE MODEL ADAPTATION
HYBRID ATTENTION-BASED ENCODER-DECODER MODEL FOR EFFICIENT L...
收藏 引用
2024 Spoken Language Technology Workshop
作者: Ling, Shaoshi Ye, Guoli Zhao, Rui Gong, Yifan Microsoft Cloud & AI Redmond WA 98052 USA
The attention-based encoder-decoder (AED) speech recognition model has been widely successful in recent years. However, the joint optimization of acoustic model and language model in end-to-end manner has created chal... 详细信息
来源: 评论
Investigating Methods to Improve Language Model Integration for attention-based encoder-decoder ASR Models  22
Investigating Methods to Improve Language Model Integration ...
收藏 引用
Interspeech Conference
作者: Zeineldeen, Mohammad Glushko, Aleksandr Michel, Wilfried Zeyer, Albert Schlueter, Ralf Ney, Hermann Rhein Westfal TH Aachen Comp Sci Dept Human Language Technol & Pattern Recognit D-52074 Aachen Germany AppTek GmbH D-52062 Aachen Germany
attention-based encoder-decoder (AED) models learn an implicit internal language model (ILM) from the training transcriptions. The integration with an external LM trained on much more unpaired text usually leads to be... 详细信息
来源: 评论
Enhancing E-commerce recommendations with sentiment analysis using MLA-EDTCNet and collaborative filtering
收藏 引用
SCIENTIFIC REPORTS 2025年 第1期15卷 1-16页
作者: Krishna, E. S. Phalguna Ramu, T. Bhargava Chaitanya, R. Krishna Ram, M. Sitha Balayesu, Narasimhula Gandikota, Hari Prasad Jagadesh, B. N. GITAM Univ GITAM Sch Technol Dept Comp Sci & Engn Bengaluru Campus Bengaluru India MLR Inst Technol Dept Elect & Elect Engn Hyderabad 500043 Telangana India SRKR Engn Coll Dept ECE Bhimavaram India SRM Univ Sch Engn & Sci Dept Comp Sci & Engn Amaravati Andhra Pradesh India Vasireddy Venkatadri Inst Technol Dept Comp Sci & Engn AIML Guntur India Koneru Lakshmaiah Educ Fdn Dept Comp Sci & Engn Hyderabad 500075 Telangana India VIT AP Univ Sch Comp Sci & Engn Vijayawada 522237 India
The rapid growth of e-commerce has made product recommendation systems essential for enhancing customer experience and driving business success. This research proposes an advanced recommendation framework that integra... 详细信息
来源: 评论
Joint Beam Search Integrating CTC, attention, and Transducer decoders
收藏 引用
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 2025年 33卷 598-612页
作者: Sudo, Yui Shakeel, Muhammad Fukumoto, Yosuke Yan, Brian Shi, Jiatong Peng, Yifan Watanabe, Shinji Honda Res Inst Japan Co Ltd Res Div Wako Saitama 3510188 Japan Carnegie Mellon Univ Language Technol Inst Pittsburgh PA 15213 USA Carnegie Mellon Univ Dept Elect & Comp Engn Pittsburgh PA 15213 USA
End-to-end automatic speech recognition (E2E-ASR) can be classified by its decoder architectures, such as connectionist temporal classification (CTC), recurrent neural network transducer (RNN-T), attention-based encod... 详细信息
来源: 评论
An End-to-End Transformer-based Automatic Speech Recognition for Qur’an Reciters
收藏 引用
Computers, Materials & Continua 2023年 第2期74卷 3471-3487页
作者: Mohammed Hadwan Hamzah A.Alsayadi Salah AL-Hagree Department of Information Technology College of ComputerQassim UniversityBuraydah51452Saudi Arabia Department of Computer Science College of Applied SciencesTaiz UniversityTaiz6803Yemen Computer Science Department Faculty of Computer and Information SciencesAin Shams UniversityCairo11566Egypt Computer Science Department Faculty of SciencesIbb UniversityYemen Department of Computer Sciences&Information Ibb UniversityYemen
The attention-based encoder-decoder technique,known as the trans-former,is used to enhance the performance of end-to-end automatic speech recognition(ASR).This research focuses on applying ASR end-toend transformer-ba... 详细信息
来源: 评论
Dynamic Network Slice Scaling Assisted by attention-based Prediction in 5G Core Network
收藏 引用
IEEE ACCESS 2022年 10卷 72955-72972页
作者: Chien-Nguyen Nhu Park, Minho Soongsil Univ Dept Informat Commun Convergence Technol Seoul 156743 South Korea Soongsil Univ Sch Elect Engn Seoul 156743 South Korea
Network slicing is a key technology in fifth-generation (5G) networks that allows network operators to create multiple logical networks over a shared physical infrastructure to meet the requirements of diverse use cas... 详细信息
来源: 评论
A Human-Inspired Recognition System for Pre-Modern Japanese Historical Documents
收藏 引用
IEEE ACCESS 2019年 7卷 84163-84169页
作者: Ann Duc Le Clanuwat, Tarin Kitamoto, Asanobu Duy Tan Univ Inst Res & Dev Da Nang 550000 Vietnam Ctr Open Data Humanities Tokyo 1018430 Japan
Recognition of historical documents is a challenging problem due to the noised, damaged characters, and background. However, in Japanese historical documents, not only contains the mentioned problems, pre-modern Japan... 详细信息
来源: 评论
An End-to-End Network for Continuous Human Motion Recognition via Radar Radios
收藏 引用
IEEE SENSORS JOURNAL 2021年 第5期21卷 6487-6496页
作者: Zhao, Running Ma, Xiaolin Liu, Xinhua Liu, Jian Wuhan Univ Technol Sch Informat Engn Hubei Key Lab Broadband Wireless Commun & Sensor Wuhan 430070 Peoples R China Univ Tennessee Dept Elect Engn & Comp Sci Knoxville TN 37996 USA
Micro-Doppler-based continuous human motion recognition (HMR) has gained considerable attention recently. However, existing methods mainly rely on individual recurrent neural network or sliding-window-based approaches... 详细信息
来源: 评论
Alignment Knowledge Distillation for Online Streaming attention-based Speech Recognition
收藏 引用
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 2023年 31卷 1371-1385页
作者: Inaguma, Hirofumi Kawahara, Tatsuya Kyoto Univ Grad Sch Informat Kyoto 6068501 Japan
This article describes an efficient training method for online streaming attention-based encoder-decoder (AED) automatic speech recognition (ASR) systems. AED models have achieved competitive performance in offline sc... 详细信息
来源: 评论