咨询与建议

限定检索结果

文献类型

  • 18 篇 会议
  • 11 篇 期刊文献

馆藏范围

  • 29 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 27 篇 工学
    • 20 篇 计算机科学与技术...
    • 15 篇 电气工程
    • 8 篇 软件工程
    • 5 篇 信息与通信工程
    • 1 篇 机械工程
    • 1 篇 仪器科学与技术
    • 1 篇 控制科学与工程
    • 1 篇 网络空间安全
  • 11 篇 理学
    • 11 篇 物理学
  • 8 篇 医学
    • 8 篇 临床医学
  • 2 篇 文学
    • 1 篇 外国语言文学
  • 1 篇 管理学
    • 1 篇 管理科学与工程(可...

主题

  • 29 篇 attention-based ...
  • 8 篇 speech recogniti...
  • 5 篇 end-to-end
  • 3 篇 language model
  • 3 篇 connectionist te...
  • 2 篇 speaker adaptati...
  • 2 篇 monotonic chunkw...
  • 2 篇 deep learning
  • 2 篇 end-to-end speec...
  • 2 篇 predictive model...
  • 2 篇 decoding
  • 2 篇 recurrent neural...
  • 2 篇 phone synchronou...
  • 2 篇 multichannel end...
  • 2 篇 data models
  • 2 篇 e2e
  • 2 篇 training
  • 1 篇 encoder-decoder
  • 1 篇 long short-term ...
  • 1 篇 language modelin...

机构

  • 3 篇 microsoft corp r...
  • 2 篇 sony grp corp
  • 1 篇 microsoft cloud ...
  • 1 篇 ping an technol ...
  • 1 篇 department of co...
  • 1 篇 department of in...
  • 1 篇 mitsubishi elect...
  • 1 篇 van lang univ sc...
  • 1 篇 shanghai jiao to...
  • 1 篇 kyoto univ grad ...
  • 1 篇 srm univ sch eng...
  • 1 篇 pre univ kota sa...
  • 1 篇 soongsil univ sc...
  • 1 篇 univ malaysia sa...
  • 1 篇 computer science...
  • 1 篇 computer science...
  • 1 篇 microsoft res as...
  • 1 篇 mitsubishi elect...
  • 1 篇 doshisha univ gr...
  • 1 篇 ctr open data hu...

作者

  • 4 篇 gaur yashesh
  • 3 篇 zhao rui
  • 3 篇 kanda naoyuki
  • 3 篇 gong yifan
  • 3 篇 watanabe shinji
  • 3 篇 meng zhong
  • 3 篇 li jinyu
  • 2 篇 chen xie
  • 2 篇 kashiwagi yosuke
  • 2 篇 inaguma hirofumi
  • 2 篇 kawahara tatsuya
  • 2 篇 tsunoo emiru
  • 2 篇 sun eric
  • 2 篇 ochiai tsubasa
  • 2 篇 parthasarathy sa...
  • 2 篇 lu liang
  • 2 篇 woodland philip ...
  • 1 篇 hamzah a.alsayad...
  • 1 篇 zhao running
  • 1 篇 shigeru katagiri

语言

  • 29 篇 英文
检索条件"主题词=attention-based encoder-decoder"
29 条 记 录,以下是11-20 订阅
排序:
A Comparative Analysis of Generative Neural attention-based Service Chatbot
收藏 引用
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS 2022年 第8期13卷 742-751页
作者: Suhaili, Sinarwati Mohamad Salim, Naomie Jambli, Mohamad Nazim Pre Univ Kota Samarahan Sarawak Malaysia Univ Teknol Malaysia Fac Comp Skudai 81310 Johor Malaysia Univ Teknol Malaysia Ibnu Sina Inst Sci & Ind Res UTM Big Data Ctr Skudai 81310 Johor Malaysia Univ Malaysia Sarawak Fac Comp Sci & Informat Technol Kota Samarahan Sarawak Malaysia
Companies constantly rely on customer support to deliver pre-and post-sale services to their clients through websites, mobile devices or social media platforms such as Twitter. In assisting customers, companies employ... 详细信息
来源: 评论
Residual Language Model for End-to-end Speech Recognition  23
Residual Language Model for End-to-end Speech Recognition
收藏 引用
Interspeech Conference
作者: Tsunoo, Emiru Kashiwagi, Yosuke Narisetty, Chaitanya Watanabe, Shinji Sony Grp Corp Tokyo Japan Carnegie Mellon Univ Pittsburgh PA 15213 USA
End-to-end automatic speech recognition suffers from adaptation to unknown target domain speech despite being trained with a large amount of paired audio-text data. Recent studies estimate a linguistic bias of the mod... 详细信息
来源: 评论
Review of methods of end-to-end automatic recognition of Kazakh speech
收藏 引用
Procedia Computer Science 2024年 251卷 615-620页
作者: Yerlan Karabaliyev Kateryna Kolesnikova Nurkhan Batyrkhan International IT University 34/1 Manas str. Almaty Kazakhstan
This paper provides a comprehensive review of end-to-end automatic speech recognition methods for the Kazakh language, which is considered a low-resource language with unique phonetic and grammatical features. These f... 详细信息
来源: 评论
An End-to-End Network for Continuous Human Motion Recognition via Radar Radios
收藏 引用
IEEE SENSORS JOURNAL 2021年 第5期21卷 6487-6496页
作者: Zhao, Running Ma, Xiaolin Liu, Xinhua Liu, Jian Wuhan Univ Technol Sch Informat Engn Hubei Key Lab Broadband Wireless Commun & Sensor Wuhan 430070 Peoples R China Univ Tennessee Dept Elect Engn & Comp Sci Knoxville TN 37996 USA
Micro-Doppler-based continuous human motion recognition (HMR) has gained considerable attention recently. However, existing methods mainly rely on individual recurrent neural network or sliding-window-based approaches... 详细信息
来源: 评论
INTERNAL LANGUAGE MODEL TRAINING FOR DOMAIN-ADAPTIVE END-TO-END SPEECH RECOGNITION
INTERNAL LANGUAGE MODEL TRAINING FOR DOMAIN-ADAPTIVE END-TO-...
收藏 引用
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
作者: Meng, Zhong Kanda, Naoyuki Gaur, Yashesh Parthasarathy, Sarangarajan Sun, Eric Lu, Liang Chen, Xie Li, Jinyu Gong, Yifan Microsoft Corp Redmond WA 98052 USA
The efficacy of external language model (LM) integration with existing end-to-end (E2E) automatic speech recognition (ASR) systems can be improved significantly using the internal language model estimation (ILME) meth... 详细信息
来源: 评论
STREAMING END-TO-END SPEECH RECOGNITION WITH JOINTLY TRAINED NEURAL FEATURE ENHANCEMENT
STREAMING END-TO-END SPEECH RECOGNITION WITH JOINTLY TRAINED...
收藏 引用
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
作者: Kim, Chanwoo Garg, Abhinav Gowda, Dhananjaya Mun, Seongkyu Han, Changwoo Samsung Res Seoul South Korea
In this paper, we present a streaming end-to-end speech recognition model based on Monotonic Chunkwise attention (MoCha) jointly trained with enhancement layers. Even though the MoCha attention enables streaming speec... 详细信息
来源: 评论
Streaming End-to-End Speech Recognition for Hybrid RNN-T/attention Architecture  22
Streaming End-to-End Speech Recognition for Hybrid RNN-T/Att...
收藏 引用
Interspeech Conference
作者: Moriya, Takafumi Tanaka, Tomohiro Ashihara, Takanori Ochiai, Tsubasa Sato, Hiroshi Ando, Atsushi Masumura, Ryo Delcroix, Marc Asami, Taichi NTT Corp Chiyoda City Tokyo Japan
We present a novel architecture with its decoding approach for improving recurrent neural network-transducer (RNN-T) performance. RNN-T is promising for building time-synchronous automatic speech recognition (ASR) sys... 详细信息
来源: 评论
INTERNAL LANGUAGE MODEL ESTIMATION FOR DOMAIN-ADAPTIVE END-TO-END SPEECH RECOGNITION
INTERNAL LANGUAGE MODEL ESTIMATION FOR DOMAIN-ADAPTIVE END-T...
收藏 引用
IEEE Spoken Language Technology Workshop (SLT)
作者: Meng, Zhong Parthasarathy, Sarangarajan Sun, Eric Gaur, Yashesh Kanda, Naoyuki Lu, Liang Chen, Xie Zhao, Rui Li, Jinyu Gong, Yifan Microsoft Corp Redmond WA 98052 USA
The external language models (LM) integration remains a challenging task for end-to-end (E2E) automatic speech recognition (ASR) which has no clear division between acoustic and language models. In this work, we propo... 详细信息
来源: 评论
END-TO-END SILENT SPEECH RECOGNITION WITH ACOUSTIC SENSING
END-TO-END SILENT SPEECH RECOGNITION WITH ACOUSTIC SENSING
收藏 引用
IEEE Spoken Language Technology Workshop (SLT)
作者: Luo, Jian Wang, Jianzong Cheng, Ning Jiang, Guilin Xiao, Jing Ping An Technol Shenzhen Co Ltd Shenzhen Peoples R China
Silent speech interfaces (SSI) has been an exciting area of recent interest. In this paper, we present a non-invasive silent speech interface that uses inaudible acoustic signals to capture people's lip movements ... 详细信息
来源: 评论
TREE-CONSTRAINED POINTER GENERATOR FOR END-TO-END CONTEXTUAL SPEECH RECOGNITION
TREE-CONSTRAINED POINTER GENERATOR FOR END-TO-END CONTEXTUAL...
收藏 引用
IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
作者: Sun, Guangzhi Zhang, Chao Woodland, Philip C. Univ Cambridge Engn Dept Trumpington St Cambridge CB2 1PZ England
Contextual knowledge is important for real-world automatic speech recognition (ASR) applications. In this paper, a novel tree-constrained pointer generator (TCPGen) component is proposed that incorporates such knowled... 详细信息
来源: 评论