咨询与建议

限定检索结果

文献类型

  • 153 篇 会议
  • 85 篇 期刊文献

馆藏范围

  • 238 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 164 篇 工学
    • 120 篇 计算机科学与技术...
    • 106 篇 软件工程
    • 46 篇 信息与通信工程
    • 14 篇 控制科学与工程
    • 12 篇 电气工程
    • 11 篇 电子科学与技术(可...
    • 9 篇 生物工程
    • 8 篇 机械工程
    • 8 篇 生物医学工程(可授...
    • 6 篇 化学工程与技术
    • 5 篇 光学工程
    • 2 篇 力学(可授工学、理...
    • 2 篇 动力工程及工程热...
  • 109 篇 理学
    • 77 篇 物理学
    • 37 篇 数学
    • 19 篇 统计学(可授理学、...
    • 13 篇 系统科学
    • 11 篇 生物学
    • 6 篇 化学
  • 19 篇 管理学
    • 12 篇 图书情报与档案管...
    • 5 篇 管理科学与工程(可...
    • 3 篇 工商管理
    • 3 篇 公共管理
  • 9 篇 医学
    • 8 篇 临床医学
    • 5 篇 基础医学(可授医学...
    • 4 篇 公共卫生与预防医...
    • 2 篇 药学(可授医学、理...
  • 4 篇 法学
    • 4 篇 社会学
  • 4 篇 教育学
    • 2 篇 心理学(可授教育学...
  • 1 篇 哲学
    • 1 篇 哲学
  • 1 篇 经济学
  • 1 篇 文学
  • 1 篇 历史学
  • 1 篇 农学
  • 1 篇 艺术学

主题

  • 53 篇 speech recogniti...
  • 15 篇 training
  • 14 篇 hidden markov mo...
  • 13 篇 neural machine t...
  • 12 篇 machine translat...
  • 12 篇 transducers
  • 11 篇 computer aided l...
  • 11 篇 decoding
  • 9 篇 error analysis
  • 9 篇 recurrent neural...
  • 8 篇 speech
  • 8 篇 feature extracti...
  • 8 篇 neural network
  • 7 篇 modelling langua...
  • 7 篇 humans
  • 6 篇 signal processin...
  • 6 篇 vocabulary
  • 6 篇 handwriting reco...
  • 5 篇 hierarchical sys...
  • 5 篇 modeling languag...

机构

  • 40 篇 human language t...
  • 38 篇 apptek gmbh aach...
  • 32 篇 human language t...
  • 20 篇 human language t...
  • 10 篇 human language t...
  • 9 篇 human language t...
  • 8 篇 computer science...
  • 8 篇 human language t...
  • 7 篇 spoken language ...
  • 7 篇 apptek gmbh aach...
  • 6 篇 human language t...
  • 6 篇 apptek gmbh
  • 6 篇 human language t...
  • 5 篇 human language t...
  • 5 篇 human language t...
  • 4 篇 human language t...
  • 3 篇 human language t...
  • 3 篇 informatics inst...
  • 3 篇 rwth aachen univ...
  • 3 篇 limsi cnrs spoke...

作者

  • 141 篇 ney hermann
  • 58 篇 schlüter ralf
  • 37 篇 hermann ney
  • 16 篇 zeyer albert
  • 16 篇 zhou wei
  • 14 篇 gao yingbo
  • 14 篇 ralf schlüter
  • 14 篇 ralf schluter
  • 12 篇 mansour saab
  • 12 篇 zeineldeen moham...
  • 12 篇 michel wilfried
  • 12 篇 zens richard
  • 11 篇 herold christian
  • 10 篇 bahar parnia
  • 10 篇 peitz stephan
  • 9 篇 peter jan-thorst...
  • 9 篇 schluter ralf
  • 9 篇 freitag markus
  • 9 篇 yang zijian
  • 9 篇 wang weiyue

语言

  • 236 篇 英文
  • 2 篇 其他
检索条件"机构=Human Language Technology and Pattern Recognition Group Computer Science Department"
238 条 记 录,以下是181-190 订阅
排序:
Acoustic data-driven subword modeling for end-to-end speech recognition
arXiv
收藏 引用
arXiv 2021年
作者: Zhou, Wei Zeineldeen, Mohammad Zheng, Zuoyun Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52074 Germany AppTek GmbH Aachen52062 Germany
Subword units are commonly used for end-to-end automatic speech recognition (ASR), while a fully acoustic-oriented subword modeling approach is somewhat missing. We propose an acoustic data-driven subword modeling (AD... 详细信息
来源: 评论
MONOTONIC SEGMENTAL ATTENTION FOR AUTOMATIC SPEECH recognition
arXiv
收藏 引用
arXiv 2022年
作者: Zeyer, Albert Schmitt, Robin Zhou, Wei Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52062 Germany AppTek GmbH Aachen52062 Germany
We introduce a novel segmental-attention model for automatic speech recognition. We restrict the decoder attention to segments to avoid quadratic runtime of global attention, better generalize to long sequences, and e... 详细信息
来源: 评论
Librispeech transducer model with internal language model prior correction
arXiv
收藏 引用
arXiv 2021年
作者: Zeyer, Albert Merboldt, André Michel, Wilfried Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52062 Germany AppTek GmbH Aachen52062 Germany
We present our transducer model on Librispeech. We study variants to include an external language model (LM) with shallow fusion and subtract an estimated internal LM. This is justified by a Bayesian interpretation wh... 详细信息
来源: 评论
EFFICIENT SEQUENCE TRAINING OF ATTENTION MODELS USING APPROXIMATIVE RECOMBINATION
arXiv
收藏 引用
arXiv 2021年
作者: Wynands, Nils-Philipp Michel, Wilfried Rosendahl, Jan Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52062 Germany AppTek GmbH Aachen52062 Germany
Sequence discriminative training is a great tool to improve the performance of an automatic speech recognition system. It does, however, necessitate a sum over all possible word sequences, which is intractable to comp... 详细信息
来源: 评论
LSTM language models for LVCSR in first-pass decoding and lattice-rescoring
arXiv
收藏 引用
arXiv 2019年
作者: Beck, Eugen Zhou, Wei Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52074 Germany AppTek GmbH Aachen52062 Germany
LSTM based language models are an important part of modern LVCSR systems as they significantly improve performance over traditional backoff language models. Incorporating them efficiently into decoding has been notori... 详细信息
来源: 评论
ON THE RELATION BETWEEN INTERNAL language MODEL AND SEQUENCE DISCRIMINATIVE TRAINING FOR NEURAL TRANSDUCERS
arXiv
收藏 引用
arXiv 2023年
作者: Yang, Zijian Zhou, Wei Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52074 Germany AppTek GmbH Aachen52062 Germany
Internal language model (ILM) subtraction has been widely applied to improve the performance of the RNN-Transducer with external language model (LM) fusion for speech recognition. In this work, we show that sequence d... 详细信息
来源: 评论
A new training pipeline for an improved neural transducer
arXiv
收藏 引用
arXiv 2020年
作者: Zeyer, Albert Merboldt, André Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52062 Germany AppTek GmbH Aachen52062 Germany
The RNN transducer is a promising end-to-end model candidate. We compare the original training criterion with the full marginalization over all alignments, to the commonly used maximum approximation, which simplifies,... 详细信息
来源: 评论
RWTH ASR Systems for LibriSpeech: Hybrid vs Attention - w/o Data Augmentation
arXiv
收藏 引用
arXiv 2019年
作者: Lüscher, Christoph Beck, Eugen Irie, Kazuki Kitza, Markus Michel, Wilfried Zeyer, Albert Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52074 Germany AppTek GmbH Aachen52062 Germany
We present state-of-the-art automatic speech recognition (ASR) systems employing a standard hybrid DNN/HMM architecture compared to an attention-based encoder-decoder design for the LibriSpeech task. Detailed descript... 详细信息
来源: 评论
On language model integration for RNN transducer based speech recognition
arXiv
收藏 引用
arXiv 2021年
作者: Zhou, Wei Zheng, Zuoyun Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52074 Germany AppTek GmbH Aachen52062 Germany
The mismatch between an external language model (LM) and the implicitly learned internal LM (ILM) of RNN-Transducer (RNN-T) can limit the performance of LM integration such as simple shallow fusion. A Bayesian interpr... 详细信息
来源: 评论
Generating synthetic audio data for attention-based speech recognition systems
arXiv
收藏 引用
arXiv 2019年
作者: Rossenbach, Nick Zeyer, Albert Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Germany AppTek GmbH 52074 Aachen Aachen52062 Germany
Recent advances in text-to-speech (TTS) led to the development of flexible multi-speaker end-to-end TTS systems. We extend state-of-the-art attention-based automatic speech recognition (ASR) systems with synthetic aud... 详细信息
来源: 评论