咨询与建议

限定检索结果

文献类型

  • 146 篇 会议
  • 68 篇 期刊文献

馆藏范围

  • 214 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 151 篇 工学
    • 111 篇 计算机科学与技术...
    • 98 篇 软件工程
    • 44 篇 信息与通信工程
    • 13 篇 控制科学与工程
    • 12 篇 电气工程
    • 11 篇 电子科学与技术(可...
    • 8 篇 机械工程
    • 6 篇 生物工程
    • 5 篇 化学工程与技术
    • 5 篇 生物医学工程(可授...
    • 4 篇 光学工程
    • 2 篇 动力工程及工程热...
  • 101 篇 理学
    • 75 篇 物理学
    • 38 篇 数学
    • 19 篇 统计学(可授理学、...
    • 12 篇 系统科学
    • 7 篇 生物学
    • 5 篇 化学
    • 1 篇 地球物理学
  • 17 篇 管理学
    • 11 篇 图书情报与档案管...
    • 4 篇 管理科学与工程(可...
    • 3 篇 工商管理
    • 2 篇 公共管理
  • 4 篇 医学
    • 4 篇 临床医学
    • 2 篇 基础医学(可授医学...
    • 2 篇 公共卫生与预防医...
  • 3 篇 法学
    • 2 篇 社会学
    • 1 篇 法学
  • 1 篇 经济学
    • 1 篇 应用经济学
  • 1 篇 教育学
    • 1 篇 体育学
  • 1 篇 农学

主题

  • 51 篇 speech recogniti...
  • 15 篇 hidden markov mo...
  • 15 篇 training
  • 13 篇 neural machine t...
  • 12 篇 machine translat...
  • 12 篇 transducers
  • 11 篇 computer aided l...
  • 11 篇 decoding
  • 9 篇 recurrent neural...
  • 8 篇 speech
  • 8 篇 feature extracti...
  • 8 篇 neural network
  • 8 篇 error analysis
  • 7 篇 modelling langua...
  • 6 篇 vocabulary
  • 6 篇 optimization
  • 6 篇 handwriting reco...
  • 6 篇 humans
  • 6 篇 automatic speech...
  • 5 篇 hierarchical sys...

机构

  • 40 篇 human language t...
  • 37 篇 apptek gmbh aach...
  • 32 篇 human language t...
  • 20 篇 human language t...
  • 10 篇 human language t...
  • 9 篇 human language t...
  • 8 篇 computer science...
  • 8 篇 human language t...
  • 7 篇 spoken language ...
  • 7 篇 apptek gmbh aach...
  • 6 篇 human language t...
  • 6 篇 human language t...
  • 6 篇 human language t...
  • 5 篇 human language t...
  • 4 篇 human language t...
  • 3 篇 human language t...
  • 3 篇 rwth aachen univ...
  • 3 篇 limsi cnrs spoke...
  • 3 篇 human language t...
  • 2 篇 computer vision ...

作者

  • 141 篇 ney hermann
  • 55 篇 schlüter ralf
  • 36 篇 hermann ney
  • 16 篇 zeyer albert
  • 16 篇 zhou wei
  • 14 篇 gao yingbo
  • 14 篇 ralf schluter
  • 12 篇 ralf schlüter
  • 12 篇 mansour saab
  • 12 篇 zeineldeen moham...
  • 12 篇 michel wilfried
  • 12 篇 zens richard
  • 11 篇 herold christian
  • 10 篇 bahar parnia
  • 10 篇 peitz stephan
  • 9 篇 peter jan-thorst...
  • 9 篇 schluter ralf
  • 9 篇 freitag markus
  • 9 篇 wang weiyue
  • 8 篇 wuebker joern

语言

  • 214 篇 英文
检索条件"机构=Human Language Technology and Pattern Recognition Group Computer Science"
214 条 记 录,以下是181-190 订阅
排序:
A new training pipeline for an improved neural transducer
arXiv
收藏 引用
arXiv 2020年
作者: Zeyer, Albert Merboldt, André Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52062 Germany AppTek GmbH Aachen52062 Germany
The RNN transducer is a promising end-to-end model candidate. We compare the original training criterion with the full marginalization over all alignments, to the commonly used maximum approximation, which simplifies,... 详细信息
来源: 评论
LSTM language models for LVCSR in first-pass decoding and lattice-rescoring
arXiv
收藏 引用
arXiv 2019年
作者: Beck, Eugen Zhou, Wei Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52074 Germany AppTek GmbH Aachen52062 Germany
LSTM based language models are an important part of modern LVCSR systems as they significantly improve performance over traditional backoff language models. Incorporating them efficiently into decoding has been notori... 详细信息
来源: 评论
Generating synthetic audio data for attention-based speech recognition systems
arXiv
收藏 引用
arXiv 2019年
作者: Rossenbach, Nick Zeyer, Albert Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Germany AppTek GmbH 52074 Aachen Aachen52062 Germany
Recent advances in text-to-speech (TTS) led to the development of flexible multi-speaker end-to-end TTS systems. We extend state-of-the-art attention-based automatic speech recognition (ASR) systems with synthetic aud... 详细信息
来源: 评论
ON THE RELATION BETWEEN INTERNAL language MODEL AND SEQUENCE DISCRIMINATIVE TRAINING FOR NEURAL TRANSDUCERS
arXiv
收藏 引用
arXiv 2023年
作者: Yang, Zijian Zhou, Wei Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52074 Germany AppTek GmbH Aachen52062 Germany
Internal language model (ILM) subtraction has been widely applied to improve the performance of the RNN-Transducer with external language model (LM) fusion for speech recognition. In this work, we show that sequence d... 详细信息
来源: 评论
RWTH ASR Systems for LibriSpeech: Hybrid vs Attention - w/o Data Augmentation
arXiv
收藏 引用
arXiv 2019年
作者: Lüscher, Christoph Beck, Eugen Irie, Kazuki Kitza, Markus Michel, Wilfried Zeyer, Albert Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52074 Germany AppTek GmbH Aachen52062 Germany
We present state-of-the-art automatic speech recognition (ASR) systems employing a standard hybrid DNN/HMM architecture compared to an attention-based encoder-decoder design for the LibriSpeech task. Detailed descript... 详细信息
来源: 评论
On language model integration for RNN transducer based speech recognition
arXiv
收藏 引用
arXiv 2021年
作者: Zhou, Wei Zheng, Zuoyun Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52074 Germany AppTek GmbH Aachen52062 Germany
The mismatch between an external language model (LM) and the implicitly learned internal LM (ILM) of RNN-Transducer (RNN-T) can limit the performance of LM integration such as simple shallow fusion. A Bayesian interpr... 详细信息
来源: 评论
ENHANCING AND ADVERSARIAL: IMPROVE ASR WITH SPEAKER LABELS
arXiv
收藏 引用
arXiv 2022年
作者: Zhou, Wei Wu, Haotian Xu, Jingjing Zeineldeen, Mohammad Lüscher, Christoph Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52074 Germany AppTek GmbH Aachen52062 Germany
ASR can be improved by multi-task learning (MTL) with domain enhancing or domain adversarial training, which are two opposite objectives with the aim to increase/decrease domain variance towards domain-aware/agnostic ... 详细信息
来源: 评论
Equivalence of segmental and neural transducer modeling: A proof of concept
arXiv
收藏 引用
arXiv 2021年
作者: Zhou, Wei Zeyer, Albert Merboldt, André Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52074 Germany AppTek GmbH Aachen52062 Germany
With the advent of direct models in automatic speech recognition (ASR), the formerly prevalent frame-wise acoustic modeling based on hidden Markov models (HMM) diversified into a number of modeling architectures like ... 详细信息
来源: 评论
MORPHEME-BASED FEATURE-RICH language MODELS USING DEEP NEURAL NETWORKS FOR LVCSR OF EGYPTIAN ARABIC
MORPHEME-BASED FEATURE-RICH LANGUAGE MODELS USING DEEP NEURA...
收藏 引用
IEEE International Conference on Acoustics, Speech, and Signal Processing
作者: Amr El-Desoky Mousa Hong-Kwang Jeff Kuo Lidia Mangu Hagen Soltau Human Language Technology and Pattern Recognition - Computer Science Department RWTH Aachen University IBM T. J. Watson Research Center
Egyptian Arabic (EA) is a colloquial version of Arabic. It is a low-resource morphologically rich language that causes problems in Large Vocabulary Continuous Speech recognition (LVCSR). Building LMs on morpheme level... 详细信息
来源: 评论
Cumulative adaptation for BLSTM acoustic models
arXiv
收藏 引用
arXiv 2019年
作者: Kitza, Markus Golik, Pavel Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52074 Germany AppTek GmbH Aachen52062 Germany
This paper addresses the robust speech recognition problem as an adaptation task. Specifically, we investigate the cumulative application of adaptation methods. A bidirectional Long Short-Term Memory (BLSTM) based neu... 详细信息
来源: 评论