咨询与建议

限定检索结果

文献类型

  • 146 篇 会议
  • 68 篇 期刊文献

馆藏范围

  • 214 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 151 篇 工学
    • 111 篇 计算机科学与技术...
    • 98 篇 软件工程
    • 44 篇 信息与通信工程
    • 13 篇 控制科学与工程
    • 12 篇 电气工程
    • 11 篇 电子科学与技术(可...
    • 8 篇 机械工程
    • 6 篇 生物工程
    • 5 篇 化学工程与技术
    • 5 篇 生物医学工程(可授...
    • 4 篇 光学工程
    • 2 篇 动力工程及工程热...
  • 101 篇 理学
    • 75 篇 物理学
    • 38 篇 数学
    • 19 篇 统计学(可授理学、...
    • 12 篇 系统科学
    • 7 篇 生物学
    • 5 篇 化学
    • 1 篇 地球物理学
  • 17 篇 管理学
    • 11 篇 图书情报与档案管...
    • 4 篇 管理科学与工程(可...
    • 3 篇 工商管理
    • 2 篇 公共管理
  • 4 篇 医学
    • 4 篇 临床医学
    • 2 篇 基础医学(可授医学...
    • 2 篇 公共卫生与预防医...
  • 3 篇 法学
    • 2 篇 社会学
    • 1 篇 法学
  • 1 篇 经济学
    • 1 篇 应用经济学
  • 1 篇 教育学
    • 1 篇 体育学
  • 1 篇 农学

主题

  • 51 篇 speech recogniti...
  • 15 篇 hidden markov mo...
  • 15 篇 training
  • 13 篇 neural machine t...
  • 12 篇 machine translat...
  • 12 篇 transducers
  • 11 篇 computer aided l...
  • 11 篇 decoding
  • 9 篇 recurrent neural...
  • 8 篇 speech
  • 8 篇 feature extracti...
  • 8 篇 neural network
  • 8 篇 error analysis
  • 7 篇 modelling langua...
  • 6 篇 vocabulary
  • 6 篇 optimization
  • 6 篇 handwriting reco...
  • 6 篇 humans
  • 6 篇 automatic speech...
  • 5 篇 hierarchical sys...

机构

  • 40 篇 human language t...
  • 37 篇 apptek gmbh aach...
  • 32 篇 human language t...
  • 20 篇 human language t...
  • 10 篇 human language t...
  • 9 篇 human language t...
  • 8 篇 computer science...
  • 8 篇 human language t...
  • 7 篇 spoken language ...
  • 7 篇 apptek gmbh aach...
  • 6 篇 human language t...
  • 6 篇 human language t...
  • 6 篇 human language t...
  • 5 篇 human language t...
  • 4 篇 human language t...
  • 3 篇 human language t...
  • 3 篇 rwth aachen univ...
  • 3 篇 limsi cnrs spoke...
  • 3 篇 human language t...
  • 2 篇 computer vision ...

作者

  • 141 篇 ney hermann
  • 55 篇 schlüter ralf
  • 36 篇 hermann ney
  • 16 篇 zeyer albert
  • 16 篇 zhou wei
  • 14 篇 gao yingbo
  • 14 篇 ralf schluter
  • 12 篇 ralf schlüter
  • 12 篇 mansour saab
  • 12 篇 zeineldeen moham...
  • 12 篇 michel wilfried
  • 12 篇 zens richard
  • 11 篇 herold christian
  • 10 篇 bahar parnia
  • 10 篇 peitz stephan
  • 9 篇 peter jan-thorst...
  • 9 篇 schluter ralf
  • 9 篇 freitag markus
  • 9 篇 wang weiyue
  • 8 篇 wuebker joern

语言

  • 214 篇 英文
检索条件"机构=Human Language Technology and Pattern Recognition Group Computer Science"
214 条 记 录,以下是191-200 订阅
排序:
Improved training of end-to-end attention models for speech recognition
arXiv
收藏 引用
arXiv 2018年
作者: Zeyer, Albert Irie, Kazuki Schlüter, Ralf Ney, Hermann Computer Science Department Rwth Aachen University Human Language Technology and Pattern Recognition Aachen52062 Germany AppTek United States Nnaisense Switzerland
Sequence-to-sequence attention-based models on subword units allow simple open-vocabulary end-to-end speech recognition. In this work, we show that such models can achieve competitive results on the Switchboard 300h a... 详细信息
来源: 评论
The rwth asr system for ted-lium release 2: improving hybrid hmm with specaugment
arXiv
收藏 引用
arXiv 2020年
作者: Zhou, Wei Michel, Wilfried Irie, Kazuki Kitza, Markus Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52074 Germany AppTek GmbH Aachen52062 Germany
We present a complete training pipeline to build a state-of-the-art hybrid HMM-based ASR system on the 2nd release of the TED-LIUM corpus. Data augmentation using SpecAugment is successfully applied to improve perform... 详细信息
来源: 评论
CONFORMER-BASED HYBRID ASR SYSTEM FOR SWITCHBOARD DATASET
arXiv
收藏 引用
arXiv 2021年
作者: Zeineldeen, Mohammad Xu, Jingjing Lüscher, Christoph Michel, Wilfried Gerstenberger, Alexander Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52074 Germany AppTek GmbH Aachen52062 Germany
The recently proposed conformer architecture has been successfully used for end-to-end automatic speech recognition (ASR) architectures achieving state-of-the-art performance on different datasets. To our best knowled... 详细信息
来源: 评论
RADMM: RECURRENT ADAPTIVE MIXTURE MODEL WITH APPLICATIONS TO DOMAIN ROBUST language MODELING
RADMM: RECURRENT ADAPTIVE MIXTURE MODEL WITH APPLICATIONS TO...
收藏 引用
IEEE International Conference on Acoustics, Speech and Signal Processing
作者: Kazuki Irie Shankar Kumar Michael Nirschl Hank Liao Human Language Technology and Pattern Recognition Group Computer Science Department RWTH Aachen University D-52056 Aachen Germany Google Inc. New York NY 10011 USA
We present a new architecture and a training strategy for an adaptive mixture of experts with applications to domain robust language modeling. The proposed model is designed to benefit from the scenario where the trai... 详细信息
来源: 评论
Robust Beam Search for Encoder-Decoder Attention Based Speech recognition without Length Bias
arXiv
收藏 引用
arXiv 2020年
作者: Zhou, Wei Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52074 Germany AppTek GmbH Aachen52062 Germany
As one popular modeling approach for end-to-end speech recognition, attention-based encoder-decoder models are known to suffer the length bias and corresponding beam problem. Different approaches have been applied in ... 详细信息
来源: 评论
A systematic comparison of grapheme-based vs. phoneme-based label units for encoder-decoder-attention models
arXiv
收藏 引用
arXiv 2020年
作者: Zeineldeen, Mohammad Zeyer, Albert Zhou, Wei Ng, Thomas Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University 52062 Aachen Germany AppTek GmbH Aachen52062 Germany
Following the rationale of end-to-end modeling, CTC, RNN-T or encoder-decoder-attention models for automatic speech recognition (ASR) use graphemes or grapheme-based subword units based on e.g. byte-pair encoding (BPE... 详细信息
来源: 评论
Comparing the benefit of synthetic training data for various automatic speech recognition architectures
arXiv
收藏 引用
arXiv 2021年
作者: Rossenbach, Nick Zeineldeen, Mohammad Hilmes, Benedikt Schlüter, Ralf Ney, Hermann Human Language Technology and Pattern Recognition Computer Science Department RWTH Aachen University Aachen52074 Germany AppTek GmbH Aachen52062 Germany
Recent publications on automatic-speech-recognition (ASR) have a strong focus on attention encoder-decoder (AED) architectures which tend to suffer from over-fitting in low resource scenarios. One solution to tackle t... 详细信息
来源: 评论
Confidence scores for acoustic model adaptation
Confidence scores for acoustic model adaptation
收藏 引用
International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
作者: Christian Gollan Michiel Bacchiani Human Language Technology and Pattern Recognition Computer Science Department 6 RWTH Aachen University Germany Google Inc. New York NY USA
This paper focuses on confidence scores for use in acoustic model adaptation. Frame-based confidence estimates are used in linear transform (CMLLR and MLLR) and MAP adaptation. We show that adaptation approaches with ... 详细信息
来源: 评论
ROBUST KNOWLEDGE DISTILLATION FROM RNN-T MODELS WITH NOISY TRAINING LABELS USING FULL-SUM LOSS
arXiv
收藏 引用
arXiv 2023年
作者: Zeineldeen, Mohammad Audhkhasi, Kartik Baskar, Murali Karthick Ramabhadran, Bhuvana Human Language Technology and Pattern Recognition Computer Science Department Rwth Aachen University Aachen52074 Germany Google Llc New York United States
This work studies knowledge distillation (KD) and addresses its constraints for recurrent neural network transducer (RNNT) models. In hard distillation, a teacher model transcribes large amounts of unlabelled speech t... 详细信息
来源: 评论
Warp that smile on your face: Optimal and smooth deformations for face recognition
Warp that smile on your face: Optimal and smooth deformation...
收藏 引用
International Conference on Automatic Face and Gesture recognition
作者: Tobias Gass Leonid Pishchulin Philippe Dreuw Hermann Ney Human Language Technology and Pattern Recognition Group RWTH Aachen University Germany Computer Vision Laboratory ETH Zurich Switzerland Computer Vision and Multimodal Computing MPI Informatics Saarbruecken Germany
In this work, we present novel warping algorithms for full 2D pixel-grid deformations for face recognition. Due to high variation in face appearance, face recognition is considered a very difficult task, especially if... 详细信息
来源: 评论