咨询与建议

限定检索结果

文献类型

  • 328 篇 会议
  • 129 篇 期刊文献

馆藏范围

  • 457 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 320 篇 工学
    • 241 篇 计算机科学与技术...
    • 210 篇 软件工程
    • 98 篇 信息与通信工程
    • 27 篇 生物工程
    • 18 篇 控制科学与工程
    • 17 篇 化学工程与技术
    • 16 篇 电气工程
    • 14 篇 电子科学与技术(可...
    • 13 篇 仪器科学与技术
    • 11 篇 生物医学工程(可授...
    • 7 篇 机械工程
    • 7 篇 建筑学
    • 6 篇 安全科学与工程
    • 5 篇 土木工程
    • 5 篇 农业工程
  • 170 篇 理学
    • 122 篇 物理学
    • 58 篇 数学
    • 32 篇 生物学
    • 22 篇 统计学(可授理学、...
    • 17 篇 化学
    • 10 篇 系统科学
  • 78 篇 管理学
    • 69 篇 图书情报与档案管...
    • 6 篇 管理科学与工程(可...
  • 15 篇 医学
    • 13 篇 基础医学(可授医学...
    • 13 篇 临床医学
    • 8 篇 药学(可授医学、理...
    • 6 篇 公共卫生与预防医...
  • 9 篇 法学
    • 7 篇 社会学
  • 8 篇 文学
    • 6 篇 中国语言文学
    • 5 篇 外国语言文学
  • 6 篇 教育学
  • 5 篇 农学
  • 1 篇 经济学

主题

  • 47 篇 speech recogniti...
  • 31 篇 speech
  • 30 篇 training
  • 18 篇 acoustics
  • 14 篇 machine translat...
  • 13 篇 decoding
  • 12 篇 social networkin...
  • 12 篇 speaker recognit...
  • 11 篇 hidden markov mo...
  • 11 篇 computational mo...
  • 11 篇 semantics
  • 10 篇 conferences
  • 9 篇 speech processin...
  • 9 篇 computational li...
  • 9 篇 feature extracti...
  • 9 篇 embeddings
  • 8 篇 training data
  • 8 篇 natural language...
  • 8 篇 pipelines
  • 7 篇 lattices

机构

  • 88 篇 human language t...
  • 54 篇 human language t...
  • 43 篇 center for langu...
  • 21 篇 center for langu...
  • 20 篇 human language t...
  • 20 篇 human language t...
  • 18 篇 center for langu...
  • 15 篇 human language t...
  • 13 篇 center for langu...
  • 12 篇 human language t...
  • 11 篇 human language t...
  • 10 篇 johns hopkins un...
  • 9 篇 johns hopkins un...
  • 8 篇 human language t...
  • 7 篇 human language t...
  • 7 篇 department of co...
  • 7 篇 xiaomi corp.
  • 6 篇 computer and inf...
  • 6 篇 xiaomi corporati...
  • 6 篇 center for langu...

作者

  • 64 篇 dredze mark
  • 50 篇 khudanpur sanjee...
  • 43 篇 van durme benjam...
  • 30 篇 dehak najim
  • 27 篇 sanjeev khudanpu...
  • 21 篇 post matt
  • 20 篇 mcnamee paul
  • 20 篇 hermansky hynek
  • 20 篇 callison-burch c...
  • 19 篇 villalba jesús
  • 18 篇 povey daniel
  • 16 篇 duh kevin
  • 16 篇 mayfield james
  • 15 篇 zelasko piotr
  • 15 篇 daniel povey
  • 15 篇 watanabe shinji
  • 14 篇 wiesner matthew
  • 14 篇 andrews nicholas
  • 13 篇 paul michael j.
  • 13 篇 mccree alan

语言

  • 447 篇 英文
  • 10 篇 其他
检索条件"机构=Human Language Technology Center of Excellence and Center for Language and Speech Processing"
457 条 记 录,以下是131-140 订阅
排序:
PYCHAIN: A fully parallelized pytorch implementation of LF-MMI for end-to-end ASR
arXiv
收藏 引用
arXiv 2020年
作者: Shao, Yiwen Wang, Yiming Povey, Daniel Khudanpur, Sanjeev Center for Language and Speech Processing Johns Hopkins University BaltimoreMD United States Human Language Technology Center of Excellence Johns Hopkins University BaltimoreMD United States Xiaomi Inc. Beijing China
We present PYCHAIN, a fully parallelized PyTorch implementation of end-to-end lattice-free maximum mutual information (LF-MMI) training for the so-called chain models in the Kaldi automatic speech recognition (ASR) to... 详细信息
来源: 评论
Study of pre-processing defenses against adversarial attacks on state-of-the-art speaker recognition systems
arXiv
收藏 引用
arXiv 2021年
作者: Joshi, Sonal Villalba, Jesús Zelasko, Piotr Moro-Velázquez, Laureano Dehak, Najim Johns Hopkins University BaltimoreMD21218 United States The Human Language Technology Center of Excellence Johns Hopkins University BaltimoreMD21218 United States
Adversarial examples to speaker recognition (SR) systems are generated by adding a carefully crafted noise to the speech signal to make the system fail while being imperceptible to humans. Such attacks pose severe sec... 详细信息
来源: 评论
A call for prudent choice of subword merge operations in neural machine translation
arXiv
收藏 引用
arXiv 2019年
作者: Ding, Shuoyang Renduchintala, Adithya Duh, Kevin Center for Language and Speech Processing Human Language Technology Center of Excellence Johns Hopkins University
Most neural machine translation systems are built upon subword units extracted by methods such as Byte-Pair Encoding (BPE) or wordpiece. However, the choice of number of merge operations is generally made by following... 详细信息
来源: 评论
Machine Translation System Selection from Bandit Feedback
arXiv
收藏 引用
arXiv 2020年
作者: Naradowsky, Jason Zhang, Xuan Duh, Kevin Preferred Networks Johns Hopkins University Human Language Technology Center of Excellence
Adapting machine translation systems in the real world is a difficult problem. In contrast to offline training, users cannot provide the type of fine-grained feedback typically used for improving the system. Moreover,... 详细信息
来源: 评论
Wake Word Detection with Alignment-Free Lattice-Free MMI
arXiv
收藏 引用
arXiv 2020年
作者: Wang, Yiming Lv, Hang Povey, Daniel Xie, Lei Khudanpur, Sanjeev Center for Language and Speech Processing Human Language Technology Center of Excellence Johns Hopkins University BaltimoreMD United States Xiaomi Inc. Beijing China ASLP@NPU School of Computer Science Northwestern Polytechnical University Xi’an China
Always-on spoken language interfaces, e.g. personal digital assistants, rely on a wake word to start processing spoken input. We present novel methods to train a hybrid DNN/HMM wake word detection system from partiall... 详细信息
来源: 评论
Software in the natural world: A computational approach to hierarchical emergence
arXiv
收藏 引用
arXiv 2024年
作者: Rosas, Fernando E. Geiger, Bernhard C. Luppi, Andrea I. Seth, Anil K. Polani, Daniel Gastpar, Michael Mediano, Pedro A.M. Department of Informatics University of Sussex United Kingdom Sussex Centre for Consciousness Science and Sussex AI University of Sussex United Kingdom Center for Psychedelic Research and Centre for Complexity Science Department of Brain Science Imperial College London United Kingdom Center for Eudaimonia and Human Flourishing University of Oxford United Kingdom Know-Center GmbH Graz Austria Signal Processing and Speech Communication Laboratory Graz University of Technology Graz Austria Montreal Neurological Institute McGill University Canada Department of Computer Science University of Hertfordshire Hatfield United Kingdom School of Computer and Communication Sciences EPFL Lausanne Switzerland Department of Computing Imperial College London United Kingdom Division of Psychology and Language Sciences University College London United Kingdom
Understanding the functional architecture of complex systems is crucial to illuminate their inner workings and enable effective methods for their prediction and control. Recent advances have introduced tools to charac... 详细信息
来源: 评论
That sounds familiar: An analysis of phonetic representations transfer across languages
arXiv
收藏 引用
arXiv 2020年
作者: Zelasko, Piotr Velazquez, Laureano Moro Johnson, Mark Hasegawa Scharenborg, Odette Dehak, Najim Center for Language and Speech Processing Human Language Technology Center of Excellence Johns Hopkins University BaltimoreMD United States Ece Department and Beckman Institute University of Illinois Urbana-Champaign United States Multimedia Computing Group Delft University of Technology Delft Netherlands
Only a handful of the worlds languages are abundant with the resources that enable practical applications of speech processing technologies. One of the methods to overcome this problem is to use the resources existing... 详细信息
来源: 评论
Jhu-HLTCOE System for the Voxsrc Speaker Recognition Challenge
Jhu-HLTCOE System for the Voxsrc Speaker Recognition Challen...
收藏 引用
IEEE International Conference on Acoustics, speech and Signal processing
作者: Daniel Garcia-Romero Alan McCree David Snyder Gregory Sell Human Language Technology Center of Excellence Johns Hopkins University Baltimore MD USA
The VoxSRC speaker recognition challenge comprises data obtained from YouTube videos of celebrity interviews in a wide range of recording environments. The challenge provides FIXED and OPEN training conditions to allo...
来源: 评论
Script identification using across-and within-image distribution estimation  15
Script identification using across-and within-image distribu...
收藏 引用
15th IAPR International Conference on Document Analysis and Recognition, ICDAR 2019
作者: Sell, Gregory Etter, David Garcia-Romero, Daniel McCree, Alan Human Language Technology Center of Excellence Johns Hopkins University Baltimore United States
In this paper, we apply several modifications to script identification, several of which inspired by techniques from the similar audio task of spoken language recognition. Specifically, we alter the architecture of a ... 详细信息
来源: 评论
How phonotactics affect multilingual and zero-shot ASR performance
arXiv
收藏 引用
arXiv 2020年
作者: Feng, Siyuan Zelasko, Piotr Moro-Velázquez, Laureano Abavisani, Ali Hasegawa-Johnson, Mark Scharenborg, Odette Dehak, Najim Multimedia Computing Group Delft University of Technology Delft Netherlands Center for Language and Speech Processing United States Human Language Technology Center of Excellence Johns Hopkins University BaltimoreMD United States Department of Electrical and Computer Engineering University of Illinois at Urbana-Champaign IL United States
The idea of combining multiple languages’ recordings to train a single automatic speech recognition (ASR) model brings the promise of the emergence of universal speech representation. Recently, a Transformer encoder-... 详细信息
来源: 评论