咨询与建议

限定检索结果

文献类型

  • 328 篇 会议
  • 129 篇 期刊文献

馆藏范围

  • 457 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 320 篇 工学
    • 241 篇 计算机科学与技术...
    • 210 篇 软件工程
    • 98 篇 信息与通信工程
    • 27 篇 生物工程
    • 18 篇 控制科学与工程
    • 17 篇 化学工程与技术
    • 16 篇 电气工程
    • 14 篇 电子科学与技术(可...
    • 13 篇 仪器科学与技术
    • 11 篇 生物医学工程(可授...
    • 7 篇 机械工程
    • 7 篇 建筑学
    • 6 篇 安全科学与工程
    • 5 篇 土木工程
    • 5 篇 农业工程
  • 170 篇 理学
    • 122 篇 物理学
    • 58 篇 数学
    • 32 篇 生物学
    • 22 篇 统计学(可授理学、...
    • 17 篇 化学
    • 10 篇 系统科学
  • 78 篇 管理学
    • 69 篇 图书情报与档案管...
    • 6 篇 管理科学与工程(可...
  • 15 篇 医学
    • 13 篇 基础医学(可授医学...
    • 13 篇 临床医学
    • 8 篇 药学(可授医学、理...
    • 6 篇 公共卫生与预防医...
  • 9 篇 法学
    • 7 篇 社会学
  • 8 篇 文学
    • 6 篇 中国语言文学
    • 5 篇 外国语言文学
  • 6 篇 教育学
  • 5 篇 农学
  • 1 篇 经济学

主题

  • 47 篇 speech recogniti...
  • 31 篇 speech
  • 30 篇 training
  • 18 篇 acoustics
  • 14 篇 machine translat...
  • 13 篇 decoding
  • 12 篇 social networkin...
  • 12 篇 speaker recognit...
  • 11 篇 hidden markov mo...
  • 11 篇 computational mo...
  • 11 篇 semantics
  • 10 篇 conferences
  • 9 篇 speech processin...
  • 9 篇 computational li...
  • 9 篇 feature extracti...
  • 9 篇 embeddings
  • 8 篇 training data
  • 8 篇 natural language...
  • 8 篇 pipelines
  • 7 篇 lattices

机构

  • 88 篇 human language t...
  • 54 篇 human language t...
  • 43 篇 center for langu...
  • 21 篇 center for langu...
  • 20 篇 human language t...
  • 20 篇 human language t...
  • 18 篇 center for langu...
  • 15 篇 human language t...
  • 13 篇 center for langu...
  • 12 篇 human language t...
  • 11 篇 human language t...
  • 10 篇 johns hopkins un...
  • 9 篇 johns hopkins un...
  • 8 篇 human language t...
  • 7 篇 human language t...
  • 7 篇 department of co...
  • 7 篇 xiaomi corp.
  • 6 篇 computer and inf...
  • 6 篇 xiaomi corporati...
  • 6 篇 center for langu...

作者

  • 64 篇 dredze mark
  • 50 篇 khudanpur sanjee...
  • 43 篇 van durme benjam...
  • 30 篇 dehak najim
  • 27 篇 sanjeev khudanpu...
  • 21 篇 post matt
  • 20 篇 mcnamee paul
  • 20 篇 hermansky hynek
  • 20 篇 callison-burch c...
  • 19 篇 villalba jesús
  • 18 篇 povey daniel
  • 16 篇 duh kevin
  • 16 篇 mayfield james
  • 15 篇 zelasko piotr
  • 15 篇 daniel povey
  • 15 篇 watanabe shinji
  • 14 篇 wiesner matthew
  • 14 篇 andrews nicholas
  • 13 篇 paul michael j.
  • 13 篇 mccree alan

语言

  • 448 篇 英文
  • 9 篇 其他
检索条件"机构=Human Language Technology Center of Excellence and Center for Language and Speech Processing"
457 条 记 录,以下是111-120 订阅
排序:
CopyPaste: An augmentation method for speech emotion recognition
arXiv
收藏 引用
arXiv 2020年
作者: Pappagari, Raghavendra Villalba, Jesús Zelasko, Piotr Moro-Velazquez, Laureano Dehak, Najim Center for Language and Speech Processing United States Human Language Technology Center of Excellence Johns Hopkins University BaltimoreMD United States
Data augmentation is a widely used strategy for training robust machine learning models. It partially alleviates the problem of limited data for tasks like speech emotion recognition (SER), where collecting data is ex... 详细信息
来源: 评论
OOV Recovery with Efficient 2nd Pass Decoding and Open-vocabulary Word-level RNNLM Rescoring for Hybrid ASR
OOV Recovery with Efficient 2nd Pass Decoding and Open-vocab...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Xiaohui Zhang Daniel Povey Sanjeev Khudanpur Facebook AI US Center for Language and Speech Processing & Human Language Technology Center of Excellence The Johns Hopkins University Baltimore MD US
In this paper, we investigate out-of-vocabulary (OOV) word recovery in hybrid automatic speech recognition (ASR) systems, with emphasis on dynamic vocabulary expansion for both Weight Finite State Transducer (WFST)-ba...
来源: 评论
Multi-class spectral clustering with overlaps for speaker diarization
arXiv
收藏 引用
arXiv 2020年
作者: Raj, Desh Huang, Zili Khudanpur, Sanjeev Center for Language and Speech Processing United States Human Language Technology Center of Excellence The Johns Hopkins University BaltimoreMD21218 United States
This paper describes a method for overlap-aware speaker diarization. Given an overlap detector and a speaker embedding extractor, our method performs spectral clustering of segments informed by the output of the overl... 详细信息
来源: 评论
The attentional bias of gelotophobes towards emotion words containing the Chinese character for ‘laugh’: An eye-tracking approach
收藏 引用
Current Psychology 2023年 第19期42卷 16330-16343页
作者: Lee, Yen-Lin Chen, Hsueh-Chih Chan, Yu-Chen Institute of Learning Sciences and Technologies National Tsing Hua University Hsinchu Taiwan Department of Educational Psychology and Counseling National Taiwan Normal University Taipei Taiwan Institute for Research Excellence in Learning Sciences National Taiwan Normal University Taipei Taiwan Chinese Language and Technology Center National Taiwan Normal University Taipei Taiwan MOST AI Biomedical Research Center Taipei Taiwan Department of Educational Psychology and Counseling National Tsing Hua University 101 Sec. 2 Kuang Fu Road Hsinchu 30013 Taiwan Cognitive and Human Affective Neuroscience Laboratory CHAN Lab NTHU Hsinchu Taiwan Research Center for Education and Mind Sciences NTHU Hsinchu Taiwan
Gelotophobes are typically characterized by the fear of laughter, social withdrawal, and humorlessness, possibly related to negative experiences of being laughed at in the past. The present study seeks to expand our u... 详细信息
来源: 评论
Eye movement patterns are similar during accurate multiple-target tracking
Eye movement patterns are similar during accurate multiple-t...
收藏 引用
International Conference on Cognitive Infocommunications (CogInfoCom)
作者: Kamyar Bagha Shiva Kamkar Hamid Abrishami Moghaddam Lauri Oksama Jie Li Jukka Hyönä Computer Engineering Department Khatam University Tehran Iran Machine Vision and Medical Image Processing (MVMIP) Laboratory Faculty of Electrical Engineering K.N.Toosi University of Technology Tehran Iran Center for International Scientific Studies and Collaboration (CISSC) Tehran Iran Department of Psychology and Speech-Language Pathology University of Turku Turku Finland Center for Cognition and Brain Disorders Hangzhou Normal University Hangzhou China
Understanding how the brain works is a base of cognitive info-communication. To this aim we focus on multiple target tracking (MTT) as a key task that involves two important cognitive factors, attention and memory. Hu... 详细信息
来源: 评论
MMMORRF: Multimodal Multilingual MOdularized Reciprocal Rank Fusion
arXiv
收藏 引用
arXiv 2025年
作者: Samuel, Saron Degenaro, Dan Guallar-Blasco, Jimena Sanders, Kate Eisape, Oluwaseun Spendlove, Tanner Reddy, Arun Martin, Alexander Yates, Andrew Yang, Eugene Carpenter, Cameron Etter, David Kayi, Efsun Wiesner, Matthew Murray, Kenton Kriz, Reno Stanford University StanfordCA United States Georgetown University WashingtonDC United States University of California Berkeley BerkeleyCA United States Johns Hopkins University United States Applied Physics Laboratory BaltimoreMD United States Human Language Technology Center of Excellence BaltimoreMD United States
Videos inherently contain multiple modalities, including visual events, text overlays, sounds, and speech, all of which are important for retrieval. However, state-of-the-art multimodal language models like VAST and L... 详细信息
来源: 评论
Discovering Phonetic Inventories with Crosslingual Automatic speech Recognition
arXiv
收藏 引用
arXiv 2022年
作者: Żelasko, Piotr Feng, Siyuan Velázquez, Laureano Moro Abavisani, Ali Bhati, Saurabhchand Scharenborg, Odette Hasegawa-Johnson, Mark Dehak, Najim Center of Language and Speech Processing The Johns Hopkins University 3400 North Charles Street BaltimoreMD21218 United States Human Language Technology Center of Excellence The Johns Hopkins University 810 Wyman Park Drive BaltimoreMD21218 United States Multimedia Computing Group Delft University of Technology Van Mourik Broekmanweg 6 Delft2628 XE Netherlands Department of Electrical and Computer Engineering University of Illinois 405 N Mathews UrbanaIL61801 United States
The high cost of data acquisition makes Automatic speech Recognition (ASR) model training problematic for most existing languages, including languages that do not even have a written script, or for which the phone inv... 详细信息
来源: 评论
An asynchronous wfst-based decoder for automatic speech recognition
arXiv
收藏 引用
arXiv 2021年
作者: Lv, Hang Chen, Zhehuai Xu, Hainan Povey, Daniel Xie, Lei Khudanpur, Sanjeev School of Computer Science Northwestern Polytechnical University Xi'an China Center of Language and Speech Processing United States Human Language Technology Center of Excellence Johns Hopkins University BaltimoreMD United States Xiaomi Corporation Beijing China SpeechLab Department of Computer Science and Engineering Shanghai Jiao Tong University China
We introduce asynchronous dynamic decoder, which adopts an efficient A∗ algorithm to incorporate big language models in the onepass decoding for large vocabulary continuous speech recognition. Unlike standard one-pass... 详细信息
来源: 评论
Frustratingly easy noise-aware training of acoustic models
arXiv
收藏 引用
arXiv 2020年
作者: Raj, Desh Villalba, Jesús Povey, Daniel Khudanpur, Sanjeev Center for Language and Speech Processing & Human Language Technology Center of Excellence The Johns Hopkins University BaltimoreMD21218 United States Xiaomi Corp. Beijing China
Environmental noises and reverberation have a detrimental effect on the performance of automatic speech recognition (ASR) systems. Multi-condition training of neural network-based acoustic models is used to deal with ... 详细信息
来源: 评论
Mixture of speaker-type PLDAs for children's speech diarization
arXiv
收藏 引用
arXiv 2020年
作者: Xie, Jiamin Sia, Suzanna García, Paola Povey, Daniel Khudanpur, Sanjeev Center for Language and Speech Processing Human Language Technology Center of Excellence Johns Hopkins University BaltimoreMD21218 United States Xiaomi Corp. Beijing China
In diarization, the PLDA is typically used to model an inference structure which assumes the variation in speech segments be induced by various speakers. The speaker variation is then learned from the training data. H... 详细信息
来源: 评论