咨询与建议

限定检索结果

文献类型

  • 328 篇 会议
  • 129 篇 期刊文献

馆藏范围

  • 457 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 320 篇 工学
    • 241 篇 计算机科学与技术...
    • 210 篇 软件工程
    • 98 篇 信息与通信工程
    • 27 篇 生物工程
    • 18 篇 控制科学与工程
    • 17 篇 化学工程与技术
    • 16 篇 电气工程
    • 14 篇 电子科学与技术(可...
    • 13 篇 仪器科学与技术
    • 11 篇 生物医学工程(可授...
    • 7 篇 机械工程
    • 7 篇 建筑学
    • 6 篇 安全科学与工程
    • 5 篇 土木工程
    • 5 篇 农业工程
  • 170 篇 理学
    • 122 篇 物理学
    • 58 篇 数学
    • 32 篇 生物学
    • 22 篇 统计学(可授理学、...
    • 17 篇 化学
    • 10 篇 系统科学
  • 78 篇 管理学
    • 69 篇 图书情报与档案管...
    • 6 篇 管理科学与工程(可...
  • 15 篇 医学
    • 13 篇 基础医学(可授医学...
    • 13 篇 临床医学
    • 8 篇 药学(可授医学、理...
    • 6 篇 公共卫生与预防医...
  • 9 篇 法学
    • 7 篇 社会学
  • 8 篇 文学
    • 6 篇 中国语言文学
    • 5 篇 外国语言文学
  • 6 篇 教育学
  • 5 篇 农学
  • 1 篇 经济学

主题

  • 47 篇 speech recogniti...
  • 31 篇 speech
  • 30 篇 training
  • 18 篇 acoustics
  • 14 篇 machine translat...
  • 13 篇 decoding
  • 12 篇 social networkin...
  • 12 篇 speaker recognit...
  • 11 篇 hidden markov mo...
  • 11 篇 computational mo...
  • 11 篇 semantics
  • 10 篇 conferences
  • 9 篇 speech processin...
  • 9 篇 computational li...
  • 9 篇 feature extracti...
  • 9 篇 embeddings
  • 8 篇 training data
  • 8 篇 natural language...
  • 8 篇 pipelines
  • 7 篇 lattices

机构

  • 88 篇 human language t...
  • 54 篇 human language t...
  • 43 篇 center for langu...
  • 21 篇 center for langu...
  • 20 篇 human language t...
  • 20 篇 human language t...
  • 18 篇 center for langu...
  • 15 篇 human language t...
  • 13 篇 center for langu...
  • 12 篇 human language t...
  • 11 篇 human language t...
  • 10 篇 johns hopkins un...
  • 9 篇 johns hopkins un...
  • 8 篇 human language t...
  • 7 篇 human language t...
  • 7 篇 department of co...
  • 7 篇 xiaomi corp.
  • 6 篇 computer and inf...
  • 6 篇 xiaomi corporati...
  • 6 篇 center for langu...

作者

  • 64 篇 dredze mark
  • 50 篇 khudanpur sanjee...
  • 43 篇 van durme benjam...
  • 30 篇 dehak najim
  • 27 篇 sanjeev khudanpu...
  • 21 篇 post matt
  • 20 篇 mcnamee paul
  • 20 篇 hermansky hynek
  • 20 篇 callison-burch c...
  • 19 篇 villalba jesús
  • 18 篇 povey daniel
  • 16 篇 duh kevin
  • 16 篇 mayfield james
  • 15 篇 zelasko piotr
  • 15 篇 daniel povey
  • 15 篇 watanabe shinji
  • 14 篇 wiesner matthew
  • 14 篇 andrews nicholas
  • 13 篇 paul michael j.
  • 13 篇 mccree alan

语言

  • 448 篇 英文
  • 9 篇 其他
检索条件"机构=Human Language Technology Center of Excellence and Center for Language and Speech Processing"
457 条 记 录,以下是61-70 订阅
排序:
Earnings-21: A practical benchmark for ASR in the wild
arXiv
收藏 引用
arXiv 2021年
作者: Del Rio, Miguel Delworth, Natalie Westerman, Ryan Huang, Michelle Bhandari, Nishchal Palakapilly, Joseph McNamara, Quinten Dong, Joshua Zelasko, Piotr Jetté, Miguel *** Center for Language and Speech Processing Human Language Technology Center of Excellence Johns Hopkins University BaltimoreMD United States
Commonly used speech corpora inadequately challenge academic and commercial ASR systems. In particular, speech corpora lack metadata needed for detailed analysis and WER measurement. In response, we present Earnings-2... 详细信息
来源: 评论
A data-driven approach to estimating post-discovery parameters of unexplored oilfields
收藏 引用
Petroleum 2023年 第2期9卷 285-300页
作者: Fransiscus Pratikto Sapto Indratno Kadarsah Suryadi Djoko Santoso Industrial Engineering Department Bandung Institute of TechnologyJl Ganesha 10Bandung40132Indonesia Mathematics Department Bandung Institute of TechnologyJl Ganesha 10Bandung40132Indonesia Geophysical Engineering Department Bandung Institute of TechnologyJl Ganesha 10Bandung40132Indonesia University Center of Excellence on Artificial Intelligence for Vision Institut Teknologi BandungNatural Language Processing&Big Data Analytics(U-CoE AI-VLB)Bandung 40132West JavaIndonesia
Consider a typical situation where an investor is considering acquiring an unexplored *** oilfield has undergone a preliminary geological and geophysical study in which pre-discovery data such as lithology,depth,depos... 详细信息
来源: 评论
Video-ColBERT: Contextualized Late Interaction for Text-to-Video Retrieval
arXiv
收藏 引用
arXiv 2025年
作者: Reddy, Arun Martin, Alexander Yang, Eugene Yates, Andrew Sanders, Kate Murray, Kenton Kriz, Reno de Melo, Celso M. Van Durme, Benjamin Chellappa, Rama Johns Hopkins Applied Physics Laboratory China Johns Hopkins University United States Human Language Technology Center of Excellence DEVCOM Army Research Laboratory
In this work, we tackle the problem of text-to-video retrieval (T2VR). Inspired by the success of late interaction techniques in text-document, text-image, and text-video retrieval, our approach, Video-ColBERT, introd... 详细信息
来源: 评论
Multilingual Pixel Representations for Translation and Effective Cross-lingual Transfer
arXiv
收藏 引用
arXiv 2023年
作者: Salesky, Elizabeth Verma, Neha Koehn, Philipp Post, Matt Johns Hopkins University United States Human Language Technology Center of Excellence United States Microsoft United States
We introduce and demonstrate how to effectively train multilingual machine translation models with pixel representations. We experiment with two different data settings with a variety of language and script coverage, ... 详细信息
来源: 评论
Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream end-to-end ASR
Two-Stage Augmentation and Adaptive CTC Fusion for Improved ...
收藏 引用
IEEE Spoken language technology Workshop
作者: Ruizhi Li Gregory Sell Hynek Hermansky Center for Language and Speech Processing The Johns Hopkins University USA Human Language Technology Center of Excellence The Johns Hopkins University USA
Performance degradation of an Automatic speech Recognition (ASR) system is commonly observed when the test acoustic condition is different from training. Hence, it is essential to make ASR systems robust against vario... 详细信息
来源: 评论
Exploring Prompt-based Multi-task Learning for Multimodal Dialog State Tracking and Immersive Multimodal Conversation  11
Exploring Prompt-based Multi-task Learning for Multimodal Di...
收藏 引用
11th Dialog System technology Challenge, DSTC 2023
作者: Chen, Yirong Li, Ya Wang, Tao Xing, Xiaofen Xu, Xiangmin Liu, Quan Liu, Cong Hu, Guoping Guangdong Provincial Key Laboratory of Human Digital Twin School of EE South China University of Technology Guangzhou China iFLYTEK Research Hefei China Pazhou Lab. Guangzhou China School of Future Technology South China University of Technology Guangzhou China State Key Laboratory of Cognitive Intelligence Hefei China National Engineering Research Center of Speech and Language Information Processing Hefei China
With the rise of the metaverse, immersive multimodal conversation has attracted more and more researchers’ attention. Multimodal contexts will become more important for human-computer interaction in the metaverse, es... 详细信息
来源: 评论
Ambiguous Images With human Judgments for Robust Visual Event Classification
arXiv
收藏 引用
arXiv 2022年
作者: Sanders, Kate Kriz, Reno Liu, Anqi Van Durme, Benjamin Johns Hopkins University Human Language Technology Center of Excellence United States
Contemporary vision benchmarks predominantly consider tasks on which humans can achieve near-perfect performance. However, humans are frequently presented with visual data that they cannot classify with 100% certainty... 详细信息
来源: 评论
Injecting Text and Cross-lingual Supervision in Few-shot Learning from Self-Supervised Models
arXiv
收藏 引用
arXiv 2021年
作者: Wiesner, Matthew Raj, Desh Khudanpur, Sanjeev Human Language Technology Center of Excellence Johns Hopkins University United States Center for Language and Speech Processing Johns Hopkins University United States
Self-supervised model pre-training has recently garnered significant interest, but relatively few efforts have explored using additional resources in fine-tuning these models. We demonstrate how universal phoneset aco... 详细信息
来源: 评论
Focus on the Present: A Regularization Method for the ASR Source-Target Attention Layer
Focus on the Present: A Regularization Method for the ASR So...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Nanxin Chen Piotr Żelasko Jesús Villalba Najim Dehak Center for Language and Speech Processing Johns Hopkins University Baltimore MD Human Language Technology Center of Excellence Johns Hopkins University Baltimore MD
This paper introduces a novel method to diagnose the source-target attention in state-of-the-art end-to-end speech recognition models with joint connectionist temporal classification (CTC) and attention training. Our ... 详细信息
来源: 评论
Radically old way of computing spectra: Applications in end-to-end ASR
arXiv
收藏 引用
arXiv 2021年
作者: Sadhu, Samik Hermansky, Hynek Center for Language and Speech Processing Johns Hopkins University United States Human Language Technology Center of Excellence Johns Hopkins University United States
We propose a technique to compute spectrograms using Frequency Domain Linear Prediction (FDLP) that uses all-pole models to fit the squared Hilbert envelope of speech in different frequency sub-bands. The spectrogram ... 详细信息
来源: 评论