咨询与建议

限定检索结果

文献类型

  • 267 篇 会议
  • 155 篇 期刊文献

馆藏范围

  • 422 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 282 篇 工学
    • 184 篇 计算机科学与技术...
    • 164 篇 软件工程
    • 111 篇 信息与通信工程
    • 28 篇 生物工程
    • 27 篇 电子科学与技术(可...
    • 24 篇 电气工程
    • 23 篇 控制科学与工程
    • 21 篇 仪器科学与技术
    • 19 篇 化学工程与技术
    • 11 篇 机械工程
    • 8 篇 生物医学工程(可授...
    • 6 篇 光学工程
    • 5 篇 建筑学
    • 4 篇 土木工程
    • 3 篇 材料科学与工程(可...
  • 176 篇 理学
    • 137 篇 物理学
    • 56 篇 数学
    • 31 篇 生物学
    • 19 篇 化学
    • 16 篇 统计学(可授理学、...
    • 8 篇 系统科学
  • 44 篇 管理学
    • 37 篇 图书情报与档案管...
    • 7 篇 管理科学与工程(可...
  • 11 篇 法学
    • 11 篇 社会学
  • 8 篇 医学
    • 7 篇 临床医学
    • 6 篇 基础医学(可授医学...
    • 5 篇 药学(可授医学、理...
  • 7 篇 文学
    • 6 篇 中国语言文学
    • 5 篇 外国语言文学
  • 4 篇 教育学
    • 4 篇 教育学
  • 3 篇 农学
  • 2 篇 艺术学

主题

  • 59 篇 speech recogniti...
  • 52 篇 training
  • 33 篇 acoustics
  • 31 篇 speech
  • 20 篇 speech processin...
  • 19 篇 feature extracti...
  • 18 篇 hidden markov mo...
  • 18 篇 signal processin...
  • 16 篇 computational mo...
  • 15 篇 conferences
  • 14 篇 speech enhanceme...
  • 13 篇 predictive model...
  • 13 篇 decoding
  • 12 篇 machine translat...
  • 11 篇 speech synthesis
  • 10 篇 training data
  • 10 篇 neural networks
  • 10 篇 data models
  • 9 篇 transformers
  • 9 篇 self-supervised ...

机构

  • 71 篇 national enginee...
  • 51 篇 human language t...
  • 46 篇 center for langu...
  • 31 篇 human language t...
  • 21 篇 center for langu...
  • 21 篇 center for langu...
  • 13 篇 center for langu...
  • 11 篇 iflytek research
  • 10 篇 center for langu...
  • 9 篇 ict cluster sing...
  • 9 篇 human language t...
  • 8 篇 national enginee...
  • 8 篇 center for langu...
  • 8 篇 human language t...
  • 7 篇 center for langu...
  • 7 篇 human language t...
  • 7 篇 university of sc...
  • 7 篇 xiaomi corp.
  • 6 篇 university of sc...
  • 6 篇 state key labora...

作者

  • 49 篇 ling zhen-hua
  • 47 篇 khudanpur sanjee...
  • 35 篇 dehak najim
  • 32 篇 ai yang
  • 29 篇 sanjeev khudanpu...
  • 23 篇 zhen-hua ling
  • 23 篇 dredze mark
  • 19 篇 povey daniel
  • 19 篇 yang ai
  • 18 篇 villalba jesús
  • 18 篇 van durme benjam...
  • 18 篇 daniel povey
  • 17 篇 post matt
  • 16 篇 hermansky hynek
  • 16 篇 lu ye-xin
  • 15 篇 zelasko piotr
  • 14 篇 du hui-peng
  • 13 篇 raj desh
  • 13 篇 gu jia-chen
  • 13 篇 watanabe shinji

语言

  • 344 篇 英文
  • 78 篇 其他
  • 2 篇 中文
检索条件"机构=Center for Language and Speech Processing & Human Language Technology"
422 条 记 录,以下是21-30 订阅
排序:
Stargan-vc Based Cross-Domain Data Augmentation for Speaker Verification  48
Stargan-vc Based Cross-Domain Data Augmentation for Speaker ...
收藏 引用
48th IEEE International Conference on Acoustics, speech and Signal processing, ICASSP 2023
作者: Hu, Hang-Rui Song, Yan Zhang, Jian-Tao Dai, Li-Rong McLoughlin, Ian Zhuo, Zhu Zhou, Yu Li, Yu-Hong Xue, Hui University of Science and Technology of China National Engineering Research Center for Speech and Language Information Processing Hefei China Alibaba Group China
Automatic speaker verification (ASV) faces domain shift caused by the mismatch of intrinsic and extrinsic factors, such as recording device and speaking style, in real-world applications, which leads to severe perform... 详细信息
来源: 评论
Faux Polyglot: A Study on Information Disparity in Multilingual Large language Models
arXiv
收藏 引用
arXiv 2024年
作者: Sharma, Nikhil Murray, Kenton Xiao, Ziang Johns Hopkins University United States Center for Speech and Language Processing United States Human Language Technology Center for Excellence United States
Although the multilingual capability of LLMs offers new opportunities to overcome the language barrier, do these capabilities translate into real-life scenarios where linguistic divide and knowledge conflicts between ... 详细信息
来源: 评论
Joint Energy-Based Model for Robust speech Classification System Against Dirty-Label Backdoor Poisoning Attacks
Joint Energy-Based Model for Robust Speech Classification Sy...
收藏 引用
2023 IEEE Automatic speech Recognition and Understanding Workshop, ASRU 2023
作者: Sustek, Martin Joshi, Sonal Li, Henry Thebaud, Thomas Villalba, Jesus Khudanpur, Sanjeev Dehak, Najim Johns Hopkins University Center for Language and Speech Processing BaltimoreMD United States Brno University of Technology Faculty of Information Technology Czech Republic
Our novel technique utilizes a Joint Energy-based Model (JEM) that integrates both discriminative and generative approaches to increase resistance against dirty-label backdoor attacks. Our approach is especially effec... 详细信息
来源: 评论
Adapting Self-Supervised Models to Multi-Talker speech Recognition Using Speaker Embeddings
Adapting Self-Supervised Models to Multi-Talker Speech Recog...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Zili Huang Desh Raj Paola García Sanjeev Khudanpur Center for Language and Speech Processing and Human Language Technology Center of Excellence Johns Hopkins University Baltimore USA
Self-supervised learning (SSL) methods which learn representations of data without explicit supervision have gained popularity in speech-processing tasks, particularly for single-talker applications. However, these mo... 详细信息
来源: 评论
TEAR: A Cross-Modal Pre-Trained Text Encoder Enhanced by Acoustic Representations for speech Synthesis
IEEE Transactions on Audio, Speech and Language Processing
收藏 引用
IEEE Transactions on Audio, speech and language processing 2025年 33卷 1117-1128页
作者: Shiming Wang Yang Ai Liping Chen Yajun Hu Zhenhua Ling National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
Text encoders play an important role in text-to-speech (TTS) by analyzing text input and converting it into linguistic representations. In order to generate expressive speech from text, pre-training text encoders on l... 详细信息
来源: 评论
PQLM - Multilingual Decentralized Portable Quantum language Model  48
PQLM - Multilingual Decentralized Portable Quantum Language ...
收藏 引用
48th IEEE International Conference on Acoustics, speech and Signal processing, ICASSP 2023
作者: Li, Shuyue Stella Zhang, Xiangyu Zhou, Shu Shu, Hongchao Liang, Ruixing Liu, Hexin Garcia, Leibny Paola Hong Kong University of Science and Technology Department of Physics Hong Kong Nanyang Technological University School of Electrical and Electronic Engineering Singapore Johns Hopkins University Center for Language and Speech Processing United States Johns Hopkins University Human Language Technology Center of Excellence United States
With careful manipulation, malicious agents can reverse engineer private information encoded in pre-trained language models. Security concerns motivate the development of quantum pre-training. In this work, we propose... 详细信息
来源: 评论
ERVQ: Enhanced Residual Vector Quantization With Intra-and-Inter-Codebook Optimization for Neural Audio Codecs
IEEE Transactions on Audio, Speech and Language Processing
收藏 引用
IEEE Transactions on Audio, speech and language processing 2025年 33卷 2539-2550页
作者: Rui-Chen Zheng Hui-Peng Du Xiao-Hang Jiang Yang Ai Zhen-Hua Ling National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
Current neural audio codecs typically use residual vector quantization (RVQ) to discretize audio signals. However, they often experience codebook collapse, which reduces the effective codebook size and leads to subopt... 详细信息
来源: 评论
Towards High-Quality and Efficient speech Bandwidth Extension With Parallel Amplitude and Phase Prediction
IEEE Transactions on Audio, Speech and Language Processing
收藏 引用
IEEE Transactions on Audio, speech and language processing 2024年 33卷 236-250页
作者: Ye-Xin Lu Yang Ai Hui-Peng Du Zhen-Hua Ling National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
speech bandwidth extension (BWE) refers to widening the frequency bandwidth range of speech signals, enhancing the speech quality towards brighter and fuller. This paper proposes a generative adversarial network (GAN)... 详细信息
来源: 评论
Recovering document annotations for sentence-level bitext
arXiv
收藏 引用
arXiv 2024年
作者: Wicks, Rachel Post, Matt Koehn, Philipp Human Language Technology Center of Excellence Johns Hopkins University United States Center of Language and Speech Processing Johns Hopkins University United States Microsoft United States
Data availability limits the scope of any given task. In machine translation, historical models were incapable of handling longer contexts, so the lack of document-level datasets was less noticeable. Now, despite the ... 详细信息
来源: 评论
Finding Spoken Identifications: Using GPT-4 Annotation For An Efficient And Fast Dataset Creation Pipeline  30
Finding Spoken Identifications: Using GPT-4 Annotation For A...
收藏 引用
Joint 30th International Conference on Computational Linguistics and 14th International Conference on language Resources and Evaluation, LREC-COLING 2024
作者: Jahan, Maliha Wang, Helin Thebaud, Thomas Sun, Yinglun Le, Giang Fagyal, Zsuzsanna Scharenborg, Odette Hasegawa-Johnson, Mark Moro-Velazquez, Laureano Dehak, Najim Center for Language and Speech Processing Johns Hopkins University BaltimoreMD United States University of Illinois Urbana-Champaign ChampaignIL United States Multimedia Computing Group Delft University of Technology Netherlands
The growing emphasis on fairness in speech-processing tasks requires datasets with speakers from diverse subgroups that allow training and evaluating fair speech technology systems. However, creating such datasets thr... 详细信息
来源: 评论