咨询与建议

限定检索结果

文献类型

  • 267 篇 会议
  • 154 篇 期刊文献

馆藏范围

  • 421 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 282 篇 工学
    • 184 篇 计算机科学与技术...
    • 164 篇 软件工程
    • 111 篇 信息与通信工程
    • 28 篇 生物工程
    • 27 篇 电子科学与技术(可...
    • 24 篇 电气工程
    • 23 篇 控制科学与工程
    • 21 篇 仪器科学与技术
    • 19 篇 化学工程与技术
    • 11 篇 机械工程
    • 8 篇 生物医学工程(可授...
    • 6 篇 光学工程
    • 5 篇 建筑学
    • 4 篇 土木工程
    • 3 篇 材料科学与工程(可...
  • 176 篇 理学
    • 137 篇 物理学
    • 56 篇 数学
    • 31 篇 生物学
    • 19 篇 化学
    • 16 篇 统计学(可授理学、...
    • 8 篇 系统科学
  • 44 篇 管理学
    • 37 篇 图书情报与档案管...
    • 7 篇 管理科学与工程(可...
  • 11 篇 法学
    • 11 篇 社会学
  • 8 篇 医学
    • 7 篇 临床医学
    • 6 篇 基础医学(可授医学...
    • 5 篇 药学(可授医学、理...
  • 7 篇 文学
    • 6 篇 中国语言文学
    • 5 篇 外国语言文学
  • 4 篇 教育学
    • 4 篇 教育学
  • 3 篇 农学
  • 2 篇 艺术学

主题

  • 59 篇 speech recogniti...
  • 51 篇 training
  • 33 篇 acoustics
  • 31 篇 speech
  • 20 篇 speech processin...
  • 19 篇 feature extracti...
  • 18 篇 hidden markov mo...
  • 18 篇 signal processin...
  • 16 篇 computational mo...
  • 15 篇 conferences
  • 14 篇 speech enhanceme...
  • 13 篇 predictive model...
  • 13 篇 decoding
  • 12 篇 machine translat...
  • 11 篇 speech synthesis
  • 10 篇 training data
  • 10 篇 neural networks
  • 10 篇 data models
  • 9 篇 transformers
  • 9 篇 self-supervised ...

机构

  • 70 篇 national enginee...
  • 51 篇 human language t...
  • 45 篇 center for langu...
  • 31 篇 human language t...
  • 21 篇 center for langu...
  • 21 篇 center for langu...
  • 13 篇 center for langu...
  • 11 篇 iflytek research
  • 10 篇 center for langu...
  • 9 篇 ict cluster sing...
  • 9 篇 human language t...
  • 8 篇 national enginee...
  • 8 篇 center for langu...
  • 8 篇 human language t...
  • 7 篇 center for langu...
  • 7 篇 human language t...
  • 7 篇 university of sc...
  • 7 篇 xiaomi corp.
  • 6 篇 university of sc...
  • 6 篇 state key labora...

作者

  • 49 篇 ling zhen-hua
  • 47 篇 khudanpur sanjee...
  • 35 篇 dehak najim
  • 32 篇 ai yang
  • 29 篇 sanjeev khudanpu...
  • 23 篇 dredze mark
  • 22 篇 zhen-hua ling
  • 19 篇 povey daniel
  • 18 篇 villalba jesús
  • 18 篇 van durme benjam...
  • 18 篇 daniel povey
  • 18 篇 yang ai
  • 17 篇 post matt
  • 16 篇 hermansky hynek
  • 16 篇 lu ye-xin
  • 15 篇 zelasko piotr
  • 14 篇 du hui-peng
  • 13 篇 raj desh
  • 13 篇 gu jia-chen
  • 13 篇 watanabe shinji

语言

  • 364 篇 英文
  • 57 篇 其他
检索条件"机构=Center for Language and Speech Processing & Human Language Technology"
421 条 记 录,以下是121-130 订阅
排序:
Pre-training language Model as a Multi-perspective Course Learner
arXiv
收藏 引用
arXiv 2023年
作者: Chen, Beiduo Huang, Shaohan Zhang, Zihan Guo, Wu Ling, Zhenhua Huang, Haizhen Wei, Furu Deng, Weiwei Zhang, Qi National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China Microsoft Corporation Beijing China
ELECTRA (Clark et al., 2020), the generator-discriminator pre-training framework, has achieved impressive semantic construction capability among various downstream tasks. Despite the convincing performance, ELECTRA st... 详细信息
来源: 评论
Self-supervised Prosody Learning at Phoneme-level with Momentum Contrast for speech Synthesis
Self-supervised Prosody Learning at Phoneme-level with Momen...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Zhao-Ci Liu Ya-Jun Hu Liping Chen Zhen-Hua Ling National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei P.R.China iFLYTEK Research iFLYTEK Co. Ltd. China
This paper investigates leveraging large-scale speech data to enhance prosodic modeling in speech synthesis, and introduces a model named SP2MC which achieves self-supervised prosody learning at phoneme-level with mom... 详细信息
来源: 评论
WIDER & CLOSER: Mixture of Short-channel Distillers for Zero-shot Cross-lingual Named Entity Recognition
WIDER & CLOSER: Mixture of Short-channel Distillers for Zero...
收藏 引用
2022 Conference on Empirical Methods in Natural language processing, EMNLP 2022
作者: Ma, Jun-Yu Chen, Beiduo Gu, Jia-Chen Ling, Zhen-Hua Guo, Wu Liu, Quan Chen, Zhigang Liu, Cong National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China State Key Laboratory of Cognitive Intelligence China iFLYTEK Research Hefei China Jilin Kexun Information Technology Co. Ltd China
Zero-shot cross-lingual named entity recognition (NER) aims at transferring knowledge from annotated and rich-resource data in source languages to unlabeled and lean-resource data in target languages. Existing mainstr... 详细信息
来源: 评论
END-TO-END LYRICS RECOGNITION WITH SELF-SUPERVISED LEARNING
arXiv
收藏 引用
arXiv 2022年
作者: Zhang, Xiangyu Li, Shuyue Stella He, Zhanhong Togneri, Roberto Garcia, Leibny Paola Center for Language and Speech Processing Johns Hopkins University United States Human Language Technology Center of Excellence Johns Hopkins University United States Department of Computer Science University of Western Australia Australia
Lyrics recognition is an important task in music processing. Despite traditional algorithms such as the hybrid HMM-TDNN model achieving good performance, studies on applying end-to-end models and self-supervised learn... 详细信息
来源: 评论
Deep CLAS: Deep Contextual Listen, Attend and Spell
arXiv
收藏 引用
arXiv 2024年
作者: Wang, Mengzhi Xiong, Shifu Wan, Genshun Chen, Hang Gao, Jianqing Dai, Lirong iFLYTEK Research iFLYTEK Co. Ltd. Hefei230088 China National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei230027 China
Contextual-LAS (CLAS) has been shown effective in improving Automatic speech Recognition (ASR) of rare words. It relies on phrase-level contextual modeling and attention-based relevance scoring without explicit contex... 详细信息
来源: 评论
Decoupled Pronunciation and Prosody Modeling in Meta-Learning-Based Multilingual speech Synthesis
arXiv
收藏 引用
arXiv 2022年
作者: Peng, Yukun Ling, Zhenhua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
This paper presents a method of decoupled pronunciation and prosody modeling to improve the performance of meta-learning-based multilingual speech synthesis. The baseline meta-learning synthesis method adopts a single... 详细信息
来源: 评论
NEURAL speech PHASE PREDICTION BASED ON PARALLEL ESTIMATION ARCHITECTURE AND ANTI-WRAPPING LOSSES
arXiv
收藏 引用
arXiv 2022年
作者: Ai, Yang Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
This paper presents a novel speech phase prediction model which predicts wrapped phase spectra directly from amplitude spectra by neural networks. The proposed model is a cascade of a residual convolutional network an... 详细信息
来源: 评论
Joint Generative-Contrastive Representation Learning for Anomalous Sound Detection
Joint Generative-Contrastive Representation Learning for Ano...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Xiao-Min Zeng Yan Song Zhu Zhuo Yu Zhou Yu-Hong Li Hui Xue Li-Rong Dai Ian McLoughlin National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China Alibaba Group China ICT Cluster Singapore Institute of Technology Singapore
In this paper, we propose a joint generative and contrastive representation learning method (GeCo) for anomalous sound detection (ASD). GeCo exploits a Predictive AutoEncoder (PAE) equipped with self-attention as a ge... 详细信息
来源: 评论
JOINT GENERATIVE-CONTRASTIVE REPRESENTATION LEARNING FOR ANOMALOUS SOUND DETECTION
arXiv
收藏 引用
arXiv 2023年
作者: Zeng, Xiao-Min Song, Yan Zhuo, Zhu Zhou, Yu Li, Yu-Hong Xue, Hui Dai, Li-Rong McLoughlin, Ian National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China ICT Cluster Singapore Institute of Technology Singapore Alibaba Group China
In this paper, we propose a joint generative and contrastive representation learning method (GeCo) for anomalous sound detection (ASD). GeCo exploits a Predictive AutoEncoder (PAE) equipped with self-attention as a ge... 详细信息
来源: 评论
Can Automated speech Recognition Errors Provide Valuable Clues for Alzheimer’s Disease Detection?
Can Automated Speech Recognition Errors Provide Valuable Clu...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Yin-Long Liu Rui Feng Ye-Xin Lu Jia-Xin Chen Yang Ai Jia-Hong Yuan Zhen-Hua Ling National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei P. R. China Interdisciplinary Research Center for Linguistic Sciences University of Science and Technology of China Hefei P. R. China
Recent advances in automatic speech recognition (ASR) technology have boosted the viability of fully automated Alzheimer’s disease (AD) detection via ASR transcripts. However, there is a lack of understanding of how ... 详细信息
来源: 评论