咨询与建议

限定检索结果

文献类型

  • 528 篇 会议
  • 297 篇 期刊文献
  • 3 册 图书

馆藏范围

  • 828 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 520 篇 工学
    • 387 篇 计算机科学与技术...
    • 336 篇 软件工程
    • 142 篇 信息与通信工程
    • 56 篇 生物工程
    • 45 篇 控制科学与工程
    • 40 篇 电子科学与技术(可...
    • 35 篇 仪器科学与技术
    • 33 篇 化学工程与技术
    • 30 篇 电气工程
    • 21 篇 生物医学工程(可授...
    • 16 篇 机械工程
    • 16 篇 光学工程
    • 7 篇 建筑学
    • 6 篇 材料科学与工程(可...
  • 291 篇 理学
    • 167 篇 物理学
    • 118 篇 数学
    • 62 篇 生物学
    • 55 篇 统计学(可授理学、...
    • 31 篇 化学
    • 18 篇 系统科学
  • 120 篇 管理学
    • 79 篇 图书情报与档案管...
    • 45 篇 管理科学与工程(可...
    • 15 篇 工商管理
  • 15 篇 法学
    • 13 篇 社会学
  • 15 篇 医学
    • 13 篇 临床医学
    • 10 篇 基础医学(可授医学...
    • 8 篇 药学(可授医学、理...
  • 12 篇 文学
    • 8 篇 中国语言文学
    • 8 篇 外国语言文学
  • 10 篇 农学
    • 7 篇 作物学
  • 4 篇 教育学
  • 3 篇 经济学
  • 3 篇 艺术学
  • 1 篇 军事学

主题

  • 77 篇 speech recogniti...
  • 73 篇 training
  • 50 篇 acoustics
  • 46 篇 speech processin...
  • 44 篇 speech
  • 33 篇 hidden markov mo...
  • 31 篇 signal processin...
  • 29 篇 feature extracti...
  • 26 篇 decoding
  • 23 篇 speech enhanceme...
  • 21 篇 computational mo...
  • 20 篇 speech synthesis
  • 20 篇 linguistics
  • 19 篇 predictive model...
  • 18 篇 data models
  • 17 篇 neural networks
  • 17 篇 natural language...
  • 16 篇 accuracy
  • 15 篇 conferences
  • 15 篇 training data

机构

  • 70 篇 national enginee...
  • 55 篇 school of comput...
  • 47 篇 audio speech and...
  • 42 篇 beijing engineer...
  • 27 篇 department of co...
  • 25 篇 center for langu...
  • 21 篇 department of co...
  • 18 篇 mainlp center fo...
  • 18 篇 department of co...
  • 15 篇 audio speech and...
  • 14 篇 iflytek research
  • 14 篇 national enginee...
  • 12 篇 munich
  • 11 篇 department of co...
  • 10 篇 center for infor...
  • 10 篇 ict cluster sing...
  • 10 篇 audio speech and...
  • 9 篇 center for infor...
  • 9 篇 department of co...
  • 9 篇 center for speec...

作者

  • 71 篇 lei xie
  • 54 篇 ling zhen-hua
  • 37 篇 huang heyan
  • 32 篇 ai yang
  • 23 篇 plank barbara
  • 21 篇 zhen-hua ling
  • 18 篇 zheng thomas fan...
  • 18 篇 yarowsky david
  • 18 篇 thomas fang zhen...
  • 18 篇 yang ai
  • 17 篇 wang dong
  • 17 篇 heyan huang
  • 17 篇 khudanpur sanjee...
  • 16 篇 lu ye-xin
  • 15 篇 pengcheng guo
  • 15 篇 gu jia-chen
  • 15 篇 van der goot rob
  • 14 篇 du jun
  • 14 篇 mao xian-ling
  • 14 篇 xie lei

语言

  • 739 篇 英文
  • 84 篇 其他
  • 8 篇 中文
检索条件"机构=Center for Language and Speech Processing and Computer Science"
828 条 记 录,以下是211-220 订阅
排序:
Wav2f0: Exploring the Potential of Wav2vec 2.0 for speech Fundamental Frequency Extraction
Wav2f0: Exploring the Potential of Wav2vec 2.0 for Speech Fu...
收藏 引用
International Symposium on Chinese Spoken language processing
作者: Rui Feng Yin-Long Liu Zhen-Hua Ling Jia-Hong Yuan National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei P. R. China Interdisciplinary Research Center for Linguistic Sciences University of Science and Technology of China Hefei P. R. China
speech fundamental frequency (F0) extraction is one of the most important tasks in speech signal processing. This paper aims to explore the feasibility of using deep learning for speech fundamental frequency extractio... 详细信息
来源: 评论
WIDER & CLOSER: Mixture of Short-channel Distillers for Zero-shot Cross-lingual Named Entity Recognition
WIDER & CLOSER: Mixture of Short-channel Distillers for Zero...
收藏 引用
2022 Conference on Empirical Methods in Natural language processing, EMNLP 2022
作者: Ma, Jun-Yu Chen, Beiduo Gu, Jia-Chen Ling, Zhen-Hua Guo, Wu Liu, Quan Chen, Zhigang Liu, Cong National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China State Key Laboratory of Cognitive Intelligence China iFLYTEK Research Hefei China Jilin Kexun Information Technology Co. Ltd China
Zero-shot cross-lingual named entity recognition (NER) aims at transferring knowledge from annotated and rich-resource data in source languages to unlabeled and lean-resource data in target languages. Existing mainstr... 详细信息
来源: 评论
Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features
Expressive-VC: Highly Expressive Voice Conversion with Atten...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Ziqian Ning Qicong Xie Pengcheng Zhu Zhichao Wang Liumeng Xue Jixun Yao Lei Xie Mengxiao Bi Audio Speech and Language Processing Group (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xi’an China Fuxi AI Lab NetEase Inc. Hangzhou China
Voice conversion for highly expressive speech is challenging. Current approaches struggle with the balance between speaker similarity, intelligibility, and expressiveness. To address this problem, we propose Expressiv... 详细信息
来源: 评论
HIGNN-TTS: Hierarchical Prosody Modeling With Graph Neural Networks for Expressive Long-Form TTS
HIGNN-TTS: Hierarchical Prosody Modeling With Graph Neural N...
收藏 引用
IEEE Workshop on Automatic speech Recognition and Understanding
作者: Dake Guo Xinfa Zhu Liumeng Xue Tao Li Yuanjun Lv Yuepeng Jiang Lei Xie Audio Speech and Language Processing Group (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xi’an China School of Data Science The Chinese University of Hong Kong Shenzhen (CUHK-Shenzhen) China
Recent advances in text-to-speech, particularly those based on Graph Neural Networks (GNNs), have significantly improved the expressiveness of short-form synthetic speech. However, generating human-parity long-form sp...
来源: 评论
Entity Linking in the Job Market Domain
arXiv
收藏 引用
arXiv 2024年
作者: Zhang, Mike van der Goot, Rob Plank, Barbara Department of Computer Science IT University of Copenhagen Denmark Pioneer Centre for Artificial Intelligence Copenhagen Denmark MaiNLP Center for Information and Language Processing LMU Munich Germany Munich Germany
In Natural language processing, entity linking (EL) has centered around Wikipedia, but remains underexplored for the job market domain. Disambiguating skill mentions can help us to get insight into the labor market de... 详细信息
来源: 评论
Distance-Based Weight Transfer for Fine-Tuning From Near-Field to Far-Field Speaker Verification
Distance-Based Weight Transfer for Fine-Tuning From Near-Fie...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Li Zhang Qing Wang Hongji Wang Yue Li Wei Rao Yannan Wang Lei Xie Audio Speech and Language Processing Group (ASLP@NPU) School of Computer Science Northwestern Polytechnical University (NPU) Xi’an China Tencent Ethereal Audio Lab Tencent Corporation Shenzhen China
The scarcity of labeled far-field speech is a constraint for training superior far-field speaker verification systems. In general, fine-tuning the model pre-trained on large-scale near- field speech through a small am... 详细信息
来源: 评论
Few-Shot Keyword Spotting from Mixed speech
arXiv
收藏 引用
arXiv 2024年
作者: Yuan, Junming Shi, Ying Li, LanTian Wang, Dong Hamdulla, Askar School of Computer Science and Technology Xinjiang University China School of Artificial Intelligence Beijing University of Posts and Telecommunications China Center for Speech and Language Technologies BNRist Tsinghua University China School of Computer Science and Technology Harbin Institute of Technology China
Few-shot keyword spotting (KWS) aims to detect unknown keywords with limited training samples. A commonly used approach is the pre-training and fine-tuning framework. While effective in clean conditions, this approach...
来源: 评论
EDSep: An Effective Diffusion-Based Method for speech Source Separation
arXiv
收藏 引用
arXiv 2025年
作者: Dong, Jinwei Wang, Xinsheng Mao, Qirong School of Computer Science and Communication Engineering Jiangsu University China Jiangsu Engineering Research Center of Big Data Ubiquitous Perception and Intelligent Agriculture Applications China Provincial Key Laboratory of Computational Intelligence and New Technologies in Low-Altitude Digital Agriculture Zhenjiang China Audio Speech and Language Processing Group School of Computer Science Northwestern Polytechnical University Xi’an China
Generative models have attracted considerable attention for speech separation tasks, and among these, diffusion-based methods are being explored. Despite the notable success of diffusion techniques in generation tasks... 详细信息
来源: 评论
A Semantics-Aware Normalizing Flow Model for Anomaly Detection
A Semantics-Aware Normalizing Flow Model for Anomaly Detecti...
收藏 引用
IEEE International Conference on Multimedia and Expo (ICME)
作者: Wei Ma Shiyong Lan Weikang Huang Wenwu Wang Hongyu Yang Yitong Ma Yongjie Ma College of Computer Science Sichuan University China National Key Laboratory of Fundamental Science on Synthetic Vision China Center for Vision Speech and Signal Processing University of Surrey UK
Anomaly detection in computer vision aims to detect outliers from input image data. Examples include texture defect detection and semantic discrepancy detection. However, existing methods are limited in detecting both...
来源: 评论
Joint Generative-Contrastive Representation Learning for Anomalous Sound Detection
Joint Generative-Contrastive Representation Learning for Ano...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Xiao-Min Zeng Yan Song Zhu Zhuo Yu Zhou Yu-Hong Li Hui Xue Li-Rong Dai Ian McLoughlin National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China Alibaba Group China ICT Cluster Singapore Institute of Technology Singapore
In this paper, we propose a joint generative and contrastive representation learning method (GeCo) for anomalous sound detection (ASD). GeCo exploits a Predictive AutoEncoder (PAE) equipped with self-attention as a ge... 详细信息
来源: 评论