咨询与建议

限定检索结果

文献类型

  • 528 篇 会议
  • 297 篇 期刊文献
  • 3 册 图书

馆藏范围

  • 828 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 520 篇 工学
    • 387 篇 计算机科学与技术...
    • 336 篇 软件工程
    • 142 篇 信息与通信工程
    • 56 篇 生物工程
    • 45 篇 控制科学与工程
    • 40 篇 电子科学与技术(可...
    • 35 篇 仪器科学与技术
    • 33 篇 化学工程与技术
    • 30 篇 电气工程
    • 21 篇 生物医学工程(可授...
    • 16 篇 机械工程
    • 16 篇 光学工程
    • 7 篇 建筑学
    • 6 篇 材料科学与工程(可...
  • 291 篇 理学
    • 167 篇 物理学
    • 118 篇 数学
    • 62 篇 生物学
    • 55 篇 统计学(可授理学、...
    • 31 篇 化学
    • 18 篇 系统科学
  • 120 篇 管理学
    • 79 篇 图书情报与档案管...
    • 45 篇 管理科学与工程(可...
    • 15 篇 工商管理
  • 15 篇 法学
    • 13 篇 社会学
  • 15 篇 医学
    • 13 篇 临床医学
    • 10 篇 基础医学(可授医学...
    • 8 篇 药学(可授医学、理...
  • 12 篇 文学
    • 8 篇 中国语言文学
    • 8 篇 外国语言文学
  • 10 篇 农学
    • 7 篇 作物学
  • 4 篇 教育学
  • 3 篇 经济学
  • 3 篇 艺术学
  • 1 篇 军事学

主题

  • 77 篇 speech recogniti...
  • 73 篇 training
  • 50 篇 acoustics
  • 46 篇 speech processin...
  • 44 篇 speech
  • 33 篇 hidden markov mo...
  • 31 篇 signal processin...
  • 29 篇 feature extracti...
  • 26 篇 decoding
  • 23 篇 speech enhanceme...
  • 21 篇 computational mo...
  • 20 篇 speech synthesis
  • 20 篇 linguistics
  • 19 篇 predictive model...
  • 18 篇 data models
  • 17 篇 neural networks
  • 17 篇 natural language...
  • 16 篇 accuracy
  • 15 篇 conferences
  • 15 篇 training data

机构

  • 70 篇 national enginee...
  • 55 篇 school of comput...
  • 47 篇 audio speech and...
  • 42 篇 beijing engineer...
  • 27 篇 department of co...
  • 25 篇 center for langu...
  • 21 篇 department of co...
  • 18 篇 mainlp center fo...
  • 18 篇 department of co...
  • 15 篇 audio speech and...
  • 14 篇 iflytek research
  • 14 篇 national enginee...
  • 12 篇 munich
  • 11 篇 department of co...
  • 10 篇 center for infor...
  • 10 篇 ict cluster sing...
  • 10 篇 audio speech and...
  • 9 篇 center for infor...
  • 9 篇 department of co...
  • 9 篇 center for speec...

作者

  • 71 篇 lei xie
  • 54 篇 ling zhen-hua
  • 37 篇 huang heyan
  • 32 篇 ai yang
  • 23 篇 plank barbara
  • 21 篇 zhen-hua ling
  • 18 篇 zheng thomas fan...
  • 18 篇 yarowsky david
  • 18 篇 thomas fang zhen...
  • 18 篇 yang ai
  • 17 篇 wang dong
  • 17 篇 heyan huang
  • 17 篇 khudanpur sanjee...
  • 16 篇 lu ye-xin
  • 15 篇 pengcheng guo
  • 15 篇 gu jia-chen
  • 15 篇 van der goot rob
  • 14 篇 du jun
  • 14 篇 mao xian-ling
  • 14 篇 xie lei

语言

  • 739 篇 英文
  • 84 篇 其他
  • 8 篇 中文
检索条件"机构=Center for Language and Speech Processing and Computer Science"
828 条 记 录,以下是151-160 订阅
排序:
EEVEE: An Easy Annotation Tool for Natural language processing
arXiv
收藏 引用
arXiv 2024年
作者: Sorensen, Axel Peng, Siyao Plank, Barbara van der Goot, Rob Department of Computer Science IT University of Copenhagen Denmark Munich Germany MaiNLP Center for Information and Language Processing LMU Munich Germany
Annotation tools are the starting point for creating Natural language processing (NLP) datasets. There is a wide variety of tools available;setting up these tools is however a hindrance. We propose EEVEE, an annotatio... 详细信息
来源: 评论
Leveraging Prompt Learning and Pause Encoding for Alzheimer’s Disease Detection
arXiv
收藏 引用
arXiv 2024年
作者: Liu, Yin-Long Feng, Rui Yuan, Jia-Hong Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China Interdisciplinary Research Center for Linguistic Sciences University of Science and Technology of China Hefei China
Compared to other clinical screening techniques, speech-and-language-based automated Alzheimer’s disease (AD) detection methods are characterized by their non-invasiveness, cost-effectiveness, and convenience. Previo... 详细信息
来源: 评论
DP-MAE: A Dual-Path Masked Autoencoder Based Self-Supervised Learning Method for Anomalous Sound Detection
DP-MAE: A Dual-Path Masked Autoencoder Based Self-Supervised...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Zhuo-Li Liu Yan Song Xiao-Min Zeng Li-Rong Dai Ian McLoughlin National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China ICT Cluster Singapore Institute of Technology Singapore
In this paper, we present a novel general-purpose audio representation learning method named Dual-Path Masked AutoEncoder (DPMAE) for anomalous sound detection (ASD) task. Existing methods mainly focus on frame-level ...
来源: 评论
The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge: Tasks, Results and Findings
arXiv
收藏 引用
arXiv 2024年
作者: Xia, Kangxiang Guo, Dake Yao, Jixun Xue, Liumeng Li, Hanzhao Wang, Shuai Guo, Zhao Xie, Lei Zhang, Qingqing Luo, Lei Dong, Minghui Sun, Peng Audio Speech and Language Processing Group ASLP@NPU School of Computer Science Northwestern Polytechnical University Xi’an China China China Magic data China Singapore China Computer Federation China
The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge aims to benchmark and advance zero-shot spontaneous style voice cloning, particularly focusing on generating spontaneous behaviors in conversational speech.... 详细信息
来源: 评论
SELM: speech Enhancement using Discrete Tokens and language Models
SELM: Speech Enhancement using Discrete Tokens and Language ...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Ziqian Wang Xinfa Zhu Zihan Zhang YuanJun Lv Ning Jiang Guoqing Zhao Lei Xie Audio Speech and Language Processing Group (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xian China Mashang Consumer Finance Co. Ltd.
language models (LMs) have recently shown superior performances in various speech generation tasks, demonstrating their powerful ability for semantic context modeling. Given the intrinsic similarity between speech gen...
来源: 评论
Are Clinical T5 Models Better for Clinical Text?
arXiv
收藏 引用
arXiv 2024年
作者: Li, Yahan Harrigian, Keith Zirikly, Ayah Dredze, Mark Department of Computer Science University of Southern California United States Department of Computer Science Johns Hopkins University United States Center for Language and Speech Processing Whiting School of Engineering Johns Hopkins University United States Malone Center for Engineering in Healthcare Johns Hopkins University United States
Large language models with a transformerbased encoder/decoder architecture, such as T5 (Raffel et al., 2023), have become standard platforms for supervised tasks. To bring these technologies to the clinical domain, re...
来源: 评论
Preserving Background Sound in Noise-Robust Voice Conversion Via Multi-Task Learning
Preserving Background Sound in Noise-Robust Voice Conversion...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Jixun Yao Yi Lei Qing Wang Pengcheng Guo Ziqian Ning Lei Xie Hai Li Junhui Liu Danming Xie Audio Speech and Language Processing Group (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xi’an China iQIYI Inc China
Background sound is an informative form of art that is helpful in providing a more immersive experience in real-application voice conversion (VC) scenarios. However, prior research about VC, mainly focusing on clean v... 详细信息
来源: 评论
Stargan-vc Based Cross-Domain Data Augmentation for Speaker Verification
Stargan-vc Based Cross-Domain Data Augmentation for Speaker ...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Hang-Rui Hu Yan Song Jian-Tao Zhang Li-Rong Dai Ian McLoughlin Zhu Zhuo Yu Zhou Yu-Hong Li Hui Xue National Engineering Research Center for Speech and Language Information Processing University of Science and Technology of China Hefei China Alibaba Group China
Automatic speaker verification (ASV) faces domain shift caused by the mismatch of intrinsic and extrinsic factors, such as recording device and speaking style, in real-world applications, which leads to severe perform... 详细信息
来源: 评论
Zero-Shot Personalized Lip-To-speech Synthesis with Face Image Based Voice Control
Zero-Shot Personalized Lip-To-Speech Synthesis with Face Ima...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Zheng-Yan Sheng Yang Ai Zhen-Hua Ling National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei P.R. China
Lip-to-speech (Lip2speech) synthesis, which predicts corresponding speech from talking face images, has witnessed significant progress with various models and training strategies in a series of independent studies. Ho... 详细信息
来源: 评论
Neural speech Phase Prediction Based on Parallel Estimation Architecture and Anti-Wrapping Losses
Neural Speech Phase Prediction Based on Parallel Estimation ...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Yang Ai Zhen-Hua Ling National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei P.R.China
This paper presents a novel speech phase prediction model which predicts wrapped phase spectra directly from amplitude spectra by neural networks. The proposed model is a cascade of a residual convolutional network an... 详细信息
来源: 评论