咨询与建议

限定检索结果

文献类型

  • 528 篇 会议
  • 297 篇 期刊文献
  • 3 册 图书

馆藏范围

  • 828 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 520 篇 工学
    • 387 篇 计算机科学与技术...
    • 336 篇 软件工程
    • 142 篇 信息与通信工程
    • 56 篇 生物工程
    • 45 篇 控制科学与工程
    • 40 篇 电子科学与技术(可...
    • 35 篇 仪器科学与技术
    • 33 篇 化学工程与技术
    • 30 篇 电气工程
    • 21 篇 生物医学工程(可授...
    • 16 篇 机械工程
    • 16 篇 光学工程
    • 7 篇 建筑学
    • 6 篇 材料科学与工程(可...
  • 291 篇 理学
    • 167 篇 物理学
    • 118 篇 数学
    • 62 篇 生物学
    • 55 篇 统计学(可授理学、...
    • 31 篇 化学
    • 18 篇 系统科学
  • 120 篇 管理学
    • 79 篇 图书情报与档案管...
    • 45 篇 管理科学与工程(可...
    • 15 篇 工商管理
  • 15 篇 法学
    • 13 篇 社会学
  • 15 篇 医学
    • 13 篇 临床医学
    • 10 篇 基础医学(可授医学...
    • 8 篇 药学(可授医学、理...
  • 12 篇 文学
    • 8 篇 中国语言文学
    • 8 篇 外国语言文学
  • 10 篇 农学
    • 7 篇 作物学
  • 4 篇 教育学
  • 3 篇 经济学
  • 3 篇 艺术学
  • 1 篇 军事学

主题

  • 77 篇 speech recogniti...
  • 73 篇 training
  • 50 篇 acoustics
  • 46 篇 speech processin...
  • 44 篇 speech
  • 33 篇 hidden markov mo...
  • 31 篇 signal processin...
  • 29 篇 feature extracti...
  • 26 篇 decoding
  • 23 篇 speech enhanceme...
  • 21 篇 computational mo...
  • 20 篇 speech synthesis
  • 20 篇 linguistics
  • 19 篇 predictive model...
  • 18 篇 data models
  • 17 篇 neural networks
  • 17 篇 natural language...
  • 16 篇 accuracy
  • 15 篇 conferences
  • 15 篇 training data

机构

  • 70 篇 national enginee...
  • 55 篇 school of comput...
  • 47 篇 audio speech and...
  • 42 篇 beijing engineer...
  • 27 篇 department of co...
  • 25 篇 center for langu...
  • 21 篇 department of co...
  • 18 篇 mainlp center fo...
  • 18 篇 department of co...
  • 15 篇 audio speech and...
  • 14 篇 iflytek research
  • 14 篇 national enginee...
  • 12 篇 munich
  • 11 篇 department of co...
  • 10 篇 center for infor...
  • 10 篇 ict cluster sing...
  • 10 篇 audio speech and...
  • 9 篇 center for infor...
  • 9 篇 department of co...
  • 9 篇 center for speec...

作者

  • 71 篇 lei xie
  • 54 篇 ling zhen-hua
  • 37 篇 huang heyan
  • 32 篇 ai yang
  • 23 篇 plank barbara
  • 21 篇 zhen-hua ling
  • 18 篇 zheng thomas fan...
  • 18 篇 yarowsky david
  • 18 篇 thomas fang zhen...
  • 18 篇 yang ai
  • 17 篇 wang dong
  • 17 篇 heyan huang
  • 17 篇 khudanpur sanjee...
  • 16 篇 lu ye-xin
  • 15 篇 pengcheng guo
  • 15 篇 gu jia-chen
  • 15 篇 van der goot rob
  • 14 篇 du jun
  • 14 篇 mao xian-ling
  • 14 篇 xie lei

语言

  • 739 篇 英文
  • 84 篇 其他
  • 8 篇 中文
检索条件"机构=Center for Language and Speech Processing and Computer Science"
828 条 记 录,以下是141-150 订阅
排序:
Two-Step Band-Split Neural Network Approach For Full-Band Residual Echo Suppression
Two-Step Band-Split Neural Network Approach For Full-Band Re...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Zihan Zhang Shimin Zhang Mingshuai Liu Yanhong Leng Zhe Han Li Chen Lei Xie Audio Speech and Language Processing Group (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xi’an China ByteDance China
This paper describes a Two-step Band-split Neural Network (TBNN) approach for full-band acoustic echo cancellation. Specifically, after linear filtering, we split the full-band signal into wideband (16KHz) and high-ba... 详细信息
来源: 评论
Delivering Speaking Style in Low-Resource Voice Conversion with Multi-Factor Constraints
Delivering Speaking Style in Low-Resource Voice Conversion w...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Zhichao Wang Xinsheng Wang Lei Xie Yuanzhe Chen Qiao Tian Yuping Wang Audio Speech and Language Processing Group (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xi’an China Speech Audio & Music Intelligence (SAMI) ByteDance
Conveying the linguistic content and maintaining the source speech’s speaking style, such as intonation and emotion, is essential in voice conversion (VC). However, in a low-resource situation, where only limited utt... 详细信息
来源: 评论
VATEX2020: PLSTM framework for video captioning
VATEX2020: PLSTM framework for video captioning
收藏 引用
2022 International Conference on Machine Learning and Data Engineering, ICMLDE 2022
作者: Singh, Alok Singh, Salam Michael Meetei, Loitongbam Sanayai Das, Ringki Singh, Thoudam Doren Bandyopadhyay, Sivaji Department of Computer Science and Engineering National Institute of Technology Assam Silchar India Center for Natural Language Processing National Institute of Technology Assam Silchar India
Captioning a video involves condensing the video's information into text, which can be useful in video sentiment analysis, video-guided machine translation (VMT), visual question-answering and humanitarian aid. Th... 详细信息
来源: 评论
Joint Pre-Training with speech and Bilingual Text for Direct speech to speech Translation
Joint Pre-Training with Speech and Bilingual Text for Direct...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Kun Wei Long Zhou Ziqiang Zhang Liping Chen Shujie Liu Lei He Jinyu Li Furu Wei Audio Speech and Language Processing Group (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xian China Microsoft Corporation
Direct speech-to-speech translation (S2ST) is an attractive research topic with many advantages compared to cascaded S2ST. However, direct S2ST suffers from the data scarcity problem because the corpora from the speec... 详细信息
来源: 评论
Clever Hans Effect Found in Automatic Detection of Alzheimer’s Disease through speech
arXiv
收藏 引用
arXiv 2024年
作者: Liu, Yin-Long Feng, Rui Yuan, Jiahong Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China Interdisciplinary Research Center for Linguistic Sciences University of Science and Technology of China Hefei China
We uncover an underlying bias present in the audio recordings produced from the picture description task of the Pitt corpus, the largest publicly accessible database for Alzheimer’s Disease (AD) detection research. E... 详细信息
来源: 评论
Distinguishable Speaker Anonymization Based on Formant and Fundamental Frequency Scaling
Distinguishable Speaker Anonymization Based on Formant and F...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Jixun Yao Qing Wang Yi Lei Pengcheng Guo Lei Xie Namin Wang Jie Liu Audio Speech and Language Processing Group (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xi’an China Huawei Cloud
speech data on the Internet are proliferating exponentially because of the emergence of social media, and the sharing of such personal data raises obvious security and privacy concerns. One solution to mitigate these ... 详细信息
来源: 评论
An Exploration of Task-Decoupling on Two-Stage Neural Post Filter for Real-Time Personalized Acoustic Echo Cancellation
An Exploration of Task-Decoupling on Two-Stage Neural Post F...
收藏 引用
IEEE Workshop on Automatic speech Recognition and Understanding
作者: Zihan Zhang Jiayao Sun Xianjun Xia Ziqian Wang Xiaopeng Yan Yijian Xiao Lei Xie Audio Speech and Language Processing Group (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xi’an China ByteDance China
Deep learning based techniques have been popularly adopted in acoustic echo cancellation (AEC). Utilization of speaker representation has extended the frontier of AEC, thus attracting many researchers’ interest in pe...
来源: 评论
Sagalee: an Open Source Automatic speech Recognition Dataset for Oromo language
arXiv
收藏 引用
arXiv 2025年
作者: Abu, Turi Shi, Ying Zheng, Thomas Fang Wang, Dong Center for Speech and Language Technologies BNRist Beijing China Department of Computer Science and Technology Tsinghua University Beijing China School of Computer Science and Technology Harbin Institute of Technology Harbin China
We present a novel Automatic speech Recognition (ASR) dataset for the Oromo language, a widely spoken language in Ethiopia and neighboring regions. The dataset was collected through a crowd-sourcing initiative, encomp... 详细信息
来源: 评论
PERTURBATION-RESTRAINED SEQUENTIAL MODEL EDITING
arXiv
收藏 引用
arXiv 2024年
作者: Ma, Jun-Yu Wang, Hong Xu, Hao-Xiang Ling, Zhen-Hua Gu, Jia-Chen University of Science and Technology of China China National Engineering Research Center of Speech and Language Information Processing China University of California Los Angeles United States
Model editing is an emerging field that focuses on updating the knowledge embedded within large language models (LLMs) without extensive retraining. However, current model editing methods significantly compromise the ... 详细信息
来源: 评论
Incident Task Sequence for Service Priority using Cosine Similarity  1
Incident Task Sequence for Service Priority using Cosine Sim...
收藏 引用
1st International Conference on Technology Innovation and Its Applications, ICTIIA 2022
作者: Boonprapapan, Teratam Horata, Punyaphol Seresangtakul, Pusadee Natural Language And Speech Processing Laboratory College Of Computing Khon Kaen University Department Of Computer Science Khon Kaen40002 Thailand Advanced Smart Computing Laboratory College Of Computing Khon Kaen University Department Of Computer Science Khon Kaen40002 Thailand
The article herein details a procedure for classifying service cases by priority level based on the service level agreement (SLA) between an organization and the customer. The main factor in the article's publicat... 详细信息
来源: 评论