咨询与建议

限定检索结果

文献类型

  • 232 篇 会议
  • 127 篇 期刊文献
  • 1 册 图书

馆藏范围

  • 360 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 219 篇 工学
    • 140 篇 计算机科学与技术...
    • 123 篇 软件工程
    • 88 篇 信息与通信工程
    • 28 篇 电子科学与技术(可...
    • 26 篇 仪器科学与技术
    • 21 篇 电气工程
    • 20 篇 生物工程
    • 18 篇 控制科学与工程
    • 15 篇 化学工程与技术
    • 13 篇 机械工程
    • 7 篇 建筑学
    • 6 篇 土木工程
    • 3 篇 光学工程
    • 3 篇 生物医学工程(可授...
  • 155 篇 理学
    • 114 篇 物理学
    • 56 篇 数学
    • 23 篇 生物学
    • 20 篇 统计学(可授理学、...
    • 15 篇 化学
    • 5 篇 系统科学
  • 52 篇 管理学
    • 37 篇 图书情报与档案管...
    • 18 篇 管理科学与工程(可...
    • 10 篇 工商管理
  • 13 篇 法学
    • 10 篇 社会学
    • 3 篇 法学
  • 7 篇 教育学
    • 6 篇 教育学
    • 4 篇 心理学(可授教育学...
  • 7 篇 文学
    • 7 篇 外国语言文学
    • 6 篇 中国语言文学
  • 3 篇 医学
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 2 篇 农学

主题

  • 59 篇 speech recogniti...
  • 38 篇 speech processin...
  • 26 篇 training
  • 21 篇 acoustics
  • 19 篇 signal processin...
  • 17 篇 natural language...
  • 17 篇 speech enhanceme...
  • 16 篇 automatic speech...
  • 15 篇 feature extracti...
  • 15 篇 robustness
  • 13 篇 speech
  • 12 篇 speech synthesis
  • 11 篇 error analysis
  • 10 篇 hidden markov mo...
  • 10 篇 predictive model...
  • 9 篇 decoding
  • 8 篇 training data
  • 8 篇 transformers
  • 8 篇 self-supervised ...
  • 8 篇 accuracy

机构

  • 68 篇 national enginee...
  • 18 篇 hitachi ltd. res...
  • 15 篇 institute for la...
  • 15 篇 center for langu...
  • 13 篇 center for langu...
  • 10 篇 iflytek research
  • 10 篇 institute for la...
  • 9 篇 department of in...
  • 9 篇 ict cluster sing...
  • 8 篇 robust speech pr...
  • 8 篇 national enginee...
  • 7 篇 university of sc...
  • 7 篇 iflytek research...
  • 7 篇 school of ece na...
  • 6 篇 robust speech pr...
  • 6 篇 state key labora...
  • 6 篇 institute for la...
  • 6 篇 national enginee...
  • 5 篇 university of sc...
  • 5 篇 ibm thomas j. wa...

作者

  • 51 篇 ling zhen-hua
  • 32 篇 ai yang
  • 21 篇 hansen john h.l.
  • 19 篇 zhen-hua ling
  • 17 篇 hansen john h. l...
  • 16 篇 watanabe shinji
  • 16 篇 lu ye-xin
  • 15 篇 yang ai
  • 14 篇 gu jia-chen
  • 14 篇 katsouros vassil...
  • 14 篇 potamianos alexa...
  • 14 篇 j.h.l. hansen
  • 14 篇 du hui-peng
  • 13 篇 fujita yusuke
  • 13 篇 paraskevopoulos ...
  • 13 篇 katsamanis athan...
  • 12 篇 androutsopoulos ...
  • 10 篇 horiguchi shota
  • 10 篇 shinji watanabe
  • 10 篇 zheng rui-chen

语言

  • 331 篇 英文
  • 29 篇 其他
检索条件"机构=Center for Research in Speech and Language Processing"
360 条 记 录,以下是81-90 订阅
排序:
Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection
Leveraging Prompt Learning and Pause Encoding for Alzheimer'...
收藏 引用
International Symposium on Chinese Spoken language processing
作者: Yin-Long Liu Rui Feng Jia-Hong Yuan Zhen-Hua Ling National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei Interdisciplinary Research Center for Linguistic Sciences University of Science and Technology of China Hefei
Compared to other clinical screening techniques, speech-and-language-based automated Alzheimer's disease (AD) detection methods are characterized by their non-invasiveness, cost-effectiveness, and convenience. Pre... 详细信息
来源: 评论
PNP-RKD: A Positive-Negative Pair based Relational Knowledge Distillation Method for Cross-Domain Speaker Verification
PNP-RKD: A Positive-Negative Pair based Relational Knowledge...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Qing Gu Yan Song Nan Jiang Pengfei Cai Ian McLoughlin National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China ICT Cluster Singapore Institute of Technology Singapore
Existing deep embedding learning based speaker verification (SV) methods suffer from performance degradation under domain shift conditions. This can be alleviated through unsupervised domain adaptation (UDA) technique... 详细信息
来源: 评论
Exploring language-Agnostic speech Representations Using Domain Knowledge for Detecting Alzheimer’s Dementia
Exploring Language-Agnostic Speech Representations Using Dom...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Zehra Shah Shi-Ang Qi Fei Wang Mahtab Farrokh Mashrura Tasnim Eleni Stroulia Russell Greiner Manos Plitsis Athanasios Katsamanis Department of Computing Science University of Alberta Edmonton Canada Institute for Language and Speech Processing Athena Research Center Greece
We explore ways to use speech data to screen for indications of Alzheimer’s dementia (AD). In particular, we describe our approach to the ICASSP 2023 Signal processing Grand Challenge, which involves extrapolating fr... 详细信息
来源: 评论
Long-frame-shift Neural speech Phase Prediction with Spectral Continuity Enhancement and Interpolation Error Compensation
arXiv
收藏 引用
arXiv 2023年
作者: Ai, Yang Lu, Ye-Xin Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei230027 China
speech phase prediction, which is a significant research focus in the field of signal processing, aims to recover speech phase spectra from amplitude-related features. However, existing speech phase prediction methods... 详细信息
来源: 评论
Incorporating Ultrasound Tongue Images for Audio-Visual speech Enhancement
arXiv
收藏 引用
arXiv 2023年
作者: Zheng, Rui-Chen Ai, Yang Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei230027 China
Audio-visual speech enhancement (AV-SE) aims to enhance degraded speech along with extra visual information such as lip videos, and has been shown to be more effective than audio-only speech enhancement. This paper pr... 详细信息
来源: 评论
language-Independent Prosody-Enhanced speech Representations For Multilingual speech Synthesis
Language-Independent Prosody-Enhanced Speech Representations...
收藏 引用
IEEE Spoken language Technology Workshop
作者: Chang Liu Zhen-Hua Ling Ya-Jun Hu National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China China iFLYTEK Co. Ltd. China
This paper proposes language-independent prosody-enhanced speech representations to improve the naturalness of speech synthesis for the target languages that lack prosodic labels. To build text-to-speech (TTS) systems... 详细信息
来源: 评论
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality speech Enhancement
arXiv
收藏 引用
arXiv 2023年
作者: Lu, Ye-Xin Ai, Yang Ling, Zhen-Hua The National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei230027 China
Phase information has a significant impact on speech perceptual quality and intelligibility. However, existing speech enhancement methods encounter limitations in explicit phase estimation due to the non-structural na... 详细信息
来源: 评论
Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event Detection
arXiv
收藏 引用
arXiv 2024年
作者: Cai, Pengfei Song, Yan Jiang, Nan Gu, Qing McLoughlin, Ian National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China China ICT Cluster Singapore Institute of Technology Singapore
A significant challenge in sound event detection (SED) is the effective utilization of unlabeled data, given the limited availability of labeled data due to high annotation costs. Semi-supervised algorithms rely on la... 详细信息
来源: 评论
Adapted Multimodal Bert with Layer-Wise Fusion for Sentiment Analysis
Adapted Multimodal Bert with Layer-Wise Fusion for Sentiment...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Odysseas S. Chlapanis Georgios Paraskevopoulos Alexandros Potamianos National Technical University of Athens Athens Greece Athena Research Center Institute for Language and Speech Processing Athens Greece
Multimodal learning pipelines have benefited from the success of pretrained language models. However, this comes at the cost of increased model parameters. In this work, we propose Adapted Multimodal BERT (AMB), a BER... 详细信息
来源: 评论
An investigation of phrase break prediction in an End-to-End TTS system
arXiv
收藏 引用
arXiv 2023年
作者: Vadapalli, Anandaswarup Speech Processing Lab Language Technologies Research Center International Institute of Information Technology Telangana Hyderabad500032 India
Purpose: This work explores the use of external phrase break prediction models to enhance listener comprehension in End-to-End Text-to-speech (TTS) systems. Methods: The effectiveness of these models is evaluated base... 详细信息
来源: 评论