咨询与建议

限定检索结果

文献类型

  • 528 篇 会议
  • 297 篇 期刊文献
  • 3 册 图书

馆藏范围

  • 828 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 520 篇 工学
    • 387 篇 计算机科学与技术...
    • 336 篇 软件工程
    • 142 篇 信息与通信工程
    • 56 篇 生物工程
    • 45 篇 控制科学与工程
    • 40 篇 电子科学与技术(可...
    • 35 篇 仪器科学与技术
    • 33 篇 化学工程与技术
    • 30 篇 电气工程
    • 21 篇 生物医学工程(可授...
    • 16 篇 机械工程
    • 16 篇 光学工程
    • 7 篇 建筑学
    • 6 篇 材料科学与工程(可...
  • 291 篇 理学
    • 167 篇 物理学
    • 118 篇 数学
    • 62 篇 生物学
    • 55 篇 统计学(可授理学、...
    • 31 篇 化学
    • 18 篇 系统科学
  • 120 篇 管理学
    • 79 篇 图书情报与档案管...
    • 45 篇 管理科学与工程(可...
    • 15 篇 工商管理
  • 15 篇 法学
    • 13 篇 社会学
  • 15 篇 医学
    • 13 篇 临床医学
    • 10 篇 基础医学(可授医学...
    • 8 篇 药学(可授医学、理...
  • 12 篇 文学
    • 8 篇 中国语言文学
    • 8 篇 外国语言文学
  • 10 篇 农学
    • 7 篇 作物学
  • 4 篇 教育学
  • 3 篇 经济学
  • 3 篇 艺术学
  • 1 篇 军事学

主题

  • 77 篇 speech recogniti...
  • 73 篇 training
  • 50 篇 acoustics
  • 46 篇 speech processin...
  • 44 篇 speech
  • 33 篇 hidden markov mo...
  • 31 篇 signal processin...
  • 29 篇 feature extracti...
  • 26 篇 decoding
  • 23 篇 speech enhanceme...
  • 21 篇 computational mo...
  • 20 篇 speech synthesis
  • 20 篇 linguistics
  • 19 篇 predictive model...
  • 18 篇 data models
  • 17 篇 neural networks
  • 17 篇 natural language...
  • 16 篇 accuracy
  • 15 篇 conferences
  • 15 篇 training data

机构

  • 70 篇 national enginee...
  • 55 篇 school of comput...
  • 47 篇 audio speech and...
  • 42 篇 beijing engineer...
  • 27 篇 department of co...
  • 25 篇 center for langu...
  • 21 篇 department of co...
  • 18 篇 mainlp center fo...
  • 18 篇 department of co...
  • 15 篇 audio speech and...
  • 14 篇 iflytek research
  • 14 篇 national enginee...
  • 12 篇 munich
  • 11 篇 department of co...
  • 10 篇 center for infor...
  • 10 篇 ict cluster sing...
  • 10 篇 audio speech and...
  • 9 篇 center for infor...
  • 9 篇 department of co...
  • 9 篇 center for speec...

作者

  • 71 篇 lei xie
  • 54 篇 ling zhen-hua
  • 37 篇 huang heyan
  • 32 篇 ai yang
  • 23 篇 plank barbara
  • 21 篇 zhen-hua ling
  • 18 篇 zheng thomas fan...
  • 18 篇 yarowsky david
  • 18 篇 thomas fang zhen...
  • 18 篇 yang ai
  • 17 篇 wang dong
  • 17 篇 heyan huang
  • 17 篇 khudanpur sanjee...
  • 16 篇 lu ye-xin
  • 15 篇 pengcheng guo
  • 15 篇 gu jia-chen
  • 15 篇 van der goot rob
  • 14 篇 du jun
  • 14 篇 mao xian-ling
  • 14 篇 xie lei

语言

  • 739 篇 英文
  • 84 篇 其他
  • 8 篇 中文
检索条件"机构=Center for Language and Speech Processing and Computer Science"
828 条 记 录,以下是121-130 订阅
排序:
Incorporating Ultrasound Tongue Images for Audio-Visual speech Enhancement through Knowledge Distillation
arXiv
收藏 引用
arXiv 2023年
作者: Zheng, Rui-Chen Ai, Yang Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
Audio-visual speech enhancement (AV-SE) aims to enhance degraded speech along with extra visual information such as lip videos, and has been shown to be more effective than audio-only speech enhancement. This paper pr... 详细信息
来源: 评论
Pitch-and-Spectrum-Aware Singing Quality Assessment with Bias Correction and Model Fusion
Pitch-and-Spectrum-Aware Singing Quality Assessment with Bia...
收藏 引用
IEEE Spoken language Technology Workshop
作者: Yu-Fei Shi Yang Ai Ye-Xin Lu Hui-Peng Du Zhen-Hua Ling National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei P. R. China
We participated in track 2 of the VoiceMOS Challenge 2024, which aimed to predict the mean opinion score (MOS) of singing samples. Our submission secured the first place among all participating teams, excluding the of... 详细信息
来源: 评论
Automatic Channel Selection and Spatial Feature Integration for Multi-Channel speech Recognition Across Various Array Topologies
Automatic Channel Selection and Spatial Feature Integration ...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Bingshen Mu Pengcheng Guo Dake Guo Pan Zhou Wei Chen Lei Xie Audio Speech and Language Processing Group (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xian China Space AI Li Auto
Automatic speech Recognition (ASR) has shown remarkable progress, yet it still faces challenges in real-world distant scenarios across various array topologies each with multiple recording devices. The focal point of ...
来源: 评论
MDCTCodec: A Lightweight MDCT-Based Neural Audio Codec Towards High Sampling Rate and Low Bitrate Scenarios
MDCTCodec: A Lightweight MDCT-Based Neural Audio Codec Towar...
收藏 引用
IEEE Spoken language Technology Workshop
作者: Xiao-Hang Jiang Yang Ai Rui-Chen Zheng Hui-Peng Du Ye-Xin Lu Zhen-Hua Ling National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei P. R. China
In this paper, we propose MDCTCodec, an efficient lightweight end-to-end neural audio codec based on the modified discrete cosine transform (MDCT). The encoder takes the MDCT spectrum of audio as input, encoding it in... 详细信息
来源: 评论
Implicit Neural Representations for Robust Joint Sparse-View CT Reconstruction
arXiv
收藏 引用
arXiv 2024年
作者: Shi, Jiayang Zhu, Junyi Pelt, Daniël M. Joost Batenburg, K. Blaschko, Matthew B. Leiden Institute of Advanced Computer Science Leiden University Netherlands Center for Processing Speech and Images KU Leuven Belgium
Computed Tomography (CT) is pivotal in industrial quality control and medical diagnostics. Sparse-view CT, offering reduced ionizing radiation, faces challenges due to its under-sampled nature, leading to ill-posed re... 详细信息
来源: 评论
Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection
Leveraging Prompt Learning and Pause Encoding for Alzheimer'...
收藏 引用
International Symposium on Chinese Spoken language processing
作者: Yin-Long Liu Rui Feng Jia-Hong Yuan Zhen-Hua Ling National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei Interdisciplinary Research Center for Linguistic Sciences University of Science and Technology of China Hefei
Compared to other clinical screening techniques, speech-and-language-based automated Alzheimer's disease (AD) detection methods are characterized by their non-invasiveness, cost-effectiveness, and convenience. Pre... 详细信息
来源: 评论
CoUDA: Coherence Evaluation via Unified Data Augmentation
CoUDA: Coherence Evaluation via Unified Data Augmentation
收藏 引用
2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human language Technologies, NAACL 2024
作者: Zhu, Dawei Wu, Wenhao Song, Yifan Zhu, Fangwei Cao, Ziqiang Li, Sujian School of Computer Science Peking University China National Key Laboratory for Multimedia Information Processing Peking University China Institute of Artificial Intelligence Soochow University China Jiangsu Collaborative Innovation Center for Language Ability Jiangsu Normal University China
Coherence evaluation aims to assess the organization and structure of a discourse, which remains challenging even in the era of large language models. Due to the scarcity of annotated data, data augmentation is common...
来源: 评论
PNP-RKD: A Positive-Negative Pair based Relational Knowledge Distillation Method for Cross-Domain Speaker Verification
PNP-RKD: A Positive-Negative Pair based Relational Knowledge...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Qing Gu Yan Song Nan Jiang Pengfei Cai Ian McLoughlin National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China ICT Cluster Singapore Institute of Technology Singapore
Existing deep embedding learning based speaker verification (SV) methods suffer from performance degradation under domain shift conditions. This can be alleviated through unsupervised domain adaptation (UDA) technique... 详细信息
来源: 评论
Corrective Retrieval Augmented Generation
arXiv
收藏 引用
arXiv 2024年
作者: Yan, Shi-Qi Gu, Jia-Chen Zhu, Yun Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China Department of Computer Science University of California Los Angeles United States Google DeepMind United Kingdom
Large language models (LLMs) inevitably exhibit hallucinations since the accuracy of generated texts cannot be secured solely by the parametric knowledge they encapsulate. Although retrieval-augmented generation (RAG)... 详细信息
来源: 评论
HPCNet: Hybrid Pixel and Contour Network for Audio-Visual speech Enhancement with Low-Quality Video
收藏 引用
IEEE Journal on Selected Topics in Signal processing 2025年
作者: Chen, Hang Zhang, Chen-Yue Wang, Qing Du, Jun Siniscalchi, Sabato Marco Xiong, Shi-Fu Wan, Gen-Shun University of Science and Technology of China National Engineering Research Center of Speech and Language Information Processing Anhui Hefei China University of Palermo Palermo Italy IFlytek Research Anhui Hefei China
To advance audio-visual speech enhancement (AVSE) research in low-quality video settings, we introduce the multimodal information-based speech processing-low quality video (MISP-LQV) benchmark, which includes a 120-ho... 详细信息
来源: 评论