咨询与建议

限定检索结果

文献类型

  • 528 篇 会议
  • 297 篇 期刊文献
  • 3 册 图书

馆藏范围

  • 828 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 520 篇 工学
    • 387 篇 计算机科学与技术...
    • 336 篇 软件工程
    • 142 篇 信息与通信工程
    • 56 篇 生物工程
    • 45 篇 控制科学与工程
    • 40 篇 电子科学与技术(可...
    • 35 篇 仪器科学与技术
    • 33 篇 化学工程与技术
    • 30 篇 电气工程
    • 21 篇 生物医学工程(可授...
    • 16 篇 机械工程
    • 16 篇 光学工程
    • 7 篇 建筑学
    • 6 篇 材料科学与工程(可...
  • 291 篇 理学
    • 167 篇 物理学
    • 118 篇 数学
    • 62 篇 生物学
    • 55 篇 统计学(可授理学、...
    • 31 篇 化学
    • 18 篇 系统科学
  • 120 篇 管理学
    • 79 篇 图书情报与档案管...
    • 45 篇 管理科学与工程(可...
    • 15 篇 工商管理
  • 15 篇 法学
    • 13 篇 社会学
  • 15 篇 医学
    • 13 篇 临床医学
    • 10 篇 基础医学(可授医学...
    • 8 篇 药学(可授医学、理...
  • 12 篇 文学
    • 8 篇 中国语言文学
    • 8 篇 外国语言文学
  • 10 篇 农学
    • 7 篇 作物学
  • 4 篇 教育学
  • 3 篇 经济学
  • 3 篇 艺术学
  • 1 篇 军事学

主题

  • 77 篇 speech recogniti...
  • 73 篇 training
  • 50 篇 acoustics
  • 46 篇 speech processin...
  • 44 篇 speech
  • 33 篇 hidden markov mo...
  • 31 篇 signal processin...
  • 29 篇 feature extracti...
  • 26 篇 decoding
  • 23 篇 speech enhanceme...
  • 21 篇 computational mo...
  • 20 篇 speech synthesis
  • 20 篇 linguistics
  • 19 篇 predictive model...
  • 18 篇 data models
  • 17 篇 neural networks
  • 17 篇 natural language...
  • 16 篇 accuracy
  • 15 篇 conferences
  • 15 篇 training data

机构

  • 70 篇 national enginee...
  • 55 篇 school of comput...
  • 47 篇 audio speech and...
  • 42 篇 beijing engineer...
  • 27 篇 department of co...
  • 25 篇 center for langu...
  • 21 篇 department of co...
  • 18 篇 mainlp center fo...
  • 18 篇 department of co...
  • 15 篇 audio speech and...
  • 14 篇 iflytek research
  • 14 篇 national enginee...
  • 12 篇 munich
  • 11 篇 department of co...
  • 10 篇 center for infor...
  • 10 篇 ict cluster sing...
  • 10 篇 audio speech and...
  • 9 篇 center for infor...
  • 9 篇 department of co...
  • 9 篇 center for speec...

作者

  • 71 篇 lei xie
  • 54 篇 ling zhen-hua
  • 37 篇 huang heyan
  • 32 篇 ai yang
  • 23 篇 plank barbara
  • 21 篇 zhen-hua ling
  • 18 篇 zheng thomas fan...
  • 18 篇 yarowsky david
  • 18 篇 thomas fang zhen...
  • 18 篇 yang ai
  • 17 篇 wang dong
  • 17 篇 heyan huang
  • 17 篇 khudanpur sanjee...
  • 16 篇 lu ye-xin
  • 15 篇 pengcheng guo
  • 15 篇 gu jia-chen
  • 15 篇 van der goot rob
  • 14 篇 du jun
  • 14 篇 mao xian-ling
  • 14 篇 xie lei

语言

  • 739 篇 英文
  • 84 篇 其他
  • 8 篇 中文
检索条件"机构=Center for Language and Speech Processing and Computer Science"
828 条 记 录,以下是131-140 订阅
排序:
Joint Lemmatization and Morphological Tagging with LEMMING
arXiv
收藏 引用
arXiv 2024年
作者: Müller, Thomas Cotterell, Ryan Fraser, Alexander Schütze, Hinrich Center for Information and Language Processing University of Munich Germany Department of Computer Science Johns Hopkins University United States
We present LEMMING, a modular log-linear model that jointly models lemmatization and tagging and supports the integration of arbitrary global features. It is trainable on corpora annotated with gold standard tags and ... 详细信息
来源: 评论
Incorporating Ultrasound Tongue Images for Audio-Visual speech Enhancement
arXiv
收藏 引用
arXiv 2023年
作者: Zheng, Rui-Chen Ai, Yang Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei230027 China
Audio-visual speech enhancement (AV-SE) aims to enhance degraded speech along with extra visual information such as lip videos, and has been shown to be more effective than audio-only speech enhancement. This paper pr... 详细信息
来源: 评论
Long-frame-shift Neural speech Phase Prediction with Spectral Continuity Enhancement and Interpolation Error Compensation
arXiv
收藏 引用
arXiv 2023年
作者: Ai, Yang Lu, Ye-Xin Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei230027 China
speech phase prediction, which is a significant research focus in the field of signal processing, aims to recover speech phase spectra from amplitude-related features. However, existing speech phase prediction methods... 详细信息
来源: 评论
Exploring language-Agnostic speech Representations Using Domain Knowledge for Detecting Alzheimer’s Dementia
Exploring Language-Agnostic Speech Representations Using Dom...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Zehra Shah Shi-Ang Qi Fei Wang Mahtab Farrokh Mashrura Tasnim Eleni Stroulia Russell Greiner Manos Plitsis Athanasios Katsamanis Department of Computing Science University of Alberta Edmonton Canada Institute for Language and Speech Processing Athena Research Center Greece
We explore ways to use speech data to screen for indications of Alzheimer’s dementia (AD). In particular, we describe our approach to the ICASSP 2023 Signal processing Grand Challenge, which involves extrapolating fr... 详细信息
来源: 评论
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality speech Enhancement
arXiv
收藏 引用
arXiv 2023年
作者: Lu, Ye-Xin Ai, Yang Ling, Zhen-Hua The National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei230027 China
Phase information has a significant impact on speech perceptual quality and intelligibility. However, existing speech enhancement methods encounter limitations in explicit phase estimation due to the non-structural na... 详细信息
来源: 评论
language-Independent Prosody-Enhanced speech Representations For Multilingual speech Synthesis
Language-Independent Prosody-Enhanced Speech Representations...
收藏 引用
IEEE Spoken language Technology Workshop
作者: Chang Liu Zhen-Hua Ling Ya-Jun Hu National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China China iFLYTEK Co. Ltd. China
This paper proposes language-independent prosody-enhanced speech representations to improve the naturalness of speech synthesis for the target languages that lack prosodic labels. To build text-to-speech (TTS) systems... 详细信息
来源: 评论
MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank
arXiv
收藏 引用
arXiv 2024年
作者: Blaschke, Verena Kovačić, Barbara Peng, Siyao Schütze, Hinrich Plank, Barbara Center for Information and Language Processing LMU Munich Germany Munich Germany Department of Computer Science IT University of Copenhagen Denmark
Despite the success of the Universal Dependencies (UD) project exemplified by its impressive language breadth, there is still a lack in ‘within-language breadth’: most treebanks focus on standard languages. Even for... 详细信息
来源: 评论
Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event Detection
arXiv
收藏 引用
arXiv 2024年
作者: Cai, Pengfei Song, Yan Jiang, Nan Gu, Qing McLoughlin, Ian National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China China ICT Cluster Singapore Institute of Technology Singapore
A significant challenge in sound event detection (SED) is the effective utilization of unlabeled data, given the limited availability of labeled data due to high annotation costs. Semi-supervised algorithms rely on la... 详细信息
来源: 评论
Sagalee: an Open Source Automatic speech Recognition Dataset for Oromo language
Sagalee: an Open Source Automatic Speech Recognition Dataset...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Turi Abu Ying Shi Thomas Fang Zheng Dong Wang Center for Speech and Language Technologies BNRist Beijing Department of Computer Science and Technology Tsinghua University Beijing China School of Computer Science and Technology Harbin Institute of Technology Harbin China
We present a novel Automatic speech Recognition (ASR) dataset for the Oromo language, a widely spoken language in Ethiopia and neighboring regions. The dataset was collected through a crowdsourcing initiative, encompa... 详细信息
来源: 评论
Boosting Multi-Speaker Expressive speech Synthesis with Semi-Supervised Contrastive Learning
Boosting Multi-Speaker Expressive Speech Synthesis with Semi...
收藏 引用
IEEE International Conference on Multimedia and Expo (ICME)
作者: Xinfa Zhu Yuke Li Yi Lei Ning Jiang Guoqing Zhao Lei Xie Audio Speech and Language Processing Group (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xi’an China Mashang Consumer Finance Co. Ltd
This paper aims to build a multi-speaker expressive TTS system, synthesizing a target speaker’s speech with multiple styles and emotions. To this end, we propose a novel contrastive learning-based TTS approach to tra... 详细信息
来源: 评论