咨询与建议

限定检索结果

文献类型

  • 528 篇 会议
  • 297 篇 期刊文献
  • 3 册 图书

馆藏范围

  • 828 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 520 篇 工学
    • 387 篇 计算机科学与技术...
    • 336 篇 软件工程
    • 142 篇 信息与通信工程
    • 56 篇 生物工程
    • 45 篇 控制科学与工程
    • 40 篇 电子科学与技术(可...
    • 35 篇 仪器科学与技术
    • 33 篇 化学工程与技术
    • 30 篇 电气工程
    • 21 篇 生物医学工程(可授...
    • 16 篇 机械工程
    • 16 篇 光学工程
    • 7 篇 建筑学
    • 6 篇 材料科学与工程(可...
  • 291 篇 理学
    • 167 篇 物理学
    • 118 篇 数学
    • 62 篇 生物学
    • 55 篇 统计学(可授理学、...
    • 31 篇 化学
    • 18 篇 系统科学
  • 120 篇 管理学
    • 79 篇 图书情报与档案管...
    • 45 篇 管理科学与工程(可...
    • 15 篇 工商管理
  • 15 篇 法学
    • 13 篇 社会学
  • 15 篇 医学
    • 13 篇 临床医学
    • 10 篇 基础医学(可授医学...
    • 8 篇 药学(可授医学、理...
  • 12 篇 文学
    • 8 篇 中国语言文学
    • 8 篇 外国语言文学
  • 10 篇 农学
    • 7 篇 作物学
  • 4 篇 教育学
  • 3 篇 经济学
  • 3 篇 艺术学
  • 1 篇 军事学

主题

  • 77 篇 speech recogniti...
  • 73 篇 training
  • 50 篇 acoustics
  • 46 篇 speech processin...
  • 44 篇 speech
  • 33 篇 hidden markov mo...
  • 31 篇 signal processin...
  • 29 篇 feature extracti...
  • 26 篇 decoding
  • 23 篇 speech enhanceme...
  • 21 篇 computational mo...
  • 20 篇 speech synthesis
  • 20 篇 linguistics
  • 19 篇 predictive model...
  • 18 篇 data models
  • 17 篇 neural networks
  • 17 篇 natural language...
  • 16 篇 accuracy
  • 15 篇 conferences
  • 15 篇 training data

机构

  • 70 篇 national enginee...
  • 55 篇 school of comput...
  • 47 篇 audio speech and...
  • 42 篇 beijing engineer...
  • 27 篇 department of co...
  • 25 篇 center for langu...
  • 21 篇 department of co...
  • 18 篇 mainlp center fo...
  • 18 篇 department of co...
  • 15 篇 audio speech and...
  • 14 篇 iflytek research
  • 14 篇 national enginee...
  • 12 篇 munich
  • 11 篇 department of co...
  • 10 篇 center for infor...
  • 10 篇 ict cluster sing...
  • 10 篇 audio speech and...
  • 9 篇 center for infor...
  • 9 篇 department of co...
  • 9 篇 center for speec...

作者

  • 71 篇 lei xie
  • 54 篇 ling zhen-hua
  • 37 篇 huang heyan
  • 32 篇 ai yang
  • 23 篇 plank barbara
  • 21 篇 zhen-hua ling
  • 18 篇 zheng thomas fan...
  • 18 篇 yarowsky david
  • 18 篇 thomas fang zhen...
  • 18 篇 yang ai
  • 17 篇 wang dong
  • 17 篇 heyan huang
  • 17 篇 khudanpur sanjee...
  • 16 篇 lu ye-xin
  • 15 篇 pengcheng guo
  • 15 篇 gu jia-chen
  • 15 篇 van der goot rob
  • 14 篇 du jun
  • 14 篇 mao xian-ling
  • 14 篇 xie lei

语言

  • 739 篇 英文
  • 84 篇 其他
  • 8 篇 中文
检索条件"机构=Center for Language and Speech Processing and Computer Science"
828 条 记 录,以下是101-110 订阅
排序:
Multiscale Matching Driven by Cross-Modal Similarity Consistency for Audio-Text Retrieval
Multiscale Matching Driven by Cross-Modal Similarity Consist...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Qian Wang Jia-Chen Gu Zhen-Hua Ling National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei P.R.China
Audio-text retrieval (ATR), which retrieves a relevant caption given an audio clip (A2T) and vice versa (T2A), has recently attracted much research attention. Existing methods typically aggregate information from each...
来源: 评论
Considering Temporal Connection between Turns for Conversational speech Synthesis
Considering Temporal Connection between Turns for Conversati...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Kangdi Mei Zhaoci Liu Huipeng Du Hengyu Li Yang Ai Liping Chen Zhenhua Ling National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei P.R. China
Conversational speech synthesis aims to synthesize speech of an individual speaker based on history conversation. However, most studies in conversational speech synthesis only focus on the synthesis performance of the...
来源: 评论
Privacy-Preserving Blockchain-Based Solutions in the Internet of Things  1
收藏 引用
6th EAI International Conference on science and Technologies for Smart Cities, SmartCity 2020
作者: Zapoglou, Nikolaos Patsakos, Ioannis Drosatos, George Rantos, Konstantinos Department of Computer Science International Hellenic University Kavala Greece Institute for Language and Speech Processing Athena Research Center Xanthi Greece
Internet of Things (IoT) is a promising, relatively new technology that develops "smart" networks with a variety of uses and applications (e.g., smart cities, smart home and autonomous cars). The diversity o... 详细信息
来源: 评论
An Exploratory Approach to the Corpus Filtering Shared Task WMT20  5
An Exploratory Approach to the Corpus Filtering Shared Task ...
收藏 引用
5th Conference on Machine Translation, WMT 2020
作者: Kejriwal, Ankur Koehn, Philipp Department of Computer Science Johns Hopkins University United States Center for Language and Speech Processing Johns Hopkins University United States
This document describes an exploratory look into the Parallel Corpus Filtering Shared Task in WMT20. We submitted scores for both Pashto-English and Khmer-English systems combining multiple techniques like monolingual... 详细信息
来源: 评论
HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models
HDMoLE: Mixture of LoRA Experts with Hierarchical Routing an...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Bingshen Mu Kun Wei Qijie Shao Yong Xu Lei Xie Audio Speech and Language Processing Group (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xi'an China Tencent AI Lab Shenzhen China
Recent advancements in integrating Large language Models (LLM) with automatic speech recognition (ASR) have performed remarkably in general domains. While supervised fine-tuning (SFT) of all model parameters is often ... 详细信息
来源: 评论
The NPU-ASLP System for Audio-Visual speech Recognition in MISP 2022 Challenge
The NPU-ASLP System for Audio-Visual Speech Recognition in M...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Pengcheng Guo He Wang Bingshen Mu Ao Zhang Peikun Chen Audio Speech and Language Processing Group (ASLP@NPU) School of Computer Science Northwestern Polytechnical University Xian China
This paper describes our NPU-ASLP system for the Audio-Visual Diarization and Recognition (AVDR) task in the Multi-modal Information based speech processing (MISP) 2022 Challenge. Specifically, the weighted prediction... 详细信息
来源: 评论
Is ChatGPT a Good Multi-Party Conversation Solver?
arXiv
收藏 引用
arXiv 2023年
作者: Tan, Chao-Hong Gu, Jia-Chen Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
Large language Models (LLMs) have emerged as influential instruments within the realm of natural language processing;nevertheless, their capacity to handle multi-party conversations (MPCs) – a scenario marked by the ... 详细信息
来源: 评论
Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event Detection
Prototype based Masked Audio Model for Self-Supervised Learn...
收藏 引用
International Conference on Acoustics, speech, and Signal processing (ICASSP)
作者: Pengfei Cai Yan Song Nan Jiang Qing Gu Ian McLoughlin National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China China ICT Cluster Singapore Institute of Technology Singapore
A significant challenge in sound event detection (SED) is the effective utilization of unlabeled data, given the limited availability of labeled data due to high annotation costs. Semi-supervised algorithms rely on la... 详细信息
来源: 评论
MP-SENet: A speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra
arXiv
收藏 引用
arXiv 2023年
作者: Lu, Ye-Xin Ai, Yang Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
This paper proposes MP-SENet, a novel speech Enhancement Network which directly denoises Magnitude and Phase spectra in parallel. The proposed MP-SENet adopts a codec architecture in which the encoder and decoder are ... 详细信息
来源: 评论
speech RECONSTRUCTION FROM SILENT TONGUE AND LIP ARTICULATION BY PSEUDO TARGET GENERATION AND DOMAIN ADVERSARIAL TRAINING
arXiv
收藏 引用
arXiv 2023年
作者: Zheng, Rui-Chen Ai, Yang Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
This paper studies the task of speech reconstruction from ultrasound tongue images and optical lip videos recorded in a silent speaking mode, where people only activate their intra-oral and extra-oral articulators wit... 详细信息
来源: 评论