咨询与建议

限定检索结果

文献类型

  • 530 篇 会议
  • 298 篇 期刊文献
  • 3 册 图书

馆藏范围

  • 831 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 521 篇 工学
    • 390 篇 计算机科学与技术...
    • 338 篇 软件工程
    • 142 篇 信息与通信工程
    • 56 篇 生物工程
    • 45 篇 控制科学与工程
    • 40 篇 电子科学与技术(可...
    • 35 篇 仪器科学与技术
    • 33 篇 化学工程与技术
    • 30 篇 电气工程
    • 21 篇 生物医学工程(可授...
    • 16 篇 机械工程
    • 16 篇 光学工程
    • 7 篇 建筑学
    • 6 篇 材料科学与工程(可...
  • 292 篇 理学
    • 167 篇 物理学
    • 118 篇 数学
    • 63 篇 生物学
    • 55 篇 统计学(可授理学、...
    • 31 篇 化学
    • 18 篇 系统科学
  • 122 篇 管理学
    • 79 篇 图书情报与档案管...
    • 45 篇 管理科学与工程(可...
    • 15 篇 工商管理
  • 17 篇 医学
    • 13 篇 临床医学
    • 10 篇 基础医学(可授医学...
    • 8 篇 药学(可授医学、理...
  • 15 篇 法学
    • 13 篇 社会学
  • 12 篇 文学
    • 8 篇 中国语言文学
    • 8 篇 外国语言文学
  • 11 篇 农学
    • 7 篇 作物学
  • 4 篇 教育学
  • 3 篇 经济学
  • 3 篇 艺术学
  • 1 篇 军事学

主题

  • 77 篇 speech recogniti...
  • 74 篇 training
  • 50 篇 acoustics
  • 46 篇 speech processin...
  • 44 篇 speech
  • 33 篇 hidden markov mo...
  • 31 篇 signal processin...
  • 29 篇 feature extracti...
  • 26 篇 decoding
  • 23 篇 speech enhanceme...
  • 21 篇 computational mo...
  • 20 篇 speech synthesis
  • 20 篇 linguistics
  • 19 篇 predictive model...
  • 18 篇 natural language...
  • 18 篇 data models
  • 17 篇 neural networks
  • 16 篇 accuracy
  • 15 篇 conferences
  • 15 篇 training data

机构

  • 71 篇 national enginee...
  • 55 篇 school of comput...
  • 47 篇 audio speech and...
  • 42 篇 beijing engineer...
  • 27 篇 department of co...
  • 26 篇 center for langu...
  • 21 篇 department of co...
  • 18 篇 mainlp center fo...
  • 18 篇 department of co...
  • 15 篇 audio speech and...
  • 14 篇 iflytek research
  • 14 篇 national enginee...
  • 12 篇 munich
  • 11 篇 department of co...
  • 10 篇 center for infor...
  • 10 篇 ict cluster sing...
  • 10 篇 audio speech and...
  • 9 篇 center for infor...
  • 9 篇 department of co...
  • 9 篇 center for speec...

作者

  • 71 篇 lei xie
  • 54 篇 ling zhen-hua
  • 37 篇 huang heyan
  • 32 篇 ai yang
  • 23 篇 plank barbara
  • 22 篇 zhen-hua ling
  • 19 篇 yang ai
  • 18 篇 zheng thomas fan...
  • 18 篇 yarowsky david
  • 18 篇 thomas fang zhen...
  • 17 篇 wang dong
  • 17 篇 heyan huang
  • 17 篇 khudanpur sanjee...
  • 16 篇 lu ye-xin
  • 15 篇 pengcheng guo
  • 15 篇 gu jia-chen
  • 15 篇 van der goot rob
  • 14 篇 du jun
  • 14 篇 mao xian-ling
  • 14 篇 xie lei

语言

  • 699 篇 英文
  • 127 篇 其他
  • 12 篇 中文
检索条件"机构=Center for Language and Speech Processing and Computer Science"
831 条 记 录,以下是51-60 订阅
排序:
Thought-Path Contrastive Learning via Premise-Oriented Data Augmentation for Logical Reading Comprehension  39
Thought-Path Contrastive Learning via Premise-Oriented Data ...
收藏 引用
39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025
作者: Wang, Chenxu Jian, Ping Yang, Zhen School of Computer Science and Technology Beijing Institute of Technology Beijing China Beijing Engineering Research Center of High Volume Language Information Processing and Cloud Computing Applications Beijing Institute of Technology Beijing China
Logical reading comprehension is a challenging task that involves understanding the underlying semantics of text and applying reasoning to deduce the correct answer. Prior researches have primarily focused on enhancin... 详细信息
来源: 评论
CSDNet: cross-sketch with dual gated attention for fine-grained image captioning network
收藏 引用
Multimedia Tools and Applications 2024年 1-28页
作者: Hossain, Md. Shamim Aktar, Shamima Hossen, Md. Bipul Hossain, Mohammad Alamgir Gu, Naijie Huang, Zhangjin School of Computer Science and Technology University of Science and Technology of China Anhui Hefei230027 China Deqing Alpha Innovation Institute Huzhou313299 China Department of Mathematics Jashore University of Science and Technology Jashore7408 Bangladesh Department of Statistics Begum Rokeya University Rangpur5404 Bangladesh National Engineering Laboratory for Speech and Language Information Processing University of Science and Technology of China Anhui Hefei230027 China
In the realm of extracting inter and intra-modal interactions, contemporary models often face challenges such as reduced computational efficiency, particularly when dealing with lengthy visual sequences. To address th... 详细信息
来源: 评论
Anchored Monotonic Alignment and Representation Substitution for Rare Spontaneous Behaviors in Spontaneous speech Synthesis
Anchored Monotonic Alignment and Representation Substitution...
收藏 引用
2025 IEEE International Conference on Acoustics, speech, and Signal processing, ICASSP 2025
作者: Wu, Ning-Qian Hu, Ya-Jun Chen, Liping Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China iFLYTEK Research iFLYTEK Co. Ltd. China MoE Key Laboratory of Brain-inspired Intelligent Perception and Cognition University of Science and Technology of China China
Spontaneous behaviors in speech pose significant challenges for speech synthesis. Existing research has not adequately addressed these behaviors, with most studies relying on specially recorded datasets. In contrast, ... 详细信息
来源: 评论
MULTI-CROSSRE A Multi-Lingual Multi-Domain Dataset for Relation Extraction  24
MULTI-CROSSRE A Multi-Lingual Multi-Domain Dataset for Relat...
收藏 引用
24th Nordic Conference on Computational Linguistics, NoDaLiDa 2023
作者: Bassignana, Elisa Ginter, Filip Pyysalo, Sampo van der Goot, Rob Plank, Barbara Department of Computer Science IT University of Copenhagen Denmark TurkuNLP Department of Computing University of Turku Finland MaiNLP Center for Information and Language Processing LMU Munich Germany
Most research in Relation Extraction (RE) involves the English language, mainly due to the lack of multi-lingual resources. We propose MULTI-CROSSRE, the broadest multi-lingual dataset for RE, including 26 languages i... 详细信息
来源: 评论
A Streamable Neural Audio Codec with Residual Scalar-Vector Quantization for Real-Time Communication
arXiv
收藏 引用
arXiv 2025年
作者: Jiang, Xiao-Hang Ai, Yang Zheng, Rui-Chen Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei230027 China
This paper proposes StreamCodec, a streamable neural audio codec designed for real-time communication. StreamCodec adopts a fully causal, symmetric encoder-decoder structure and operates in the modified discrete cosin... 详细信息
来源: 评论
Subject Disentanglement Neural Network for speech Envelope Reconstruction from EEG
Subject Disentanglement Neural Network for Speech Envelope R...
收藏 引用
IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
作者: Li Zhang Jiyao Liu Lei Xie Audio Speech and Language Processing Group (ASLP) School of Computer Science Northwestern Polytechnical University (NPU) Xi’an China
Reconstructing speech envelopes from EEG signals is essential for exploring neural mechanisms underlying speech perception. Yet, EEG variability across subjects and physiological artifacts complicate accurate reconstr... 详细信息
来源: 评论
STAGE-WISE AND PRIOR-AWARE NEURAL speech PHASE PREDICTION
arXiv
收藏 引用
arXiv 2024年
作者: Liu, Fei Ai, Yang Du, Hui-Peng Lu, Ye-Xin Zheng, Rui-Chen Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
This paper proposes a novel Stage-wise and Prior-aware Neural speech Phase Prediction (SP-NSPP) model, which predicts the phase spectrum from input amplitude spectrum by two-stage neural networks. In the initial prior... 详细信息
来源: 评论
APCodec+: A Spectrum-Coding-Based High-Fidelity and High-Compression-Rate Neural Audio Codec with Staged Training Paradigm
arXiv
收藏 引用
arXiv 2024年
作者: Du, Hui-Peng Ai, Yang Zheng, Rui-Chen Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
This paper proposes a novel neural audio codec, named APCodec+, which is an improved version of APCodec. The APCodec+ takes the audio amplitude and phase spectra as the coding object, and employs an adversarial traini... 详细信息
来源: 评论
MDCTCodec: A Lightweight MDCT-based Neural Audio Codec towards High Sampling Rate and Low Bitrate Scenarios
arXiv
收藏 引用
arXiv 2024年
作者: Jiang, Xiao-Hang Ai, Yang Zheng, Rui-Chen Du, Hui-Peng Lu, Ye-Xin Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
In this paper, we propose MDCTCodec, an efficient lightweight end-to-end neural audio codec based on the modified discrete cosine transform (MDCT). The encoder takes the MDCT spectrum of audio as input, encoding it in... 详细信息
来源: 评论
Multi-Stage speech Bandwidth Extension with Flexible Sampling Rate Control
arXiv
收藏 引用
arXiv 2024年
作者: Lu, Ye-Xin Ai, Yang Sheng, Zheng-Yan Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
The majority of existing speech bandwidth extension (BWE) methods operate under the constraint of fixed source and target sampling rates, which limits their flexibility in practical applications. In this paper, we pro... 详细信息
来源: 评论