咨询与建议

限定检索结果

文献类型

  • 528 篇 会议
  • 297 篇 期刊文献
  • 3 册 图书

馆藏范围

  • 828 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 520 篇 工学
    • 387 篇 计算机科学与技术...
    • 336 篇 软件工程
    • 142 篇 信息与通信工程
    • 56 篇 生物工程
    • 45 篇 控制科学与工程
    • 40 篇 电子科学与技术(可...
    • 35 篇 仪器科学与技术
    • 33 篇 化学工程与技术
    • 30 篇 电气工程
    • 21 篇 生物医学工程(可授...
    • 16 篇 机械工程
    • 16 篇 光学工程
    • 7 篇 建筑学
    • 6 篇 材料科学与工程(可...
  • 291 篇 理学
    • 167 篇 物理学
    • 118 篇 数学
    • 62 篇 生物学
    • 55 篇 统计学(可授理学、...
    • 31 篇 化学
    • 18 篇 系统科学
  • 120 篇 管理学
    • 79 篇 图书情报与档案管...
    • 45 篇 管理科学与工程(可...
    • 15 篇 工商管理
  • 15 篇 法学
    • 13 篇 社会学
  • 15 篇 医学
    • 13 篇 临床医学
    • 10 篇 基础医学(可授医学...
    • 8 篇 药学(可授医学、理...
  • 12 篇 文学
    • 8 篇 中国语言文学
    • 8 篇 外国语言文学
  • 10 篇 农学
    • 7 篇 作物学
  • 4 篇 教育学
  • 3 篇 经济学
  • 3 篇 艺术学
  • 1 篇 军事学

主题

  • 77 篇 speech recogniti...
  • 73 篇 training
  • 50 篇 acoustics
  • 46 篇 speech processin...
  • 44 篇 speech
  • 33 篇 hidden markov mo...
  • 31 篇 signal processin...
  • 29 篇 feature extracti...
  • 26 篇 decoding
  • 23 篇 speech enhanceme...
  • 21 篇 computational mo...
  • 20 篇 speech synthesis
  • 20 篇 linguistics
  • 19 篇 predictive model...
  • 18 篇 data models
  • 17 篇 neural networks
  • 17 篇 natural language...
  • 16 篇 accuracy
  • 15 篇 conferences
  • 15 篇 training data

机构

  • 70 篇 national enginee...
  • 55 篇 school of comput...
  • 47 篇 audio speech and...
  • 42 篇 beijing engineer...
  • 27 篇 department of co...
  • 25 篇 center for langu...
  • 21 篇 department of co...
  • 18 篇 mainlp center fo...
  • 18 篇 department of co...
  • 15 篇 audio speech and...
  • 14 篇 iflytek research
  • 14 篇 national enginee...
  • 12 篇 munich
  • 11 篇 department of co...
  • 10 篇 center for infor...
  • 10 篇 ict cluster sing...
  • 10 篇 audio speech and...
  • 9 篇 center for infor...
  • 9 篇 department of co...
  • 9 篇 center for speec...

作者

  • 71 篇 lei xie
  • 54 篇 ling zhen-hua
  • 37 篇 huang heyan
  • 32 篇 ai yang
  • 23 篇 plank barbara
  • 21 篇 zhen-hua ling
  • 18 篇 zheng thomas fan...
  • 18 篇 yarowsky david
  • 18 篇 thomas fang zhen...
  • 18 篇 yang ai
  • 17 篇 wang dong
  • 17 篇 heyan huang
  • 17 篇 khudanpur sanjee...
  • 16 篇 lu ye-xin
  • 15 篇 pengcheng guo
  • 15 篇 gu jia-chen
  • 15 篇 van der goot rob
  • 14 篇 du jun
  • 14 篇 mao xian-ling
  • 14 篇 xie lei

语言

  • 739 篇 英文
  • 84 篇 其他
  • 8 篇 中文
检索条件"机构=Center for Language and Speech Processing and Computer Science"
828 条 记 录,以下是1-10 订阅
排序:
Integrating Time-Frequency Domain Shallow and Deep Features for speech-EEG Match-Mismatch of Auditory Attention Decoding
收藏 引用
Journal of Shanghai Jiaotong University (science) 2025年 1-7页
作者: Zhang, Yubang Zhu, Qiushi Xu, Qingtian Zhang, Jie National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei230026 China
Electroencephalogram (EEG) signals provide an important pathway to reflect brain activations, from which auditory attention clues of the listener can be decoded, termed as auditory attention decoding (AAD). However, e... 详细信息
来源: 评论
APCodec+: A Spectrum-Coding-Based High-Fidelity and High-Compression-Rate Neural Audio Codec with Staged Training Paradigm  14
APCodec+: A Spectrum-Coding-Based High-Fidelity and High-Com...
收藏 引用
14th International Symposium on Chinese Spoken language processing, ISCSLP 2024
作者: Du, Hui-Peng Ai, Yang Zheng, Rui-Chen Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
This paper proposes a novel neural audio codec, named APCodec+, which is an improved version of APCodec. The APCodec+ takes the audio amplitude and phase spectra as the coding object, and employs an adversarial traini... 详细信息
来源: 评论
APNet2: High-Quality and High-Efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase Spectra  1
收藏 引用
18th National Conference on Man-Machine speech Communication, NCMMSC 2023
作者: Du, Hui-Peng Lu, Ye-Xin Ai, Yang Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
In our previous work, we have proposed a neural vocoder called APNet, which directly predicts speech amplitude and phase spectra with a 5 ms frame shift in parallel from the input acoustic features, and then reconstru... 详细信息
来源: 评论
Which is more faithful,seeing or saying? Multimodal sarcasm detection exploiting contrasting sentiment knowledge
收藏 引用
CAAI Transactions on Intelligence Technology 2025年 第2期10卷 375-386页
作者: Yutao Chen Shumin Shi Heyan Huang School of Computer Science and Technology Beijing Institute of TechnologyBeijingChina Beijing Engineering Research Center of High Volume Language Information Processing and Cloud Computing Applications BeijingChina
Using sarcasm on social media platforms to express negative opinions towards a person or object has become increasingly ***,detecting sarcasm in various forms of communication can be difficult due to conflicting *** t... 详细信息
来源: 评论
Automating Sound Change Prediction for Phylogenetic Inference: A Tukanoan Case Study  4
Automating Sound Change Prediction for Phylogenetic Inferenc...
收藏 引用
4th International Workshop on Computational Approaches to Historical language Change, LChange 2023
作者: Chang, Kalvin Robinson, Nathaniel R. Cai, Anna Chen, Ting Zhang, Annie Mortensen, David R. School of Computer Science Carnegie Mellon University United States Center for Language and Speech Processing Johns Hopkins University United States
We describe a set of new methods to partially automate linguistic phylogenetic inference given (1) cognate sets with their respective protoforms and sound laws, (2) a mapping from phones to their articulatory features... 详细信息
来源: 评论
System 1 Description of BV-SLP for Sindhi-English Machine Translation in MultiIndic22MT 2024 Shared Task  9
System 1 Description of BV-SLP for Sindhi-English Machine Tr...
收藏 引用
9th Conference on Machine Translation, WMT 2024
作者: Joshi, Nisheeth Katyayan, Pragya Arora, Palak Nathani, Bharti Speech and Language Processing Lab Banasthali Vidyapith Rajasthan India School of Computer Science University of Petroleum and Energy Studies Uttrakhand India
This paper presents our machine translation system that was developed for the WAT2024 MultiIndic MT shared task. We built our system for the Sindhi-English language pair. We developed two MT systems. The first system ... 详细信息
来源: 评论
Improved G723.1 Codec speech Quality Under Burst Packet Loss Conditions  5th
Improved G723.1 Codec Speech Quality Under Burst Packet Loss...
收藏 引用
5th International Conference on Electrical Engineering and Control Applications, ICEECA 2022
作者: Bakri, Adil Mahdjane, Karima Amrouche, Abderrahmane Krobba, Ahmed Scientific Research and Technical Center for the Development of Arabic Language CRSTDLA Algiers Algeria Speech Communication and Signal Processing Laboratory Faculty of Electronics and Computer Science USTHB Algiers Algeria
In this paper, a Packet Loss Concealment (PLC) algorithm is proposed for G723.1 CELP-type speech coders in order to improve the quality of decoded speech in VoIP under burst packet loss. The original PLC method implem... 详细信息
来源: 评论
Zero-Shot Personalized Lip-To-speech Synthesis with Face Image Based Voice Control  48
Zero-Shot Personalized Lip-To-Speech Synthesis with Face Ima...
收藏 引用
48th IEEE International Conference on Acoustics, speech and Signal processing, ICASSP 2023
作者: Sheng, Zheng-Yan Ai, Yang Ling, Zhen-Hua University of Science and Technology of China National Engineering Research Center of Speech and Language Information Processing Hefei China
Lip-to-speech (Lip2speech) synthesis, which predicts corresponding speech from talking face images, has witnessed significant progress with various models and training strategies in a series of independent studies. Ho... 详细信息
来源: 评论
Neural speech Phase Prediction Based on Parallel Estimation Architecture and Anti-Wrapping Losses  48
Neural Speech Phase Prediction Based on Parallel Estimation ...
收藏 引用
48th IEEE International Conference on Acoustics, speech and Signal processing, ICASSP 2023
作者: Ai, Yang Ling, Zhen-Hua University of Science and Technology of China National Engineering Research Center of Speech and Language Information Processing Hefei China
This paper presents a novel speech phase prediction model which predicts wrapped phase spectra directly from amplitude spectra by neural networks. The proposed model is a cascade of a residual convolutional network an... 详细信息
来源: 评论
Sample-Efficient Unsupervised Domain Adaptation of speech Recognition Systems: A Case Study for Modern Greek
收藏 引用
IEEE/ACM Transactions on Audio speech and language processing 2024年 32卷 286-299页
作者: Paraskevopoulos, Georgios Kouzelis, Theodoros Rouvalis, Georgios Katsamanis, Athanasios Katsouros, Vassilis Potamianos, Alexandros National Technical University of Athens Graduate School of Electrical and Computer Engineering Athens10682 Greece Athena Research Center Institute for Speech and Language Processing Marousi15125 Greece National Technical University of Athens Faculty of Electrical and Computer Engineering Athens10682 Greece
Modern speech recognition systems exhibit rapid performance degradation under domain shift. This issue is especially prevalent in data-scarce settings, such as low-resource languages, where the diversity of training d... 详细信息
来源: 评论