咨询与建议

限定检索结果

文献类型

  • 232 篇 会议
  • 127 篇 期刊文献
  • 1 册 图书

馆藏范围

  • 360 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 219 篇 工学
    • 140 篇 计算机科学与技术...
    • 123 篇 软件工程
    • 88 篇 信息与通信工程
    • 28 篇 电子科学与技术(可...
    • 26 篇 仪器科学与技术
    • 21 篇 电气工程
    • 20 篇 生物工程
    • 18 篇 控制科学与工程
    • 15 篇 化学工程与技术
    • 13 篇 机械工程
    • 7 篇 建筑学
    • 6 篇 土木工程
    • 3 篇 光学工程
    • 3 篇 生物医学工程(可授...
  • 155 篇 理学
    • 114 篇 物理学
    • 56 篇 数学
    • 23 篇 生物学
    • 20 篇 统计学(可授理学、...
    • 15 篇 化学
    • 5 篇 系统科学
  • 52 篇 管理学
    • 37 篇 图书情报与档案管...
    • 18 篇 管理科学与工程(可...
    • 10 篇 工商管理
  • 13 篇 法学
    • 10 篇 社会学
    • 3 篇 法学
  • 7 篇 教育学
    • 6 篇 教育学
    • 4 篇 心理学(可授教育学...
  • 7 篇 文学
    • 7 篇 外国语言文学
    • 6 篇 中国语言文学
  • 3 篇 医学
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 2 篇 农学

主题

  • 59 篇 speech recogniti...
  • 38 篇 speech processin...
  • 26 篇 training
  • 21 篇 acoustics
  • 19 篇 signal processin...
  • 17 篇 natural language...
  • 17 篇 speech enhanceme...
  • 16 篇 automatic speech...
  • 15 篇 feature extracti...
  • 15 篇 robustness
  • 13 篇 speech
  • 12 篇 speech synthesis
  • 11 篇 error analysis
  • 10 篇 hidden markov mo...
  • 10 篇 predictive model...
  • 9 篇 decoding
  • 8 篇 training data
  • 8 篇 transformers
  • 8 篇 self-supervised ...
  • 8 篇 accuracy

机构

  • 68 篇 national enginee...
  • 18 篇 hitachi ltd. res...
  • 15 篇 institute for la...
  • 15 篇 center for langu...
  • 13 篇 center for langu...
  • 10 篇 iflytek research
  • 10 篇 institute for la...
  • 9 篇 department of in...
  • 9 篇 ict cluster sing...
  • 8 篇 robust speech pr...
  • 8 篇 national enginee...
  • 7 篇 university of sc...
  • 7 篇 iflytek research...
  • 7 篇 school of ece na...
  • 6 篇 robust speech pr...
  • 6 篇 state key labora...
  • 6 篇 institute for la...
  • 6 篇 national enginee...
  • 5 篇 university of sc...
  • 5 篇 ibm thomas j. wa...

作者

  • 51 篇 ling zhen-hua
  • 32 篇 ai yang
  • 21 篇 hansen john h.l.
  • 19 篇 zhen-hua ling
  • 17 篇 hansen john h. l...
  • 16 篇 watanabe shinji
  • 16 篇 lu ye-xin
  • 15 篇 yang ai
  • 14 篇 gu jia-chen
  • 14 篇 katsouros vassil...
  • 14 篇 potamianos alexa...
  • 14 篇 j.h.l. hansen
  • 14 篇 du hui-peng
  • 13 篇 fujita yusuke
  • 13 篇 paraskevopoulos ...
  • 13 篇 katsamanis athan...
  • 12 篇 androutsopoulos ...
  • 10 篇 horiguchi shota
  • 10 篇 shinji watanabe
  • 10 篇 zheng rui-chen

语言

  • 331 篇 英文
  • 29 篇 其他
检索条件"机构=Center for Research in Speech and Language Processing"
360 条 记 录,以下是1-10 订阅
排序:
Integrating Time-Frequency Domain Shallow and Deep Features for speech-EEG Match-Mismatch of Auditory Attention Decoding
收藏 引用
Journal of Shanghai Jiaotong University (Science) 2025年 1-7页
作者: Zhang, Yubang Zhu, Qiushi Xu, Qingtian Zhang, Jie National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei230026 China
Electroencephalogram (EEG) signals provide an important pathway to reflect brain activations, from which auditory attention clues of the listener can be decoded, termed as auditory attention decoding (AAD). However, e... 详细信息
来源: 评论
APCodec+: A Spectrum-Coding-Based High-Fidelity and High-Compression-Rate Neural Audio Codec with Staged Training Paradigm  14
APCodec+: A Spectrum-Coding-Based High-Fidelity and High-Com...
收藏 引用
14th International Symposium on Chinese Spoken language processing, ISCSLP 2024
作者: Du, Hui-Peng Ai, Yang Zheng, Rui-Chen Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
This paper proposes a novel neural audio codec, named APCodec+, which is an improved version of APCodec. The APCodec+ takes the audio amplitude and phase spectra as the coding object, and employs an adversarial traini... 详细信息
来源: 评论
APNet2: High-Quality and High-Efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase Spectra  1
收藏 引用
18th National Conference on Man-Machine speech Communication, NCMMSC 2023
作者: Du, Hui-Peng Lu, Ye-Xin Ai, Yang Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
In our previous work, we have proposed a neural vocoder called APNet, which directly predicts speech amplitude and phase spectra with a 5 ms frame shift in parallel from the input acoustic features, and then reconstru... 详细信息
来源: 评论
LAR-ECHR: A New Legal Argument Reasoning Task and Dataset for Cases of the European Court of Human Rights  6
LAR-ECHR: A New Legal Argument Reasoning Task and Dataset fo...
收藏 引用
6th Natural Legal language processing Workshop 2024, NLLP 2024, co-located with the 2024 Conference on Empirical Methods in Natural language processing
作者: Chlapanis, Odysseas S. Galanis, Dimitrios Androutsopoulos, Ion Department of Informatics Athens University of Economics and Business Greece Institute for Language and Speech Processing Athena Research Center Greece Archimedes Unit Athena Research Center Greece
We present Legal Argument Reasoning (LAR), a novel task designed to evaluate the legal reasoning capabilities of Large language Models (LLMs). The task requires selecting the correct next statement (from multiple choi... 详细信息
来源: 评论
Multilingual Synthesis of Depictions through Structured Descriptions of Sign: An Initial Case Study  11
Multilingual Synthesis of Depictions through Structured Desc...
收藏 引用
11th Workshop on the Representation and processing of Sign languages: Evaluation of Sign language Resources, sign-lang@LREC-COLING 2024
作者: McDonald, John Efthimiou, Eleni Fotinea, Stavroula-Evita Wolfe, Rosalee School of Computing DePaul University ChicagoIL United States Institute for Language and Speech Processing ATHENA Research Center Athens Greece
Sign language synthesis systems must contend with an enormous variety of possible target languages across the world, and in many locations, such as Europe, the number of sign languages that can be found in a relativel... 详细信息
来源: 评论
Zero-Shot Personalized Lip-To-speech Synthesis with Face Image Based Voice Control  48
Zero-Shot Personalized Lip-To-Speech Synthesis with Face Ima...
收藏 引用
48th IEEE International Conference on Acoustics, speech and Signal processing, ICASSP 2023
作者: Sheng, Zheng-Yan Ai, Yang Ling, Zhen-Hua University of Science and Technology of China National Engineering Research Center of Speech and Language Information Processing Hefei China
Lip-to-speech (Lip2speech) synthesis, which predicts corresponding speech from talking face images, has witnessed significant progress with various models and training strategies in a series of independent studies. Ho... 详细信息
来源: 评论
speech Reconstruction from Silent Tongue and Lip Articulation by Pseudo Target Generation and Domain Adversarial Training  48
Speech Reconstruction from Silent Tongue and Lip Articulatio...
收藏 引用
48th IEEE International Conference on Acoustics, speech and Signal processing, ICASSP 2023
作者: Zheng, Rui-Chen Ai, Yang Ling, Zhen-Hua University of Science and Technology of China National Engineering Research Center of Speech and Language Information Processing Hefei China
This paper studies the task of speech reconstruction from ultrasound tongue images and optical lip videos recorded in a silent speaking mode, where people only activate their intra-oral and extra-oral articulators wit... 详细信息
来源: 评论
Deepfake Algorithm Recognition System with Augmented Data for ADD 2023 Challenge
Deepfake Algorithm Recognition System with Augmented Data fo...
收藏 引用
2023 Workshop on Deepfake Audio Detection and Analysis, DADA 2023
作者: Zeng, Xiao-Min Zhang, Jian-Tao Li, Kang Liu, Zhuo-Li Xie, Wei-Lin Song, Yan National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
In this paper, we describe our submitted systems to the ADD2023 Challenge Track 3–Deepfake algorithm recognition (AR). This task requires not only identifying known deepfake algorithms in closed-set but also distingu... 详细信息
来源: 评论
Neural speech Phase Prediction Based on Parallel Estimation Architecture and Anti-Wrapping Losses  48
Neural Speech Phase Prediction Based on Parallel Estimation ...
收藏 引用
48th IEEE International Conference on Acoustics, speech and Signal processing, ICASSP 2023
作者: Ai, Yang Ling, Zhen-Hua University of Science and Technology of China National Engineering Research Center of Speech and Language Information Processing Hefei China
This paper presents a novel speech phase prediction model which predicts wrapped phase spectra directly from amplitude spectra by neural networks. The proposed model is a cascade of a residual convolutional network an... 详细信息
来源: 评论
Within- and Between-Class Sample Interpolation Based Supervised Metric Learning for Speaker Verification  1
收藏 引用
18th National Conference on Man-Machine speech Communication, NCMMSC 2023
作者: Zhang, Jian-Tao Song, Hao-Yu Guo, Wu Song, Yan Dai, Li-Rong National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China The Australian National University Canberra Australia
Metric learning aims to pull together the samples belonging to the same class and push apart those from different classes in embedding space. Existing methods may suffer from inadequate and low-quality sample pairs, r... 详细信息
来源: 评论