咨询与建议

限定检索结果

文献类型

  • 232 篇 会议
  • 127 篇 期刊文献
  • 1 册 图书

馆藏范围

  • 360 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 219 篇 工学
    • 140 篇 计算机科学与技术...
    • 123 篇 软件工程
    • 88 篇 信息与通信工程
    • 28 篇 电子科学与技术(可...
    • 26 篇 仪器科学与技术
    • 21 篇 电气工程
    • 20 篇 生物工程
    • 18 篇 控制科学与工程
    • 15 篇 化学工程与技术
    • 13 篇 机械工程
    • 7 篇 建筑学
    • 6 篇 土木工程
    • 3 篇 光学工程
    • 3 篇 生物医学工程(可授...
  • 155 篇 理学
    • 114 篇 物理学
    • 56 篇 数学
    • 23 篇 生物学
    • 20 篇 统计学(可授理学、...
    • 15 篇 化学
    • 5 篇 系统科学
  • 52 篇 管理学
    • 37 篇 图书情报与档案管...
    • 18 篇 管理科学与工程(可...
    • 10 篇 工商管理
  • 13 篇 法学
    • 10 篇 社会学
    • 3 篇 法学
  • 7 篇 教育学
    • 6 篇 教育学
    • 4 篇 心理学(可授教育学...
  • 7 篇 文学
    • 7 篇 外国语言文学
    • 6 篇 中国语言文学
  • 3 篇 医学
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 2 篇 农学

主题

  • 59 篇 speech recogniti...
  • 38 篇 speech processin...
  • 26 篇 training
  • 21 篇 acoustics
  • 19 篇 signal processin...
  • 17 篇 natural language...
  • 17 篇 speech enhanceme...
  • 16 篇 automatic speech...
  • 15 篇 feature extracti...
  • 15 篇 robustness
  • 13 篇 speech
  • 12 篇 speech synthesis
  • 11 篇 error analysis
  • 10 篇 hidden markov mo...
  • 10 篇 predictive model...
  • 9 篇 decoding
  • 8 篇 training data
  • 8 篇 transformers
  • 8 篇 self-supervised ...
  • 8 篇 accuracy

机构

  • 68 篇 national enginee...
  • 18 篇 hitachi ltd. res...
  • 15 篇 institute for la...
  • 15 篇 center for langu...
  • 13 篇 center for langu...
  • 10 篇 iflytek research
  • 10 篇 institute for la...
  • 9 篇 department of in...
  • 9 篇 ict cluster sing...
  • 8 篇 robust speech pr...
  • 8 篇 national enginee...
  • 7 篇 university of sc...
  • 7 篇 iflytek research...
  • 7 篇 school of ece na...
  • 6 篇 robust speech pr...
  • 6 篇 state key labora...
  • 6 篇 institute for la...
  • 6 篇 national enginee...
  • 5 篇 university of sc...
  • 5 篇 ibm thomas j. wa...

作者

  • 51 篇 ling zhen-hua
  • 32 篇 ai yang
  • 21 篇 hansen john h.l.
  • 19 篇 zhen-hua ling
  • 17 篇 hansen john h. l...
  • 16 篇 watanabe shinji
  • 16 篇 lu ye-xin
  • 15 篇 yang ai
  • 14 篇 gu jia-chen
  • 14 篇 katsouros vassil...
  • 14 篇 potamianos alexa...
  • 14 篇 j.h.l. hansen
  • 14 篇 du hui-peng
  • 13 篇 fujita yusuke
  • 13 篇 paraskevopoulos ...
  • 13 篇 katsamanis athan...
  • 12 篇 androutsopoulos ...
  • 10 篇 horiguchi shota
  • 10 篇 shinji watanabe
  • 10 篇 zheng rui-chen

语言

  • 331 篇 英文
  • 29 篇 其他
检索条件"机构=Center for Research in Speech and Language Processing"
360 条 记 录,以下是31-40 订阅
排序:
Signs and Synonymity Continuing Development of the Multilingual Sign language Wordnet  11
Signs and Synonymity Continuing Development of the Multiling...
收藏 引用
11th Workshop on the Representation and processing of Sign languages: Evaluation of Sign language Resources, sign-lang@LREC-COLING 2024
作者: Schulder, Marc Bigeard, Sam Kopf, Maria Hanke, Thomas Kuder, Anna Wójcicka, Joanna Mesch, Johanna Björkstrand, Thomas Vacalopoulou, Anna Vasilaki, Kyriaki Goulas, Theodore Fotinea, Stavroula-Evita Efthimiou, Eleni Institute of German Sign Language and Communication of the Deaf University of Hamburg Germany Inria Centre University of Lorraine France Department of Linguistics University of Cologne Germany Department of General Linguistics Sign Language Linguistics and Baltic Studies University of Warsaw Poland Department of Linguistics Stockholm University Sweden Institute for Language and Speech Processing Athena Research Center Greece
The Multilingual Sign language Wordnet is the first publicly available wordnet resource for sign languages. It is a growing multilingual resource providing data for eight sign languages to date. During the initial pha... 详细信息
来源: 评论
APCodec+: A Spectrum-Coding-Based High-Fidelity and High-Compression-Rate Neural Audio Codec with Staged Training Paradigm
arXiv
收藏 引用
arXiv 2024年
作者: Du, Hui-Peng Ai, Yang Zheng, Rui-Chen Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
This paper proposes a novel neural audio codec, named APCodec+, which is an improved version of APCodec. The APCodec+ takes the audio amplitude and phase spectra as the coding object, and employs an adversarial traini... 详细信息
来源: 评论
STAGE-WISE AND PRIOR-AWARE NEURAL speech PHASE PREDICTION
arXiv
收藏 引用
arXiv 2024年
作者: Liu, Fei Ai, Yang Du, Hui-Peng Lu, Ye-Xin Zheng, Rui-Chen Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
This paper proposes a novel Stage-wise and Prior-aware Neural speech Phase Prediction (SP-NSPP) model, which predicts the phase spectrum from input amplitude spectrum by two-stage neural networks. In the initial prior... 详细信息
来源: 评论
Multi-Stage speech Bandwidth Extension with Flexible Sampling Rate Control
arXiv
收藏 引用
arXiv 2024年
作者: Lu, Ye-Xin Ai, Yang Sheng, Zheng-Yan Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
The majority of existing speech bandwidth extension (BWE) methods operate under the constraint of fixed source and target sampling rates, which limits their flexibility in practical applications. In this paper, we pro... 详细信息
来源: 评论
MDCTCodec: A Lightweight MDCT-based Neural Audio Codec towards High Sampling Rate and Low Bitrate Scenarios
arXiv
收藏 引用
arXiv 2024年
作者: Jiang, Xiao-Hang Ai, Yang Zheng, Rui-Chen Du, Hui-Peng Lu, Ye-Xin Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
In this paper, we propose MDCTCodec, an efficient lightweight end-to-end neural audio codec based on the modified discrete cosine transform (MDCT). The encoder takes the MDCT spectrum of audio as input, encoding it in... 详细信息
来源: 评论
ESTVocoder: An Excitation-Spectral-Transformed Neural Vocoder Conditioned on Mel Spectrogram
arXiv
收藏 引用
arXiv 2024年
作者: Jiang, Xiao-Hang Du, Hui-Peng Ai, Yang Lu, Ye-Xin Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
This paper proposes ESTVocoder, a novel excitation-spectral-transformed neural vocoder within the framework of source-filter theory. The ESTVocoder transforms the amplitude and phase spectra of the excitation into the... 详细信息
来源: 评论
SAMOS: A Neural MOS Prediction Model Leveraging Semantic Representations and Acoustic Features
arXiv
收藏 引用
arXiv 2024年
作者: Shi, Yu-Fei Ai, Yang Lu, Ye-Xin Du, Hui-Peng Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
Assessing the naturalness of speech using mean opinion score (MOS) prediction models has positive implications for the automatic evaluation of speech synthesis systems. Early MOS prediction models took the raw wavefor... 详细信息
来源: 评论
BiVocoder: A Bidirectional Neural Vocoder Integrating Feature Extraction and Waveform Generation
arXiv
收藏 引用
arXiv 2024年
作者: Du, Hui-Peng Lu, Ye-Xin Ai, Yang Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
This paper proposes a novel bidirectional neural vocoder, named BiVocoder, capable both of feature extraction and reverse waveform generation within the short-time Fourier transform (STFT) domain. For feature extracti... 详细信息
来源: 评论
MULTISCALE MATCHING DRIVEN BY CROSS-MODAL SIMILARITY CONSISTENCY FOR AUDIO-TEXT RETRIEVAL
arXiv
收藏 引用
arXiv 2024年
作者: Wang, Qian Gu, Jia-Chen Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
Audio-text retrieval (ATR), which retrieves a relevant caption given an audio clip (A2T) and vice versa (T2A), has recently attracted much research attention. Existing methods typically aggregate information from each... 详细信息
来源: 评论
A Neural Denoising Vocoder for Clean Waveform Generation from Noisy Mel-Spectrogram based on Amplitude and Phase Predictions
arXiv
收藏 引用
arXiv 2024年
作者: Du, Hui-Peng Lu, Ye-Xin Ai, Yang Ling, Zhen-Hua National Engineering Research Center of Speech and Language Information Processing University of Science and Technology of China Hefei China
This paper proposes a novel neural denoising vocoder that can generate clean speech waveforms from noisy mel-spectrograms. The proposed neural denoising vocoder consists of two components, i.e., a spectrum predictor a... 详细信息
来源: 评论