咨询与建议

限定检索结果

文献类型

  • 10 篇 会议
  • 3 篇 期刊文献

馆藏范围

  • 13 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 12 篇 工学
    • 10 篇 电气工程
    • 8 篇 计算机科学与技术...
    • 2 篇 电子科学与技术(可...
    • 2 篇 信息与通信工程
    • 2 篇 控制科学与工程
    • 2 篇 软件工程
  • 6 篇 理学
    • 6 篇 物理学
  • 4 篇 医学
    • 4 篇 临床医学
  • 2 篇 文学
    • 2 篇 外国语言文学

主题

  • 13 篇 neural speech co...
  • 3 篇 speech coding
  • 2 篇 representation l...
  • 2 篇 bit rate
  • 2 篇 complexity theor...
  • 1 篇 rtc
  • 1 篇 variational auto...
  • 1 篇 model complexity
  • 1 篇 real-time commun...
  • 1 篇 speech enhanceme...
  • 1 篇 audio loss resil...
  • 1 篇 vectors
  • 1 篇 speech codecs
  • 1 篇 personalization
  • 1 篇 vocoders
  • 1 篇 lpcnet
  • 1 篇 discrete represe...
  • 1 篇 real-time system...
  • 1 篇 tts
  • 1 篇 decoding

机构

  • 2 篇 beijing univ tec...
  • 2 篇 indiana univ dep...
  • 2 篇 elect & telecomm...
  • 1 篇 univ illinois ur...
  • 1 篇 yonsei univ elec...
  • 1 篇 victoria univ we...
  • 1 篇 amazon web serv ...
  • 1 篇 indiana univ ind...
  • 1 篇 university of il...
  • 1 篇 amazon web servi...
  • 1 篇 xiph org fdn jaf...
  • 1 篇 microsoft resear...
  • 1 篇 microsoft res as...
  • 1 篇 tencent ethereal...
  • 1 篇 china electronic...
  • 1 篇 department of el...
  • 1 篇 cisco syst san j...
  • 1 篇 indiana univ dep...
  • 1 篇 beijing univ tec...
  • 1 篇 google llc ca 94...

作者

  • 2 篇 pia nicola
  • 2 篇 zhao yuhao
  • 2 篇 ru jiawei
  • 2 篇 jia maoshen
  • 2 篇 multrus markus
  • 2 篇 kim minje
  • 2 篇 lu yan
  • 2 篇 mustafa ahmed
  • 2 篇 valin jean-marc
  • 2 篇 beack seungkwon
  • 2 篇 fuchs guillaume
  • 2 篇 gupta kishan
  • 2 篇 peng xiulian
  • 1 篇 kavalekalam math...
  • 1 篇 kleijn w. bastia...
  • 1 篇 jang inseon
  • 1 篇 dou weibei
  • 1 篇 kolundzija mihai...
  • 1 篇 sung jongmo
  • 1 篇 xue huaying

语言

  • 13 篇 英文
检索条件"主题词=Neural speech coding"
13 条 记 录,以下是1-10 订阅
排序:
neural speech coding for Real-Time Communications Using Constant Bitrate Scalar Quantization
收藏 引用
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING 2024年 第8期18卷 1462-1476页
作者: Brendel, Andreas Pia, Nicola Gupta, Kishan Behringer, Lyonel Fuchs, Guillaume Multrus, Markus Fraunhofer Inst Integrated Circuits IIS Erlangen Fraunhofer IIS D-91058 Erlangen Germany
neural audio coding has emerged as a vivid research direction by promising good audio quality at very low bitrates unachievable by classical coding techniques. Here, end-to-end trainable autoencoder-like models repres... 详细信息
来源: 评论
Scalable and Efficient neural speech coding: A Hybrid Design
收藏 引用
IEEE-ACM TRANSACTIONS ON AUDIO speech AND LANGUAGE PROCESSING 2022年 30卷 12-25页
作者: Zhen, Kai Sung, Jongmo Lee, Mi Suk Beack, Seungkwon Kim, Minje Indiana Univ Dept Comp Sci Bloomington IN 47408 USA Indiana Univ Cognit Sci Program Bloomington IN 47408 USA Elect & Telecommun Res Inst Daejeon 34129 South Korea Indiana Univ Dept Intelligent Syst Engn Bloomington IN 47408 USA
We present a scalable and efficient neural waveform coding system for speech compression. We formulate the speech coding problem as an autoencoding task, where a convolutional neural network (CNN) performs encoding an... 详细信息
来源: 评论
A Hybrid DFSMN and Mamba Architecture for Low Bitrate neural speech coding  14
A Hybrid DFSMN and Mamba Architecture for Low Bitrate Neural...
收藏 引用
14th International Symposium on Chinese Spoken Language Processing
作者: Zhao, Yuhao Jia, Maoshen Ru, Jiawei Tai, Junqi Beijing Univ Technol Sch Informat Sci & Technol Beijing Peoples R China Beijing Univ Technol Beijing Dublin Int Coll Beijing Peoples R China
In this paper, we proposed a novel low bitrate neural speech codec based on sequence modeling networks. The proposed method consists of a convolution-based encoder and decoder, a DFSMN-Mamba module, and a vector quant... 详细信息
来源: 评论
A Dual-path Conformer-based Network for neural speech coding  14
A Dual-path Conformer-based Network for Neural Speech Coding
收藏 引用
14th International Symposium on Chinese Spoken Language Processing
作者: Ru, Jiawei Jia, Maoshen Zhao, Yuhao Tao, Liang Beijing Univ Technol Sch Informat Sci & Technol Beijing Peoples R China
In this paper, we propose a neural speech coding method based on the dual-path conformer, which mainly consists of three steps: (1) the encoding and decoding of the time-frequency spectrum are performed by a structure... 详细信息
来源: 评论
Disentangled Feature Learning for Real-Time neural speech coding  48
Disentangled Feature Learning for Real-Time Neural Speech Co...
收藏 引用
48th IEEE International Conference on Acoustics, speech and Signal Processing, ICASSP 2023
作者: Jiang, Xue Peng, Xiulian Zhang, Yuan Lu, Yan Communication University of China Beijing China Microsoft Research Asia Beijing China
Recently end-to-end neural audio/speech coding has shown its great potential to outperform traditional signal analysis based audio codecs. This is mostly achieved by following the VQ-VAE paradigm where blind features ... 详细信息
来源: 评论
NESC: Robust neural End-2-End speech coding with GANs  23
NESC: Robust Neural End-2-End Speech Coding with GANs
收藏 引用
Interspeech Conference
作者: Pia, Nicola Gupta, Kishan Korse, Srikanth Multrus, Markus Fuchs, Guillaume Fraunhofer IIS Erlangen Erlangen Germany
neural networks have proven to be a formidable tool to tackle the problem of speech coding at very low bit rates. However, the design of a neural coder that can be operated robustly under real-world conditions remains... 详细信息
来源: 评论
AVS3P10 Standard for Real-time speech coding
AVS3P10 Standard for Real-time Speech Coding
收藏 引用
2025 IEEE International Conference on Acoustics, speech, and Signal Processing, ICASSP 2025
作者: Xiao, Wei Dou, Weibei Wang, Wenlong Yi, Gaoxiong Li, Jingxin Shang, Shidong Tencent Ethereal Audio Lab Tencent Shenzhen China Department of Electronic Engineering Tsinghua University Beijing China Tencent Ethereal Audio Lab Tencent Beijing China China Electronics Standardization Institute Beijing China
As the tenth part of the third-generation AVS standard series for real-time speech coding, AVS3P10 is the recent standard completed in the Audio Video coding Standards Workgroup of China (AVS). Combining the state-of-... 详细信息
来源: 评论
DRED: Deep REDundancy coding of speech Using a Rate-Distortion-Optimized Variational Autoencoder
收藏 引用
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING 2024年 第8期18卷 1441-1447页
作者: Valin, Jean-Marc Buthe, Jan Mustafa, Ahmed Klingbeil, Michael Xiph Org Fdn Jaffrey NH 03452 USA Amazon Web Serv Palo Alto CA 94303 USA
Despite recent advancements in packet loss concealment (PLC) using deep learning techniques, packet loss remains a significant challenge in real-time speech communication. Redundancy has been used in the past to recov... 详细信息
来源: 评论
PERSONALIZED neural speech CODEC  49
PERSONALIZED NEURAL SPEECH CODEC
收藏 引用
49th IEEE International Conference on Acoustics, speech, and Signal Processing (ICASSP)
作者: Jang, Inseon Yang, Haici Lim, Wootaek Beack, Seungkwon Kim, Minje Elect & Telecommun Res Inst Daejeon 34129 South Korea Indiana Univ Dept Intelligent Syst Engn Bloomington IN 47408 USA Univ Illinois UrbanaChampaign Dept Comp Sci Champaign IL 61801 USA Indiana Univ Indiana PA USA
In this paper, we propose a personalized neural speech codec, envisioning that personalization can reduce the model complexity or improve perceptual speech quality. Despite the common usage of speech codecs where only... 详细信息
来源: 评论
LOW BITRATE LOSS RESILIENCE SCHEME FOR A speech ENHANCING neural CODEC  49
LOW BITRATE LOSS RESILIENCE SCHEME FOR A SPEECH ENHANCING NE...
收藏 引用
49th IEEE International Conference on Acoustics, speech, and Signal Processing (ICASSP)
作者: Kolundzija, Mihailo Kavalekalam, Mathew Balic, Ivana Mao, Michelle Casas, Raul Cisco Syst San Jose CA 95134 USA
Deep neural networks have proven their efficacy in encoding high-quality speech and audio at remarkably low bitrates, while also demonstrating superior performance in audio packet loss concealment (PLC) compared to tr... 详细信息
来源: 评论