咨询与建议

限定检索结果

文献类型

  • 6 篇 会议

馆藏范围

  • 6 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 6 篇 工学
    • 5 篇 电气工程
    • 5 篇 计算机科学与技术...
    • 2 篇 信息与通信工程
    • 2 篇 软件工程
    • 1 篇 电子科学与技术(可...
    • 1 篇 控制科学与工程
  • 3 篇 理学
    • 3 篇 物理学
  • 2 篇 医学
    • 2 篇 临床医学

主题

  • 6 篇 neural audio cod...
  • 2 篇 packet loss conc...
  • 2 篇 vector quantizat...
  • 1 篇 subjective test
  • 1 篇 karhunen-loeve t...
  • 1 篇 error resilience
  • 1 篇 real-time commun...
  • 1 篇 speech enhanceme...
  • 1 篇 bitrate scalable
  • 1 篇 speech and audio...
  • 1 篇 latent space
  • 1 篇 real-time commun...
  • 1 篇 conditional flow...
  • 1 篇 low bit rate cod...

机构

  • 3 篇 commun univ chin...
  • 3 篇 microsoft res as...
  • 1 篇 orange innovat i...
  • 1 篇 univ rennes iris...
  • 1 篇 international au...
  • 1 篇 fraunhofer iis e...
  • 1 篇 orange innovat l...
  • 1 篇 univ rennes iris...
  • 1 篇 msra peoples r c...
  • 1 篇 orange innovat r...

作者

  • 3 篇 xue huaying
  • 3 篇 jiang xue
  • 3 篇 lu yan
  • 3 篇 peng xiulian
  • 2 篇 zhang yuan
  • 2 篇 philippe pierric...
  • 2 篇 scalart pascal
  • 2 篇 muller thomas
  • 1 篇 zheng chengyu
  • 1 篇 pia nicola
  • 1 篇 multrus markus
  • 1 篇 gros laetitia
  • 1 篇 strauss martin
  • 1 篇 ragot stephane
  • 1 篇 edler bernd
  • 1 篇 ragoti stephane

语言

  • 6 篇 英文
检索条件"主题词=neural audio coding"
6 条 记 录,以下是1-10 订阅
排序:
Post-Training Latent Dimension Reduction in neural audio coding  32
Post-Training Latent Dimension Reduction in Neural Audio Cod...
收藏 引用
32nd European Signal Processing Conference (EUSIPCO)
作者: Muller, Thomas Ragot, Stephane Philippe, Pierrick Scalart, Pascal Orange Innovat Lannion France Univ Rennes IRISA Lannion France Orange Innovat Rennes France
This work addresses the problem of latent space quantization in neural audio coding. A covariance analysis of latent space is performed on several pre-trained audio coding models (Lyra V2, EnCodec, audioDec). It is pr... 详细信息
来源: 评论
FlowMAC: Conditional Flow Matching for audio coding at Low Bit Rates
FlowMAC: Conditional Flow Matching for Audio Coding at Low B...
收藏 引用
2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025
作者: Pia, Nicola Strauss, Martin Multrus, Markus Edler, Bernd Fraunhofer IIS Erlangen Germany International Audio Laboratories Erlangen Erlangen Germany
This paper introduces FlowMAC, a novel neural audio codec for high-quality general audio compression at low bit rates based on conditional flow matching (CFM). FlowMAC jointly learns a mel spectrogram encoder, quantiz... 详细信息
来源: 评论
Towards Error-Resilient neural Speech coding  23
Towards Error-Resilient Neural Speech Coding
收藏 引用
Interspeech Conference
作者: Xue, Huaying Peng, Xiulian Jiang, Xue Lu, Yan Microsoft Res Asia Beijing Peoples R China Commun Univ China Beijing Peoples R China
neural audio coding has shown very promising results recently in the literature to largely outperform traditional codecs but limited attention has been paid on its error resilience. neural codecs trained considering o... 详细信息
来源: 评论
Cross-Scale Vector Quantization for Scalable neural Speech coding  23
Cross-Scale Vector Quantization for Scalable Neural Speech C...
收藏 引用
Interspeech Conference
作者: Jiang, Xue Peng, Xiulian Xue, Huaying Zhang, Yuan Lu, Yan Commun Univ China Beijing Peoples R China Microsoft Res Asia Beijing Peoples R China MSRA Beijing Peoples R China
Bitrate scalability is a desirable feature for audio coding in real-time communications. Existing neural audio codecs usually enforce a specific bitrate during training, so different models need to be trained for each... 详细信息
来源: 评论
Speech quality evaluation of neural audio codecs  25
Speech quality evaluation of neural audio codecs
收藏 引用
25th Interspeech Conference
作者: Muller, Thomas Ragoti, Stephane Gros, Laetitia Philippe, Pierrick Scalart, Pascal Orange Innovat Ile De France France Univ Rennes IRISA Rennes France
This paper presents speech quality results to characterize the state of the art and technological advance of recent neural audio codecs targeting low bitrates. audio quality was evaluated in one clean speech experimen... 详细信息
来源: 评论
END-TO-END neural SPEECH coding FOR REAL-TIME COMMUNICATIONS  47
END-TO-END NEURAL SPEECH CODING FOR REAL-TIME COMMUNICATIONS
收藏 引用
47th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
作者: Jiang, Xue Peng, Xiulian Zheng, Chengyu Xue, Huaying Zhang, Yuan Lu, Yan Commun Univ China Beijing Peoples R China Microsoft Res Asia Beijing Peoples R China
Deep-learning based methods have shown their advantages in audio coding over traditional ones but limited attention has been paid on real-time communications (RTC). This paper proposes the TFNet, an end-to-end neural ... 详细信息
来源: 评论