检索结果-内蒙古大学图书馆

Conference on Applications of Digital Image Processing XLVII

作者： Prativadibhayankaram, Srivatsa Richter, Thomas Foessel, Siegfried Kaup, Andre Fraunhofer Inst Integrated Circuits IIS Moving Picture Technol Erlangen Germany Friedrich Alexander Univ Erlangen Nurnberg Multimedia Commun & Signal Proc Erlangen Germany

ISBN: (纸本)9781510679344;9781510679351

With the advent of learned image compression, numerous models have been developed. These models make use of non-linear transforms that are learnt during the training process, where an image is transformed into a latent space, quantized and entropy coded. At the decoder, the quantized latent is recovered and transformed back to image space through a synthesis transform. In this work, we attempt to present an analysis of the energy distribution across channels. In our prior works, we demonstrated the features captured by the analysis transform, that can provide insights into the bitrate distribution across channels. Building on that, we extend our findings with quantitative measurements. We consider various learned image codecs that are based on the variational autoencoder framework and compare them with Karhunen Loeve transform (KLT) in terms of energy compaction. We also compare the closeness of the learned transforms to KLT to study the relationship between the design of classical codecs and learned codecs.

关键词： linear transform coding non-linear transform coding deep learning image compression

来源：评论

学校读者我要写书评

暂无评论

COLOR LEARNING FOR IMAGE COMPRESSION 30

COLOR LEARNING FOR IMAGE COMPRESSION

引用

30th IEEE International Conference on Image Processing (ICIP)

作者： Prativadibhayankaram, Srivatsa Richter, Thomas Sparenberg, Heiko Foessel, Siegfried Fraunhofer Inst Integrated Circuits IIS Moving Picture Technol Erlangen Germany

ISBN: (纸本)9781728198354

Deep learning based image compression has gained a lot of momentum in recent times. To enable a method that is suitable for image compression and subsequently extended to video compression, we propose a novel deep learning model architecture, where the task of image compression is divided into two sub-tasks, learning structural information from luminance channel and color from chrominance channels. The model has two separate branches to process the luminance and chrominance components. The color difference metric CIEDE2000 is employed in the loss function to optimize the model for color fidelity. We demonstrate the benefits of our approach and compare the performance to other codecs. Additionally, the visualization and analysis of latent channel impulse response is performed.

关键词： Image compression deep learning color learning non-linear transform coding

来源：评论

学校读者我要写书评

暂无评论

Rate-Distortion Optimized Encoding for Deep Image Compression

IEEE OPEN JOURNAL OF CIRCUITS AND SYSTEMS

引用

IEEE OPEN JOURNAL OF CIRCUITS AND SYSTEMS 2021年 2卷 633-647页

作者： Schafer, Michael Pientka, Sophie Pfaff, Jonathan Schwarz, Heiko Marpe, Detlev Wiegand, Thomas Heinrich Hertz Inst Nachrichtentech Berlin GmbH Fraunhofer Inst Telecommun Video Commun & Applicat Dept D-10587 Berlin Germany Heinrich Hertz Inst Nachrichtentech Berlin GmbH Fraunhofer Inst Telecommun D-10587 Berlin Germany Free Univ Berlin Dept Math & Comp Sci D-14195 Berlin Germany Berlin Inst Technol Dept Elect Engn & Comp Sci D-10623 Berlin Germany

Deep-learned variational auto-encoders (VAE) have shown remarkable capabilities for lossy image compression. These neural networks typically employ non-linear convolutional layers for finding a compressible representation of the input image. Advanced techniques such as vector quantization, context-adaptive arithmetic coding and variable-rate compression have been implemented in these auto-encoders. Notably, these networks rely on an end-to-end approach, which fundamentally differs from hybrid, block-based video coding systems. Therefore, signal-dependent encoder optimizations have not been thoroughly investigated for VAEs yet. However, rate-distortion optimized encoding heavily determines the compression performance of state-of-the-art video codecs. Designing such optimizations for non-linear, multi-layered networks requires to understand the relationship between the quantization, the bit allocation of the features and the distortion. Therefore, this paper examines the rate-distortion performance of a variable-rate VAE. In particular, one demonstrates that the trained encoder network typically finds features with a near-optimal bit allocation across the channels. Furthermore, one approximates the relationship between distortion and quantization by a higher-order polynomial, whose coefficients can be robustly estimated. Based on these considerations, the authors investigate an encoding algorithm for the Lagrange optimization, which significantly improves the coding efficiency.

关键词： Video coding Image coding Vector quantization nonlinear distortion Bit rate Rate-distortion Signal processing algorithms Deep image compression variational auto-encoders rate-distortion optimized encoding non-linear transform coding

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：