This paper summarises the design of the Cool-Chic candidate for the Challenge on Learned imagecompression. This candidate attempts to demonstrate that neural coding methods can lead to low complexity and lightweight ...
详细信息
ISBN:
(纸本)9798350385885;9798350385878
This paper summarises the design of the Cool-Chic candidate for the Challenge on Learned imagecompression. This candidate attempts to demonstrate that neural coding methods can lead to low complexity and lightweight image decoders while still offering competitive performance. The approach is based on the already published over fitted lightweight neural networks Cool-Chic, further adapted to the human subjective viewing targeted in this challenge.
The continuously advancing image generation technology has been utilized for high-quality video reconstruction using low-bitrate feature representation. The motion semantic representations provided by existing models ...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
The continuously advancing image generation technology has been utilized for high-quality video reconstruction using low-bitrate feature representation. The motion semantic representations provided by existing models exhibit significant redundancy, indicating that their potential as video compression tools is still to be fully explored. In this work, we propose a motion semantic enhancement model called MSEM for ultra-low-bitrate talking-head video compression, aiming at improving semantic extraction effectiveness and compactness. Specifically, we enhance semantic extraction accuracy by introducing a deformable feature estimator with flexible receptive field shapes. Based on the straight-through gradient estimation, we construct a semantic encoding space that contains more compact semantic representations with low redundancy. Extensive experiments clearly demonstrate that i) compared to mainstream semantic compression models, our method has stronger semantic feature extraction capabilities benefiting from a more reasonable semantic feature impact range, and ii) our method provides an average bitrate reduction for the same visual quality of more than 50% compared to VVC.
Compared to traditional imagecompression methods, learned imagecompression (LIC) methods have demonstrated increasingly superior rate-distortion performance. However, LIC networks are often regarded as black boxes, ...
详细信息
ISBN:
(纸本)9798350390155;9798350390162
Compared to traditional imagecompression methods, learned imagecompression (LIC) methods have demonstrated increasingly superior rate-distortion performance. However, LIC networks are often regarded as black boxes, still lacking a theoretical understanding. Sparse coding provides the sparse and interpretable modeling for analyzing or synthesizing natural images in various signal and image processing applications. Therefore, we introduce convolutional sparse coding (CSC) into transform network for enhancing the interpretability of LIC methods. In this paper, we first employ CSC layers to achieve certain theoretical modeling for LIC network, and adopt a weight sharing strategy in encoderdecoder pair and attention mechanism to balance the complexity and performance. Additionally, we analyze the model robustness against data input perturbations and consider the impact of sparsity trade-off parameter in the CSC layer optimization process. Experimental results demonstrate that our method achieves comparable performance with the corresponding baseline, and our model is more robust.
Deep variational autoencoders for image and video compression have gained significant attraction in the recent years, due to their potential to offer competitive or better compression rates compared to the decades lon...
详细信息
ISBN:
(纸本)9798350347951
Deep variational autoencoders for image and video compression have gained significant attraction in the recent years, due to their potential to offer competitive or better compression rates compared to the decades long traditional codecs such as AVC, HEVC or VVC. However, because of complexity and energy consumption, these approaches are still far away from practical usage in industry. More recently, implicit neural representation (INR) based codecs have emerged, and have lower complexity and energy usage to classical approaches at decoding. However, their performances are not in par at the moment with state-of-the-art methods. In this research, we first show that INR based image codec has a lower complexity than VAE based approaches, then we propose several improvements for INR-based image codec and outperformed baseline model by a large margin.
Overfitted image codecs offer compelling compression performance and low decoder complexity, through the overfitting of a lightweight decoder for each image. Such codecs include Cool-chic, which presents image coding ...
详细信息
ISBN:
(纸本)9789464593617;9798331519773
Overfitted image codecs offer compelling compression performance and low decoder complexity, through the overfitting of a lightweight decoder for each image. Such codecs include Cool-chic, which presents image coding performance on par with VVC while requiring around 2000 multiplications per decoded pixel. This paper proposes to decrease Cool-chic encoding and decoding complexity. The encoding complexity is reduced by shortening Cool-chic training, up to the point where no overfitting is performed at all. It is also shown that a tiny neural decoder with 300 multiplications per pixel still outperforms HEVC. A near real-time CPU implementation of this decoder is made available at https://***/Cool-Chic/.
Digital images are easy to copy, spread, and tampering, therefore, protecting image contents in the digital era has emerged as a key focus of multimedia security research. Recently, researchers have introduced deep le...
详细信息
ISBN:
(纸本)9798350387780;9798350387797
Digital images are easy to copy, spread, and tampering, therefore, protecting image contents in the digital era has emerged as a key focus of multimedia security research. Recently, researchers have introduced deep learning technology into cryptography, but how to reasonably integrate cipher keys into neural networks and how to combining compression with encryption are still a challenge work. This paper proposes a new image encryption-compression network based on CycleGAN, which is strictly in accordance with the principle of cryptography. In the encryption phase, the key and the plain image are fed into the encryption network to generate a one-channel cipher image. Conversely, in the decryption phase, the same key and the cipher image are fed into the decryption network to perform deciphering, while the cipher image cannot be deciphered when the incorrect key is used. Both encryption and decryption networks take use of CycleGAN, and a new auxiliary network is designed to ensure the key sensitivity to the encryptioncompression network, thus the security of the system is enhanced. Experimental results have demonstrated the feasibility of the proposed scheme.
As one of the main carriers of information transmission and storage, image data need to be compressed in some application scenarios. Traditional imagecompression methods, although reducing the size of image files to ...
详细信息
The Dynamic Vision System (DVS) is a novel image acquisition system that works only when there is a brightness change in a pixel, resulting in a stream of events including timestamps, spatial coordinates and the sign ...
详细信息
ISBN:
(纸本)9781728198354
The Dynamic Vision System (DVS) is a novel image acquisition system that works only when there is a brightness change in a pixel, resulting in a stream of events including timestamps, spatial coordinates and the sign of the brightness change (increase or decrease). Although DVS's output data size is much smaller than conventional image systems, it still requires further compression, as the main applications of DVS are embedded systems with limited transmission and storage resources. In this paper, we propose a new method for lossless compression of event data streams based on point cloud representations. The event data stream is organized into a 3D point cloud to which a compression algorithm is applied. In addition, different generation strategies are devised in order to compare the compression performance of the proposed approach. Experimental results show an improved compression ratio of about 22% under lossless conditions.
In recent years, deep learning-based lossy imagecompression have achieved great success. However, the problem of their huge overhead in terms of computational and parametric costs has still not been adequately addres...
详细信息
ISBN:
(纸本)9781728198354
In recent years, deep learning-based lossy imagecompression have achieved great success. However, the problem of their huge overhead in terms of computational and parametric costs has still not been adequately addressed. Inspired by the classical imagecompression methods, deep learning based models are usually combined with an entropy model to maintain the compression performance. Existing methods also introduce side information to serve as a prior on the parameters of the entropy model, which have achieved better rate-distortion performance. Based on the role of side information in learned imagecompression models, we propose an efficient pruning method for such models. In particular, the proposed pruning approach automatically searches for the optimal decoder architecture based on the extent to which each hidden layer in the decoder utilizes side information. The experiment results demonstrate the effectiveness of the proposed method and show that it outperforms all existing related studies in terms of compression performance.
imagecompression is a critical component of digital systems, as it reduces the file size of images while preserving quality, thereby facilitating minimal storage requirements and seamless transmission. Wavelet transf...
详细信息
暂无评论