检索结果-内蒙古大学图书馆

Picture coding Symposium, PCS

作者： Keisuke Kaji Yasuyo Kita Ichiro Matsuda Susumu Itoh Yusuke Kameda Faculty of Science and Technology Tokyo University of Science 2641 Yamazaki Noda-shi Chiba JAPAN Sophia University

ISBN: (纸本)9781665492584

An autoregressive image generative model that estimates the conditional probability distributions of image signals pel-by-pel is a promising tool for lossless image coding. In this paper, a generative model based on a convolutional neural network (CNN) was combined with a locally trained adaptive predictor to improve its accuracy. Furthermore, sets of parameters that adjust the estimated probability distribution were numerically optimized for each image to minimize the resulting coding rate. Simulation results indicate that the proposed method improves the coding efficiency obtained by the CNN-based model for most of the tested images.

关键词： Adaptation models image coding Adaptive systems Simulation Predictive models Probability distribution Encoding

来源：评论

学校读者我要写书评

暂无评论

Compressed Sensing by Using Measurement Completion for Robust image coding

Compressed Sensing by Using Measurement Completion for Robus...

引用

IEEE International Conference on High Performance Computing and Communications (HPCC)

作者： Bo Zhang Di Xiao Sen Bai Min Li Communication NCO Academy Army Engineering University Chongqing China College of Computer Science Chongqing University Chongqing China Chongqing Institute of Engineering Chongqing China

Compressed sensing (CS) has been demonstrated to be an effective method for robust image coding. However, for existing CS-based image coding schemes, recovery performance drops rapidly at high packet loss rates (PLRs) because the received CS measurements are insufficient for stable recovery. To solve this problem, we propose a novel robust image coding scheme by using CS with measurement completion in this paper. By dividing the original image into a lot of down-sampled images with interweaving permutation (IP) and then sampling them with scrambled 2D CS (2DCS), we can obtain the CS measurement vectors of the down-sampled images. Since the CS measurement vectors preserve the correlation of the down-sampled images, they are also highly-correlated with each other. By exploring the correlation among the CS measurement vectors, a measurement completion strategy is proposed, which can recover many lost CS measurements due to packet loss at the decoder side. Simulation results show that the proposed scheme can significantly outperform previous CS-based image coding schemes at high PLRs in terms of rate distortion (R-D) performance. This advantage makes the proposed scheme a good candidate for those image communication systems which need to provide reliable transmission for the image data via channels with high PLRs.

关键词： image coding Correlation Smart cities Simulation image communication Packet loss Decoding

来源：评论

学校读者我要写书评

暂无评论

Block-Level Rate Control for Learnt image coding

Block-Level Rate Control for Learnt Image Coding

引用

Picture coding Symposium, PCS

作者： Xining Wang Ming Lu Zhan Ma Nanjing University Nanjing China

ISBN: (纸本)9781665492584

Learnt image coding (LIC) methods recently offered state-of-the-art efficiency by training separate models for individual bitrate which apparently was impractical. Variable-rate coding with a single or very few LIC models was emerged and mostly implemented to process a whole image directly (e.g., a single control rate-distortion factor $\lambda$ for a given image to approach target rate). This work provides a novel block-level rate control by applying the UnEqual Rate Allocation (UERA) to nonoverlapped image blocks, which basically exploits the spatial heterogeneousness of the underlying content. Such block-level UERA is enabled by modeling the rate-distortion (R-D) function of each block, by which we optimize block-wise $\lambda$s to maximize the overall R-D performance. Experiments show that our method can accurately adapt a wide range of bitrates by a single model, and provide almost identical performance as the solutions using multiple rate-specific models. Additionally, such block-level LIC significantly reduces the consumption of peak running memory and computational complexity, which is attractive for practical implementations.

关键词： Training Adaptation models image coding Computational modeling Bit rate Memory management Rate-distortion

来源：评论

学校读者我要写书评

暂无评论

PO-ELIC: Perception-Oriented Efficient Learned image coding

arXiv

引用

arXiv 2022年

作者： He, Dailan Yang, Ziming Yu, Hongjiu Xu, Tongda Luo, Jixiang Chen, Yuan Gao, Chenjian Shi, Xinjie Qin, Hongwei Wang, Yan SenseTime Research Tsinghua University China

In the past years, learned image compression (LIC) has achieved remarkable performance. The recent LIC methods outperform VVC in both PSNR and MS-SSIM. However, the low bit-rate reconstructions of LIC suffer from artifacts such as blurring, color drifting and texture missing. Moreover, those varied artifacts make image quality metrics correlate badly with human perceptual quality. In this paper, we propose PO-ELIC, i.e., Perception-Oriented Efficient Learned image coding. To be specific, we adapt ELIC, one of the state-of-the-art LIC models, with adversarial training techniques. We apply a mixture of losses including hinge-form adversarial loss, Charbonnier loss, and style loss, to finetune the model towards better perceptual quality. Experimental results demonstrate that our method achieves comparable perceptual quality with HiFiC with much lower bitrate. Copyright © 2022, The Authors. All rights reserved.

关键词： image coding

来源：评论

学校读者我要写书评

暂无评论

Position-based Motion Vector Prediction for Textual image coding

Position-based Motion Vector Prediction for Textual Image Co...

引用

Picture coding Symposium, PCS

作者： Donghui Feng Chen Zhu Guo Lu Li Song Cooperative Medianet Innovation Center Shanghai Jiao Tong University Shanghai China Institute of Image Communication and Network Engineering Shanghai Jiao Tong University Shanghai China

ISBN: (纸本)9781665492584

Textual content is becoming increasingly important in video conferencing, while existing screen content encoding tools still produce a high bitrate in text regions. The main coding tool Intra Block Copy (IBC) inherits the MV prediction mechanism in inter-frame coding, but the adjacent text characters typically have irrelevant MVs, making it inefficient to predict MV using only neighbor MVs. To solve the problem, we propose the Position-based Motion Vector Prediction, to cache IBC AMVP PU positions as predictors. One character can find the previously encoded position to construct a good MV prediction. Experiment results show the effectiveness of the proposed prediction scheme.

关键词： image coding Bit rate Encoding Complexity theory Videoconferences

来源：评论

学校读者我要写书评

暂无评论

Side Information Driven image coding for Machines

Side Information Driven Image Coding for Machines

引用

Picture coding Symposium, PCS

作者： Zhongpeng Zhang Ying Liu Department of Computer Science and Engineering Santa Clara University Santa Clara CA USA

ISBN: (纸本)9781665492584

With the continuous improvement of computer vision technology, more and more image information is consumed by machines rather than humans. image coding for machines (ICM) is to compress image data such that they can be more efficiently sent to the receiver side for machines to conduct visual analysis. A typical deep learning-based ICM structure contains one codec network which compresses and transmits images through the Internet and one semantic analysis task network such as image classification and object recognition. In the codec part, the side information is the hyper-prior or hierarchical layers of hyper-priors for the compression of image latent representations. In this paper, we propose a Side Information Driven image coding (SIIC) framework based on deep learning. It only compresses and transmits the side information to the receiver for image classification tasks. We obtain a top-l accuracy of 70.38% on the imageNet1K dataset with 0.046 bits per pixel.

关键词： Visualization image coding Codecs Semantics Receivers Streaming media Internet

来源：评论

学校读者我要写书评

暂无评论

Multi-level Latent Fusion in Learning-based image coding

Multi-level Latent Fusion in Learning-based Image Coding

引用

IEEE International Symposium on Circuits and Systems

作者： Jay N. Shingala Arunkumar Mohananchettiar Pankaj Sharma Peng Yin Arjun Arora Sean McCarthy Taoran Lu Fangjun Pu Ii Syses IndiIi Syses IndiIi Syses IndiDoby Lboroires Inc. Sunnyve CA USADoby Lboroires Inc. Sunnyve CA USADoby Lboroires Inc. Sunnyve CA USADoby Lboroires Inc. Sunnyve CA USAIi Syses Indi

ISBN: (数字)9781665484855

ISBN: (纸本)9781665484862

Learning-based image coding has shown promising results for coding of natural images compared to traditional block-based coding schemes. However, improvements are needed for screen content coding. Most of the popular learning-based coding approaches are based on variational autoencoders employing Convolutional Neural Networks (CNNs) which are end-to-end trained on a training dataset. The receptive field area of the latents in these architectures increase based on the down-sampling ratio and the kernel size used in each convolution layer. The latents coded from the last layer therefore have a large receptive field size which may not be optimal to code image sources such as screen content or mixed content containing text, logos and small edges. This paper proposes new methods to adaptively fuse and code the latents from different layers. It enables a novel multi-level receptive field based latent coding architecture to achieve better coding performance for a diverse set of contents. Additionally, Multi-Mixture distribution based entropy modeling of latent features and content adaptive latent refinements in the encoder is proposed to bring more coding gains. The experimental results show that the approach can significantly improve the coding efficiency for screen content with average bitrate savings of 36%.

关键词： Training image coding Codes Bit rate Encoding Entropy Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Rate-Distortion in image coding for Machines

Rate-Distortion in Image Coding for Machines

引用

Picture coding Symposium, PCS

作者： Alon Harell Anderson De Andrade Ivan V. Bajić School of Engineering Science Simon Fraser University Burnaby Canada

ISBN: (纸本)9781665492584

In recent years, there has been a sharp increase in transmission of images to remote servers specifically for the purpose of computer vision. In many applications, such as surveillance, images are mostly transmitted for automated analysis, and rarely seen by humans. Using traditional compression for this scenario has been shown to be inefficient in terms of bit-rate, likely due to the focus on human based distortion metrics. Thus, it is important to create specific image coding methods for joint use by humans and machines. One way to create the machine side of such a codec is to perform feature matching of some intermediate layer in a Deep Neural Network performing the machine task. In this work, we explore the effects of the layer choice used in training a learnable codec for humans and machines. We prove, using the data processing inequality, that matching features from deeper layers is preferable in the sense of rate-distortion. Next, we confirm our findings empirically by re-training an existing model for scalable human-machine coding. In our experiments we show the trade-off between the human and machine sides of such a scalable model, and discuss the benefit of using deeper layers for training in that regard.

关键词： Training Measurement image coding Codecs Surveillance Neural networks Rate-distortion

来源：评论

学校读者我要写书评

暂无评论

Human Action Recognition Based on image coding and CNN

Human Action Recognition Based on Image Coding and CNN

引用

IEEE Eurasia Conference on IOT, Communication and Engineering (ECICE)

作者： Shigang Wang Zhanglin Lai Shuai Feng School of Automation Guangxi University of Science and Technology Liuzhou China

ISBN: (纸本)9781665482097

In human action recognition, the way of collecting action data through video or photos is easily affected by factors such as perspective and light, and it is not easy to describe and extract features. To solve this problem, we researched human skeletal joint data and the use of the convolutional neural network (CNN). The joint data was converted into a PNG image by image coding. In addition, we proposed 3 descriptions of data arrangement order for grayscale image coding. Combined with 4 coding methods and RGB image coding, the coding scheme was expanded to 16 kinds, and used a CNN model with 9 layers structure to conduct comparative experiments on 16 kinds of coding schemes. Then, the influence of data arrangement order and coding methods was discussed based on action recognition results. The experimental results show that the “Zhi” font coding method under the data arrangement order Case 2 is easier to classify actions, and the accuracy of the test set is 96 %.

关键词： image coding image recognition Gray-scale Feature extraction Encoding Convolutional neural networks Data mining

来源：评论

学校读者我要写书评

暂无评论

LEARNING SPARSE AUTO-ENCODERS FOR GREEN AI image coding

arXiv

引用

arXiv 2022年

作者： Gille, Cyprien Guyard, Frédéric Antonini, Marc Barlaud, Michel Université Côte d'Azur I3S Cnrs Sophia Antipolis France Orange Labs Sophia Antipolis France

Recently, convolutional auto-encoders (CAE) were introduced for image coding. They achieved performance improvements over the state-of-the-art JPEG2000 method. However, these performances were obtained using massive CAEs featuring a large number of parameters and whose training required heavy computational power. In this paper, we address the problem of lossy image compression using a CAE with a small memory footprint and low computational power usage. In order to overcome the computational cost issue, the majority of the literature uses Lagrangian proximal regularization methods, which are time consuming themselves. In this work, we propose a constrained approach and a new structured sparse learning method. We design an algorithm and test it on three constraints: the classical 1constraint, the 1,∞and the new 1,1constraint. Experimental results show that the 1,1constraint provides the best structured sparsity, resulting in a high reduction of memory and computational cost, with similar rate-distortion performance as with dense networks. Copyright © 2022, The Authors. All rights reserved.

关键词： image coding

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：