检索结果-内蒙古大学图书馆

Dual Link image coding Based on CCSDS-123

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS 2022年 19卷 1页

作者： Bartrina-Rapesta, Joan Auli-Llinas, Francesc Univ Autonoma Barcelona Dept Informat & Commun Engn E-08193 Bellaterra Spain

Predictive coding techniques are attractive for image codecs because they can yield high compression efficiency while spending few computational resources. In remote sensing, predictive techniques are employed in prominent standards to transmit images captured by Earth Observation (EO) satellites. Although EO satellites have full duplex capacity, compression standards for spatial data are devised to use the downlink only. Recently, we presented a dual-link image coding system that employs both the uplink and the downlink to accelerate the transmission of such images. The proposed system was introduced in the wavelet-based JPEG2000 standard, which is not well-suited for satellites due to its complexity. This letter approaches the dual-link scheme to a more suitable standard for spatial data based on predictive coding, more precisely, the Lossless Multispectral and Hyperspectral image compression standard CCSDS-123.0-B.2. The proposed method adapts the dual-link image coding scheme to CCSDS-123.0-B-2 by incorporating a quantizer, a lightweight arithmetic coder, and a rate control technique. Experimental results suggest that the resulting system achieves higher coding ratios than CCSDS-123.0-B-2 and JPEG2000 with dual link.

关键词： image coding Satellites Entropy Encoding Transform coding Standards Redundancy CCSDS-123 0-B-2 dual link image coding remote sensing images

来源：评论

学校读者我要写书评

暂无评论

Prompt-ICM: A Unified Framework towards image coding for Machines with Task-driven Prompts

arXiv

引用

arXiv 2023年

作者： Feng, Ruoyu Liu, Jinming Jin, Xin Pan, Xiaohan Sun, Heming Chen, Zhibo University of Science and Technology of China China Waseda University Japan Eastern Institute of Advanced Study China

image coding for machines (ICM) aims to compress images to support downstream AI analysis instead of human perception. For ICM, developing a unified codec to reduce information redundancy while empowering the compressed features to support various vision tasks is very important, which inevitably faces two core challenges: 1) How should the compression strategy be adjusted based on the downstream tasks? 2) How to well adapt the compressed features to different downstream tasks? Inspired by recent advances in transferring large-scale pre-trained models to downstream tasks via prompting, in this work, we explore a new ICM framework, termed Prompt-ICM. To address both challenges by carefully learning task-driven prompts to coordinate well the compression process and downstream analysis. Specifically, our method is composed of two core designs: a) compression prompts, which are implemented as importance maps predicted by an information selector, and used to achieve different content-weighted bit allocations during compression according to different downstream tasks;b) task-adaptive prompts, which are instantiated as a few learnable parameters specifically for tuning compressed features for the specific intelligent task. Extensive experiments demonstrate that with a single feature codec and a few extra parameters, our proposed framework could efficiently support different kinds of intelligent tasks with much higher coding efficiency. © 2023, CC BY.

关键词： image coding

来源：评论

学校读者我要写书评

暂无评论

VVC+M: PLUG AND PLAY SCALABLE image coding FOR HUMANS AND MACHINES

arXiv

引用

arXiv 2023年

作者： Harell, Alon Foroutan, Yalda Bajić, Ivan V. School of Engineering Science Simon Fraser University BurnabyBC Canada

Compression for machines is an emerging field, where inputs are encoded while optimizing the performance of downstream automated analysis. In scalable coding for humans and machines, the compressed representation used for machines is further utilized to enable input reconstruction. Often performed by jointly optimizing the compression scheme for both machine task and human perception, this results in suboptimal rate-distortion (RD) performance for the machine side. We focus on the case of images, proposing to utilize the pre-existing residual coding capabilities of video codecs such as VVC to create a scalable codec from any image compression for machines (ICM) scheme. Using our approach we improve an existing scalable codec to achieve superior RD performance on the machine task, while remaining competitive for human perception. Moreover, our approach can be trained post-hoc for any given ICM scheme, and without creating a coupling between the quality of the machine analysis and human vision. Copyright © 2023, The Authors. All rights reserved.

关键词： image coding

来源：评论

学校读者我要写书评

暂无评论

Learned Disentangled Latent Representations for Scalable image coding for Humans and Machines

Learned Disentangled Latent Representations for Scalable Ima...

引用

Data Compression Conference (DCC)

作者： Ezgi Özyılkan Mateen Ulhaq Hyomin Choi Fabien Racapé Dept. of Electrical and Computer Engineering New York University InterDigital - Emerging Technologies Lab School of Engineering Science Simon Fraser University

As an increasing amount of image and video content will be analyzed by machines, there is demand for a new codec paradigm that is capable of compressing visual input primarily for the purpose of computer vision inference, while secondarily supporting input reconstruction. In this work, we propose a learned compression architecture that can be used to build such a codec. We introduce a novel variational formulation that explicitly takes feature data relevant to the desired inference task as input at the encoder side. As such, our learned scalable image codec encodes and transmits two disentangled latent representations for object detection and input reconstruction. We note that compared to relevant benchmarks, our proposed scheme yields a more compact latent representation that is specialized for the inference task. Our experiments show that our proposed system achieves a bit rate savings of 40.6% on the primary object detection task compared to the current state-of-the-art, albeit with some degradation in performance for the secondary input reconstruction task.

关键词： Visualization Codecs image coding Scalability Redundancy Object detection Transforms

来源：评论

学校读者我要写书评

暂无评论

Saliency-Driven Hierarchical Learned image coding for Machines

Saliency-Driven Hierarchical Learned Image Coding for Machin...

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Kristian Fischer Fabian Brand Christian Blum André Kaup Multimedia Communications and Signal Processing Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU)

We propose to employ a saliency-driven hierarchical neural image compression network for a machine-to-machine communication scenario following the compress-then-analyze paradigm. By that, different areas of the image are coded at different qualities depending on whether salient objects are located in the corresponding area. Areas without saliency are transmitted in latent spaces of lower spatial resolution in order to reduce the bitrate. The saliency information is explicitly derived from the detections of an object detection network. Furthermore, we propose to add saliency information to the training process in order to further specialize the different latent spaces. All in all, our hierarchical model with all proposed optimizations achieves 77.1 % bitrate savings over the latest video coding standard VVC on the Cityscapes dataset and with Mask R-CNN as analysis network at the decoder side. Thereby, it also outperforms traditional, non-hierarchical compression networks.

关键词： Training Video coding image coding Bit rate Neural networks Signal processing Speech processing

来源：评论

学校读者我要写书评

暂无评论

VISUAL ANALYSIS MOTIVATED RATE-DISTORTION MODEL FOR image coding

VISUAL ANALYSIS MOTIVATED RATE-DISTORTION MODEL FOR IMAGE CO...

引用

2021 IEEE International Conference on Multimedia and Expo, ICME 2021

作者： Huang, Zhimeng Jia, Chuanmin Wang, Shanshe Ma, Siwei Institute of Digital Media Peking University Beijing China Information Technology R&D Innovation Center Peking University Shaoxing China

ISBN: (纸本)9781665438643

Optimized for pixel fidelity metrics, images compressed by existing image codec are facing systematic challenges when used for visual analysis tasks, especially under low-bitrate coding. This paper proposes a visual analysis-motivated rate-distortion model for Versatile Video coding (VVC) intra compression. The proposed model has two major contributions, a novel rate allocation strategy and a new distortion measurement model. We first propose the region of interest for machine (ROIM) to evaluate the degree of importance for each coding tree unit (CTU) in visual analysis. Then, a novel CTU-level bit allocation model is proposed based on ROIM and the local texture characteristics of each CTU. After an in-depth analysis of multiple distortion models, a visual analysis friendly distortion criteria is subsequently proposed by extracting deep feature of each coding unit (CU). To alleviate the problem of lacking spatial context information when calculating the distortion of each CU, we finally propose a multi-scale feature distortion (MSFD) metric using different neighboring pixels by weighting the extracted deep features in each scale. Extensive experimental results show that the proposed scheme could achieve up to 28.17% bitrate saving under the same analysis performance among several typical visual analysis tasks such as image classification, object detection, and semantic segmentation. © 2021 IEEE

关键词： image coding

来源：评论

学校读者我要写书评

暂无评论

Predictive side decoding for human-centered multiple description image coding

引用

EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING 2020年第1期2020卷 1-14页

作者： Xu, Yuanyuan Hohai Univ Coll Comp & Informat 8 Fo Cheng Rd Nanjing Peoples R China

Multiple description coding (MDC) provides a favorable solution for human-centered image communication, which takes into account people's varying watching situations as well as people's demand for real-time image display. As an effective technique for MDC, three-description lattice vector quantization (3D-LVQ) is considered for image coding in this paper. Based on intra- and inter-correlation in the 3D-LVQ index assignment as well as wavelet intra-subband correlation, a novel predictive decoding method for 3D-LVQ-based image coding is proposed to enhance side decoding performance, which attempts to predict lost descriptions (sublattice points) in a good way for better reconstructions of wavelet vectors (fine lattice points) in the side decoding. Experimental results validate effectiveness of the proposed decoding scheme in terms of rate-distortion performance.

关键词： image coding Human-centered computing Multiple description coding Lattice vector quantization

来源：评论

学校读者我要写书评

暂无评论

Compressive Sensing Multi-Layer Residual Coefficients for image coding

引用

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 2020年第4期30卷 1109-1120页

作者： Chen, Zan Hou, Xingsong Shao, Ling Gong, Chen Qian, Xueming Huang, Yuan Wang, Shidong Xi An Jiao Tong Univ Sch Elect & Informat Engn Xian 710049 Peoples R China Incept Inst Artificial Intelligence Abu Dhabi U Arab Emirates Univ Sci & Technol China Sch Elect & Informat Engn Hefei 230026 Peoples R China Guangdong Xian Jiaotong Univ Acad Foshan 528300 Guangdong Peoples R China Univ East Anglia Sch Comp Sci Norwich NR4 7TJ Norfolk England

Compressive sensing (CS)-based image coding scheme has been enthusiastically studied, but it still has a poor rate-distortion performance compared with the traditional image coding techniques. In this paper, we propose a CS multi-layer residual coding scheme to rectify this problem to a certain extent. By dividing CS measurements into multi-layers and predicting a particular layer's measurements with all its preceding layers' measurements, we can transform CS measurements into multi-layer residual coefficients, which are easier to compress. By calculating the residual between the quantized ground-truth CS measurements and their corresponding quantized inference measurements and using Huffman coding to associate each residual quantization index with a binary code, we can reduce the redundancies among CS measurements efficiently. Besides, the prediction and quantization process is designed to be layer-independent, which can save much of the encoding time. The proposed approach introduces a novel framework for using CS in the compression domain. The experimental results show that the proposed scheme can significantly outperform JPEG2000 and approach or reach the performance of HEVC-Intra on some test images.

关键词： image coding image reconstruction Quantization (signal) Rate-distortion Transform coding Decoding Compressed sensing image coding compressive sensing multi-layer residual coefficients approximate message passing

来源：评论

学校读者我要写书评

暂无评论

Deep-Learning-Based Lossless image coding

引用

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 2020年第7期30卷 1829-1842页

作者： Schiopu, Ionut Munteanu, Adrian Vrije Univ Brussel Dept Elect & Informat ETRO B-1050 Brussels Belgium

This paper proposes a novel approach for lossless image compression. The proposed coding approach employs a deep-learning-based method to compute the prediction for each pixel, and a context-tree-based bit-plane codec to encode the prediction errors. First, a novel deep learning-based predictor is proposed to estimate the residuals produced by traditional prediction methods. It is shown that the use of a deep-learning paradigm substantially boosts the prediction accuracy compared with the traditional prediction methods. Second, the prediction error is modeled by a context modeling method and encoded using a novel context-tree-based bit-plane codec. Codec profiles performing either one or two coding passes are proposed, trading off complexity for compression performance. The experimental evaluation is carried out on three different types of data: photographic images, lenslet images, and video sequences. The experimental results show that the proposed lossless coding approach systematically and substantially outperforms the state-of-the-art methods for each type of data.

关键词： image coding Cameras Context modeling Tools Codecs Prediction methods Standards Machine learning image coding context modeling

来源：评论

学校读者我要写书评

暂无评论

HYBRID MODEL-BASED/DATA-DRIVEN GRAPH TRANSFORM FOR image coding

arXiv

引用

arXiv 2022年

作者： Bagheri, Saghar Do, Tam Thuc Cheung, Gene Ortega, Antonio York University Toronto Canada University of Southern California CA United States

Transform coding to sparsify signal representations remains crucial in an image compression pipeline. While the Karhunen-Loève transform (KLT) computed from an empirical covariance matrix C¯ is theoretically optimal for a stationary process, in practice, collecting sufficient statistics from a non-stationary image to reliably estimate C¯ can be difficult. In this paper, to encode an intra-prediction residual block, we pursue a hybrid model-based/data-driven approach: the first K eigenvectors of a transform matrix are derived from a statistical model, e.g., the asymmetric discrete sine transform (ADST), for stability, while the remaining N − K are computed from C¯ for performance. The transform computation is posed as a graph learning problem, where we seek a graph Laplacian matrix minimizing a graphical lasso objective inside a convex cone sharing the first K eigenvectors in a Hilbert space of real symmetric matrices. We efficiently solve the problem via augmented Lagrangian relaxation and proximal gradient (PG). Using WebP as a baseline image codec, experimental results show that our hybrid graph transform achieved better energy compaction than default discrete cosine transform (DCT) and better stability than KLT. © 2022, CC BY.

关键词： image coding

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：