检索结果-内蒙古大学图书馆

coding-Based Data Compression for Multichannel SAR

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS 2025年 22卷

作者： Martone, Michele Gollin, Nicola Krieger, Gerhard Imbembo, Ernesto Rizzoli, Paola German Aerosp Ctr DLR Microwaves & Radar Inst D-82234 Wessling Germany European Space Agcy ESA NL-2201 AZ Noordwijk Binnen Netherlands

Multichannel synthetic aperture radar (MC-SAR) allows for high-resolution imaging of a wide swath (HRWS), at the cost of acquiring and downlinking a significantly larger amount of data, compared with conventional SAR systems. In this letter, we discuss the potential of efficient data volume reduction (DVR) for MC-SAR. Specifically, we focus on methods based on transform coding (TC) and linear predictive coding (LPC), which exploit the redundancy introduced in the raw data by the finer azimuth sampling peculiar to the MC system. The proposed approaches, in combination with a variable-bit quantization, allow for the optimization of the resulting performance and data rate. We consider three exemplary yet realistic MC-SAR systems, and we conduct simulations and analyses on synthetic SAR data considering different radar backscatter distributions, which demonstrate the effectiveness of the proposed methods.

关键词： Azimuth Synthetic aperture radar Doppler effect Quantization (signal) Backscatter Encoding Autocorrelation transform coding Redundancy Linear predictive coding Data volume reduction (DVR) multichannel (MC) synthetic aperture radar (SAR) raw data quantization SAR

来源：评论

学校读者我要写书评

暂无评论

3D-listless block cube set-partitioning coding for resource constraint hyperspectral image sensors

引用

SIGNAL IMAGE AND VIDEO PROCESSING 2024年第4期18卷 3163-3178页

作者： Bajpai, Shrish Integral Univ Fac Engn & Informat Technol Dept Elect & Commun Engn Lucknow Uttar Pradesh India

The hyperspectral image provides rich spectral information content, which facilitates multiple applications. With the rapid advancement of the spatial and spectral resolution of optical instruments, the image data size has increased by many folds. For that, it requires a compression algorithm having low coding complexity, low coding memory demand and high coding efficiency. In recent years, many coding algorithms are proposed. The wavelet transform-based set-partitioned hyperspectral compression algorithms have superior coding performance. These algorithms employ linked lists or state tables to track the significant/insignificant of the partitioned sets/coefficients. The proposed algorithm uses the pyramid hierarchy property of wavelet transform. The markers are used to track the significance/insignificance of the pyramid level. A single pyramid level has many sets. An insignificant pyramid level having multiple sets is represented as a single bit in proposed compression algorithm, while a single insignificant set in 3D Set Partition Embedded bloCK (3D-SPECK) and 3D-Listless SPECK (3D-LSK) is represented as a single bit. Through this, the requirement of the bits in the proposed algorithm is less than other wavelet transform compression algorithms at the high bit planes. The simulation result shows that the proposed compression algorithm has high coding efficiency with very less coding complexity and moderate coding memory requirement. The reduced coding complexity improves the performance of the image sensor and lowers the power consumption. Thus, the proposed compression algorithm has great potential in low-resource onboard hyperspectral imaging systems.

关键词： Low complexity Discrete wavelet transform Listless embedded block partitioning Set-partitioning embedded block cube transform coding

来源：评论

学校读者我要写书评

暂无评论

Special Section on Recent Standardization Efforts for Learning-Based Visual Data coding

引用

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 2024年第5期34卷 3063-3066页

作者： Liu, Dong Liu, Shan Ascenso, Joao Tian, Dong Yu, Lu Univ Sci & Technol China Sch Informat Sci & Technol Hefei 230093 Peoples R China Tencent Media Lab Palo Alto CA 94306 USA Inst Telecomunicacoes Inst Super Tecn P-1049001 Lisbon Portugal InterDigital Inc New York NY 10120 USA Zhejiang Univ Hangzhou 310027 Peoples R China

Visual data coding is an enabling technology for various applications and is now ubiquitously adopted in modern image processing, communications, and computer vision systems. To enable interoperability between devices manufactured and services provided by different enterprises, a series of standards targeting visual data coding have been crafted in the past three decades. Several standardization organizations, such as ISO/IEC JTC 1/SC 29 consisting of Joint Picture Experts Group (JPEG) and Moving Picture Experts Group (MPEG), 1 ITU-T SG 16 Video coding Experts Group (VCEG), 2 IEEE Data Compression Standards Committee Audio Video coding Working Group (1857 WG), 3 MPAI Community, 4 have been creating these standards from many contributions of academia and industry. While most of these visual coding standards have been successfully deployed in many applications, there are more challenges nowadays, especially to accommodate the large volume of visual data in limited storage and limited bandwidth transmission links. Compression efficiency improvements are still needed, especially considering emerging data representation formats ranging from 8K/HDR image/video to rich plenoptic data.

关键词： Special issues and sections Visual databases Encoding Ubiquitous computing Learning systems Interoperability transform coding Standards Image analysis Videos

来源：评论

学校读者我要写书评

暂无评论

JPEG 2000 Extensions for Scalable coding of Discontinuous Media

引用

IEEE TRANSACTIONS ON IMAGE PROCESSING 2023年 32卷 3092-3107页

作者： Mathew, Reji Naman, Aous Thabit Li, Yue Taubman, David Univ New South Wales Sch Elect Engn & Telecommun Sydney NSW 2052 Australia

In this paper we propose novel extensions to JPEG 2000 for the coding of discontinuous media which includes piecewise smooth imagery such as depth maps and optical flows. These extensions use breakpoints to model discontinuity boundary geometry and apply a breakpoint dependent Discrete Wavelet transform (BP-DWT) to the input imagery. The highly scalable and accessible coding features provided by the JPEG 2000 compression framework are preserved by our proposed extensions, with the breakpoint and transform components encoded as independent bit streams that can be progressively decoded. Comparative rate-distortion results are provided along with corresponding visual examples which highlight the advantages of using breakpoint representations with accompanying BD-DWT and embedded bit-plane coding. Recently our proposed extensions have been adopted and are in the process of being published as a new Part 17 to the JPEG 2000 family of coding standards.

关键词： Image coding transform coding transforms Discrete wavelet transforms Media Encoding Spatial resolution JPEG 2000 scalable coding DWT EBCOT

来源：评论

学校读者我要写书评

暂无评论

Supporting Wider Baseline Light Fields in JPEG Pleno With a Novel Slanted 4D-DCT coding Mode

引用

IEEE ACCESS 2023年 11卷 28294-28317页

作者： de Carvalho, Murilo Bresciani Pagliari, Carla L. Alves, Gustavo de O. E. Schretter, Colas Schelkens, Peter Pereira, Fernando da Silva, Eduardo A. B. Univ Fed Fluminense TET CTC BR-24220900 Niteroi Brazil Inst Mil Engn PEE PGED IME BR-22290270 Rio De Janeiro Brazil Univ Fed Rio de Janeiro PEE COPPE DEL BR-21941901 Rio De Janeiro Brazil Vrije Univ Brussel Dept Elect & Informat B-1050 Brussels Belgium Imec Dept Digital & User Centr Solut B-3001 Leuven Belgium Univ Lisbon Inst Super Tecn P-1049001 Lisbon Portugal Inst Telecomunicacoes P-1049001 Lisbon Portugal

Light fields are one of the emerging 3D representation formats with an effective potential to offer very realistic and immersive visual experiences. This capability comes at the cost of a very large amount of acquired data which practical use requires efficient coding solutions. This need was already addressed by the JPEG Pleno Light Field coding standard for static light fields, which has specified two coding modes, named 4D-transform and 4D-Prediction. While the first offers better compression performance for smaller baseline light fields, the second excels for larger baseline light fields. This paper intends to propose a novel light field coding mode, the Slanted 4D-transform coding mode, which extends the 4D-transform coding mode based on the conventional 4D-DCT, to offer better compression performance than both the available JPEG Pleno coding modes, independently of the baseline. The key idea is to apply first to each 4D block in the light field an adaptive, hierarchical geometric transformation, which makes the data in the block more energy-compaction friendly for the following 4D-DCT. The rate-distortion performance results show that the proposed Slanted 4D-transform codec is able to outperform both the already standardized JPEG Pleno coding modes, showing BD-Rates gains of 31.03% and 28.30% for the 4D-transform and 4D-Prediction modes, respectively, thus implying that a single coding mode can efficiently code all types of light fields.

关键词： Light fields Encoding transform coding Standards Cameras transforms Image coding JPEG Pleno standard light field coding 4D transform mode 4D-DCT slant transform

来源：评论

学校读者我要写书评

暂无评论

Robust Spatial-Temporal Motion Coherent Priors for Multi-View Video coding Artifact Reduction

引用

IEEE ACCESS 2023年 11卷 123104-123116页

作者： Jeon, Gyulee Lee, Yeonjin Lee, Jung-Kyung Kim, Yong-Hwan Kang, Je-Won Ewha Womans Univ Grad Program Smart Factory Seoul 03760 South Korea Ewha Womans Univ Dept Elect & Elect Engn Seoul 03760 South Korea Korea Elect Technol Inst Seongnam Si 13509 South Korea

Multi-view video (MVV) data processed by three-dimensional (3D) video systems often suffer from compression artifacts, which can degrade the rendering quality of 3D spaces. In this paper, we focus on the task of artifact reduction in multi-view video compression using spatial and temporal motion priors. Previous MVV quality enhancement networks using a warping-and-fusion approach employed reference-to-target motion priors to exploit inter-view and temporal correlation among MVV frames. However, these motion priors were sensitive to quantization noise, and the warping accuracy was degraded, when the target frame used low-quality features in the corresponding search. To overcome these limitations, we propose a novel approach that utilizes bilateral spatial and temporal motion priors, leveraging the geometry relations of a structured MVV camera system, to exploit motion coherency. Our method involves a multi-view prior generation module that produces both unidirectional and bilateral warping vectors to exploit rich features in adjacent reference MVV frames and generate robust warping features. These features are further refined to account for unreliable alignments cross MVV frames caused by occlusions. The performance of the proposed method is evaluated in comparison with state-of-the-art MVV quality enhancement networks. Synthetic MVV dataset facilitates to train our network that produces various motion priors. Experimental results demonstrate that the proposed method significantly improves the quality of the reconstructed MVV frames in recent video coding standards such as the multi-view extension of High Efficiency Video coding and the MPEG immersive video standard.

关键词： Video compression transform coding Three-dimensional displays Spatiotemporal phenomena Rendering (computer graphics) Quantization (signal) High efficiency video coding Multi-view video compression video enhancement motion vector VVC MPEG-immersive video TMIV

来源：评论

学校读者我要写书评

暂无评论

Adaptive Patch-Wise Depth Range Linear Scaling Method for MPEG Immersive Video coding

引用

IEEE ACCESS 2023年 11卷 133440-133450页

作者： Lim, Sung-Gyun Kim, Hyun-Ho Kim, Yong-Hwan Korea Elect Technol Inst Seongnam Si 13488 South Korea

The Moving Picture Experts Group (MPEG) is responsible for standardizing MPEG immersive video (MIV) for immersive video coding and is involved in research and development focusing on providing six degrees of freedom through a reference software known as the test model for immersive video. To efficiently compress and transmit multiview videos with texture and depth pairings, the encoder part of the MIV codec framework reduces the pixel rate by removing redundancy between views and densely packing the remaining regions into an atlas as patches. The decoder part reconstructs multiview videos from the transmitted atlas to synthesize and render arbitrary viewports, and the depth information has a significant impact on the quality of the rendered viewport. However, the existing method of handling depth values in the MIV codec fails to adequately address the information loss that occurs during quantization or transmission. To preserve and transmit depth information more accurately, we propose a method for expanding the depth dynamic range using min-max linear scaling on a patch-by-patch basis. In addition, we efficiently encode the per-patch minimum and maximum values of depth required by the decoder to recover the original depth values and include them in the metadata. The experimental results indicate that for computer-generated sequences, the proposed method provides PSNR-based Bjontegaard delta-rate gains of 9.1% and 3.3% in the end-to-end performance for high- and low-bitrate cases, respectively. In addition, subjective quality improvements are observed by reducing the artifacts that primarily occur at the object boundaries in the rendered viewport.

关键词： transform coding Dynamic range Decoding Codecs Encoding Rendering (computer graphics) Standards Depth scaling immersive video MPEG immersive video multiview video virtual reality 6DoF video

来源：评论

学校读者我要写书评

暂无评论

Lightning-Fast Dual-Layer Lossless coding for Radiance Format High Dynamic Range Images

引用

IEEE SIGNAL PROCESSING LETTERS 2023年 30卷 1362-1366页

作者： Suzuki, Taizo Yukikata, Sara Yang, Kai Yoshida, Taichi Univ Tsukuba Inst Engn Informat & Syst Ibaraki 3058573 Japan Sky Co Ltd Tokyo 1080075 Japan Da Jiang Innovat Sci & Technol Japan Co Ltd Tokyo 1080075 Japan Univ Electrocommun Grad Sch Informat & Engn Dept Comp & Network Engn Tokyo 1828585 Japan

This letter proposes a fast dual-layer lossless coding for high dynamic range images (HDRIs) in the Radiance format. The coding, which consists of a base layer and a lossless enhancement layer, provides a standard dynamic range image (SDRI) without requiring an additional algorithm at the decoder and can losslessly decode the HDRI by adding the residual signals (residuals) between the HDRI and SDRI to the SDRI, if desired. To suppress the dynamic range of the residuals in the enhancement layer, the coding directly uses the mantissa and exponent information from the Radiance format. To further reduce the residual energy, each mantissa is modeled (estimated) as a linear function, i.e., a simple linear regression, of the encoded-decoded SDRI in each region with the same exponent. This is called simple linear regressive mantissa estimator. Experimental results show that, compared with existing methods, our coding reduces the average bitrate by approximately 1.57-6.68% and significantly reduces the average encoder implementation time by approximately 87.13-98.96%.

关键词： Encoding Image coding Decoding Linear regression Standards transform coding High dynamic range Dual-layer lossless coding high dynamic range image low computational cost mantissa estimator Radiance format simple linear regression

来源：评论

学校读者我要写书评

暂无评论

Feature Map Compression for Video coding for Machines Based on Receptive Block Based Principal Component Analysis

引用

IEEE ACCESS 2023年 11卷 26308-26319页

作者： Lee, Minhun Choi, Hansol Kim, Jihoon Do, Jihoon Kwon, Hyoungjin Jeong, Se Yoon Sim, Donggyu Oh, Seoung-Jun Kwangwoon Univ Dept Comp Engn Seoul 139701 South Korea Elect & Telecommun Res Inst Media Coding Res Sect Daejeon 34129 South Korea Kwangwoon Univ Dept Elect Engn Seoul 139701 South Korea

This paper presents a method to effectively compress the intermediate layer feature map of a convolutional neural network for the potential structures of Video coding for Machines, which is an emerging technology for future machine consumption applications. Notably, most extant studies compress a single feature map and hence cannot entirely consider both global and local information within the feature map. This limits performance maintenance during machine consumption tasks that analyze objects with various sizes in images/videos. To address this problem, a multiscale feature map compression method is proposed that consists of two major processes: receptive block based principal component analysis (RPCA) and uniform integer quantization. The RPCA derives the complete basis kernels of a feature map by selecting a set of major basis kernels that can represent a sufficient percentage of global or local information according to the variable-size receptive blocks of each feature map. After transforming each feature map using the set of major basis kernels, a uniform integer quantizer converts the 32-bit floating-point values of the set of major basis kernels, corresponding RPCA coefficients, and a mean vector to five-bit integer representation values. Experiment results reveal that the proposed method reduces the amount of feature maps by 99.30% with a loss of 8.30% in the average precision (AP) on the OpenImageV6 dataset and 0.77% in AP(M) and 0.47% in AP(L) on the MS COCO 2017 validation set while outperforming previous PCA-based feature map compression methods even at higher compression rates.

关键词： Task analysis Feature extraction Image coding transform coding Video coding Principal component analysis Object detection Moving picture experts group video coding for machines convolutional neural network principal component analysis feature map compression

来源：评论

学校读者我要写书评

暂无评论

Discrete Tchebichef transform for Versatile Video coding 21

Discrete Tchebichef Transform for Versatile Video Coding

引用

11th International Conference on Multimedia Retrieval (ICMR)

作者： Chan, Ka-Hou Im, Sio-Kei Macao Polytech Inst Sch Appl Sci Macau Peoples R China Macao Polytech Inst Macau Peoples R China

ISBN: (纸本)9781450384636

The Discrete Tchebichef transform (DTT) is a transform method based on discrete orthogonal Tchebichef polynomials, which have applications found in image compression and video coding. Our method is to construct all DTT-related discrete orthogonal transforms in the required size (corresponding to the coding unit supported by H.266/VVC). To investigate the feature of Tchebichef polynomials, we make use of a novel discrete orthogonal matrix generation method with determined DTT roots, and scaling and rounding a DTT that depends on the quantization parameter, instead of integer approximation. We can obtain an accurate integer DTT matrix. Experimental results show that this method can improve the video quality and require fewer bit rates.

关键词： transform coding H.266/VVC Discrete Tchebichef transform

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：