检索结果-内蒙古大学图书馆

IEEE Annual Joint Conference: INFOCOM, IEEE Computer and Communications Societies

作者： Yixuan Guan Xuefeng Liu Jianwei Niu Tao Ren State Key Laboratory of Virtual Reality Technology and Systems School of Computer Science and Engineering Beihang University Beijing China Zhongguancun Laboratory Beijing China Zhengzhou University Research Institute of Industrial Technology School of Information Engineering Zhengzhou University Zhengzhou China State Key Laboratory of Intelligent Game Institute of Software Chinese Academy of Sciences Beijing China

ISBN: (数字)9798350383508

ISBN: (纸本)9798350383515

Federated learning (FL) enables distributed training via periodically synchronizing model updates among participants. Communication overhead becomes a dominant constraint of FL since participating clients usually suffer from limited bandwidth. To tackle this issue, top-k based gradient compression techniques are broadly explored in FL context, manifesting powerful capabilities in reducing gradient volumes via picking significant entries. However, previous studies are primarily conducted on the raw gradients where massive spatial redundancies exist and positions of non-zero (top-k) entries vary greatly between gradients, which both impede the achievement of deeper compressions. Top-k may also degrade the performance of trained models due to biased gradient estimations. Targeting the above issues, we propose FedTC, a novel transform coding based compression framework. FedTC transforms gradients into a new domain with more compact energy distributions, which facilitates reducing spatial redundancies and biases in subsequent sparsification. Furthermore, non-zero entries across clients from different rounds become highly aligned in the transform domain, motivating us to partition the gradients into smaller entry blocks with various alignment levels to better exploit these alignments. Lastly, positions and values of non-zero entries are independently compressed in a block-wise manner with our customized designs, through which a higher compression ratio is achieved. Theoretical analysis and extensive experiments consistently demonstrate the effectiveness of our approach.

关键词： Training Quantization (signal) Costs Federated learning Redundancy transform coding Estimation

来源：评论

学校读者我要写书评

暂无评论

Analysis of the transform coding Module in the Post-VVC Standard

Analysis of the Transform Coding Module in the Post-VVC Stan...

引用

Artificial Intelligence & Green Energy (ICAIGE), IEEE International Conference on

作者： Sonda Ben Jdidia Fatma Belghith Ibtissem Wali Nouri Masmoudi Laboratory of Electronics and Information Technologies National Engineering School of Sfax University of Sfax Sfax Tunisia

ISBN: (数字)9798350389838

ISBN: (纸本)9798350389845

The Enhanced Compression Model (ECM) serves as the software foundation for future video coding exploration, extending beyond the capabilities of the current Versatile Video coding (VVC) standard. This paper conducts statistical analyses on ECM encoded videos, focusing particularly on 1D and 2D transformation types, as well as intra and inter prediction modes across videos from different classes with distinct resolutions. These analyses are performed at the decoder level, where the coding decisions have already been made by the encoder. Results reveal that the selection of transformation type and size, as well as prediction mode (intra or inter), depend on video characteristics such as motion and texture. This study represents a significant advancement in the development of intelligent algorithms based on video characteristics to expedite decision-making in the ECM encoding process.

关键词： Video coding Quantization (signal) Statistical analysis Video sequences transform coding transforms Prediction algorithms Encoding Software Standards

来源：评论

学校读者我要写书评

暂无评论

Rounding of Improved DCT transform coding for H.266/VVC 13

Rounding of Improved DCT Transform Coding for H.266/VVC

引用

13th International Conference on Digital Image Processing (ICDIP)

作者： Chan, Ka-Hou Im, Sio-Kei Macao Polytech Inst Macau Peoples R China Macao Polytech Inst Sch Appl Sci Macau Peoples R China

ISBN: (数字)9781510646018

ISBN: (纸本)9781510646018

Many video encoders use DCT transform coding to compress the encoded video. For hardware implementation, DCT will be approximately an integer matrix, which may cause some deviations in this process, and these deviations will accumulate and become obvious in the larger code unit. Our method is to construct all DCT-related discrete orthogonal transforms in the required size (corresponding to the coding unit supported by H.266/VVC). By using a novel discrete orthogonal matrix generation method with determined DCT-II roots, and scaling and rounding a regular DCT that depends on the quantization parameter, instead of integer approximation. We can obtain an accurate integer DCT matrix. Experimental results show that this method can not only improve the video quality and also require fewer bit rates.

关键词： DCT transform coding H.266/VVC Discrete Orthogonal Matrix

来源：评论

学校读者我要写书评

暂无评论

Multi-Rate Adaptive transform coding for Video Compression

Multi-Rate Adaptive Transform Coding for Video Compression

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Lyndon R. Duong Bohan Li Cheng Chen Jingning Han Center for Neural Science NYU New York NY USA Open Codecs Google LLC Mountain View CA USA

Contemporary lossy image and video coding standards rely on transform coding, the process through which pixels are mapped to an alternative representation to facilitate efficient data compression. Despite impressive performance of end-to-end optimized compression with deep neural networks, the high computational and space demands of these models has prevented them from superseding the relatively simple transform coding found in conventional video codecs. In this study, we propose learned transforms and entropy coding that may either serve as (non)linear drop-in replacements, or enhancements for linear transforms in existing codecs. These transforms can be multi-rate, allowing a single model to operate along the entire rate-distortion curve. To demonstrate the utility of our framework, we augmented the DCT with learned quantization matrices and adaptive entropy coding to compress intra-frame AV1 block prediction residuals. We report substantial BD-rate and perceptual quality improvements over more complex nonlinear transforms at a fraction of the computational cost.

关键词： Video coding Quantization (signal) Computational modeling transform coding transforms Video compression Entropy coding

来源：评论

学校读者我要写书评

暂无评论

Video Compression Using Generalized Binary Partitioning, Trellis Coded Quantization, Perceptually Optimized Encoding, and Advanced Prediction and transform coding

引用

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 2020年第5期30卷 1281-1295页

作者： Pfaff, Jonathan Schwarz, Heiko Marpe, Detlev Bross, Benjamin De-Luxan-Hernandez, Santiago Helle, Philipp Helmrich, Christian R. Hinz, Tobias Lim, W. -Q. Ma, Jackie Nguyen, Tung Rasch, Jennifer Schafer, Michael Siekmann, Mischa Venugopal, Gayathri Wieckowski, Adam Winken, Martin Wiegand, Thomas Heinrich Hertz Inst Nachrichtentech Berlin GmbH Fraunhofer Inst Telecommun D-10587 Berlin Germany Free Univ Berlin Inst Comp Sci D-14195 Berlin Germany Tech Univ Berlin Dept Telecommun Syst D-10623 Berlin Germany

In this paper, we describe a video coding design that enables a higher coding efficiency than the HEVC standard. The proposed video codec follows the design of block-based hybrid video coding, but includes a number of advanced coding tools. A part of the incorporated advanced concepts was developed by the Joint Video Exploration Team, while others are newly proposed. The key aspects of these newly proposed tools are the following. A video frame is subdivided into rectangles of variable size using a binary partitioning with variable split ratios. Three new approaches for generating spatial intra prediction signals are supported: A line-wise application of conventional intra prediction modes, coupled with a mode-dependent processing order, a region-based template matching prediction method and intra prediction modes based on neural networks. For motion-compensated prediction, a multi-hypothesis mode with more than two motion hypotheses can be used. In transform coding, mode dependent combinations of primary and secondary transforms are applied. Moreover, scalar quantization is replaced by trellis-coded quantization and the entropy coding of the quantized transform coefficients is improved. The intra and inter prediction signals can be filtered using an edge-preserving diffusion filter or a non-linear DCT-based thresholding operation. The video codec includes an adaptive in-loop filter for which one of three classifiers can be chosen on a picture basis. We also incorporated an optional encoder control, which adjusts the quantization parameters based on a perceptually motivated distortion measure. In a random access scenario, our proposed video codec achieves luma BD-rate savings between 32.5% for HDR HLG UHD and 39.6% for SDR UHD over the HEVC (HM software) anchor for different categories of test sequences.

关键词： Encoding transforms Video coding Tools transform coding Video codecs Quantization (signal) Video compression video coding High Efficiency Video coding (HEVC)

来源：评论

学校读者我要写书评

暂无评论

PRE-ECHO REDUCTION IN transform AUDIO coding VIA TEMPORAL ENVELOPE CONTROL WITH MACHINE LEARNING BASED ESTIMATION 49

PRE-ECHO REDUCTION IN TRANSFORM AUDIO CODING VIA TEMPORAL EN...

引用

49th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Kim, Jae-Won Jo, Byeongho Beack, Seungkwon Park, Hochong Kwangwoon Univ Seoul South Korea ETRI Daejeon South Korea

ISBN: (纸本)9798350344868;9798350344851

This paper proposes a new method for pre-echo reduction in transform-based audio coding by controlling the temporal envelope of the waveform. The proposed method comprises two operating modes: temporal envelope flattening and temporal envelope correction of a target signal. The proposed method estimates signal levels with a low temporal resolution from side information using machine learning and converts them into a signal to be applied to the target signal to flatten and correct the temporal envelope. It also adjusts the signals to maintain signal continuity between the non-transient and transient frames. The proposed method differs from conventional methods in that it directly modifies the waveform before encoding and after decoding, which makes it useful as a new coding tool for legacy codecs. A subjective performance evaluation confirms that the proposed method uses fewer bits to provide sound quality equivalent to that of the short-window transform.

关键词： audio coding machine learning pre-echo temporal envelope transform coding

来源：评论

学校读者我要写书评

暂无评论

A transform coding Strategy for Dynamic Point Clouds

引用

IEEE TRANSACTIONS ON IMAGE PROCESSING 2020年 29卷 8213-8225页

作者： Milani, Simone Polo, Enrico Limuti, Simone Univ Padua Dept Informat Engn I-35131 Padua Italy

The development of real-time 3D sensing devices and algorithms (e.g., multiview capturing systems, Time-of-Flight depth cameras, LIDAR sensors), as well as the widespreading of enhanced user applications processing 3D data, have motivated the investigation of innovative and effective coding strategies for 3D point clouds. Several compression algorithms, as well as some standardization efforts, has been proposed in order to achieve high compression ratios and flexibility at a reasonable computational cost. This paper presents a transform-based coding strategy for dynamic point clouds that combines a non-linear transform for geometric data with a linear transform for color data;both operations are region-adaptive in order to fit the characteristics of the input 3D data. Temporal redundancy is exploited both in the adaptation of the designed transform and in predicting the attributes at the current instant from the previous ones. Experimental results showed that the proposed solution obtained a significant bit rate reduction in lossless geometry coding and an improved rate-distortion performance in the lossy coding of color components with respect to state-of-the-art strategies.

关键词： Three-dimensional displays Image coding transforms Solid modeling Image color analysis Sensors Heuristic algorithms Dynamic point cloud compression cellular automata transform coding octree voxel color

来源：评论

学校读者我要写书评

暂无评论

Graph Based transforms based on Graph Neural Networks for Predictive transform coding

Graph Based Transforms based on Graph Neural Networks for Pr...

引用

Data Compression Conference (DCC)

作者： Debaleena Roy Tanaya Guha Victor Sanchez University of Warwick Coventry UK

This paper introduces the GBT-NN, a novel class of Graph-based transform within the context of block-based predictive transform coding using intra-prediction. The GBT-NNis constructed by learning a mapping function to map a graph Laplacian representing the covariance matrix of the current block. Our objective of learning such a mapping functionis to design a GBT that performs as well as the KLT without requiring to explicitly com-pute the covariance matrix for each residual block to be transformed. To avoid signallingany additional information required to compute the inverse GBT-NN, we also introduce acoding framework that uses a template-based prediction to predict residuals at the decoder. Evaluation results on several video frames and medical images, in terms of the percentageof preserved energy and mean square error, show that the GBT-NN can outperform the DST and DCT.

关键词： Laplace equations transform coding transforms Mean square error methods Artificial neural networks Graph neural networks Decoding

来源：评论

学校读者我要写书评

暂无评论

Distributed MIMO Uplink Capacity under transform coding Fronthaul Compression

Distributed MIMO Uplink Capacity under Transform Coding Fron...

引用

IEEE International Conference on Communications (IEEE ICC)

作者： Wiffen, Fred Bocus, Mohammud Z. Doufexi, Angela Nix, Andrew Univ Bristol Commun Syst & Networks Res Grp Bristol Avon England Toshiba Res Europe Ltd Bristol Avon England

ISBN: (纸本)9781538680889

In this work we analyse the capacity of the distributed MIMO uplink when transform coding is applied locally at each remote radio head (RRH) to compress fronthaul traffic. Assuming the use of optimal scalar compression, we derive a closed form capacity expression for the distributed MIMO uplink under Gaussian signalling, which is shown to be a function of both local and global channel eigendecompositions. We then outline two rate allocation schemes for efficiently allocating the available fronthaul to the compressed scalars, based on either local or global channel state information (CSI). Numerical results under Rayleigh fading conditions are presented which show that transform coding can provide a significant compression gain relative to direct signal quantisation, which grows as the number of antennas deployed at each RRH increases. Results also show that allocating fronthaul based on global CSI significantly improves performance, especially as the number of RRHs deployed increases.

关键词： distributed MIMO fronthaul compression transform coding cell-free C-RAN

来源：评论

学校读者我要写书评

暂无评论

A transform coding STRATEGY FOR VOXELIZED DYNAMIC POINT CLOUDS 25

A TRANSFORM CODING STRATEGY FOR VOXELIZED DYNAMIC POINT CLOU...

引用

25th IEEE International Conference on Image Processing (ICIP)

作者： Limuti, Simone Polo, Enrico Milani, Simone Univ Padua Dept Informat Engn Padua Italy

ISBN: (纸本)9781479970612

With the advent of virtual and augmented reality applications, 3D and free-viewpoint representations have evolved towards solid scene models using meshes and point clouds. Recent works have been addressing point clouds compression via octree-based hierarchical strategies in order to enable a multi-resolution coding and visualization at a reasonable computational cost. This paper presents a voxelized dynamic point cloud coding scheme that combines a Cellular Automata block reversible transform for geometric data with a region adaptive transform for color data. Temporal redundancy is removed using a low-complexity prediction scheme to minimize the computational complexity and reduce the coded bit rate. Experimental results showed that the proposed solution obtained a significant bit rate reduction in lossless geometry coding and an improved rate-distortion performance in the lossy coding of color components with respect to state-of-the-art strategies.

关键词： dynamic point cloud compression cellular automata transform coding octree voxel color

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：