检索结果-内蒙古大学图书馆

The JPEG Pleno learning-based Point Cloud coding Standard: Serving Man and Machine

IEEE ACCESS 2025年 13卷 43289-43315页

作者： Guarda, Andre F. R. Rodrigues, Nuno M. M. Pereira, Fernando Inst Telecomunicacoes P-1049001 Lisbon Portugal Politecn Leiria ESTG P-2411901 Leiria Portugal Univ Lisbon Inst Super Tecn P-1049001 Lisbon Portugal

Efficient point cloud coding has become increasingly critical for multiple applications such as virtual reality, autonomous driving, and digital twin systems, where rich and interactive 3D data representations may functionally make the difference. Deep learning has emerged as a powerful tool in this domain, offering advanced techniques for compressing point clouds more efficiently than conventional coding methods while also allowing effective computer vision tasks performed in the compressed domain thus, for the first time, making available a common compressed visual representation effective for both man and machine. Taking advantage of this potential, JPEG has recently finalized the JPEG Pleno learning-based Point Cloud coding (PCC) standard offering efficient lossy coding of static point clouds, targeting both human visualization and machine processing by leveraging deep learning models for geometry and color coding. The geometry is processed directly in its original 3D form using sparse convolutional neural networks, while the color data is projected onto 2D images and encoded using the also learning-based JPEG AI standard. The goal of this paper is to provide a complete technical description of the JPEG PCC standard, along with a thorough benchmarking of its performance against the state-of-the-art, while highlighting its main strengths and weaknesses. In terms of compression performance, JPEG PCC outperforms the conventional MPEG PCC standards, especially in geometry coding, achieving significant rate reductions. Color compression performance is less competitive but this is overcome by the power of a full learning-based coding framework for both geometry and color and the associated effective compressed domain processing.

关键词： Transform coding Point cloud compression Encoding Standards Image coding Three-dimensional displays Geometry Image color analysis Artificial intelligence Codecs JPEG Pleno standard learning-based coding man and machine point cloud coding

来源：评论

学校读者我要写书评

暂无评论

Overview of intelligent video coding: from model-based to learning-based approaches

引用

Visual Intelligence 2023年第1期1卷 1-19页

作者： Ma, Siwei Gao, Junlong Wang, Ruofan Chang, Jianhui Mao, Qi Huang, Zhimeng Jia, Chuanmin National Engineering Research Center of Visual Technology School of Computer Science Peking University Beijing 100871 China State Key Laboratory of Media Convergence and Communication Communication University of China Beijing 100024 China Wangxuan Institue of Computer Technology Peking University Beijing 100871 China

Intelligent video coding (IVC), which dates back to the late 1980s with the concept of encoding videos with knowledge and semantics, includes visual content compact representation models and methods enabling structural, detailed descriptions of visual information at different granularity levels (i.e., block, mesh, region, and object) and in different areas. It aims to support and facilitate a wide range of applications, such as visual media coding, content broadcasting, and ubiquitous multimedia computing. We present a high-level overview of the IVC technology from model-based coding (MBC) to learning-based coding (LBC). MBC mainly adopts a manually designed coding scheme to explicitly decompose videos to be coded into blocks or semantic components. Thanks to emerging deep learning technologies such as neural networks and generative models, LBC has become a rising topic in the coding area. In this paper, we first review the classical MBC approaches, followed by the LBC approaches for image and video data. We also discuss and overview our recent attempts at neural coding approaches, which are inspiring for both academic research and industrial implementation. Some critical yet less studied issues are discussed at the end of this paper. © The Author(s) 2023.

关键词： Multimedia communication Video compression Artificial intelligence Model-based coding Generative coding learning-based coding

来源：评论

学校读者我要写书评

暂无评论

Deep learning-based Point Cloud Joint Geometry and Color coding: Designing a Perceptually-Driven Differentiable Training Distortion Metric 8

Deep Learning-based Point Cloud Joint Geometry and Color Cod...

引用

8th IEEE International Conference on Multimedia Big Data (BigMM)

作者： Coelho, Luis Guarda, Andre F. R. Pereira, Fernando Univ Lisbon Inst Super Tecn Lisbon Portugal Inst Telecomunicacoes Lisbon Lisbon Portugal

ISBN: (数字)9781665459631

ISBN: (纸本)9781665459631

Deep learning (DL)-based coding has recently become very popular for multimedia data, notably images and point clouds (PCs). Training a DL coding model using the backpropagation algorithm requires a differentiable loss function. Thus, for PC joint geometry and color coding, both the PC geometry and color distortion metrics must be differentiable. Since the distortion/quality metrics commonly used for the final PC quality assessment do not meet this criterion, new PC distortion metrics have to be designed for DL-based training purposes. Moreover, for PC joint geometry and color coding, it is critical to define the balance between the geometry and color distortions in a meaningful way, ideally driven by the human perception and subjective quality assessment. In this context, this paper proposes a perceptually-driven design for a differentiable PC joint geometry and color distortion metric to be used for training purposes in DL-based coding, notably to define the relative weights for the geometry and color distortions. The obtained perceptually-driven weights achieve a rate reduction of around 3% regarding the default balanced weights at no complexity cost. This is the first proposal in the literature with this purpose and this perceptual approach.

关键词： point cloud learning-based coding distortion metric perceptually-driven

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：