检索结果-内蒙古大学图书馆

IEEE International Conference on Multimedia and Expo (ICME)

作者： Aminlou, Alireza NajafiHaghi, Zahra Namaki-Shoushtari, Majid Hashemi, Mahmoud Reza Univ Tehran Multimedia Proc Lab Sch Elect & Comp Engn Tehran 14174 Iran

ISBN: (纸本)9781612843490

In order to accommodate the wide range of applications and the corresponding platforms where the H.264/AVC standard is currently in place, one should be able to optimize the encoder's computational complexity with a careful selection of the coding configuration parameters. Motion estimation is the most time-consuming part of the encoder which constitutes up to 75% of the computational complexity. In this paper, the optimum selection of configuration parameters, including search range, reference frame, degree of down-sampling and number of truncation bits have been analyzed for the VLSI implementation of integer motion estimation in terms of distortion-complexity performance. Furthermore, the optimum parameter sets have been presented for different video sizes and different constraints on computational power.

关键词： rate-distortion-complexity optimization Integer motion estimation VLSI implementation H.264/AVC encoder

来源：评论

学校读者我要写书评

暂无评论

ELFIC: A Learning-based Flexible Image Codec with rate-distortion-complexity optimization 23

ELFIC: A Learning-based Flexible Image Codec with Rate-Disto...

引用

31st ACM International Conference on Multimedia (MM)

作者： Zhang, Zhichen Chen, Bolin Lin, Hongbin Lin, Jielian Wang, Xu Zhao, Tiesong Fuzhou Univ Fujian Key Lab Intelligent Proc & Wireless Transm Fuzhou Peoples R China City Univ Hong Kong Hong Kong Peoples R China Shenzhen Univ Shenzhen Peoples R China Fujian Sci & Technol Innovat Lab Optoelect Inform Fuzhou Peoples R China

ISBN: (纸本)9798400701085

Learning-based image coding has attracted increasing attentions for its higher compression efficiency than reigning image codecs. However, most existing learning-based codecs do not support variable rates with a single encoder;their decoders are also of fixed, high computational complexity. In this paper, we propose an End-to-end, Learning-based and Flexible Image Codec (ELFIC) that supports variable rate and flexible decoding complexity. First, we propose a general image codec with Nonlinear Feature Fusion Transform (NFFT) as nonlinear transforms to improve its rate-distortion (RD) performance. Second, we propose an Instance-aware Decoding complexity Allocation (IDCA) approach, which exploits image contents for a tradeoff between reconstruction quality and computational complexity in the decoding process. Third, we propose an RD-complexity (RDC) optimization algorithm, which maximizes the image quality under given rate and complexity constraints for the whole framework. Experimental results show that ELFIC achieves variable rate, flexible decoding complexity with the state-of-the-art RD performance. It also supports a more efficient decoding process by focusing on image contents. Source codes are available at https://***/Zhichen-Zhang/ELFIC-Image-Compression.

关键词： Deep image compression deep image coding complexity allocation rate-distortion-complexity optimization

来源：评论

学校读者我要写书评

暂无评论

A Method for rate-distortion-complexity optimization in Versatile Video Coding Standard 26

A Method for Rate-Distortion-Complexity Optimization in Vers...

引用

26th International Computer Conference of the Computer-Society-of-Iran

作者： Rezaeieh, Amir Roodaki, Hoda KN Toosi Univ Technol Fac Comp Engn Tehran Iran

ISBN: (纸本)9781665412414

The most recent video coding standard, named Versatile Video Coding (VVC), greatly improved the compression rate compared to its predecessor, High Efficiency Video Coding (HEVC) using some new coding tools. Though these new option provide appreciable coding gain, its computational complexity is relatively high since the performance of these coding tools need to be evaluated for each Coding Tree Units (CTU) through the rate-distortion optimization (RDO) process. To address this issue, in this paper, first, the effectiveness of the coding tools in various parts of the frame, such as the borderline and central CTU, is investigated. The results of this study show that the coding efficiency of some of these coding tools is much higher for the borderline CTUs due to their specific features. Hence, these coding tools would be only considered enable for the borderline CTUs in rate-distortion process to decrease the computational complexity, without affecting the coding gain considerably. Simulation results show that using this method, the compression efficiency decreased only by 0.64% in average, but the computational complexity is reduced considerably, by 28.31%, in average.

关键词： Versatile Video Coding rate-distortion-complexity optimization encoder coding tools

来源：评论

学校读者我要写书评

暂无评论

Exploring the rate-distortion-complexity optimization in neural image compression

引用

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION 2024年 105卷

作者： Gao, Yixin Feng, Runsen Guo, Zongyu Chen, Zhibo Univ Sci & Technol China Hefei Peoples R China

Despite a short history, neural image codecs have been shown to surpass classical image codecs in terms of rate-distortion performance. However, most of them suffer from significantly longer decoding times, which hinders the practical applications of neural image codecs. This issue is especially pronounced when employing an effective yet time-consuming autoregressive context model since it would increase entropy decoding time by orders of magnitude. In this paper, unlike most previous works that pursue optimal RD performance while temporally overlooking the coding complexity, we make a systematical investigation on the rate-distortioncomplexity (RDC) optimization in neural image compression. By quantifying the decoding complexity as a factor in the optimization goal, we are now able to precisely control the RDC trade-off and then demonstrate how the rate-distortion performance of neural image codecs could adapt to various complexity demands. Going beyond the investigation of RDC optimization, a variable-complexity neural codec is designed to leverage the spatial dependencies adaptively according to industrial demands, which supports fine-grained complexity adjustment by balancing the RDC tradeoff. By implementing this scheme in a powerful base model, we demonstrate the feasibility and flexibility of RDC optimization for neural image codecs.

关键词： Neural image compression rate-distortion-complexity optimization Variable-complexity

来源：评论

学校读者我要写书评

暂无评论

Fuzzy SVM-Based Coding Unit Decision in HEVC

引用

IEEE TRANSACTIONS ON BROADCASTING 2018年第3期64卷 681-694页

作者： Zhu, Linwei Zhang, Yun Kwong, Sam Wang, Xu Zhao, Tiesong City Univ Hong Kong Dept Comp Sci Hong Kong Hong Kong Peoples R China City Univ Hong Kong Shenzhen Inst Shenzhen 518057 Peoples R China Chinese Acad Sci Shenzhen Inst Adv Technol Shenzhen 518055 Peoples R China Shenzhen Univ Coll Comp Sci & Software Engn Shenzhen 518060 Peoples R China Fuzhou Univ Coll Phys & Informat Engn Fuzhou 350116 Fujian Peoples R China

The latest video compression standard, High Efficiency Video Coding (HEVC), has greatly improved the coding efficiency compared to the predecessor H. 264/AVC. However, equipped with the quadtree structure of coding tree unit partition and other sophisticated coding tools, HEVC brings a significant increase in the computational complexity. To address this issue, a coding unit (CU) decision method based on fuzzy support vector machine (SVM) is proposed for rate-distortion-complexity (RDC) optimization, where the process of CU decision is formulated as a cascaded multi-level classification task. The optimal feature set is selected according to a defined misclassification cost and a risk area is introduced for an uncertain classification output. To further improve the RDC performance, different regulation parameters in SVM are adopted and outliers in training samples are eliminated. Additionally, the proposed CU decision method is incorporated into a joint RDC optimization framework, where the width of risk area is adaptively adjusted to allocate flexible computational complexity to different CUs, aiming at minimizing computational complexity under a configurable constraint in terms of RD performance degradation. Experimental results show that the proposed approach can reduce 58.9% and 55.3% computational complexity on average with the values of Bjonteggard delta peak-signal-to-noise ratio as -0.075 dB and -0.085 dB and the values of Bjontegaard delta bit rate as 2.859% and 2.671% under low delay P and random access configurations, respectively, which has outperformed the state-of-the-art fast algorithms based on statistical information and machine learning.

关键词： Misclassification cost fuzzy support vector machine coding unit decision rate-distortion-complexity optimization High Efficiency Video Coding

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：