The Versatile Video coding (VVC) standard notably enhances encoding efficiency with the Quad-Tree plus Multi-Type Tree (QTMTT) partition structure. However, the complex QTMTT tool presents substantial challenges in bo...
详细信息
The Versatile Video coding (VVC) standard notably enhances encoding efficiency with the Quad-Tree plus Multi-Type Tree (QTMTT) partition structure. However, the complex QTMTT tool presents substantial challenges in both software and hardware implementation. To overcome those challenges, this paper introduces a hardware-friendly partition decision algorithm for VVC intra and inter coding. Firstly, we propose a lightweight backbone network to extract partition-aware features. Secondly, we employ a Quantisation Parameter (QP) fusion network to regulate the impact of QPs on the partition structure. Additionally, we apply a top-down threshold-driven post-processing algorithm, in which improbable partition types are removed to directly derive the unique partition structure. Experiments show that our method not only exceeds the previous state-of-the-art work in BD-BR performance, but also shows sufficient hardware-friendly characteristics. To the best of our knowledge, this work is among the earliest to comprehensively discuss and implement a hardware-friendly partition decision algorithm.
Video coding is necessary to transmit large amounts of video information. HEVC, which is the latest video coding standard, includes many advanced tools to achieve higher coding efficiency than previous standards, such...
详细信息
Video coding is necessary to transmit large amounts of video information. HEVC, which is the latest video coding standard, includes many advanced tools to achieve higher coding efficiency than previous standards, such as MPEG-1/2/4 and H.264/AVC. In particular, new techniques in inter prediction, such as skip, merge, and advanced motion vector prediction (AMVP) modes, drastically improve coding performance by reducing temporal redundancies between consecutive frames within a video. However, the skip mode, which does not perform a motion estimation, can deteriorate the coding performance, because of inaccurate motion vector prediction in videos with fast and complex motions. In order to solve this problem, a novel skip motion estimation method is proposed for efficient inter coding in HEVC. Unlike the conventional skip mode, the proposed skip mode performs the motion estimation, and then transmits the associated motion information. Experimental results demonstrate that the proposed method achieves the high coding performance with the low decoding complexity.
In order to adapt to various real-time applications, fast coding algorithms for high efficiency video coding (HEVC) standard maintain a hot research topic in recent years. In this paper, a complexity reduction algorit...
详细信息
In order to adapt to various real-time applications, fast coding algorithms for high efficiency video coding (HEVC) standard maintain a hot research topic in recent years. In this paper, a complexity reduction algorithm based on hierarchical classification for HEVC inter coding is proposed. It consists of five fast algorithms which is accomplished by hierarchical classification trees at coding unit (CU) level, prediction unit (PU) level and transformation unit (TU) level respectively. At the beginning of proposed algorithm, intra features and inter features which describe the texture and context properties of CU, PU and TU are extracted from the training set. Then the classification trees for CU, PU and TU are generated by carefully selecting features and designing the classification criteria. By analyzing the spatiotemporal correlation, two strategies including early termination and early split are applied to fast coding by referring to these classification trees. The objective evaluation demonstrates that the proposed algorithm can significantly reduce coding complexity with little compression loss. Particularly the subjective evaluation based on similarity measurement for color histogram approves that decoded video quality between the original HM16.9 algorithm and the proposed algorithm is nearly identical.
In this paper, we propose an enhanced machine learning-based inter coding algorithm for VVC. Conceptually, the reference pictures from the decoded picture buffer are processed using a recurrent neural network to gener...
详细信息
ISBN:
(纸本)9781728176383
In this paper, we propose an enhanced machine learning-based inter coding algorithm for VVC. Conceptually, the reference pictures from the decoded picture buffer are processed using a recurrent neural network to generate an artificial reference picture at the time instance of the currently coded picture. The network is trained using a SATD cost function to minimize the bit rate cost for the prediction error rather than the pixel-wise difference. By this we achieved average weighted BD-rate gains of 0.94%. The coding time increased about 5% for the encoder and 300% for the decoder due to the use of a neural network.
The promising compression efficiency improvement of Versatile Video coding (VVC) compared to High Efficiency Video coding (HEVC) [1] comes at the cost of a non-negligible encoder-side complexity. The largely increased...
详细信息
ISBN:
(数字)9781665496209
ISBN:
(纸本)9781665496209
The promising compression efficiency improvement of Versatile Video coding (VVC) compared to High Efficiency Video coding (HEVC) [1] comes at the cost of a non-negligible encoder-side complexity. The largely increased complexity overhead is a possible obstacle towards its industrial implementation. Many papers have proposed acceleration methods for VVC. Still, a better understanding of VVC complexity, especially related to new partitions and coding tools, is desirable to help the design of new and better acceleration methods. For this purpose, statistical analyses have been conducted, with a focus on coding Unit (CU) sizes and inter coding modes.
inter coding in HEVC can greatly improve video compression efficiency. However it also brings huge computational cost due to adaptive partition of coding Tree Unit (CTU) with the quadtree technique. In this paper, a f...
详细信息
ISBN:
(纸本)9783319715896;9783319715889
inter coding in HEVC can greatly improve video compression efficiency. However it also brings huge computational cost due to adaptive partition of coding Tree Unit (CTU) with the quadtree technique. In this paper, a fast CU (coding Unit) decision algorithm is proposed to alleviate the computational burden. This algorithm is fulfilled by analyzing the residual using mean and dispersion. And the optimal RQT (Residual Quad-tree Transform) depth is used innovatively to measure the residual dispersion. First, the optimal RQT depth and avgdis (defined in 3.3) is obtained after inter coding in 2N x 2N PU mode. Then the decision of CU partition is determined through comparing avgdis with the corresponding threshold. Thresholds are predicted based on the distribution of CU partition in the encoded pictures and they can be adaptively changed as the video content changes. Compared to HM13.0 (HEVC test model), the improved algorithm could save about 56% of encoding time on average, with 0.2034% increase of bitrate and the influence on the quality of reconstructed videos is negligible.
A novel inter coding framework for H.264/AVC is proposed to reduce the computational complexity of H.264 encoder on DSP platform. In this paper, inter coding for one P slice is separated into five consecutive loops: M...
详细信息
ISBN:
(纸本)9781424448982
A novel inter coding framework for H.264/AVC is proposed to reduce the computational complexity of H.264 encoder on DSP platform. In this paper, inter coding for one P slice is separated into five consecutive loops: Motion Estimation Loop, Transform & Quantization Loop, Entropy coding Loop, Deblocking Loop and interpolation Loop. All the macroblocks (MBs) within one P slice are processed to g ether in each loop. To further explore memory hierarchy of DSP platforms, MB-group based algorithms are introduced in each loop to reduce frequent data transfers between external memory and internal memory. Finally, the proposed method is implemented on TMS320DM6446 platform and experimental results show that, for the video sequences with CIF (Common intermediate Format) format, the optimized H.264 encoder can achieve the encoding speed of 24 frames per second, which fully meet the real-time requirements of the applications.
High Efficiency Video coding (HEVC), the latest video coding standard, is becoming popular due to its excellent coding performance, in particular in the case of high-resolution video applications. However, the signifi...
详细信息
ISBN:
(纸本)9789881476852
High Efficiency Video coding (HEVC), the latest video coding standard, is becoming popular due to its excellent coding performance, in particular in the case of high-resolution video applications. However, the significant gain in performance is achieved at the cost of substantially higher encoding complexity than its precedent H.264/AVC, in which motion estimation (ME) is one of the most time-consuming parts that effectively removes temporal redundancy. During the development, especially after the release of H.265/HEVC, plenty of fast ME algorithms have been developed to reduce the motion estimation complexity for better application of HEVC into practical real-time video applications. In this review, we provide a comprehensive review of the state-of-the-art fast ME algorithms for HEVC inter coding, for both integer-pixel and fractional-pixel ME algorithms. In all, this review paper provides a comprehensive review of the recent advances of ME for HEVC inter frame coding and hopefully it may provide valuable leads for the improvement, implementation and applications of HEVC inter-prediction as well as for the ongoing development of the next generation video coding standard.
The latest coding standard Versatile Video coding (VVC) developed by the Joint Video Experts Team (JVET) and Video coding Experts Group (VCEG) was finalized in 2020. By introducing several new coding techniques, VVC i...
详细信息
The latest coding standard Versatile Video coding (VVC) developed by the Joint Video Experts Team (JVET) and Video coding Experts Group (VCEG) was finalized in 2020. By introducing several new coding techniques, VVC improves the compression efficiency by 50% compared with H.265/HEVC. However, the coding complexity increases dramatically, which obstructs it from real-time application. To tackle this issue, a fast inter coding algorithm utilizing coding information is proposed to speed up the coding process. First, by analyzing the coding areas of the neighboring CUs, we predict the coding area of the current CU to terminate unnecessary splitting modes. Then, the temporally optimal coding mode generated during the prediction process is further utilized to shrink the candidate modes to speed up the coding process. Finally, the distribution of neighboring prediction modes are exploited to measure the motion complexity of the current CU, based on which the unnecessary prediction modes can be skipped earlier. Experimental results demonstrate that the proposed method can reduce the coding complexity by 40.08% on average with 0.07 dB BDPSNR decrease and 1.56% BDBR increase, which outperforms the state-of-the-art approach.
A novel inter coding framework for H.264/AVC is proposed to reduce the computational complexity of H.264 encoder on DSP platform. In this paper, inter coding for one P slice is separated into five consecutive loops: M...
详细信息
ISBN:
(纸本)9781424448999;9781424448982
A novel inter coding framework for H.264/AVC is proposed to reduce the computational complexity of H.264 encoder on DSP platform. In this paper, inter coding for one P slice is separated into five consecutive loops: Motion Estimation Loop, Transform & Quantization Loop, Entropy coding Loop, Deblocking Loop and interpolation Loop. All the macroblocks (MBs) within one P slice are processed together in each loop. To further explore memory hierarchy of DSP platforms, MB-group based algorithms are introduced in each loop to reduce frequent data transfers between external memory and internal memory. Finally, the proposed method is implemented on TMS320DM6446 platform and experimental results show that, for the video sequences with CIF (Common intermediate Format) format, the optimized H.264 encoder can achieve the encoding speed of 24 frames per second, which fully meet the real-time requirements of the applications.
暂无评论