Versatile Video Coding (VVC) significantly improves the coding efficiency over the preceding high efficiency video coding (HEVC) standard, but at the expense of much higher computational complexity. Specifically for i...
详细信息
ISBN:
(纸本)9781665449083
Versatile Video Coding (VVC) significantly improves the coding efficiency over the preceding high efficiency video coding (HEVC) standard, but at the expense of much higher computational complexity. Specifically for intra coding of VVC, the computational burden is mainly on the brute-force recursive rate-distortion optimization (RDO) search of quadtree with nested multi-type tree (QTMT) based coding unit (CU) partition structure. Consequently, we propose a random forest based algorithm to reduce the complexity of CU partition. The CUs are first divided into three categories, namely simple, fuzzy, and complex CUs. For simple and complex CUs, one random forest classifier is trained to directly predict the optimal partition mode. For fuzzy CUs, another random forest is trained to predict whether the partition process is terminated or not. The experimental results show that the complexity reduction of the proposed algorithm is up to 69% as compared to the VVC reference software (VTM 7.0), and averagely 57% encoding time saving is achieved with 1.21% BDBR increase.
Versatile video coding (VVC) is the newest video compression standard. It adopts quadtree with nestedmultitypetree (QT-MTT) to encode square or rectangular coding units (CUs). The QT-MTT coding structure is more fle...
详细信息
Versatile video coding (VVC) is the newest video compression standard. It adopts quadtree with nestedmultitypetree (QT-MTT) to encode square or rectangular coding units (CUs). The QT-MTT coding structure is more flexible for encoding video texture, but it is also accompanied by many time-consuming algorithms. So, this work proposes fast algorithms to determine horizontal or vertical split for binary or ternary partition of a 32 x 32 CU in the VVC intra coding to replace the rate-distortion optimization (RDO) process, which is time-consuming. The proposed fast algorithms are actually a two-step algorithm, including feature analysis method and deep learning method. The feature analysis method is based on variances of pixels, and the deep learning method applies the convolution neural networks (CNNs) for classification. Experimental results show that the proposed method can reduce encoding time by 28.94% on average but increase Bjontegaard delta bit rate (BDBR) by about 0.83%.
暂无评论