The scalable extension of the high-efficiency videocoding (SHVC) system adopts a hierarchical quadtree-based coding unit (CU) that is suitable for various texture and motion properties of videos. Currently, the test ...
详细信息
The scalable extension of the high-efficiency videocoding (SHVC) system adopts a hierarchical quadtree-based coding unit (CU) that is suitable for various texture and motion properties of videos. Currently, the test model of SHVC identifies the optimal CU size by performing an exhaustive quadtree depth-level search, which achieves a high compression efficiency at a heavy cost in terms of the computational complexity. However, many interactive multimedia applications, such as remote monitoring and video surveillance, which are sensitive to time delays, have insufficient computational power for coding high-definition (HD) and ultra-highdefinition (UHD) videos. Therefore, it is important, yet challenging, to optimize the SHVC coding procedure and accelerate videocoding. In this article, we propose a fast CU quadtree depth-level decision algorithm for inter-frames on enhancement layers that is based on an analysis of inter-layer, spatial, and temporal correlations. When motion/texture properties of coding regions can be identified early, a fast algorithm can be designed for adapting CU depth-level decision procedures to video contents and avoiding unnecessary computations during CU depth-level traversal. The proposed algorithm determines the motion activity level at the treeblock size of the hierarchical quadtree by utilizing motion vectors from its corresponding blocks at the base layer. Based on the motion activity level, neighboring encoded CUs that have larger correlations are preferentially selected to predict the optimal depth level of the current treeblock. Finally, two parameters, namely, the motion activity level and the predicted CU depth level, are used to identify a subset of candidate CU depth levels and adaptively optimize CU depth-level decision processes. The experimental results demonstrate that the proposed scheme can run approximately three times faster than the most recent SHVC reference software, with a negligible loss of compression efficiency. T
This paper describes an extension of the upcoming High Efficiency videocoding (HEVC) standard for supporting spatial and quality scalable video coding. Besides scalablecoding tools known from scalable profiles of pr...
详细信息
ISBN:
(纸本)9780819494399
This paper describes an extension of the upcoming High Efficiency videocoding (HEVC) standard for supporting spatial and quality scalable video coding. Besides scalablecoding tools known from scalable profiles of prior videocoding standards such as H.262/MPEG-2 video and H.264/MPEG-4 AVC, the proposed scalable HEVC extension includes new coding tools that further improve the coding efficiency of the enhancement layer. In particular, new coding modes by which base and enhancement layer signals are combined for forming an improved enhancement layer prediction signal have been added. All scalablecoding tools have been integrated in a way that the low-level syntax and decoding process of HEVC remain unchanged to a large extent. Simulation results for typical application scenarios demonstrate the effectiveness of the proposed design. For spatial and quality scalablecoding with two layers, bit-rate savings of about 20-30% have been measured relative to simulcasting the layers, which corresponds to a bit-rate overhead of about 5-15% relative to single-layer coding of the enhancement layer.
In this paper, we present a secondary transform scheme for inter-layer prediction residue in scalable video coding (SVC). Efficient prediction of the co-located blocks from the base layer (BL) can significantly improv...
详细信息
ISBN:
(纸本)9781479902880
In this paper, we present a secondary transform scheme for inter-layer prediction residue in scalable video coding (SVC). Efficient prediction of the co-located blocks from the base layer (BL) can significantly improve the enhancement layer (EL) coding in SVC, especially when the temporal information from previous EL frames is less correlated than the co-located BL information. However, Guo et al. showed that because of the peculiar frequency characteristics of EL residuals, the conventional DCT Type-2 transform is suboptimal and is often outperformed by either the DCT Type-3, or DST Type-3 when these transforms are applied to the EL residuals. However, their proposed technique requires upto 8 additional transform cores, two of which are of size 32x32. Here, in this work, we propose a secondary transform scheme, where the proposed transform is applied only to the lower 8x8 frequency coefficients after DCT, for block sizes 8x8 to 32x32. Our proposed transform scheme requires at most only 2 additional cores. We also propose a low-complexity 8x8 Rotational Transform as a special case of secondary transforms in this paper. Simulation results show that the proposed transform scheme provides significant BD-Rate improvement over the conventional DCT-based coding scheme for video sequences in the ongoing scalable extensions of HEVC standardization.
The paper describes a scalable video coding extension of the upcoming HEVC videocoding standard for spatial and quality scalablecoding. Besides coding tools known from scalable profiles of prior videocoding standar...
详细信息
ISBN:
(纸本)9781467360371
The paper describes a scalable video coding extension of the upcoming HEVC videocoding standard for spatial and quality scalablecoding. Besides coding tools known from scalable profiles of prior videocoding standards, it includes new coding tools that further improve the enhancement layer coding efficiency. The effectiveness of the proposed scalable HEVC extension is demonstrated by comparing the coding efficiency to simulcast and single-layer coding for several test sequences and coding conditions.
Diffusion Transformers (DiT) deliver impressive generative performance but face prohibitive computational demands due to both the quadratic complexity of token-based self-attention and the need for extensive sampling ...
详细信息
In a heterogeneous landscape of networks, devices and consumption environments, scalability is one of the most important videocoding features. To achieve higher scalablevideo compression efficiency, this paper propo...
详细信息
ISBN:
(纸本)9781479902941
In a heterogeneous landscape of networks, devices and consumption environments, scalability is one of the most important videocoding features. To achieve higher scalablevideo compression efficiency, this paper proposes a novel scalable video coding framework based on predictive videocoding but also exploiting some additional decoder side information. The side information is estimated at both encoder and decoder using a motion compensated temporal interpolation technique, commonly used in distributed videocoding solutions. To improve the B-slices compression efficiency, the side information independently created at each coding layer, notably the base and enhancement layers, is inserted in the corresponding layer decoded picture buffer to be exploited as an additional reference frame in the scalable predictive coding process. Experimental results have shown significant compression efficiency gains, notably up to around 3.5% in bitrate savings regarding the state-of-the-art SVC standard.
Rate Control plays an important role in videocoding, which performs successful transmission of all encoded bits with available limited bandwidth. This paper aims to provide an adaptive rate controller for H. 264 scal...
详细信息
ISBN:
(纸本)9781467361255
Rate Control plays an important role in videocoding, which performs successful transmission of all encoded bits with available limited bandwidth. This paper aims to provide an adaptive rate controller for H. 264 scalable video coding. The Quantization Parameter (QP) plays an important role in any rate controller for encoding demanded bits. The Quantization parameter estimation and the rate controller algorithm follows two pass 1) Initial Quantization Parameter value estimated by simplified Rate Distortion model using Cauchy Density based PDF and Buffer status 2) Rate Distortion Optimization method to determine minimum cost for the mode and motion vector using the estimated Quantization Parameter. The experimental result shows that our rate control algorithm is better than the FixedQPencoder of JSVM 9.19.15 in terms of PSNR and bit rate. Moreover our scheme performs in single iteration to encode the frame with the desired QP and Buffer status calculated to prevent from overflow and underflow.
video streaming over wireless network has a very broad application prospect. However, the quality of video will be seriously deteriorated due to the nature of time-varying and noisy wireless channels. In this paper, w...
详细信息
ISBN:
(纸本)9781479913190
video streaming over wireless network has a very broad application prospect. However, the quality of video will be seriously deteriorated due to the nature of time-varying and noisy wireless channels. In this paper, we propose a wireless video streaming method based on network coding and scalable video coding (SVC). In order to maximize the throughput and video quality simultaneously, a forwarding node should mix the data packets from different streams and flows according to their importance values, which are provided directly by a SVC-coded video stream. Simulation results show that our method can improve both video quality and throughput significantly.
This paper presents a early mode decision algorithm, which is proposed to reduce the complexity of the mode selection process for enhancement layers in H.264 scalable video coding. The proposed algorithm consists of t...
详细信息
ISBN:
(纸本)9783037855744
This paper presents a early mode decision algorithm, which is proposed to reduce the complexity of the mode selection process for enhancement layers in H.264 scalable video coding. The proposed algorithm consists of the following three main steps. We firstly divide all the macroblocks into 4 classes according to the mode of collocated macroblocks in the base layer. Then, the macroblocks are subdivided with trained BP (Back Propagation) network according to the mode of neighboring macroblocks. Finally, we choose different mode selection algorithms for different divided cases, and check whether the algorithms are agreeable. Compared to JSVM 9.18, experiment results show that, with this algorithm, 30% encoding time can be saved with a negligible loss in BDSNR, and BDBR can be significantly reduced.
In the developing scalable extension of the HEVC/H. 265 standard, a low-resolution baselayer picture may be used for predicting a higher resolution enhancement layer. This requires an upsampling process to generate th...
详细信息
ISBN:
(纸本)9781479902941
In the developing scalable extension of the HEVC/H. 265 standard, a low-resolution baselayer picture may be used for predicting a higher resolution enhancement layer. This requires an upsampling process to generate the prediction, and this upsampling process is traditionally a linear and time invariant interpolator. In this paper, we consider an upsampling design that is both non-linear and content adaptive. This choice is motivated by the compression noise in the baselayer. We propose a novel approach to include content-aware filtering into the upsampling process. It has low complexity. More importantly, it has low latency and uses the same number of line buffers as a linear interpolator. Results show the efficacy of the method. Specifically, using the test model for the scalable extension of HEVC/H.265, we observe an average bit-rate reduction of 1.1%, when the change in resolution between layers is a factor of two in each dimension.
暂无评论