The depth maps intra-frame prediction of 3d High Efficiency videocoding (3d-HEVC) inherits all texture encoding techniques provided by HEVC and provides new coding tools for depth map predictions. These tools compris...
详细信息
The depth maps intra-frame prediction of 3d High Efficiency videocoding (3d-HEVC) inherits all texture encoding techniques provided by HEVC and provides new coding tools for depth map predictions. These tools comprise algorithms, such as bipartition modes, intra-picture skip, anddC-only. This paper details these tools and shows how they work together with the original HEVC algorithms in the depth map intra-frame prediction for allowing high-efficiency encoding. Besides, this paper analyzes the encoding time and the encoding mode distribution of the intra-frame prediction tools over different quantization scenarios. We aim to provide support for upcoming works on depth map encoding, including complexity reduction and control, real-time embedded systems implementations, and even the development of improved tools to encode depth maps.
In 3d video coding, some dependent-view coding tools may utilize deriveddisparity vectors (dV) to locate inter-view correspondences, such as the Backward block-based View Synthesis Prediction (BVSP) anddepth-based M...
详细信息
ISBN:
(纸本)9781479923427
In 3d video coding, some dependent-view coding tools may utilize deriveddisparity vectors (dV) to locate inter-view correspondences, such as the Backward block-based View Synthesis Prediction (BVSP) anddepth-based Motion Vector Prediction (dMVP) in ATM. A typical way to derive a dV for a target block, as performed in ATM, is to convert reconstructeddepth values associated with the block to a dV. However, this approach only works in depth-first coding order (dFCO) and becomes inapplicable when the dependent-view texture is coded prior to the depth, i.e., in texture-first coding order (TFCO). In this paper, a sparse dV field is synthesized from the depth map of a coded reference view to provide the dVs required by dependent-view coding tools in TFCO. The synthesized sparse dV field is accurate to support disparity-aidedcoding tools in different coding orders, while only introducing 2% decoding time increase. With the proposed method, BVSP anddMVP can be applied in both TFCO anddFCO, which mitigates the large coding performance gap (around 20% Bd rate) between TFCO anddFCO of ATM.
At present, the rate control algorithm for multiview high-efficiency videocoding (MV-HEVC) does not have the capability of efficient coding tree unit(CTU) layer bit allocation, and the video quality varies greatly fo...
详细信息
At present, the rate control algorithm for multiview high-efficiency videocoding (MV-HEVC) does not have the capability of efficient coding tree unit(CTU) layer bit allocation, and the video quality varies greatly for sequences with sudden scene changes or large motions. To overcome this limitation, this paper proposes a rate control algorithm for MV-HEVC based on scene detection. Firstly, we established rho domain rate control model based on multi-objective optimization. Then, it uses image similarity to make reasonable bit allocation among viewpoints. If the video scene is switched, the image similarity is recalculated, and then the correlation between the weights of the interview point rates and the correlation between the viewpoints are analyzed. Finally, the frame layer rate control considers the layer B-frame and other factors in allocating the code rate, and the basic unit layer rate control adopts different quantization methods according to the content complexity of the CTU. Experimental results show that the proposed rate control algorithm can maintain goodcoding efficiency anddecrease the average video quality variation by 25.29%.
Holoscopic imaging, also known as integral imaging, has been recently attracting the attention of the research community, as a promising glassless 3d technology due to its ability to create a more realistic depth illu...
详细信息
ISBN:
(纸本)9780819497062
Holoscopic imaging, also known as integral imaging, has been recently attracting the attention of the research community, as a promising glassless 3d technology due to its ability to create a more realistic depth illusion than the current stereoscopic or multiview solutions. However, in order to gradually introduce this technology into the consumer market and to efficiently deliver 3d holoscopic content to end-users, backward compatibility with legacy displays is essential. Consequently, to enable 3d holoscopic content to be delivered and presented on legacy displays, a display scalable 3d holoscopic coding approach is required. Hence, this paper presents a display scalable architecture for 3d holoscopic videocoding with a three-layer approach, where each layer represents a different level of display scalability: Layer 0 - a single 2d view;Layer 1 - 3d stereo or multiview;and Layer 2 - the full 3d holoscopic content. In this context, a prediction method is proposed, which combines inter-layer prediction, aiming to exploit the existing redundancy between the multiview and the 3d holoscopic layers, with self-similarity compensated prediction (previously proposed by the authors for non-scalable 3d holoscopic videocoding), aiming to exploit the spatial redundancy inherent to the 3d holoscopic enhancement layer. Experimental results show that the proposed combined prediction can improve significantly the rate-distortion performance of scalable 3d holoscopic videocoding with respect to the authors' previously proposed solutions, where only inter-layer or only self-similarity prediction is used.
This paper presents an advanceddepth intra-coding approach for 3d video coding based on the High Efficiency videocoding (HEVC) standard and the multiview video plus depth (MVd) representation. This paper is motivate...
详细信息
This paper presents an advanceddepth intra-coding approach for 3d video coding based on the High Efficiency videocoding (HEVC) standard and the multiview video plus depth (MVd) representation. This paper is motivated by the fact that depth signals have specific characteristics that differ from those of natural signals, i.e., camera-view video. Our approach replaces conventional intra-picture coding for the depth component, targeting a consistent and efficient support of 3dvideo applications that utilize depth maps or polygon meshes or both, with a high depth coding efficiency in terms of minimal artifacts in rendered views and meshes with a minimal number of triangles for a given bit rate. For this purpose, we introduce intra-picture prediction modes based on geometric primitives along with a residual coding method in the spatial domain, substituting conventional intra-prediction modes and transform coding, respectively. The results show that our solution achieves the same quality of rendered or synthesized views with about the same bit rate as MVdcoding with the 3dvideo extension of HEVC (3d-HEVC) for high-quality depth maps and with about 8% less overall bit rate as with 3d-HEVC without relateddepth tools. At the same time, the combination of 3dvideo with 3d computer graphics content is substantially simplified, as the geometry-baseddepth intra signals can be represented as a surface mesh with about 85% less triangles, generateddirectly in the decoding process as an alternative decoder output.
H.264/MVC multi-view videocoding provides a better compression rate compared to the simulcast coding using hierarchical B-picture prediction structure exploiting inter-and intra-view redundancy. However, this techniq...
详细信息
ISBN:
(纸本)9781467300469
H.264/MVC multi-view videocoding provides a better compression rate compared to the simulcast coding using hierarchical B-picture prediction structure exploiting inter-and intra-view redundancy. However, this technique imposes random access frame delay as well as requiring huge computational time. In this paper a novel technique is proposed using 3d motion estimation (3d-ME) to overcome the problems. In the 3d-ME technique, a 3d frame is formed using the same temporal frames of all views and ME is carried out for the current 3d frame using the immediate previous 3d frame as a reference frame. As the correlation among the intra-view images is higher compared to the correlation among the inter-view images, the proposed3d-ME technique reduces the overall computational time and eliminates the frame delay with comparable rate-distortion (Rd) performance compared to H.264/MVC. Another technique is also proposed in the paper where an extra reference 3d frame comprising dynamic background frames (the most common frame of a scene i.e., McFIS) of each view is used for 3d-ME. Experimental results reveal that the proposed3d-ME-McFIS technique outperforms the H.264/MVC in terms of improved Rd performance by reducing computational time and by eliminating the random access frame delay.
The advanced3dvideo system employs the multi-view video plus depth (MVd) format to support free-viewpoint navigation and comfortable 3dvideo. Therefore, the prediction structure of the multi-view videocoding (MVC)...
详细信息
ISBN:
(纸本)9781457720482
The advanced3dvideo system employs the multi-view video plus depth (MVd) format to support free-viewpoint navigation and comfortable 3dvideo. Therefore, the prediction structure of the multi-view videocoding (MVC) can be used for 3d video coding. The view synthesis prediction method is designed to exploit inter-view correlation using the virtual view generation;hence it is suitable for 3d video coding. In this paper, we propose an efficient framework for 3d video coding using view synthesis prediction to compress multi-view color anddepth data simultaneously. We designed the coding procedure of MVddata with four types of view synthesis methods according to the view position. The experimental results showed that the proposed framework improved the coding performance at most 0.9 dB for the multi-view color videos.
Following the success of the audio video standard (AVS) for 2dvideocoding, in 2008, the China AVS workgroup starteddeveloping 3dvideo (3dV) coding techniques. In this paper, we discuss the background, technica...
详细信息
Following the success of the audio video standard (AVS) for 2dvideocoding, in 2008, the China AVS workgroup starteddeveloping 3dvideo (3dV) coding techniques. In this paper, we discuss the background, technical features, and applications of AVS 3dV coding technology. We introduce two core techniques used in AVS 3dV coding: inter-view prediction and enhanced stereo packing coding. We elaborate on these techniques, which are used in the AVS real-time 3dV encoder. An application of the AVS 3dV coding system is presented to show the great practical value of this system. Simulation results show that the advanced techniques used in AVS 3dV coding provide remarkable coding gain compared with techniques used in a simulcast scheme.
Efficient video compression algorithms in advanced multimedia broadcasting systems are in high demand. In the last decades, different video compression tools have been developed which can influence the final Quality o...
详细信息
Efficient video compression algorithms in advanced multimedia broadcasting systems are in high demand. In the last decades, different video compression tools have been developed which can influence the final Quality of Experience in different ways. This paper has two goals. The first goal is to present a study of different compression algorithms available for stereoscopic 3dvideos. The second goal is to present the possibilities in the creation of new stereoscopic models. The well-establishedvideo codecs (AVC, MVC, HEVC and MV-HEVC) are considered as encoders. Generic objective video quality metrics are used to analyze the compression efficiencies of the considered codecs, extended with results from subjective tests. The correlations between the objective and subjective scores are analyzed statistically. due to unsatisfactory results of generic 2d metrics for the stereoscopic sequences used in the test, new objective models are presented. Such models show improved correlation with subjective stereoscopic video quality. The validation, verification and a description of models are presented in detail.
This paper presents a novel motion estimation scheme for depth-map coding. depth map has a couple of different characteristics compared with the common color scene, and therefore, careful investigation of the conventi...
详细信息
This paper presents a novel motion estimation scheme for depth-map coding. depth map has a couple of different characteristics compared with the common color scene, and therefore, careful investigation of the conventional fast motion search schemes is required. In this paper, I first provide the necessity of the depth-oriented motion search scheme based on experiments and then analyze the problems of the conventional methods. On the basis of the analysis, I propose to use the initial position refinement step during a motion search. In detail, the modification of the one-at-a-time search scheme is proposed for improvement of the prediction accuracy, and an initial direction selection and a 1-bit transform scheme follow for complexity reduction. The experimental results show that the proposed scheme can achieve coding gains of more than 3% on average over the state-of-the-art zonal search methods while keeping the overall complexity increase low.
暂无评论