MVD (Multi-view video plus depth) can synthesize virtual viewpoint effectively and reduce the number of desired views in transmission. Among previous works for depth image coding, HEVC has outperformed many other algo...
详细信息
ISBN:
(纸本)9781479939039
MVD (Multi-view video plus depth) can synthesize virtual viewpoint effectively and reduce the number of desired views in transmission. Among previous works for depth image coding, HEVC has outperformed many other algorithm. However, the mode decision in HEVC is really time-consuming for its high computational complexity. In order to improve the encoding efficiency of depthimage in MVD, a fast mode decision algorithm is proposed. By utilizing the depth information, the proposed algorithm gains residual characteristics of depthimage by statistics and sets the corresponding threshold to reduce alternative modes which cause early termination of CU' division. Thus, the encoding time is greatly decreased. The experimental results show that the encoding time is sharply decreased with negligible video quality degradation.
Emerging 3-D displays show several views of the scene simultaneously. A direct transmission of a selection of these views is impractical., because various types of displays support a different number of views and the ...
详细信息
ISBN:
(纸本)0819460958
Emerging 3-D displays show several views of the scene simultaneously. A direct transmission of a selection of these views is impractical., because various types of displays support a different number of views and the decoder has to interpolate the intermediate views. The transmission of multiview image information can be simplified by only transmitting the texture data for the central view and a corresponding depth map. Additional to the coding of the texture data, this technique requires the efficient coding of depth maps. Since the depth map represents the scene geometry and thereby covers the 3-D perception of the scene. sharp edges corresponding to object boundaries, should be preserved. We propose an algorithm that models depth maps using pieceivise-linear functions (platelets). To adapt to varying scene detail, we employ a quadtree decomposition that divides the image into blocks of variable size, each block being approximated by one platelet. In order to preserve sharp object boundaries, the support area of each platelet is adapted to the object boundary. The subdivision of the quadtree and the selection of the platelet type are optimized such that a global rate-distortion trade-off is realized. Experimental results show that the described method can improve the resulting picture quality after compression of depth maps by 1 - 3 dB when compared to a JPEG-2000 encoder.
An efficient way to transmit multi-view images is to send the texture image together with a corresponding depthimage. The depthimage specifies the distance between each pixel and the camera. With this information, a...
详细信息
ISBN:
(纸本)0819459763
An efficient way to transmit multi-view images is to send the texture image together with a corresponding depthimage. The depthimage specifies the distance between each pixel and the camera. With this information, arbitrary views can be generated at the decoder. In this paper, we propose a new algorithm for the coding of depthimages that provides an efficient representation of smooth regions as well as geometric features such as object contours. Our algorithm uses a segmentation procedure based on a quadtree decomposition and models the depthimage content with piecewise linear functions. We achieved a bit-rate as low as 0.33 bit/pixel, without any entropy coding. The attractivity of the coding algorithm is that, by exploiting specific properties of depthimages, no degradations are shown along discontinuities, which is important for perceived depth.
暂无评论