As the number of views comprised in multi-viewvideos increases, some challenging problems emerge. Besides the bandwidth problems caused by the huge data flow, the calculation power needed by the multi-view encoder is...
详细信息
ISBN:
(纸本)9781612841625
As the number of views comprised in multi-viewvideos increases, some challenging problems emerge. Besides the bandwidth problems caused by the huge data flow, the calculation power needed by the multi-view encoder is an even higher burden than that of a single view encoder. In this paper, a complexity efficient way to encode a single-time instant of 5x3 view frames is presented. Some of the P frames known from the traditional encoding schemes have been replaced by a new type of frame called the D frame, in which the disparity vector of a block in a view can be derived from the other views due to the strong geometrical correspondence existing between adjacent views. Experimental results show that 20.2% complexity gain is achieved without compromising quality and bit-rate by wisely selecting threshold values at different QPs.
Based on deeply analyzing and studying of multi-view video coding technology in JMVC (Joint multi-view video coding), an effective coding algorithm in views of model decision and fast block searching was put forward i...
详细信息
Based on deeply analyzing and studying of multi-view video coding technology in JMVC (Joint multi-view video coding), an effective coding algorithm in views of model decision and fast block searching was put forward in this paper. First, we use different search algorithms for interview prediction and frame prediction, respectively, to reduce search time and improve search efficiency. Second, we analyze the relationship among the model of current macroblock, the model of adjacent coded macroblocks and the corresponding position of reference frame. According to the result we can optimize the model decision algorithm for current coding macroblock, and reduce the search time for refined small block. The improved fast search algorithm is effective to the sequences containing fast motion area, and the improved model decision algorithm is effective to the sequences containing slow motion. Therefore, we combine the two improved algorithms, and apply them to sequences with different characteristics. The experimental results show that the improved algorithms get significant improvement in coding effect. Compared with the original algorithm in JMVC, the improved algorithm can reduce the coding time by 84.5% under the premise of insuring reconstruction video quality and coding overhead. (C) 2015 Elsevier GmbH. All rights reserved.
multi-viewvideo streaming is an emerging video paradigm that enables new interactive services, such as 3D video, free viewpoint television, and immersive teleconferencing. Because of the high bandwidth cost they come...
详细信息
multi-viewvideo streaming is an emerging video paradigm that enables new interactive services, such as 3D video, free viewpoint television, and immersive teleconferencing. Because of the high bandwidth cost they come with, multi-view streaming applications can greatly benefit from the use of network coding, in particular in transmission scenarios such as wireless network, where the channels have limited capacity and are affected by losses. In this paper, we address the topic of cooperative streaming of multi-viewvideo content, wherein users who recently acquired the content can contribute parts of it to their neighbors by providing linear combinations of the video packets. We propose a novel method for selection and network encoding of the transmitted frames based on the users' preferences for the different views and the rate-distortion properties of the stream. Using network coding enables the users to retrieve the content in a faster and more reliable manner and without the need for coordination among the senders. Our experimental results prove that our preference-based approach provides a high-quality decoding even when the uplink capacity of each node is only a small fraction of the rate of the stream.
Recently, with the increasing demand for virtual reality (VR), experiencing immersive contents with VR has become easier. However, a tremendous amount of calculation and bandwidth is required when processing 360 video...
详细信息
Recently, with the increasing demand for virtual reality (VR), experiencing immersive contents with VR has become easier. However, a tremendous amount of calculation and bandwidth is required when processing 360 videos. Moreover, additional information such as the depth of the video is required to enjoy stereoscopic 360 contents. Therefore, this paper proposes an efficient method of streaming high-quality 360 videos. To reduce the bandwidth when streaming and synthesizing the 3DoF+ 360 videos, which supports limited movements of the user, a proper down-sampling ratio and quantization parameter are offered from the analysis of the graph between bitrate and peak signal-to-noise ratio. High-efficiency videocoding (HEVC) is used to encode and decode the 360 videos, and the view synthesizer produces the video of intermediate view, providing the user with an immersive experience.
An efficient compression algorithm for multi-viewvideo sequences, which are captured by two-dimensional (2D) camera arrays, is proposed in this work. First, we propose a novel prediction structure, called three-dimen...
详细信息
An efficient compression algorithm for multi-viewvideo sequences, which are captured by two-dimensional (2D) camera arrays, is proposed in this work. First, we propose a novel prediction structure, called three-dimensional hierarchical B prediction (3DHBP), which can efficiently reduce horizontal inter-view redundancies, vertical inter-view redundancies, and temporal redundancies in multi-viewvideos. Second, we develop a view interpolation scheme based on the bilateral disparity estimation. The interpolation scheme yields high quality view frames by adapting disparity estimation and compensation procedures using the information in neighboring frames. Simulation results demonstrate that the proposed multi-view video coding algorithm provides significantly better rate-distortion (R-D) performance than the conventional algorithm, by employing the 3DHBP structure and using interpolated view frames as additional reference frames. 2009 Elsevier Inc. All rights reserved.
The multiview extension of HEVC (MV-HEVC) with an ever increasing number of coding parameters allows much better coding performance compared to the previous multiviewvideocoding standards. Usually, controlling the v...
详细信息
ISBN:
(纸本)9781509059638
The multiview extension of HEVC (MV-HEVC) with an ever increasing number of coding parameters allows much better coding performance compared to the previous multiviewvideocoding standards. Usually, controlling the various coding parameters in a videocoding standard to obey the rate constraints in various applications is a major challenge. Rate Control Algorithms (RCA) is used in videocoding standards to control the coding parameters to satisfy different rate constraints. Although RCA is not part of a videocoding standard but the specific features or aspects of the videocoding standard can improve its efficiency considerably. In this paper, we propose a method to investigate the effect of a new feature of MV-HEVC, namely Advanced Motion Vector Prediction (AMVP), to improve the efficiency of the corresponding rate control algorithm. Rate control algorithms usually utilize a rate-distortion (RD) model as a basic tool to describe the relationship between the rate and the quality of the encoded video. We will show that considering the aforementioned feature of MV-HEVC videocoding standard as a view-level RD model parameter can improve the effectiveness of the RD model. Evaluation results indicate that using this new feature of MV-HEVC in view-level RD model, we can predict the rate of each view with relatively high precision and a low estimation error of 11% on average.
In this paper a multi-view Distributed videocoding scheme for mobile applications is presented. Specifically a new fusion technique between temporal and spatial side information in Zernike Moments domain is proposed....
详细信息
ISBN:
(纸本)9780819484185
In this paper a multi-view Distributed videocoding scheme for mobile applications is presented. Specifically a new fusion technique between temporal and spatial side information in Zernike Moments domain is proposed. Distributed videocoding introduces a flexible architecture that enables the design of very low complex video encoders compared to its traditional counterparts. The main goal of our work is to generate at the decoder the side information that optimally blends temporal and interview data. multi-view distributed coding performance strongly depends on the side information quality built at the decoder. At this aim for improving its quality a spatial view compensation/prediction in Zernike moments domain is applied. Spatial and temporal motion activity have been fused together to obtain the overall side-information. The proposed method has been evaluated by rate-distortion performances for different inter-view and temporal estimation quality conditions.
In this paper, a distributed image coding scheme for multi-viewvideo through an efficient generation of side information is proposed. A distributed videocoding technique corrects the errors in the side information, ...
详细信息
In this paper, a distributed image coding scheme for multi-viewvideo through an efficient generation of side information is proposed. A distributed videocoding technique corrects the errors in the side information, which is generated with the original image, by using the channel coding technique at the decoder. Therefore, the more correct the generated side information is, the better the performance of distributed videocoding. The proposed technique is to apply the distributed videocoding schemes to the image coding for multi-viewvideo. It generates side information by selectively and efficiently using both 3-dimensional warping based on the depth map with spatially adjacent frame's and motion-compensated temporal interpolation with temporally adjacent frames. In this scheme the difference between the adjacent frames, the sizes of the motion vectors for the adjacent blocks, and the edge information are used as the selection criteria. From the experiments, it was observed that the quality of the side information generated by the proposed technique was improved by the average peak signal-to-noise ratio of 0.97dB than the one by motion-compensated temporal interpolation or 3-dimensional warping. The result from analyzing the rate-distortion curves revealed that the proposed scheme could reduce the bit-rate by 8.01% on average at the same peak signal-to-noise ratio value, compared to previous work.
This paper proposes a new motion vector (MV) prediction method in multi-view video coding (MVC). In order to exploit the information in adjacent views, inter-view MVs as well as temporal MVs are used in conventional M...
详细信息
This paper proposes a new motion vector (MV) prediction method in multi-view video coding (MVC). In order to exploit the information in adjacent views, inter-view MVs as well as temporal MVs are used in conventional MVC. Since the inter-view MVs are usually uncorrelated with the temporal MVs and most neighboring partitions have temporal MVs only, the conventional DPCM coding gain of inter-view MV is very low and thus the inter-view MVs are seldom selected. In order to increase the probability of interview MV selection, we define a virtual inter-view MV which can be generated from temporal MVs. Then, an inter-view MV is predicted using these neighboring virtual inter-view MVs, leading to less prediction error than using the temporal MVs. As a result, bit-rates are decreased by up to 9% for the view-temporal prediction structure. (C) 2010 Elsevier Inc. All rights reserved.
multi-view video coding (MVC) adopts variable size mode decision to achieve high coding efficiency. However, its high computational complexity is a bottleneck of enabling MVC into practical real-time applications. In ...
详细信息
multi-view video coding (MVC) adopts variable size mode decision to achieve high coding efficiency. However, its high computational complexity is a bottleneck of enabling MVC into practical real-time applications. In this paper, an early termination strategy is proposed for DIRECT mode decision of MVC by exploiting mode homogeneity and rate distortion (RD) cost correlation. By comparing the RD cost between DIRECT mode and Inter16x16 mode, an adaptive threshold is defined based on the MB's mode homogeneity and RD cost so as to early terminate the remaining inter and intra modes. Experimental results show that compared with the original JMVC model, the proposed approach can reduce the total encoding time from 65.08% to 91.45% (80.43% on average). Meanwhile, the Bjontegaard delta peak signal-to-noise ratio only decreases 0.031 dB and Bjontegaard delta bit rate increases 0.97% on average, which is a negligible loss of coding efficiency and superior to the performance of state-of-the-art methods.
暂无评论