Motion estimation(ME) plays an important role in digital video compression. But it requires huge complexity to find an optimal motion vector. multi-viewvideo is obtained by capturing one three-dimensional scene wit...
详细信息
Motion estimation(ME) plays an important role in digital video compression. But it requires huge complexity to find an optimal motion vector. multi-viewvideo is obtained by capturing one three-dimensional scene with many cameras at different positions. The computational complexity for motion estimation in multiviewvideocoding increases in proportion to the number of cameras. To reduce computational complexity and maintain the image quality, a modified TZ search method for motion estimation in multi-view video coding is proposed in this paper. The proposed search method exploits the characteristics of the distribution of motion vectors to place the search points. Experiment results show that the speedup improvement of the proposed method over Pel Block Search and TZ search method(JMVC) can be up to more than 77 times and 1.8 .5 times faster respectively by reducing the computational complexity and the image quality degradation is about to 0.05 ~ 1(d B) and 0.01~0.24(d B) respectively.
The trend in videoviewing has been evolving beyond simply providing a multi-view ***,a function that allows selection and viewing of a clip from a multi-view service that captures a specific range or object has been ...
详细信息
The trend in videoviewing has been evolving beyond simply providing a multi-view ***,a function that allows selection and viewing of a clip from a multi-view service that captures a specific range or object has been *** particular,the free-view service is an extended concept of multi-view and provides a freer ***,since numerous videos and additional data are required for its construction,all of the clips constituting the content cannot be simultaneously *** certain clips are selected and provided to the *** the video is not the preferred video,change request is made,and a delay occurs during retransmission from the *** due to frequent re-requests degrade the overall quality of *** free-view services,selectively transmitting the video according to the user’s desired viewpoint and region of interest within the limited network of available videos is *** this study,we propose a method of screening and providing the correct video based on objects in the *** on the method of recognizing the object in each clip,we designed a method of setting its priority based on information about the object’s location for each *** the transmission and receiving process using this information,the selected video can be rapidly recognized and ***,we present a service system configuration method and propose video selection examples for free-view services.
Computational imaging and light field technology promise to deliver the required six-degrees-of-freedom for natural scenes in virtual reality. Already existing extensions of standardized videocoding formats, such as ...
详细信息
Computational imaging and light field technology promise to deliver the required six-degrees-of-freedom for natural scenes in virtual reality. Already existing extensions of standardized videocoding formats, such as multi-viewcoding and multi-view plus depth, are the most conventional light field videocoding solutions at the moment. The latest multi-viewcoding format, which is a direct extension of the high efficiency videocoding (HEVC) standard, is called multi-view HEVC (or MV-HEVC). MV-HEVC treats each light field view as a separate video sequence, and uses syntax elements similar to standard HEVC for exploiting redundancies between neighboring views. To achieve this, inter-view and temporal prediction schemes are deployed with the aim to find the most optimal trade-off between coding performance and reconstruction quality. The number of possible prediction structures is unlimited and many of them are proposed in the literature. Although some of them are efficient in terms of compression ratio, they complicate random access due to the dependencies on previously decoded pixels or frames. Random access is an important feature in video delivery, and a crucial requirement in multi-view video coding. In this work, we propose and compare different prediction structures for coding light field video using MV-HEVC with a focus on both compression efficiency and random accessibility. Experiments on three different short-baseline light field video sequences show the trade-off between bit-rate and distortion, as well as the average number of decoded views/frames, necessary for displaying any random frame at any time instance. The findings of this work indicate the most appropriate prediction structure depending on the available bandwidth and the required degree of random access.
multi-view video coding (MVC) produces more realistic three-dimensional scenes using disparity (or depth) information derived from cameras placed in parallel than mono and stereo videocoding. However, the performance...
详细信息
ISBN:
(纸本)9781467329880;9781467329897
multi-view video coding (MVC) produces more realistic three-dimensional scenes using disparity (or depth) information derived from cameras placed in parallel than mono and stereo videocoding. However, the performances such as chip area, power consumption and processing time are dependent on how to schedule the multi-view frames. This paper proposes a low-complexity frame waiting time aware scheduler (WTaS) for faster MVC. In order to reduce the frame memory size for MVC, we design a shared frame memory scheme so that the frame memories of many views can be shared using a smaller frame memory for high frame memory utilization. Experimental results show that the proposed low-complexity WTaS scheduler can reduce the size by 30%similar to 50% without degradation of the processing time.
Light Field imaging has emerged as a technology that enables the capture of images and video with richer information. Captured content is composed of numerous views aligned in both horizontal and vertical directions p...
详细信息
ISBN:
(纸本)9781538692233
Light Field imaging has emerged as a technology that enables the capture of images and video with richer information. Captured content is composed of numerous views aligned in both horizontal and vertical directions providing full parallax, offering light intensity and directional information, but at the same time significantly increasing bandwidth requirements. Several multi-viewcoding methods have attempted to tackle this problem. However, these approaches do not fully assess the intricacies that are found in light field content. This paper proposes a prediction structure for coding light field content using the MV-HEVC standard, exploiting the inter-view correlations in two directions along with the high similarity between views around the center of each frame. Experimental results show BD-rate gains up to 38% compared to an existing state-of-the-art method.
Light field video is a promising technology for delivering the required six-degrees-of-freedom for natural content in virtual reality. Already existing multi-viewcoding (MVC) and multi-view plus depth (MVD) formats, ...
详细信息
ISBN:
(数字)9781510629684
ISBN:
(纸本)9781510629684
Light field video is a promising technology for delivering the required six-degrees-of-freedom for natural content in virtual reality. Already existing multi-viewcoding (MVC) and multi-view plus depth (MVD) formats, such as MV-HEVC and 3D-HEVC, are the most conventional light field videocoding solutions since they can compress video sequences captured simultaneously from multiple camera angles. 3D-HEVC treats a single view as a video sequence and the other sub-aperture views as gray-scale disparity (depth) maps. On the other hand, MV-HEVC treats each view as a separate video sequence, which allows the use of motion compensated algorithms similar to HEVC. While MV-HEVC and 3D-HEVC provide similar results, MV-HEVC does not require any disparity maps to be readily available, and it has a more straightforward implementation since it only uses syntax elements rather than additional prediction tools for inter-view prediction. However, there are many degrees of freedom in choosing an appropriate structure and it is currently still unknown which one is optimal for a given set of application requirements. In this work, various prediction structures for MV-HEVC are implemented and tested. The findings reveal the trade-off between compression gains, distortion and random access capabilities in MV-HEVC light field videocoding. The results give an overview of the most optimal solutions developed in the context of this work, and prediction structure algorithms proposed in state-of-the-art literature. This overview provides a useful benchmark for future development of light field videocoding solutions.
New video applications, such as 3D video and free viewpoint video, require efficient compression of multi-viewvideo. In addition to temporal redundancy, exploiting the inter-view redundancy is crucial to improve the ...
详细信息
New video applications, such as 3D video and free viewpoint video, require efficient compression of multi-viewvideo. In addition to temporal redundancy, exploiting the inter-view redundancy is crucial to improve the performance of multi-view video coding. In this paper, we present a novel method to construct the optimal inter-view prediction structure for multi-view video coding using simulated annealing. In the proposed model, the design of the prediction structure is converted to the arrangement of coding order. Then, a simulated annealing algorithm is employed to minimize the total cost for obtaining the best coding order. This method is applicable to arbitrary irregular camera arrangements. As experiment results reveal, the annealing process converges to satisfactory results rapidly and the generated optimal prediction structure outperforms the reference prediction structure of the joint multi-viewvideo model (JMVM) by 0.1-0.8 dB PSNR gains.
Recently, as the demand for Virtual Reality(VR) is increasing, it is not hard to experience immersive contents with VR However, a tremendous amount of calculation and bandwidth are required when processing 360 videos....
详细信息
ISBN:
(纸本)9781538650417
Recently, as the demand for Virtual Reality(VR) is increasing, it is not hard to experience immersive contents with VR However, a tremendous amount of calculation and bandwidth are required when processing 360 videos. Moreover, to enjoy stereoscopic 360 contents, additional information such as the depth of the video must be provided. Therefore, in this paper, it proposes the efficient method of streaming high quality 360 videos. To reduce the bandwidth when streaming and synthesizing the 3DoF+ 360 video which supports limited movements of the user, the proper down-sampling ratio and quantization parameter are offered with the graph between bitrate and Peak Signal-to-Noise Ratio(PSNR). To encode and decode the 360 video, High-Efficiency videocoding(HEVC) is used. And view synthesizer produces the video of intermediate view to provide the user with an immersive experience.
The multi-view video coding (MVC) technique provides significantly better coding efficiency compared to simulcast transmission of different camera view-points. This is done by exploiting both motion and disparity comp...
详细信息
ISBN:
(纸本)9781612843490
The multi-view video coding (MVC) technique provides significantly better coding efficiency compared to simulcast transmission of different camera view-points. This is done by exploiting both motion and disparity compensation techniques to compress the different view-point videos. This paper proposes a more efficient MVC technique that transmits the optimally selected compensation replacements with respect to more accurate compensation vector predictors obtained using the multi-view geometry and the depth information. In this technique, the SKIP mode is also modified to adapt its compensation direction from both the temporal and the view-point reference frames. Experimental results show that the proposed MVC technique gives an average gain in video quality of about 0.5dB, for the interview predicted view-points.
multi-view video coding (MVC) employs both motion and disparity estimation within the encoding process. These provide a significant increase in coding efficiency at the expense of a substantial increase in computation...
详细信息
ISBN:
(纸本)9781612841625
multi-view video coding (MVC) employs both motion and disparity estimation within the encoding process. These provide a significant increase in coding efficiency at the expense of a substantial increase in computational requirements. This paper presents a fast motion and disparity estimation technique that utilizes the multi-view geometry together with the depth information and the corresponding encoded motion vectors from the reference view, to produce more reliable motion and disparity vector predictors for the current view. This allows for a smaller search area which reduces the computational cost of the multi-view encoding system. Experimental results confirm that the proposed techniques can provide a speed-up gain of up to 4.2 times, with a negligible loss in the rate-distortion performance for both the color and the depth MVC.
暂无评论