The H.264/AVC video coding standard aims to enable significantly improved compression performance compared to all existing video coding standards. In order to achieve this, a robust rate-distortion optimization (RDO) ...
详细信息
The H.264/AVC video coding standard aims to enable significantly improved compression performance compared to all existing video coding standards. In order to achieve this, a robust rate-distortion optimization (RDO) technique is employed to select the best coding mode and reference frame for each macroblock. As a result, the complexity and computation load increase drastically. This paper presents a fast mode decision algorithm for H.264/AVC intraprediction based on local edge information. Prior to intraprediction, an edge map is created and a local edge direction histogram is then established for each subblock. Based on the distribution of the edge direction histogram, only a small part of intraprediction modes are chosen for RDO calculation. Experimental results show that the fast intraprediction mode decision scheme increases the speed of intracoding significantly with negligible loss of peak signal-to-noise ratio.
To ensure the fidelity of virtual views, rate-distortion optimization (RDO) criterion for the 3D extension of the High Efficiency video coding (3D-HEVC) is well designed, in which the synthesized view distortion (SVD)...
详细信息
To ensure the fidelity of virtual views, rate-distortion optimization (RDO) criterion for the 3D extension of the High Efficiency video coding (3D-HEVC) is well designed, in which the synthesized view distortion (SVD) is introduced to derive the rate-distortion (RD) cost. To obtain accurate SVDs, the rendering operation is employed which demands a fairly high computational complexity. To address this problem, a fast RDO method for depth maps is proposed, which checks the RD cost during its calculation process. Specifically, given a coding mode, the RD cost is composed of several cumulative items. If the accumulated RD cost is equal to or exceeds the minimum RD cost of previously coded modes, it will not be necessary to continue the RD cost calculation for the mode. To reduce the encoding complexity, existing methods usually aim at reducing the number of tested modes or block partitions. To the best of our knowledge, it is the first time that the latent redundant complexity in the RD cost calculation is investigated and removed. Experimental results demonstrate that, compared with the 3D-HEVC reference software, the proposed method can save 28.1% of depth coding time with a small coding gain (0.04% BD-rate saving). An additional test is designed to evaluate four typical fast coding methods with/without the proposed method. Extensive results verify that the proposed method can be seamlessly combined with the state-of-the-art methods.
This paper describes a 360 degrees video coding scheme submitted in response to the joint call for proposal on video compression for capability beyond HEVC issued by ITU-T SG16 Q.6 (VCEG) and ISO/IEC JTCUSC29/WG11 (MP...
详细信息
This paper describes a 360 degrees video coding scheme submitted in response to the joint call for proposal on video compression for capability beyond HEVC issued by ITU-T SG16 Q.6 (VCEG) and ISO/IEC JTCUSC29/WG11 (MPEG) in October 2017. The proposed coding scheme uses projection format adaptation and spherical neighboring relationship to improve coding efficiency. A hybrid angular cubemap projection format is used to adapt the sampling within each face. The faces are packed in an adaptive manner, and the frame packing configuration is updated at regular intervals. Geometry padding of reference photographs is used to improve inter prediction. For intra-/inter-prediction and in-loop filters, processes are modified to avoid utilizing "wrong" neighbors in the frame packed photograph. Finally, post-filtering is used to reduce artifacts at face discontinuities. Experimental results are presented to demonstrate the superior compression efficiency achieved by the proposed 360 degrees video coding scheme. Based on the end-to-end weighted-to-spherically uniform PSNR metric, it achieves average bit rate savings of 33.9% and 13.5% over the HM and joint exploration model anchors, respectively, encoded in the padded equirectangular projection format.
On the basis of Wyner-Ziv video coding in transform domain, this paper studies the coding method of key frame and proposes a method of coding mode selection based on partition of frequency band: First of all, the key ...
详细信息
On the basis of Wyner-Ziv video coding in transform domain, this paper studies the coding method of key frame and proposes a method of coding mode selection based on partition of frequency band: First of all, the key frames are transformed by DCT and they are divided into low-frequency band and high-frequency band according to the characteristics of frequency band. Then the low-frequency band and high-frequency band are encoded and decoded by Wyner-Ziv and intraframe, respectively. In this paper, the decision of coding mode is transferred to the decoder to be completed, which further improves the performance of the system without increasing the complexity of the encoder. Compared with the reference algorithm, the peak signal-to-noise ratio obtained by the proposed coding mode selection algorithm based on partition of frequency band is improved by 1-5 dB.
In the classical block-matching motion-estimation approach, the motion vectors which result in minimum distortion between the estimated and the actual image block are chosen. However, these motion vectors may not be o...
详细信息
In the classical block-matching motion-estimation approach, the motion vectors which result in minimum distortion between the estimated and the actual image block are chosen. However, these motion vectors may not be optimal in terms of coding efficiency. An analysis by synthesis method which selects the optimal motion vectors, using the resulting bit rate and distortion, is presented. A significant reduction in bit rate is achieved with virtually no degradation in objective image quality. H.263 is used in simulation experiments to test the algorithm.
This paper addresses the important aspect of compressing and transmitting video signals generated by wireless broadband networks while heeding the architectural demands imposed by these networks in terms of energy con...
详细信息
This paper addresses the important aspect of compressing and transmitting video signals generated by wireless broadband networks while heeding the architectural demands imposed by these networks in terms of energy constraints as well as the channel uncertainty related to the wireless communication medium. Driven by the need to develop light, robust, energy-efficient, and low delay video delivery schemes, a distributed video coding based framework dubbed PRISM is introduced. PRISM addresses the wireless video sensor network requirements far more effectively than current state-of-the-art standards like MPEG. This paper focuses on the case of a single video camera and use it as a platform to describe the theoretical principles and practical aspects underlying distributed video coding.
In multi-view video coding, inter-view and temporal redundancies decrease the coding efficiency and video quality, and they need to be eliminated. This paper proposes a method of motion-estimation-based H.264 video co...
详细信息
In multi-view video coding, inter-view and temporal redundancies decrease the coding efficiency and video quality, and they need to be eliminated. This paper proposes a method of motion-estimation-based H.264 video coding method using the optimal search-range for video broadcasting from a studio. In the method, first, a point-matching tool is used to match the corresponding points in the previous and current frames. These points are then calculated to obtain the movement vectors in order to estimate the corresponding points in the next frame, and the estimated corresponding points in the next frame are used as the centers for drawing circles, which are the individual search ranges. The corresponding points in the next frame are found in the determined search ranges by using the optical flow. They are finally encoded with the disparities and transmitted using the H.264 standard. To evaluate the performance of the proposed method, experiments with standard videos are performed, and the performance is approximately improved by 0.2-0.3 dB and 84 ms per 100 frames in terms of the PSNR (peak signal-to-noise ratio) and computational speed, respectively.
Basic characteristics of variable-rate video coders applied to asynchronous transfer mode (ATM) transmission are described. Burstiness of video information is evaluated for conference-type scenes using various coding ...
详细信息
Basic characteristics of variable-rate video coders applied to asynchronous transfer mode (ATM) transmission are described. Burstiness of video information is evaluated for conference-type scenes using various coding algorithms. Three measures (distribution, autocorrelation, and coefficient of variation) are introduced to evaluate burstiness. video sources are modeled and characterized by the autoregressive process and coefficient of variation. video quality improvement achieved with variable rate transmission is evaluated using signal-to-noise ratio (SNR) and subjective ratings. An improvement of 5-10 dB in temporal SNR and 1 rank in mean opinion score are reported.< >
Current techniques for coding images and video sources with resilience to channel errors can remove much of the need for complex high-redundancy channel coding and provide a graceful degradation of performance with de...
详细信息
Current techniques for coding images and video sources with resilience to channel errors can remove much of the need for complex high-redundancy channel coding and provide a graceful degradation of performance with decreasing channel quality. The main function of these error-resilient techniques is to reduce the propagation of errors within the decoded data. Two main techniques are discussed in detail in this paper: the error-resilient entropy code (EREC) and pyramid vector quantisation (PVQ). The paper concludes with a brief comparison of the relative merits of these systems and areas for further consideration.
Since the 1970s, various image and video coding techniques have been explored, and some of them have been included in the video coding standards issued by the International Organization for Standardization (ISO)/Inter...
详细信息
Since the 1970s, various image and video coding techniques have been explored, and some of them have been included in the video coding standards issued by the International Organization for Standardization (ISO)/International Electrotechnical Commission (IEC) Motion Pictures Expert Group (MPEG) and International Telecommunication Union-Telecommunication Standardization Sector (ITU-T) video coding Experts Group (VCEG). MPEG is the most successful standards development organization (SDO) for multimedia compression standardization. In particular, most of the widely deployed video coding standards of the past 30 years have been developed within this working group. One of the first, standardized in 1996, was the MPEG-2 standard [1], which is still in use as a digital TV standard in many countries.
暂无评论