This work presents a performance comparison of the two latest video coding standards H.264/MPEG-AVC and H.265/MPEG-HEVC (High-efficiency Video coding) as well as the recently published proprietary video coding scheme ...
详细信息
ISBN:
(纸本)9781479902941
This work presents a performance comparison of the two latest video coding standards H.264/MPEG-AVC and H.265/MPEG-HEVC (High-efficiency Video coding) as well as the recently published proprietary video coding scheme VP9. According to the experimental results, which were obtained for a whole test set of video sequences by using similar encoding configurations for all three examined representative encoders, H.265/MPEG-HEVC provides significant average bit-rate savings of 43.3% and 39.3% relative to VP9 and H.264/MPEG-AVC, respectively. As a particular aspect of the conducted experiments, it turned out that the VP9 encoder produces an average bit-rate overhead of 8.4% at the same objective quality, when compared to an open H.264/MPEG-AVC encoder implementation - the x264 encoder. On the other hand, the typical encoding times of the VP9 encoder are more than 100 times higher than those measured for the x264 encoder. When compared to the full-fledged H.265/MPEG-HEVC reference software encoder implementation, the VP9 encoding times are lower by a factor of 7.35, on average.
Depth map is a kind of video clip that contains 3D object's depth information, and is an important coding feature in the recently 3D video coding standards, which has been applied for the latest 3D coding approach...
详细信息
ISBN:
(纸本)9786163618238
Depth map is a kind of video clip that contains 3D object's depth information, and is an important coding feature in the recently 3D video coding standards, which has been applied for the latest 3D coding approaches, e.g. MV-HEVC and 3D-HEVC. It has been approved that the support of depth map coding can significantly improve the coding performance for 3D videos, and provide more flexibility for 3D applications. Some previous works show that depth map has some different coding properties compared to the traditional 2D sequences. Many coding tools have different performance influence and behaviors on these two kind of video clips. This paper concentrates on the investigation and analysis of those phenomena for depth map coding.
Nearly all block-based transform techniques developed so far for image and video coding applications choose the 2-D discrete cosine transform (DCT) of a square block shape. With almost no exception, this conventional ...
详细信息
ISBN:
(纸本)9781424407286
Nearly all block-based transform techniques developed so far for image and video coding applications choose the 2-D discrete cosine transform (DCT) of a square block shape. With almost no exception, this conventional DCT is always implemented separately through two I-D transforms, along the vertical and horizontal directions, respectively. In one of our recent works, we have developed a directional DCT framework in which the first transform may choose to follow a direction other than the vertical or horizontal one, while the second transform is arranged to be a horizontal one. Compared to the conventional DCT, our directional DCT framework has been demonstrated to provide a better coding performance for image blocks that contain directional edges - a popular scenario in many image and video signals. In this paper, we attempt to pursue an in-depth theoretical analysis to understand how the coding gain is produced in the directional DCT framework and how big it can be.
This paper presents a review of video codecs that are in use and currently being developed, the codec development process, current trends, challenges and opportunities for the research community. There is a paradigm s...
详细信息
ISBN:
(纸本)9781728185798
This paper presents a review of video codecs that are in use and currently being developed, the codec development process, current trends, challenges and opportunities for the research community. There is a paradigm shift in video coding standards. Concurrently, multiple video standards are standardised by standardising organisations. At the same time, royalty free video compression standards are being developed and standardised. Introduction of enhancement-layer-based coding standards will extend the lifetime of legacy video codecs finding middle ground in improved coding efficiency, computational complexity and power requirements. The video coding landscape is changing that is challenged by emergence of multiple video coding standards for different use cases. These may offer some opportunities for coding industry, especially for New Zealand researchers serving niche markets in video games, computer generated videos and animations.
The ERIC (Energy Ramp In Critical-band) criterion for window switching in perceptual audio coding Is proposed in this paper, This new criterion can distinguish the real burst signal from the stationary signals with hi...
详细信息
ISBN:
(纸本)0780370252
The ERIC (Energy Ramp In Critical-band) criterion for window switching in perceptual audio coding Is proposed in this paper, This new criterion can distinguish the real burst signal from the stationary signals with high perceptual entropy effectively. Therefore, comparing with the Perceptual Entropy criterion used in the MPEG standard, the possibility of misjudgment of the window types is reduced by using the proposed criterion. It results in some improvements both on the audio quality and the coding efficiency.
We recognize that the key challenge in image compression is to efficiently represent and encode high-frequency image structural components, such as edges, patterns, and textures. Existing image compression schemes att...
详细信息
ISBN:
(纸本)9781424445936
We recognize that the key challenge in image compression is to efficiently represent and encode high-frequency image structural components, such as edges, patterns, and textures. Existing image compression schemes attempt to predict image data using its spatial neighborhood. In this work, We develop an efficient image compression scheme based on super-spatial prediction of structural units. This so-called super-spatial prediction breaks this neighborhood constraint, attempting to find an optimal prediction of structural components within the whole image domain. We consider only lossless image compression. Our extensive experimental results demonstrate that the proposed scheme is very competitive and even outperforms the state-of-the-art image compression methods.
The second generation digital terrestrial video broadcasting system (DVB-T2), compared to its predecessor, has extended options of system configurations to broadcast TV services for different transmission scenarios. T...
详细信息
ISBN:
(纸本)9781538646953
The second generation digital terrestrial video broadcasting system (DVB-T2), compared to its predecessor, has extended options of system configurations to broadcast TV services for different transmission scenarios. This paper, according to the report ITU-R BT.2254-2, presents minimal carrier-to-noise (C/N) values for the correct reception of DVB-T2 signal, fulfilling quasi-error-free (QEF) condition, under portable indoor reception scenarios. The influence of different code rates (CRs), M-QAM modulations, guard interval (GI) lengths and pilot patterns (PPs) on the DVB-T2 performance is studied. In addition, this study takes into account In-phase and Quadrature (IQ) errors, which can occur in the Orthogonal Frequency Division Multiplexing (OFDM) modulator. All the theory-based results are confirmed by extensive laboratory measurements. The obtained results extend the ITU-R BT.224-2 report with a new set of C/N values for DVB-T2 signals under different portable indoor reception scenarios, which can be used by the broadcasters for DVB-T2 network planning.
In this paper, we investigate the frequency sensitivity of the human visual system. The human visual system reacts differently at different frequencies. Based on this observation, we used different quantization steps ...
详细信息
ISBN:
(纸本)9780819492319
In this paper, we investigate the frequency sensitivity of the human visual system. The human visual system reacts differently at different frequencies. Based on this observation, we used different quantization steps for different frequency components to explore the possibility of improving coding efficiency while maintaining perceptual video quality. In other words, small quantization steps were used for sensitive frequency components while large quantization steps were used for less sensitive frequency components.
Embedded zero tree wavelet coding (EZW) is an effective image coding algorithm, Study on the output stream of P, N, T, Z symbol, they are appeared with inequality probability. In order to reduce the desired number of ...
详细信息
ISBN:
(纸本)9781479941711
Embedded zero tree wavelet coding (EZW) is an effective image coding algorithm, Study on the output stream of P, N, T, Z symbol, they are appeared with inequality probability. In order to reduce the desired number of digits used in coding, this paper proposes an EZW and Huffman joint encoding algorithm. The experimental results shows, compared with the independent EZW algorithm, the joint encoding algorithm can improve the efficiency of image compression and coding.
One approach that can be used to increase compression efficiency beyond the data rates achievable by state-of-the-art video codecs is to use content-based methods whereby not all the pixels are conventionally encoded....
详细信息
ISBN:
(纸本)9781424456536
One approach that can be used to increase compression efficiency beyond the data rates achievable by state-of-the-art video codecs is to use content-based methods whereby not all the pixels are conventionally encoded. An approach to reduce the data rate is to use different coding methods for pixels belonging to areas containing large amount of detail that are costly to encode, for example textures. This can be extended by focusing on the semantic meaning of objects represented in the video sequence and also taking into consideration Human Visual System properties. The goal is to determine where "detail-irrelevant" regions are located in the frame and synthesize them with acceptable perceptual quality. In this paper, we discuss the effects and trade-offs of these techniques based on a set of perceptual experiments and analyze how these areas can influence the viewer's attention.
暂无评论