In this paper, we propose a new intra-frame coding approach using the predictive graph transform (PGT). The predicted block together with the reference pixels are modeled as a normal distributed random vector with res...
详细信息
ISBN:
(纸本)9781479923410
In this paper, we propose a new intra-frame coding approach using the predictive graph transform (PGT). The predicted block together with the reference pixels are modeled as a normal distributed random vector with respect to a graph whose edges represent the correlations between pixels. This model is more flexible than the Gaussian Markov random field (GMRF) model in the sense that it enables us to adapt the graph both before and after the collection of the statistics. The optimal prediction and the transform of the prediction residual are then derived jointly. Two PGT based intra coding schemes are proposed: one is based on global image statistics and the other is mode-adaptive, i.e., the graph is adaptive to different directional modes defined in H.264/AVC. The simulations show the advantage of our proposed approach over standard intra predictive transform coding in terms of both prediction quality and coding gain assuming the model parameters are known at decoder.
This paper provides a theoretical foundation for the closed-form expression of model observers on compressed images. in medical applications, model observers, especially the channelized Hotelling observer, have been s...
详细信息
ISBN:
(纸本)0819452858
This paper provides a theoretical foundation for the closed-form expression of model observers on compressed images. in medical applications, model observers, especially the channelized Hotelling observer, have been successfully used to predict human observer performance and to evaluate image quality for detection tasks in various backgrounds. To use model observers, however, requires knowledge of noise statistics. This paper first identifies quantization noise as the sole distortion source in transform coding, one of the most commonly used methods for image compression. Then, it represents transform coding as a I-D block-based matrix expression, it further derives first and second moments, and the probability density function (pdf) of the compression noise at pixel, block and image levels. The compression noise statistics depend on the transform matrix and the quantization matrix in the transform coding algorithm. Compression noise is jointly normally distributed when the dimension of the transform (the block size) is typical and the contents of image sets vary randomly. Moreover, this paper uses JPEG as a test example to verify the derived statistics. The test simulation results show that the closed-form expression of JPEG quantization and compression noise statistics correctly predicts the estimated ones from actual images.
Modern perceptual image coders reach impressively high subjective quality even at low bit-rates but tend to denoise or "detexturize" the coded pictures. Traditionally, two independent parametric approaches, ...
详细信息
ISBN:
(纸本)9781538660959
Modern perceptual image coders reach impressively high subjective quality even at low bit-rates but tend to denoise or "detexturize" the coded pictures. Traditionally, two independent parametric approaches, known as texture and film grain synthesis, have been applied in the spatial domain as pre and post-processors around the codec to counteract such effects. In this work, a unified alternative, operating directly within the spectral domain of conventional transform codecs with tight coupling to the transform coefficient quantizer, is proposed. Due to its design, this spectrally adaptive noise filling tool (SANFT) enables highly input adaptive realizations by reusing the coder's existing optimized spatial and spectral partitioning algorithms. Formal subjective evaluation in the context of a "main still picture" High Efficiency Video coding (HEVC) implementation confirms the benefit of the proposal.
The JPEG lossy compression technique in medical imagery has several disadvantages (at higher compression ratios), mainly due to block-distortion. We therefore investigated two methods, the lapped orthogonal transform ...
详细信息
ISBN:
(纸本)0819423564
The JPEG lossy compression technique in medical imagery has several disadvantages (at higher compression ratios), mainly due to block-distortion. We therefore investigated two methods, the lapped orthogonal transform (LOT) and the DCT/DST coder, for the use on medical image data. These techniques are block-based but they reduce the block- distortion by spreading it out over the entire image. These compression techniques were applied on four different types of medical images (MRI image, x-ray image, angiogram and CT- scan). They were then compared with results from JPEG and variable block size DCT coders. At a first stage, we determined the optimal block size for each image and for each technique. It was found that for a specific image, the optimal block size was independent of the different transform coders. For the x-ray image, the CT-scan and the angiogram an optimal block size of 32 by 32 was found, while for the MRI image the optimal block size was 16 by 16. Afterwards, for all images the rate-distortion curves of the different techniques were calculated, using the optimal block size. The overall conclusion from our experiments is that the LOT is the best transform among the ones being investigated for compressing medical images of many different kinds. However, JPEG should be used for very high image qualities, as it then requires almost the same bit rate as the LOT and as it requires fewer computations than the LOT technique.
Demand for screen content videos that contain computer generated text and graphics is growing. They are very different from natural videos, because they include much sharper edge transitions and very repetitive patter...
详细信息
ISBN:
(纸本)9781467399616
Demand for screen content videos that contain computer generated text and graphics is growing. They are very different from natural videos, because they include much sharper edge transitions and very repetitive patterns. On this type of material, the efficacy of the conventional discrete cosine transform (DCT) is questionable because it relies on the assumption that a Gauss-Markov model leads to a base-band signal. However, the assumption may not hold true for screen content material. This work exploits a class of staircase transforms. Unlike the DCT whose bases are samplings of sinusoidal functions, the staircase transforms have their bases sampled from staircase functions, which better approximate the sharp transitions often encountered in the context of screen content. The staircase transform is integrated into a hybrid transform coding scheme, in conjunction with DCT. It is experimentally shown that the proposed approach provides an average of 2.9% compression performance gains in terms of BD-rate reduction. A perceptual comparison further demonstrates that the use of staircase transform achieves substantial reduction in ringing artifact due to the Gibbs phenomenon.
This paper is concerned with transform coding of correlated sources in conjunction with variable rate quantization at high resolution. The approach builds on our prior work on optimality conditions for transform codin...
详细信息
ISBN:
(纸本)9780769546568
This paper is concerned with transform coding of correlated sources in conjunction with variable rate quantization at high resolution. The approach builds on our prior work on optimality conditions for transform coding in the point-to-point setting. The first contribution involves transform coding with decoder side information. In this setting, side information is only available to the decoder, whereas the encoder knows the joint statistics. The necessary and sufficient condition for optimality of a unitary transform in the side information setting is derived, namely, such transform minimizes a conditional divergence-based measure of inter-dependence of the transform coefficients, given the side information. This optimality result subsumes prior, known results that were restricted to the Gaussian case, where the conditional Karhunen-Loeve transform is optimal. The second contribution involves distributed transform coding, where two correlated sources are to be transform coded separately, but decoded jointly. The necessary and sufficient condition for optimality of unitary transforms in the distributed coding setting is derived. It is then specialized to produce closed form optimal transforms for specific source densities, including the case of jointly Gaussian sources.
One problem of conventional digital transmission systems is that the transmission fidelity always saturates at the design channel quality (worst-case channel condition);this saturation is due to the irreversible quant...
详细信息
ISBN:
(纸本)9781479903566
One problem of conventional digital transmission systems is that the transmission fidelity always saturates at the design channel quality (worst-case channel condition);this saturation is due to the irreversible quantization error introduced by the source coder. Hybrid digital analog (HDA) codes address this problem by additionally transmitting the inherent quantization error by pseudo-analog methods. However, the design and the decoding of these HDA codes is complex or even impossible for long block lengths (look-up table decoding). In this study we propose an HDA transmission system for sources with correlation supporting long block lengths. This is achieved by combining transform coding with HDA transmission using well-known digital channel codes. For the evaluation Monte Carlo simulations are used with a normalized discrete cosine transform (DCTN) and Turbo codes as the digital part and LMMSE estimation in the pseudo-analog part of the HDA system. The proposed HDA system with transform coding outperforms the purely digital transmission system at all channel qualities and exploits the expected gain due to source correlation.
We examine the performance of the KLT for transform coding applications. The KLT has long been viewed as the best available block transform for transform coding. This paper treats fixed-rate and variable-rate transfor...
详细信息
ISBN:
(纸本)0769518966
We examine the performance of the KLT for transform coding applications. The KLT has long been viewed as the best available block transform for transform coding. This paper treats fixed-rate and variable-rate transform codes. The fixed-rate approach uses an optimal fixed-rate scalar quantizer to describe the transform coefficients;the variable-rate approach uses a uniform scalar quantizer followed by an optimal entropy code. Earlier work shows that for the variable-rate case there exist sources on which the KLT is not unique and the optimal transform code matched to a "worst'' KLT yields performance as much as 1.5 dB worse than the optimal transform code matched to a "best" KLT. In this paper, we strengthen that result to show that in both the fixed-rate and the variable-rate coding frameworks there exist sources for which the performance penalty for using a "worst" KLT can be made arbitrarily large. Further, we demonstrate in both frameworks that there exist sources for which even a best KLT gives suboptimal performance. Finally, we show that even for vector sources where the KLT yields independent coefficients, the KLT can be suboptimal for fixed-rate coding.
Modern hybrid video codecs like Versatile Video coding (VVC) heavily rely on transform coding tools. Given a prediction signal at the encoder, the residual is transformed using trigonometric transforms. Rate-distortio...
详细信息
ISBN:
(纸本)9798350358483;9798350358490
Modern hybrid video codecs like Versatile Video coding (VVC) heavily rely on transform coding tools. Given a prediction signal at the encoder, the residual is transformed using trigonometric transforms. Rate-distortion-optimized quantization (RDOQ) and entropy coding of the transformed residual is well-understood due to the orthogonality and the energy compaction of these transforms. Within this setting, there is considerable success in optimizing secondary orthogonal transforms. The most prominent example is the Low-Frequency Non-Separable transform (LFNST) in VVC. However, training nonlinear transforms without re-designing the RDOQ and entropy coding stage is a hard problem. In learned image compression, variational autoencoders have shown impressive results, but they use their own entropy model, remain difficult to train for small blocks and RDOQ is nontrivial for them. This paper describes a novel design of a nonlinear transform network for block-based video coding. Given a transform block, a fully-connected neural network predict coefficients from previously reconstructed ones and the adherent block boundary, such that only the residual coefficients need to be transmitted. Furthermore, another neural network filters the entire transform block before the inverse transform is applied and the intra prediction signal is added. Against the Versatile Video coding Test Model 14.2 (VTM-14.2), luma bit-rate savings of approximately 1.9 % are reported for the All-Intra configuration.
In this paper, a high efficient arbitrary transform blocks hardware allocation framework is proposed, which can adapt the prediction tree structure and then improve the utilization ratio as well as a parallel hardware...
详细信息
ISBN:
(纸本)9781467364997
In this paper, a high efficient arbitrary transform blocks hardware allocation framework is proposed, which can adapt the prediction tree structure and then improve the utilization ratio as well as a parallel hardware design to improve the data throughput. This method will configure an appropriate combination of five different size inverse transform units: Fast IDST, 4x4 IDCT, 8x8 IDCT, 16x16 IDCT and 32x32 IDCT. If the input video stream changed, it will reconfigure the combination and allocated the hardware resources to retain a high utilization ratio in hardware framework. Experiments show that the performance of the proposed method is improved from 48.8% to 96.2% under various conditions. The proposed method can enhance the efficiency of H.265 decoder.
暂无评论