A minimum edge difference (MED) predictor is proposed to enhance the performance of the basic JPEG (Joint Photographic Experts Group) system. The MED predictor uses the edge difference between adjacent blocks for pred...
详细信息
A minimum edge difference (MED) predictor is proposed to enhance the performance of the basic JPEG (Joint Photographic Experts Group) system. The MED predictor uses the edge difference between adjacent blocks for prediction of the DC coefficient of the current block. Analysis based on a statistical model shows that the proposed predictor can encode the DC coefficients more effectively than the JPEG predictor and the optimal single element 1-D predictor. Computer simulation using real images also shows that, when the proposed MED predictor is used, there is about 5% bit rate reduction over the basic JPEG system.< >
The concept and design methods for efficient use of vector quantization (VQ) in high-fidelity audio coding are presented. It is demonstrated that with constrained-storage VQ (CSVQ), tree-structured codebooks can be co...
详细信息
The concept and design methods for efficient use of vector quantization (VQ) in high-fidelity audio coding are presented. It is demonstrated that with constrained-storage VQ (CSVQ), tree-structured codebooks can be constructed for very high rates without incurring an exponential growth in storage complexity and without impairing the rate-distortion performance. Nonlinear interpolative VQ allows efficient coding of the power envelope needed for transform-coefficient normalization and adaptive distortion assignment. These techniques lead to a substantial reduction in the overall bit rate and codebook storage for the audio coder.< >
A novel very-low-bit-rate textured image compression scheme is proposed in this scheme, both texture self-similarity property and affine transform are used as basis to achieve coding tasks. An input texture image can ...
详细信息
A novel very-low-bit-rate textured image compression scheme is proposed in this scheme, both texture self-similarity property and affine transform are used as basis to achieve coding tasks. An input texture image can be reconstructed from both the prototype block spectral and the corresponding affine transform parameters. These parameters are estimated from the prototype block spectral and that of the rest blocks using the multiresolution Fourier transform, MFT, of the input image. In the synthesizing process, two different groups of parameters are generated, the MFT coefficients of the prototype block and the estimated affine parameters amongst blocks. In order to achieve the very-low-bit-rate coding scheme, this paper presents the quantization and coding scheme of the transformed coefficients and the estimated affine parameters. Consequently, figures of the compression and distortion ratio of the reconstructed image can be measured to analyze the structure of textured image over various scales.
transform based image compression has difficulty with image regions containing edges. Edge compensated transform coding (ECTC) addresses this problem by preprocessing to remove edges. This preprocessing is adapted to ...
详细信息
transform based image compression has difficulty with image regions containing edges. Edge compensated transform coding (ECTC) addresses this problem by preprocessing to remove edges. This preprocessing is adapted to transform coding. The edge information is sent in a side channel and the edges are replaced at the receiver. Subjective improvement is demonstrated.< >
In this paper, we introduce a multistage vector quantization technique (MVQ-OBA) applied to the transform coefficients where the effective number of bits assigned to each coefficient is proportional to the coefficient...
详细信息
In this paper, we introduce a multistage vector quantization technique (MVQ-OBA) applied to the transform coefficients where the effective number of bits assigned to each coefficient is proportional to the coefficient variance. A (scalar) optimal bit allocation map for a given bit rate, {B(ij)}, is first found based on the variances of the transform coefficients. The optimal bit allocation map {B(ij)} is then sliced into a set of bit allocation planes {B(k, ij), k = 0, 1, ...} by applying a set of thresholds {T(k), k = 0, 1, ...} Here, B(k, ij) indicates the number of bits assigned to coefficient (i, j) at stage k. The transformed image is then vector quantized on a stage-by-stage basis where, at each stage k, only the (residual error) coefficients assigned a nonzero number of bits are combined into vectors and vector quantized with a codebook of size 2-SIGMA-i, j (B)k, ij. Since only part of the coefficients are included into vectors and a relatively small codebook is used at each stage, both the overhead for transmitting the codebook and the complexity in codebook design are significantly reduced. Furthermore, as MVQ-OBA operates in a multistage manner where the information transmitted up to each stage corresponds to an approximation of the image, it is well suited for progressive image transmission. Finally, MVQ-OBA is shown to be competitive with other transform coding techniques in terms of performance.
We extend Wyner-Ziv high-rate quantization and transform coding theory to the case in which a noisy observation of some source data is available at the encoder, but we are interested in estimating the unseen source da...
详细信息
We extend Wyner-Ziv high-rate quantization and transform coding theory to the case in which a noisy observation of some source data is available at the encoder, but we are interested in estimating the unseen source data at the decoder, with the help of side information. Ideal Slepian-Wolf coders are assumed, thus rates are conditional entropies of quantization indices given the side information. transform coders of noisy images for different communication constraints are compared. Experimental results show that the Wyner-Ziv transform coder achieves a performance close to the case in which the side information is also available at the encoder.
A novel splitting strategy for variable block-size image coding based on edge information is presented, which makes it possible to reduce the number of blocks to be encoded while maintaining an in-depth split in edge ...
详细信息
A novel splitting strategy for variable block-size image coding based on edge information is presented, which makes it possible to reduce the number of blocks to be encoded while maintaining an in-depth split in edge areas. This approach has been applied to a standard transform coding algorithm, and results have been compared with those obtained by a classical splitting structure. Tests showed a notable increase in the compression factor under the same SNR conditions. Reconstructed images are affected by the blocking effect to a smaller extent, as the problem of generating a large number of small blocks in the uniform areas surrounding an edge is overcome by the proposed split topology.< >
Modern hybrid video codecs like Versatile Video coding (VVC) heavily rely on transform coding tools. Given a prediction signal at the encoder, the residual is transformed using trigonometric transforms. Rate-distortio...
详细信息
ISBN:
(数字)9798350358483
ISBN:
(纸本)9798350358490
Modern hybrid video codecs like Versatile Video coding (VVC) heavily rely on transform coding tools. Given a prediction signal at the encoder, the residual is transformed using trigonometric transforms. Rate-distortion-optimized quantization (RDOQ) and entropy coding of the transformed residual is well-understood due to the orthogonality and the energy compaction of these transforms. Within this setting, there is considerable success in optimizing secondary orthogonal transforms. The most prominent example is the Low-Frequency Non-Separable transform (LFNST) in VVC. However, training nonlinear transforms without re-designing the RDOQ and entropy coding stage is a hard problem. In learned image compression, variational autoencoders have shown impressive results, but they use their own entropy model, remain difficult to train for small blocks and RDOQ is nontrivial for them. This paper describes a novel design of a nonlinear transform network for block-based video coding. Given a transform block, a fully-connected neural network predict coefficients from previously reconstructed ones and the adherent block boundary, such that only the residual coefficients need to be transmitted. Furthermore, another neural network filters the entire transform block before the inverse transform is applied and the intra prediction signal is added. Against the Versatile Video coding Test Model 14.2 (VTM-14.2), luma bit-rate savings of approximately 1.9 % are reported for the All-Intra configuration.
Nonstationarity in image textures requires implementation of variable-size partitioning into stationary regions for coding purposes. The popular methods in regular decomposition fail in accurately following the variou...
详细信息
Nonstationarity in image textures requires implementation of variable-size partitioning into stationary regions for coding purposes. The popular methods in regular decomposition fail in accurately following the various region boundaries. In the present work, the boundaries of different textures in the image are detected using a one-dimensional model of each row of the image. The autoregressive algorithm operates in 8 directions to enable an efficient detection of the boundaries. The edge map is then achieved by taking the result of edge detection operation in all directions into account. transform coding and run-length coding are applied to encode the textures and the boundaries, respectively.< >
暂无评论