This is a correction to the “transform coding Techniques in HEVC” article published in the IEEE Selected Topics in Signal Processing, vol. 7, no. 6, Dec. 2013
This is a correction to the “transform coding Techniques in HEVC” article published in the IEEE Selected Topics in Signal Processing, vol. 7, no. 6, Dec. 2013
Improvements in compression performance can be achieved if the image transform is able to adapt to the shape of the objects present. Ne propose a new hybrid shape-adaptive orthogonal transform which is based on M-chan...
详细信息
Improvements in compression performance can be achieved if the image transform is able to adapt to the shape of the objects present. Ne propose a new hybrid shape-adaptive orthogonal transform which is based on M-channel subband decomposition and shape-adaptive DCT techniques, The signal extension method for the region-based dyadic wavelet transform is applied to region-based image coding using M-channel filter banks. Simulation results show that the proposed method outperforms shape-adaptive DCT coding.
The method of transform coding for image-data compression is generalized by regarding transform coding as a least-squares approximation of two-dimensional functions. By an orthogonalization of basis functions with res...
详细信息
The method of transform coding for image-data compression is generalized by regarding transform coding as a least-squares approximation of two-dimensional functions. By an orthogonalization of basis functions with respect to a particular segment shape, a generalized transform coding scheme is derived. The algorithm contains all block-oriented transforms as special cases and allows the construction of new transforms, e.g. polynomial or spline transforms. The theoretical results are converted into coder and decoder structures enabling region-oriented transform coding without a transmission of the orthogonal basis functions for each segment.< >
It has been shown [1] that an analysis/synthesis system based on a sinusoidal representation leads to synthetic speech that is essentially indistinguishable from the original. By exploiting the peak-to-peak correlatio...
详细信息
It has been shown [1] that an analysis/synthesis system based on a sinusoidal representation leads to synthetic speech that is essentially indistinguishable from the original. By exploiting the peak-to-peak correlation of the sine-wave amplitudes [2], a harmonic model for the sine-wave frequencies, and a predictive model for the sine-wave phases [3], it has also been shown that the sine-wave parameters can be coded at 8 kbps. In this paper a new technique is described for coding the sine-wave amplitudes based on the idea of a pitch-adaptive channel vocoder. Using this amplitude-coding strategy and operating at a total bit rate of 4.8 kbps, it was possible to code and transmit enough phase information so that very intelligible, natural sounding speech could be synthesized. This 4.8 kbps system has been implemented in real-time and has achieved a Diagnostic Rhyme Test (DRT) score of 95. At 2.4 kbps no explicit phase information could be coded, but by phase-locking all of the sine waves to the fundamental, by adding a pitch-adaptive quadratic phase, and by adding a voicing dependent random phase to each sine wave, natural sounding synthetic speech could be obtained. This new system is currently being implemented in real-time so that intelligibility tests can be performed.
In low-bitrate audio coding, modern coders often rely on efficient parametric techniques to enhance the performance of the waveform preserving transform coder core. While the latter features well-known perceptually ad...
详细信息
In low-bitrate audio coding, modern coders often rely on efficient parametric techniques to enhance the performance of the waveform preserving transform coder core. While the latter features well-known perceptually adapted quantization of spectral coefficients, parametric techniques reconstruct the signal parts that have been quantized to zero by the encoder to meet the low-bitrate constraint. Large numbers of zeroed spectral values and especially consecutive zeros constituting gaps often lead to audible artifacts at the decoder. To avoid such artifacts the new 3GPP Enhanced Voice Services (EVS) coding standard utilizes noise filling and intelligent gap filling (IGF) techniques, guided by spectral envelope information. In this paper the underlying considerations of the parametric energy adjustment and transmission in EVS and its relation to noise filling, IGF, and tonality preservation are presented. It is further shown that complex-valued IGF envelope calculation in the encoder improves the temporal energy stability of some signals while retaining real-valued decoder-side processing.
A technique for sine-wave synthesis is described that uses the fast Fourier transform overlap-add method at a 100 Hz rate based on sine-wave parameter coded at a 50 Hz rate. This technique leads to an implementation r...
详细信息
A technique for sine-wave synthesis is described that uses the fast Fourier transform overlap-add method at a 100 Hz rate based on sine-wave parameter coded at a 50 Hz rate. This technique leads to an implementation requiring less than one-half the computational power of a digital-signal-processor chip. The synthesis method implicitly introduces a frequency jitter which renders the encoded synthetic speech more natural. For speech computed by additive acoustic noise, the synthesizer, in conjunction with straightforward noise suppression, greatly improve the quality of the synthetic speech, rendering the sinusoidal transform coder (STC) algorithm a truly robust system. More recent architecture studies of the STC algorithm suggests that an entire implementation requires no more than two ADSP2 100 chips.< >
A progressive image transmission scheme is presented which combines block transforms, a quadtree data structure and vector quantization. The experimental results demonstrate that the scheme achieves lossless progressi...
详细信息
A progressive image transmission scheme is presented which combines block transforms, a quadtree data structure and vector quantization. The experimental results demonstrate that the scheme achieves lossless progressive transmission with compression. The quadtree data structure makes it possible to transmit the images progressively. Compression is obtained by using vector quantization on each level. Lossless reproduction is guaranteed by delivering the residual errors due to quantization from high level to low level and using an entropy coder on the final residual error image.< >
A fast and computationally less complex coding technique is described which uses partial-LOT computation algorithm and efficiently discards perceptually insignificant high frequency transform coefficients. The coding ...
详细信息
A fast and computationally less complex coding technique is described which uses partial-LOT computation algorithm and efficiently discards perceptually insignificant high frequency transform coefficients. The coding process involves AC energy classification, human visual system weighted normalization and quantization. The values of normalization factors are image independent and governed only by the bit-rate and activity index of the image blocks. Zones of various shapes and sizes are chosen for each activity class to perform zonal sampling which efficiently discards high frequency coefficients having zero values. A progressive transmission version of the proposed technique is also given. This technique gives better performance at comparable bit-rate than that obtained by using LOT/CVQ and LOT/VQ schemes.
Summary form only given. We designed a family of integer-to-integer (i2i) approximations to the Cartesian-to-polar transformation and analyzed its behavior for high-rate transform coding. Denoting (ordinary, continuou...
详细信息
Summary form only given. We designed a family of integer-to-integer (i2i) approximations to the Cartesian-to-polar transformation and analyzed its behavior for high-rate transform coding. Denoting (ordinary, continuous) polar coordinates by (r, 0), our precise high-rate analysis relates the performance to the differential entropies of r 2 and 0, which are often easy to evaluate. One may thus predict when there is an improvement over linear transform coding. The analysis matches our simulations for coding of Gaussian scale mixtures and other polar-separable sources. The advantage over the best linear transform coder can be large. Our hope is to extend the polar-coordinate results to a general theory for nonlinear transform coding based on i2i implementations of arbitrary nonlinear transformations
This paper c the numerous transform coding strategies used for image and video compression when it comes to their effectiveness, implementation problems, and ability packages. transform coding is an information compre...
详细信息
ISBN:
(数字)9798350370249
ISBN:
(纸本)9798350370270
This paper c the numerous transform coding strategies used for image and video compression when it comes to their effectiveness, implementation problems, and ability packages. transform coding is an information compression approach that reduces the number of statistics by exploiting redundancies such as spatial and spectral correlations within the data. The primary remodel coding strategies mentioned encompass the discrete cosine rework (DCT), discrete wavelet remodel (DWT), block-based movement compensation (BMC), and changed discrete cosine remodel (MDCT). Furthermore, the paper compares the extraordinary remodel coding strategies in phrases in their performance, performance, and complexity. The paper starts by introducing the motivation at the back of rework coding techniques and their position in records compression. It then discusses how the numerous rework coding techniques compare to each other and why one technique can be favored over the alternative. The paper examines the overall performance of the special strategies by thinking about parameters including top sign-to-noise ratio (PSNR), compression fee, and objective/subjective nice. Finally, the paper affords an overview of the unique applications in which transform coding may be used. Standard, this paper affords an insightful evaluation of the diverse transform coding strategies used for information compression.
暂无评论