This paper introduces the 3D color set partitioning in hierarchical trees (3D-CSPIHT) low bit rate embedded video coding scheme. The codec exploits the correlation between temporal and spatial wavelet coefficients and...
详细信息
This paper introduces the 3D color set partitioning in hierarchical trees (3D-CSPIHT) low bit rate embedded video coding scheme. The codec exploits the correlation between temporal and spatial wavelet coefficients and the interdependency between luminance and chrominance components to code color video sequences without the need for explicit bit allocation. Besides offering rate scalability, the new codec also produces multi-resolution scalable code streams. The hierarchical variable size block matching motion estimation technique is also integrated to demonstrate the motion estimation option with 3D-CSPIHT. The coding results show that 3D-CSPIHT produces better performance and visual quality compared to 3D-SPIHT.
Thanks to advances in sensor technology, today we have many applications (space-borne imaging, medical imaging, etc.) where images of large sizes are generated. Straightforward application of wavelet techniques for ab...
详细信息
Thanks to advances in sensor technology, today we have many applications (space-borne imaging, medical imaging, etc.) where images of large sizes are generated. Straightforward application of wavelet techniques for above images involves certain difficulties. embedded coders such as EZW and SPIHT require that the wavelet transform of the full image be buffered for coding. Since the transform coefficients also require storing in high precision, buffering requirements for large images become prohibitively high. In this paper, we first devise a technique for embedded coding of large images using zero trees with reduced memory requirements. A 'strip buffer' capable of holding few lines of wavelet coefficients from all the subbands belonging to the same spatial location is employed. A pipeline architecure for a line implementation of above technique is then proposed. Further, an efficient algorithm to extract an encoded bitstream corresponding to a region of interest in the image has also been developed. Finally, the paper describes a strip based non-embedded coding which uses a single pass algorithm. This is to handle high-input data rates. (C) 2002 Elsevier Science B.V. All rights reserved.
The evolution of mobile network and the popularization of mobile devices;the demand for multimedia services and 3D graphics applications on limited resource devices is more contemporary. Most of the works on multimedi...
详细信息
The evolution of mobile network and the popularization of mobile devices;the demand for multimedia services and 3D graphics applications on limited resource devices is more contemporary. Most of the works on multimedia transmission are focused on bit errors and packet losses due to the fading channel environment of a wireless network. Error resilient multimedia is significant research topic which can be adapted to the different conditions in a wireless environment. The current solutions in transmission of multimedia across different networks include some type of transcoder where the source is partially or fully decoded, and re-encoded to suit the network conditions. This paper introduces a flexible progressive coding framework for 3D meshes, which can be adapted to the different conditions imposed by wired and wireless channels at the bitstream level. By avoiding the computationally complex steps of transcoding between networks, could deteriorate decoded model quality. The framework also allows refined degradation of model quality when the network conditions are poor due to congestion or deep fades.
Progressive image transmission is difficult in the presence of a noisy channel, mainly due to the propagation of errors during the decoding of a progressive bitstream. Excellent results for this problem are made possi...
详细信息
Progressive image transmission is difficult in the presence of a noisy channel, mainly due to the propagation of errors during the decoding of a progressive bitstream. Excellent results for this problem are made possible through combined source-channel coding, a method that matches the channel code to the source operational rate distortion as well as channel conditions. This paper focuses on the key component of combined source-channel coding: rate allocation. We develop a parametric methodology for rate allocation, in progressive source-channel coding. The key to this technique is an empirical model of decoded bit-error rate as a function of the channel code rate. We investigate several scenarios. In the case of the memoryless channel, we present closed-form expressions. For the fading channel and channels with feedback, where closed-form results are elusive, our analysis leads to low-complexity algorithms.. The results presented in this paper are applicable to any progressive source code, and any family of channel codes.
Three dimensional nature of hyperspectral data with huge amount of correlation in spatial and spectral domain makes transform coding methods more efficient for compression. Transform methods concentrate signal power i...
详细信息
Three dimensional nature of hyperspectral data with huge amount of correlation in spatial and spectral domain makes transform coding methods more efficient for compression. Transform methods concentrate signal power in a few coefficients resulting in better low bit rate performance with low computational complexity. A set of 3D hybrid transforms obtained by combining various 1D spectral decorrelator and 2D spatial decorrelator are investigated for their performance evaluation. Wavelet-based methods generate clustered coefficients having parent-child relationship between the subbands. This property can be exploited by entropy encoders to generate bit streams. For entropy encoding, various 2D-set partitioning methods are studied. 2D-set partitioning in hierarchical trees and 2D-tree block encoding exploit parent-child relationship, and 2D-set partitioning in embedded blocks exploits spatial correlation between neighboring pixels within the sub-band in space and frequency of transformed band images. 2D-set partitioning in blocks of hierarchical trees (2D-SPBHT) exploits energy clustering as well as tree structure of wavelet transform simultaneously. It is shown that 2D-SPBHT provides better performance at all the bitrates as compared to other 2D-set partitioning methods irrespective of the 3D transformation used.
We have developed novel progressive scalable region-of-interest (ROI) image compression schemes with rate-distortion-complexity tradeoff based on vector quantization. Residual vector quantization (RVQ) equips the enco...
详细信息
We have developed novel progressive scalable region-of-interest (ROI) image compression schemes with rate-distortion-complexity tradeoff based on vector quantization. Residual vector quantization (RVQ) equips the encoder with a multi-resolution apparatus which is useful for rate-distortion tradeoff. Having all advantages of RVQ, jointly suboptimized RVQ provides a distortion-complexity adjustment. The systems are unbalanced in the sense that the decoder has less computational requirements than the encoder. The proposed jointly suboptimized RVQ method provides an interactive tool for fast ROI-based browsing from image archives.
This paper describes a highly scalable hybrid image coding scheme (HS-HIC). The proposed hybrid coding scheme combines simply modification of the data in the wavelet domain and the high performance of the set partitio...
详细信息
This paper describes a highly scalable hybrid image coding scheme (HS-HIC). The proposed hybrid coding scheme combines simply modification of the data in the wavelet domain and the high performance of the set partitioning in hierarchical trees (SPIHT) coding. The modification of the subband image data is done based on the transformation of the high-frequency subband (details) in the wavelet domain. It is based on linear modification of 9-subband image data within three-layer in the wavelet domain. Except the image data in LL3, all other image data will be linearly modified based on the discrete Fourier transform (DFT) components. The modification process provides a new subband image data containing almost the same information as the original one but having a smaller frequency spectrum. The modified data is then located in the corresponding position and the simple SPIHT coder followed by adaptive arithmetic coder is applied on the resulting hierarchical representation to generate the symbol stream. Simulation results demonstrate that, with small addition in the computational complexity of the coding process, the PSNR performance of the proposed algorithm is much higher than that of the SPIHT test coder and some of famous image coding techniques. (c) 2007 Elsevier Inc. All rights reserved.
In the latest large-screen TVs, an RGBW color domain is widely used to enhance the brightness of liquid crystal display (LCD) panels. As the correlations among RGBW color components are different from those among conv...
详细信息
In the latest large-screen TVs, an RGBW color domain is widely used to enhance the brightness of liquid crystal display (LCD) panels. As the correlations among RGBW color components are different from those among conventional RGB components, a compression algorithm developed for an RGB image may not be efficient for RGBW compression. For high-fidelity RGBW image coding, a novel prediction-based coding algorithm is proposed in this paper. The proposed algorithm is composed of two prediction steps that take advantage of spatial correlation and inter-color correlations, respectively. For residual coding, this paper proposes a fixed-ratio coding method based on Golomb-Rice coding. The proposed encoder is composed of two sub-coders: pre-coder and post-coder. The pre-coder estimates the length of the bitstream for various quantization levels. Based on that estimation, the appropriate quantization level is selected. The post-coder generates the final bitstream. Experimental results show that the average peak signal-tonoise ratio (PSNR) of the proposed algorithm is 51.16 dB. The throughput of the proposed encoder and decoder hardwares is 26.7 Gb/s, and their respective gate counts are 56.1 K and 39.6 K.
A new scheme for robust, high-quality, embedded speech coding based on subband decomposition and perceptually optimized bit allocation and prioritization is presented, An infinite impulse response (IIR) quadrature mir...
详细信息
A new scheme for robust, high-quality, embedded speech coding based on subband decomposition and perceptually optimized bit allocation and prioritization is presented, An infinite impulse response (IIR) quadrature mirror filterbank (QMF) performs subband decomposition, A perceptual model, computed using subband spectral analysis, optimizes the coder's perceptual quality, Dynamic bit allocation and prioritization is combined with embedded quantization resulting in little performance degradation relative to a nonembedded implementation, The coder output is scalable from high quality at higher bit rates to lower quality at lower bit rates, supporting a wide range of service and resource utilization, The lower bit-rate representation is obtained simply through truncation of the higher bit-rate representation, Since source-rate adaptation is performed through truncation of the encoded stream, interaction with the coder is not required, making the embedded coder ideally suited for rate-adaptive communication systems, performance for both speech and music was verified through subjective listening tests.
An embedded wavelet-based coder for the compression of hyperspectral imagery is described. The proposed coder, three-dimensional (3-D) tarp, employs an explicit estimate of the probability of coefficient significance ...
详细信息
An embedded wavelet-based coder for the compression of hyperspectral imagery is described. The proposed coder, three-dimensional (3-D) tarp, employs an explicit estimate of the probability of coefficient significance to drive a nonadaptive arithmetic coder, resulting in a simple implementation suited to vectorized acceleration in single-instruction-multiple-data (SIMD) hardware. The proposed 3-D tarp coder is compared to other prominent coders for the compression of hyperspectral imagery, and state-of-the-art rate-distortion performance is observed.
暂无评论