A new quadtree segmented predictive image coding technique is presented in this paper for exploiting the correlation between adjacent image blocks and uniformity in variable block size image blocks. For exploiting cor...
详细信息
ISBN:
(纸本)0819450235
A new quadtree segmented predictive image coding technique is presented in this paper for exploiting the correlation between adjacent image blocks and uniformity in variable block size image blocks. For exploiting correlation between adjacent image blocks, a predictive coding technique is used for reducing the inter block correlation. The proposed quadtree technique decomposes an image into variable size block, and each segmented block is then coded at a different bit rate according to the level of visual activity inside the block. A novel classification scheme, which operates based on the distribution of the block residuals, is employed to determine the activity level inside the block. hi this method, the orientation of the pattern appearing inside the block will be computed as an aid to the classification. To preserve edge integrity, a block pattern-based coding technique is proposed and incorporated to the predictive coding method for coding the high-activity blocks of the segmented image. The use of a set of parameters associated with the pattern appearing inside a high activity block at the receiver, together with the inter block correlation information. reduce the cost of instruction and saves the encoding time. Experiments have been conducted to compare with other predictive and quadtree- based techniques. Results show a lower bit rate at competitive reconstruction quality.
This paper presents a multiple description (MD) video codec based on the principles side-information coding. In particular, we highlight certain key components of the codec design that contribute significantly to the ...
详细信息
This paper presents a multiple description (MD) video codec based on the principles side-information coding. In particular, we highlight certain key components of the codec design that contribute significantly to the rate-distortion performance of the proposed codec. These include the use of randomized permutations of the quantization codebook in conjunction with binary LDPC codes for partitioning the available bit-rate among the coefficient bit-planes. Another key component of the proposed codec is the use of pdf estimation for improved decoder reconstruction. Lastly, we use a bank of sequential LDPC decoders to efficiently decode the transmitted coset information. Empirical evaluation demonstrates the superior performance of the proposed codec for the communication of encoded video over packet erasure channels.
This paper presents a method for losslessly compressing multi-channel electroencephalogram signals. The Karhunen-Loeve transform is used to exploit the inter-correlation among the EEG channels. The transform is approx...
详细信息
This paper presents a method for losslessly compressing multi-channel electroencephalogram signals. The Karhunen-Loeve transform is used to exploit the inter-correlation among the EEG channels. The transform is approximated using lifting scheme which results in a reversible realization under finite precision processing. An integer time-frequency transform is applied to further minimize the temporal redundancy.
A new image coder is presented in which an image is divided into blocks, each block is quadtree segmented, and each segment is coded using a form of predictive coding. We provide nearly-optimal segmentation and quanti...
详细信息
A new image coder is presented in which an image is divided into blocks, each block is quadtree segmented, and each segment is coded using a form of predictive coding. We provide nearly-optimal segmentation and quantization rules for this framework, as well as an iterative codebook design algorithm. In simulations, our new system comes within 1 dB of JPEG 2000 and outperforms JPEG by about 1.5 dB at moderate rates and up to 4.5 dB at low rates. Additionally, our method captures edges better and has potentially lower complexity than transform-based methods.
Video communication through wireless channels is still a challenging problem due to the limitations in bandwidth and the presence of channel errors. Since,many video sources are originally coded at a high rate and wit...
详细信息
Video communication through wireless channels is still a challenging problem due to the limitations in bandwidth and the presence of channel errors. Since,many video sources are originally coded at a high rate and without considering,the different channel conditions that may be encountered later, a means to repurpose this content for delivery over a dynamic wireless channel is needed. Transcoding is typically used to reduce the rate and change the format of the originally encoded video source to match network conditions and terminal capabilities. Given the existence of channel errors that can easily corrupt video quality, there is also the need to make the bitstream more resilient to transmission errors. In this article we provide an overview of the error resilience tools found in today's video coding standards and describe a variety of techniques that may be used to achieve error-resilient video transcoding.
Shannon's information theory provides a principled framework for the quantitative analysis of brain responses during the encoding and representation of event streams. In particular, entropy measures the expected u...
详细信息
Shannon's information theory provides a principled framework for the quantitative analysis of brain responses during the encoding and representation of event streams. In particular, entropy measures the expected uncertainty of events in a given context. This contextual uncertainty or unpredictability may, itself, be important for balancing [bottom-up] sensory information and [top-down] prior expectations during perceptual synthesis. Using event-related functional magnetic resonance imaging (fMRI), we found that the anterior hippocampus is sensitive to the entropy of a visual stimulus stream. In contrast, activity in an extensive bilateral cortico-thalamic network was dictated by the surprise or information associated with each particular stimulus. In short, we show that the probabilistic structure or context in which events occur is an important predictor of hippocampal activity. (c) 2005 Elsevier Ltd. All rights reserved.
This paper proposes an encoding method for high-quality, low-delay audio communication that is robust to losses in packetized transmission. Robustness is provided by a multiple description vector quantization (MDVQ) t...
详细信息
This paper proposes an encoding method for high-quality, low-delay audio communication that is robust to losses in packetized transmission. Robustness is provided by a multiple description vector quantization (MDVQ) technique that is designed to minimize the mean-squared error (MSE). The key to applying this technique effectively is the use of psycho-acoustically controlled pre- and post-filters that make the mean-squared quantization error perceptually relevant. Experiments show that the MDVQ-based encoder yields better results-in both MSE and subjective audio quality-than simple alternative coders with the same low delay.
Synchronous presentation of stimuli to the auditory and visual systems can modify the formation of a percept in either modality. For example, perception of auditory speech is improved when the speaker's facial art...
详细信息
Synchronous presentation of stimuli to the auditory and visual systems can modify the formation of a percept in either modality. For example, perception of auditory speech is improved when the speaker's facial articulatory movements are visible. Neural convergence onto multisensory sites exhibiting supra-additivity has been proposed as the principal mechanism for integration. Recent findings, however, have suggested that putative sensory-specific cortices are responsive to inputs presented through a different modality. Consequently, when and where audiovisual representations emerge remain unsettled. In combined psychophysical and electroencephalography experiments we show that visual speech speeds up the cortical processing of auditory signals early (within 100 ms of signal onset). The auditory-visual interaction is reflected as an articulator-specific temporal facilitation (as well as a nonspecific amplitude reduction). The latency facilitation systematically depends on the degree to which the visual signal predicts possible auditory targets. The observed auditory-visual data support the view that there exist abstract internal representations that constrain the analysis of subsequent speech inputs. This is evidence for the existence of an "analysis-by-synthesis" mechanism in auditory-visual speech perception.
This paper presents a novel and efficient diagnostically lossless compression for 3D medical image sets. This compression scheme provides the 3D medical image sets with a progressive transmission capability. An automa...
详细信息
This paper presents a novel and efficient diagnostically lossless compression for 3D medical image sets. This compression scheme provides the 3D medical image sets with a progressive transmission capability. An automated filter-and-threshold based preprocessing technique is used to remove noise outside the diagnostic region. Then a wavelet decomposition feature vector based approach is applied to determine the reference image for the entire 3D medical image set. The selected reference image contains the most discernible anatomical structures within a relative large diagnostic region. It is progressively encoded by a lossless embedded zerotree wavelet method so the validity of an entire set can be determined early. This preprocessing technique is followed by an optimal predictor plus a 1st-level integer wavelet transform to de-correlate the 3D medical image set. Run-length and arithmetic coding are used to further remove coding redundancy. This diagnostically lossless compression method achieves an average compression of 2.1038, 2.4292, and 1.6826 bits per pixel for three types of 3D magneticresonance image sets. The integrated progressive transmission capability degrades the compression performance by an average of 7.25%, 6.60%, and 4.49% for the above three types. Moreover, our compression without and with progressive transmission achieves better compression than the state-of-the-art. (c) 2005 Elsevier Inc. All rights reserved.
This paper presents a novel and efficient diagnostically lossless compression for 3D medical image sets. This compression scheme provides the 3D medical image sets with a progressive transmission capability. An automa...
详细信息
This paper presents a novel and efficient diagnostically lossless compression for 3D medical image sets. This compression scheme provides the 3D medical image sets with a progressive transmission capability. An automated filter-and-threshold based preprocessing technique is used to remove noise outside the diagnostic region. Then a wavelet decomposition feature vector based approach is applied to determine the reference image for the entire 3D medical image set. The selected reference image contains the most discernible anatomical structures within a relative large diagnostic region. It is progressively encoded by a lossless embedded zerotree wavelet method so the validity of an entire set can be determined early. This preprocessing technique is followed by an optimal predictor plus a 1st-level integer wavelet transform to de-correlate the 3D medical image set. Run-length and arithmetic coding are used to further remove coding redundancy. This diagnostically lossless compression method achieves an average compression of 2.1038, 2.4292, and 1.6826 bits per pixel for three types of 3D magneticresonance image sets. The integrated progressive transmission capability degrades the compression performance by an average of 7.25%, 6.60%, and 4.49% for the above three types. Moreover, our compression without and with progressive transmission achieves better compression than the state-of-the-art. (c) 2005 Elsevier Inc. All rights reserved.
暂无评论