Thresholding is a technique for suppressing small transform coefficients in DCT-based coding. Recently, dynamic programming has been presented to optimize thresholding in a rate-distortion sense. This contribution inv...
详细信息
ISBN:
(纸本)0819423564
Thresholding is a technique for suppressing small transform coefficients in DCT-based coding. Recently, dynamic programming has been presented to optimize thresholding in a rate-distortion sense. This contribution investigates the extension of this method for SNR scalability. Because of the tight coupling between base and enhancement layer in SNR scalability, a 2D dynamic programming algorithm has been developed.
A new method for coding of displaced frame differences (DFD) is proposed. It is in its main aspects close to the classical pyramid approach of Burt and Adelson. In particular a least squares L2 Laplacian pyramid is em...
详细信息
ISBN:
(纸本)0819423564
A new method for coding of displaced frame differences (DFD) is proposed. It is in its main aspects close to the classical pyramid approach of Burt and Adelson. In particular a least squares L2 Laplacian pyramid is employed which decomposes the DFD into several levels with differing spatial resolution. This pyramid is quantized and coded following a layered quantization approach together with a layered coding method based on conditional arithmetic coding. The DFD encoder outputs an embedded bit stream. Thus the coder control may truncate the bitstream at any point, and can keep a fixed rate. Simulation results show promising rate-distortion performance for low bit rate video coding.
作者:
Bonneau, Robert J.Columbia University
Image and Advanced Television Laboratory Shapiro Research Bldg. 530 W. 120th Street New York NY 10027 United States
Traditional wavelet edge detection and encoding schemes preserve shape features of objects effectively at a variety of spatial scales and also allow an efficient means of image and videocompression. However these sch...
详细信息
Frame rate conversion requires interpolation of image frames at time instances, where the original sequence has not been sampled. This can be done in high quality only by means of motion-compensated algorithms, theref...
详细信息
ISBN:
(纸本)0819423564
Frame rate conversion requires interpolation of image frames at time instances, where the original sequence has not been sampled. This can be done in high quality only by means of motion-compensated algorithms, therefore the knowledge of motion present in the sequence is essential. This motion information has to be determined from the image sequence itself. In this paper a motion-based image segmentation algorithm is proposed, and its application for motion- compensated (MC) frame rate conversion is presented. The segmentation algorithm that can trace multiple rigid objects with translational movement, is based on vector quantization of the estimated motion field determining a set of global motion vectors and segmenting the images into multiple moving areas. Then, the spatial order of the objects (which one is in front of the other) is determined. Interpolation is performed based on the results of the segmentation, the set of motion vectors, and the proper handling of covered and uncovered areas. Furthermore, an accelerated motion model developed previously by the authors is applied, in order to further improve the performance of the MC frame rate converter.
In this and an accompanying paper in the same proceedings a hierarchical video coding scheme is presented which is designed mainly for video communications at very low bit rates and therefore based on the hybrid codin...
详细信息
ISBN:
(纸本)0819423564
In this and an accompanying paper in the same proceedings a hierarchical video coding scheme is presented which is designed mainly for video communications at very low bit rates and therefore based on the hybrid coding principle. Both the displacement vector field and the displaced frame differences are decomposed into a Laplacian-type pyramid, and these pyramids are encoded in an embedded fashion using zero-trees and conditioning contexts, respectively. The focus of this paper lies on the motion estimation and the design of the motion vector field coding. Especially, the aspect of estimating and coding vector fields of different resolution is investigated to achieve optimal coding efficiency in coding situations, where there is not enough data rate available to code higher resolution vector fields completely.
The forthcoming introduction of helical scan digital data tape recorders with high access bandwidth and large capacity will facilitate the recording and retrieval of a wide variety of multimedia information from diffe...
详细信息
ISBN:
(纸本)0819420425
The forthcoming introduction of helical scan digital data tape recorders with high access bandwidth and large capacity will facilitate the recording and retrieval of a wide variety of multimedia information from different sources, such as computer data and digital audio and video. For the compression of digital audio and video, the MPEG standard has internationally been accepted. Although helical scan tape recorders can store and playback MPEG compressed signals transparently they are not well suited for carrying out special playback modes, in particular fast forward and fast reverse. Only random portions of a original MPEG bitstream are recovered on fast playback. Unfortunately these shreds of information cannot be interpreted by a standard MPEG decoder, due to loss of synchronization and missing reference pictures. In the EC-sponsored RACE project DART (digital Data Recorder Terminal) the possibilities for recording and fast playback of MPEG video on a helical scan recorder have been investigated. In the approach we present in this paper, we assume that not transcoding is carried out on the incoming bitstream at recording time, nor that any additional information is recorded. To use the shreds of information for the reconstruction of interpretable pictures, a bitstream validator has been developed to achieve conformance to the MPEG-2 syntax during fast playback. The concept has been validated by realizing hardware demonstrators that connect to a prototype helical scan digital data tape recorder.
A new interpolation algorithm for 2D data is presented that is based on the least-squares minimization and the use of splines. This interpolation technique is then integrated into a double source decomposition scheme ...
详细信息
ISBN:
(纸本)0819423564
A new interpolation algorithm for 2D data is presented that is based on the least-squares minimization and the use of splines. This interpolation technique is then integrated into a double source decomposition scheme for image data compression. First, a least-squares interpolation is implemented and applied to a uniform sampling image. Second, the splines and the analysis of the entropy allow us to reconstruct the final image. Experimental results show that the proposed image interpolation algorithm is very efficient. The major advantages of this new method over traditional block-coding techniques are the absence of the tiling effect and a more effective exploitation of interblock correlation.
We make a case that taking the number of bits to code each motion vector into account when estimating motion for videocompression results in significantly better performance at low bit rates, using simulation studies...
详细信息
ISBN:
(纸本)0819420425
We make a case that taking the number of bits to code each motion vector into account when estimating motion for videocompression results in significantly better performance at low bit rates, using simulation studies on established benchmark videos. First, by modifying a `vanilla' implementation of the H.261 standard, we show that choosing motion vectors explicitly to minimize rate (in a greedy manner), subject to implicit constraints on distortion, yields better rate-distortion tradeoffs than minimizing notions of prediction error. Locally minimizing a linear combination of rate and distortion results in further improvements. Using a heuristic function of the prediction error and the motion vector code-length results in compression performance comparable to the more computationally intensive coders while requiring a practically small amount of computation. We also show that making coding control decisions to minimize rate yields further improvements.
Software implementations of MPEG decompression provide flexibility at low cost but suffer performance problems, including poor cache behavior. For MPEG video, decompressing the video in the implied order does not take...
详细信息
ISBN:
(纸本)0819420425
Software implementations of MPEG decompression provide flexibility at low cost but suffer performance problems, including poor cache behavior. For MPEG video, decompressing the video in the implied order does not take advantage of coherence generated by dependent macroblocks and, therefore, undermines the effectiveness of processor caching. In this paper, we investigate the caching performance gain which is available to algorithms that use different traversal algorithms to decompress these MPEG streams. We have found that the total cache miss rate can be reduced considerably at the expense of a small increase in instructions. To show the potential gains available, we have implemented the different traversal algorithms using the standard Berkeley MPEG player. Without optimizing the MPEG decompression code itself, we are able to obtain better cache performance for the traversal orders examined. In one case, faster decompression rates are achieved by making better use of processor caching, even though additional overhead is introduced to implement the different traversal algorithm. With better instruction-level support in future architectures, low cache miss rates will be crucial for the overall performance of software MPEG video decompression.
Low power consumption is a requirement for any battery powered portable equipment. When designing ASICs for image and videocompression, emphasis has been placed mainly on building circuits that are fast enough to sat...
详细信息
ISBN:
(纸本)0819423564
Low power consumption is a requirement for any battery powered portable equipment. When designing ASICs for image and videocompression, emphasis has been placed mainly on building circuits that are fast enough to satisfy the high data throughput associated with image and video processing. The imminent development of portable systems featuring full multimedia applications, adds the low-power constraint to the design of VLSI circuits for this kind of application. Several techniques such as lowering the supply voltage, architectural parallelization, pipelining etc., have been proposed in the literature to achieve low-power consumption. In this paper we report a VLSI circuit featuring a power management user-controllable technique that trades image quality for power consumption in a transform-based algorithm.
暂无评论