Effective quantization and rate control are very important issues in real time videocompression in order to control picture quality and maintain target bit rate. MPEG-2 test model 5 (TM5) describes an adaptive quanti...
详细信息
ISBN:
(纸本)0819420425
Effective quantization and rate control are very important issues in real time videocompression in order to control picture quality and maintain target bit rate. MPEG-2 test model 5 (TM5) describes an adaptive quantization scheme that exploits human visual system properties to improve subjective quality. The TM5 algorithm, however produces blocking artifacts and distortion in edges on flat background in some pictures. Moreover, the TM5 quantization scheme can not effectively control the quality in predicted pictures, since the activity classification performed in the pixel domain is weakly correlated with the actual quantization performed in the transform domain. In this paper, we propose a new quantization scheme that addresses these deficiencies and also takes into account the variation in compaction property of the DCT kernel as this relates to orientation of edges and structures within a block. The proposed scheme involves a two step activity determination procedure. In the first step, activity is determined based on the actual block variance in the pixel domain and in the second step, a correction factor is applied depending upon the effectiveness of the DCT kernel in the transform domain. Our results show that the proposed scheme results in improved picture quality and better bit-distribution as compared to MPEG-2 TM5.
The paper describes a coding optimization strategy in conformity with the recent ITU-T H.263 Recommendation for videophone sequences at bitrates less than 64 Kbit/s. The optimization algorithm jointly selects the temp...
详细信息
ISBN:
(纸本)0819423564
The paper describes a coding optimization strategy in conformity with the recent ITU-T H.263 Recommendation for videophone sequences at bitrates less than 64 Kbit/s. The optimization algorithm jointly selects the temporal position of the frames to be coded and the coding mode (l, P or PB) of the selected frames. The decision is based on the observation, on a group of frames, of an 'activity' parameter, representing the variation of each frame with respect to the last coded one. The proposed strategy produces coded sequences with average frame rates lower than those produced by a non optimized coder, and a better visual quality of the single frame. However, the activity parameters evaluation, and the observation of several candidates, requires a greater delay, buffer size and complexity of the coding algorithm.
In the MPEG-2 video coding an interlaced frame can be encoded as either a frame-picture or two field-pictures. The selection of picture structure (frame/field) has a strong impact on picture quality. In order to achie...
详细信息
ISBN:
(纸本)0819423564
In the MPEG-2 video coding an interlaced frame can be encoded as either a frame-picture or two field-pictures. The selection of picture structure (frame/field) has a strong impact on picture quality. In order to achieve the best possible picture quality, an adaptive scheme is proposed in this paper to select the optimal picture structure on a frame by frame basis. The selection of picture structure is performed in connection with that of the optimal scanning path. First, the scanning path (zig-zag scan/alternate scan) is chosen based on a post-analysis of DCT-coefficients. Secondly, the optimal picture structure is selected for the next frame according to the chosen scanning path, i.e. a zig-zag scan corresponds to frame picture structure, while an alternate scan corresponds to field picture structure. Furthermore, the TM5 buffer control algorithm is extended to support the coding with adaptive frame/field picture structure. Finally, simulation results verify the adaptive scheme proposed in this paper.
We describe a motion-compensated filtering scheme for preprocessing of video based on motion vectors computed from an MPEG encoder. A modified clustering filter is used to filter the pixels in the spatiotemporal kerne...
详细信息
ISBN:
(纸本)0819420425
We describe a motion-compensated filtering scheme for preprocessing of video based on motion vectors computed from an MPEG encoder. A modified clustering filter is used to filter the pixels in the spatiotemporal kernel generated by the motion trajectories. The filter scheme can be incorporated into the encoder with very slight modifications, since it uses the results of the encoder motion estimation to filter each image prior to performing the rest of the encoding functions. In this way, our work differs from previous work that require additional motion estimation. We have tested our scheme with different types of noise. It is observed that the motion estimation in the encoder is more accurate, resulting in the motion-compensated difference signal that has smaller energies. This leads to finer quantization. Visual quality of the compressed video is significantly enhanced. Furthermore, reduced noise leads to better performance of other modules that rely on the statistics of the intermediate signals (for example, the intra/inter coding decision).
The CNN Universal Machine is applied to object-oriented image compressionalgorithms and proves its universality as a hardware platform for future applications. The estimated processing times allow a real-time analysi...
详细信息
The CNN Universal Machine is applied to object-oriented image compressionalgorithms and proves its universality as a hardware platform for future applications. The estimated processing times allow a real-time analysis of the video sequence and outdo the performances reported for comparable digital devices.
video data encoded using the Motion Picture Experts Group (MPEG) standards is highly susceptible to errors during transmission or storage. We investigate whether the error tolerance of coded MPEG video can be improved...
详细信息
ISBN:
(纸本)0819420425
video data encoded using the Motion Picture Experts Group (MPEG) standards is highly susceptible to errors during transmission or storage. We investigate whether the error tolerance of coded MPEG video can be improved by varying the size of each slice depending on the class of coded picture. We evaluate the effect of varying the slice size within each of the three MPEG picture classes on the error tolerance of the coded sequence. We encode a number of test sequences and subject each one to simulated transmission errors. We show that reducing the slice size in I and P pictures improves the decoded quality in the presence of errors. The slice size in B pictures can be increased without significantly reducing the tolerance to errors. Reducing the slice size in I and P pictures whilst increasing the slice size in B pictures can significantly improve the tolerance of the coded sequence to transmission errors without increasing the amount of coded data. A video sequence encoded in this way complies with the MPEG1 and MPEG2 standards.
This paper presents a performance comparison of different loop filtering techniques in a generic hybrid video coding algorithm. This study will compare the performance of the filtering techniques by integrating each o...
详细信息
ISBN:
(纸本)0819420425
This paper presents a performance comparison of different loop filtering techniques in a generic hybrid video coding algorithm. This study will compare the performance of the filtering techniques by integrating each of the loop filters separately into a MPEG-1 compliant codec, and coding a number of video sequences at various bit-rates and motion compensation (MC) accuracies. The performance of the filters will be assessed in terms of the energy of the displaced frame difference. Comparisons are conducted between four filtering techniques: (1) the 1:2:1 loop filter described in ITU Recommendation H.261;(2) an MC-accuracy dependent 3-tap filter, whose tap weights are based upon a first-order Markov model of the source;(3) a spatially-adaptive filter for the blocking-effect based on the theory of Projections Onto Convex Sets (POCS);and (4) an anisotropic filter for the reduction of the blocking-effect. Our results will examine the effect that traditional low-pass loop filters have on MC prediction quality, and compare this to a POCS-based loop filter. The filtering of only the blocking-effect will also provide an indication of the contribution that the blocking-effect has on the overall high-frequency distortions that are reduced by the low-pass loop filters.
In block-based motion-compensated video coding, a fixed-resolution motion field with one motion vector per image block is used to improve the prediction of the frame to be coded. All motion vectors are encoded with th...
详细信息
ISBN:
(纸本)0819420425
In block-based motion-compensated video coding, a fixed-resolution motion field with one motion vector per image block is used to improve the prediction of the frame to be coded. All motion vectors are encoded with the same fixed accuracy, typically 1 or 1/2 pixel accuracy. In this work, we explore the benefits of encoding the motion vectors with other accuracies, and of encoding different motion vectors with different accuracies within the same frame. To do this, we analytically model the effect of motion vector accuracy and derive expressions for the encoding rates for both motion vectors and difference frames, in terms of the accuracies. Minimizing these expressions leads to simple formulas that indicate how accurately to encode the motion vectors in a classical block-based motion-compensated video coder. These formulas also show that the motion vectors must be encoded more accurately where more texture is present, and less accurately when there is much interframe noise. We implement video coders based on our analysis and present experimental results on real video frames. These results suggest that our equations are accurate, and that significant bit rate savings can be achieved when our optimal motion vector accuracies are used.
In this paper we present some error concealment techniques for MPEG-2 video coded and multiplexed streams damaged by ATM cell losses. Decoder early resynchronization limits the effects of transmission errors by decodi...
详细信息
ISBN:
(纸本)0819420425
In this paper we present some error concealment techniques for MPEG-2 video coded and multiplexed streams damaged by ATM cell losses. Decoder early resynchronization limits the effects of transmission errors by decoding some information that is normally discarded from the damaged MPEG-2 video bitstreams. A part of this information cannot be completely decoded due to its differential coding among macroblocks (DC levels, motion vectors). Three different techniques are presented for the case of DC level recovery in Intra pictures. Two of them are predictive techniques, one operating in the frequency domain and the other in the spatial domain. The third technique provides an exact reconstruction of DC values using special data coded into the user data area of the MPEG-2 video bitstream. For not resynchronized areas classical temporal and spatial concealment techniques are used. These techniques have been tested on a simulated environment which includes an implementation of MPEG-2 elementary video coding and decoding and MPEG-2 system standards. The ATM transmission part has been simulated by means of specialized simulation software. Results relative to the presented concealment techniques are included.
Reversible compression of color images is gaining the ever- increasing attention of multimedia publishing industries for collections of works-of-art. In fact, the availability of high-resolution high-quality multispec...
详细信息
ISBN:
(纸本)0819423564
Reversible compression of color images is gaining the ever- increasing attention of multimedia publishing industries for collections of works-of-art. In fact, the availability of high-resolution high-quality multispectral scanners demands robust and efficient coding techniques capable to capture inter-band redundancy without destroying the underlying intra-band correlation. Although DPCM schemes (e.g., lossless JPEG) are employed for reversible compression, their straightforward extension to true-color (e.g., RGB, XYZ) image data usually leads to a negligible coding gain or even to a performance penalty with respect to individual coding of each color component. Previous closest neighbor (PCN) prediction has been recently proposed for lossless data compression of multispectral images, in order to take advantage of inter-band data correlation. The basic idea to predict the value of the current pixel in the current band on the basis of the best zero-order predictor on the previously coded band has been applied by extending the set of predictors to those adopted by lossless JPEG. On a variety of color images, one of which acquired directly from a painting by the VASARI Scanner at the Uffizi Gallery with a very high resolution (20 pel/mm, 8 MSB for each of the XYZ color components), experimental results show that the method is suitable for inter-frame decorrelation and outperforms lossless JPEG and, to a lesser extent, PCN.
暂无评论