Video coding standard H.264/AVC employs zig-zag scan and field scan to map transform coefficients from two dimensional matrix to one dimensional array. The zig-zag scan is used for transform coefficients in frame macr...
详细信息
Video coding standard H.264/AVC employs zig-zag scan and field scan to map transform coefficients from two dimensional matrix to one dimensional array. The zig-zag scan is used for transform coefficients in frame macroblocks (MBs), and the field scan is used in field MBs. This paper presents a most probable scan mode (MPSM) decision for H.264 inter picture coding. Besides the two scans, a horizontal scan is illustrated. For a 4x4 block, one out of the three scan modes is selected as the most probable one, and is used for the transform coefficient scan. Furthermore, an improved MPSM decision is discussed. Simulation results report that the proposed method yields an average of 1.15% bit rate reduction over the H.264 baseline profile.
This paper presents a new algorithm for reducing aliasing artifacts related to the conventional spatial scalable subband/wavelet coding for recovering video at lower resolution. The proposed scheme has been fully inte...
详细信息
This paper presents a new algorithm for reducing aliasing artifacts related to the conventional spatial scalable subband/wavelet coding for recovering video at lower resolution. The proposed scheme has been fully integrated with the H.264/AVC JSVM reference software and experimentally evaluated against the conventional methods for intraframe spatial scalable video coding. The simulation results show our method can effectively remove the aliasing artifacts for scalable decoding at lower resolution without performance penalty at full resolution.
Given the JPEG syntax, the rate-distortion performance a JPEG optimization method can improve is limited. Part of the limitation comes from the poor context modeling used by a JPEG coder, which fails to take full adva...
详细信息
Given the JPEG syntax, the rate-distortion performance a JPEG optimization method can improve is limited. Part of the limitation comes from the poor context modeling used by a JPEG coder, which fails to take full advantage of the pixel correlation existing in both space and frequency domains. Consequently, context-based arithmetic coding is proposed in the literature to replace the Huffman coding used in JPEG for better rate-distortion performance. In this paper, we extend our previous JPEG compatible joint optimization algorithm to a context-based arithmetic coding scenario. Experimental results show that an extra of 10~15% size reduction or 0.5 dB compression gain can be achieved on top of JPEG compatible joint optimization with the same level of complexity.
In this paper, a region of interest (ROI) MRI image coding scheme is proposed. This scheme use the Maxshift ROI image coding method supported in JPEG 2000 image coding standard. An automatic ROI detection method is al...
详细信息
In this paper, a region of interest (ROI) MRI image coding scheme is proposed. This scheme use the Maxshift ROI image coding method supported in JPEG 2000 image coding standard. An automatic ROI detection method is also introduced. Experiments results demonstrate that the proposed method has good performance on MRI image compression.
In this work, we develop a fast binary partition tree based variable size video coding system. New adaptive algorithms proposed herein are applied to a video encoder with binary partition trees. First, to reduce the c...
详细信息
ISBN:
(纸本)9781424456536;9781424456543
In this work, we develop a fast binary partition tree based variable size video coding system. New adaptive algorithms proposed herein are applied to a video encoder with binary partition trees. First, to reduce the computation for block-matching, an adaptive search area method is described which adjusts the searching region according to the size of each block. Second, an early termination method is introduced which terminates the binary partitioning process adaptively according to the statistics of the peak-signal-to-noise-ratio values during each step of block splitting. Third, we put forward a new model for fast rate-distortion (RD) estimation to decrease the computation of matching pursuit (MP) transform coding. Simulation results show that the proposed techniques provide significant gain in computation speed with little or no sacrifice of RD performance, when compared with non-adaptive binary partitioning scheme.
Region-of-interest (ROI) image coding is one of the new features included in the JPEG2000 image coding standard. Two methods are defined in the standard: the Maxshift method and the generic scaling based method. In th...
详细信息
ISBN:
(纸本)9781424435623
Region-of-interest (ROI) image coding is one of the new features included in the JPEG2000 image coding standard. Two methods are defined in the standard: the Maxshift method and the generic scaling based method. In this paper, a new region-of-interest coding method called Perceptually Optimized Shift (POShift) is proposed. Unlike other proposed methods, the POShift method realigns the bitplanes based on perceptually optimized order, which makes the visually most important bitplanes (in both ROI and BG) to be coded firstly. Experimental results indicate that the proposed method significantly outperforms the previous ROI coding schemes in overall ROI coding performance.
This paper considers a coding scheme for data transmission over erasure channels which is also known as multiple description coding. The LMMSE prefilter method of Romano [1] is reviewed and generalized to allow three ...
详细信息
ISBN:
(纸本)9781424414833
This paper considers a coding scheme for data transmission over erasure channels which is also known as multiple description coding. The LMMSE prefilter method of Romano [1] is reviewed and generalized to allow three different operational modes of the prefilter. They include the possibility to decrease or increase the number of descriptions to be transmitted. We derive explicitly the Hessian matrix for an efficient calculation of the prefilter. We also study the properties of the distortion measure theoretically.
This paper proposes a new quantization for transform coefficients based on algebraic quantization. The coefficients are represented by a few pulses multiplied by a unique amplitude. The coefficients to be transmitted ...
详细信息
ISBN:
(纸本)9781424414833
This paper proposes a new quantization for transform coefficients based on algebraic quantization. The coefficients are represented by a few pulses multiplied by a unique amplitude. The coefficients to be transmitted are selected by optimizing an error criterion, that determines the signs, positions and amplitudes of the pulses. This simple quantization has been implemented in a wavelet-based wideband scalable coder, and has been proved to provide a perceptually better quality than SPIHT on speech signal and music.
This paper proposes a performance-aware transform IP design which can be configured to appropriate hardware for different performance requirements on demand without requiring additional data bandwidth in Multi-mode vi...
详细信息
This paper proposes a performance-aware transform IP design which can be configured to appropriate hardware for different performance requirements on demand without requiring additional data bandwidth in Multi-mode video coding (JPEG/MPEG-1/2/4/H.261/H.263/H.264). Based on the scalable-DA approach, three schemes of hardware configurations which are respectively composed of 3, 6, and 12 data-paths are illustrated. The three schemes of the proposed performance-aware DCT/IDCT can achieve CIF, 720HD, and digital cinema video formats when operated at 9.13 MHz, 41.48 MHz, and 188.75 MHz, respectively.
This algorithm is specially aimed at steganalytic algorithm of JPEG image and used to detect whether hidden information exists in it. According to the structural feature of JPEG image and through analyzing the DCT coe...
详细信息
This algorithm is specially aimed at steganalytic algorithm of JPEG image and used to detect whether hidden information exists in it. According to the structural feature of JPEG image and through analyzing the DCT coefficient, the algorithm firstly extract quantification matrix, and judge whether hidden information exists in the image according to the relationship between image and the quantification matrix. This algorithm introduces an improved method of extracting quantification matrix, and raises the efficiency and accuracy of extracting quantification matrix, which makes the algorithm acquire a comparatively high accuracy rate of detecting.
暂无评论