The side-match Vector quantizer (SMVQ) is a well known method for image coding. We propose the smooth side-match vector quantizer (SSMVQ). The SSMVQ improves coding quality by making smoother the transition of the gra...
详细信息
The side-match Vector quantizer (SMVQ) is a well known method for image coding. We propose the smooth side-match vector quantizer (SSMVQ). The SSMVQ improves coding quality by making smoother the transition of the gray levels of the adjacent blocks. Moreover, the smooth overlap-match vector quantizer (SOMVQ) is proposed to improve the performance of the overlap-match vector quantizer (OMVQ). Experimental results reveal that SSMVQ and SOMVQ have the higher peak SNR (PSNR) values, better visual perception quality, and lower bit rate than SMVQ and overlap-match vector quantizer (OMVQ), respectively. (C) 2000 Society of Photo-Optical Instrumentation Engineers. [S0091-3286(00)03407-3].
A new gray-scale image coding technique has been developed, in which an extended DPCM approach has been combined with entropy coding. This technique has been implemented in a freeze-frame videoconferencing system whic...
详细信息
A new gray-scale image coding technique has been developed, in which an extended DPCM approach has been combined with entropy coding. This technique has been implemented in a freeze-frame videoconferencing system which is now operational at IBM sites throughout the world. Following image preprocessing, the two fields of the interlaced 512 x 480 pixel video frame are compressed sequentially with different algorithms. The reconstructed image quality is improved by subsequent image postprocessing, the final reconstructed image being almost indistinguishable from the original image. Typical gray-scale video images compress to about a half bit per pixel and transmit over 4.8 kbit/s dial-up telephone lines in about a half minute. The gray-scale image processing and compression algorithms are described in this paper.
The Joint Picture Expert Group (JPEG) committee has been standardizing next-generation image compression, called JPEG XL, to meet the specific needs for a responsive web, wide color gamut, and high dynamic range. JPEG...
详细信息
The Joint Picture Expert Group (JPEG) committee has been standardizing next-generation image compression, called JPEG XL, to meet the specific needs for a responsive web, wide color gamut, and high dynamic range. JPEG XL supports lossy and lossless compression. A variable-sized discrete cosine transform (DCT) block is used for lossy compression. A block partitioning method is regarded as a critical function for the performance of JPEG XL. The current DCT block partitioning method used in JPEG XL is highly dependent on the compression rate and tends to assign small-sized DCT blocks to homogeneously textured regions (HTRs) having similar or regular patterns. We propose a region-adaptive DCT block partitioning method that assigns larger blocks to the HTR. The proposed method identifies the HTRs by using a combined metric employing a sum-modified Laplacian, zero-crossing, and colorfulness metric for measuring the region homogeneity. Objective, subjective, and visual comparison evaluations with the ten images recommended by the JPEG working group were provided to show the improvement in coding performance. The proposed method shows its superiority in terms of the compression efficiency evaluated using six objective metrics, subjective tests with 15 participants, visual comparison improvements in the HTR, and gains in the execution time.
In this paper, we propose a novel image coding framework with semantic-aware visual decomposition towards extremely low bitrate compression. In particular, an input image is analyzed into a semantic map as structural ...
详细信息
In this paper, we propose a novel image coding framework with semantic-aware visual decomposition towards extremely low bitrate compression. In particular, an input image is analyzed into a semantic map as structural representation and semantic-wise texture representation and further compressed into bitstreams at the encoder side. On the decoder side, the received bitstreams of dual-layer representations are decoded and reconstructed for target image synthesis with generative models. Moreover, the attention mechanism is introduced into the model architecture for texture representation modeling and a coherency regularization is proposed to further optimize the texture representation space by aligning the representation space with the source pixel space for higher synthesis quality. Besides, we also propose a cross-channel entropy module and control the quantization scale to facilitate rate-distortion optimization. Upon compressing the decomposed components into the bitstream, the simple yet effective representation philosophy benefits image compression in many aspects. First, in terms of compression performance, compact representations, and high visual synthesis quality can bring remarkable advantages. Second, the proposed framework yields a physically explainable bitstream composed of the structural segment and semantic-wise texture segments. Third and most importantly, subsequent vision tasks (e.g., content manipulation) can receive fundamental support from the semantic-aware visual decomposition and synthesis mechanism. Extensive experimental results demonstrate the superiority of the proposed framework towards efficient visual representation learning, high efficiency image compression (< 0.1 bpp), and intelligent visual applications (e.g., manipulation and analysis).
A new technique to recover the information loss in a block-based image coding system is developed in this paper, The proposed scheme is based on fuzzy logic reasoning and can be divided into three main steps: 1) hiera...
详细信息
A new technique to recover the information loss in a block-based image coding system is developed in this paper, The proposed scheme is based on fuzzy logic reasoning and can be divided into three main steps: 1) hierarchical compass interpolation/extrapolation (HCIE) in the spatial domain for initial recovery of lost blocks that mainly contain low-frequency information such as smooth background 2) coarse spectra interpretation by fuzzy logic reasoning for recovery of lost blocks that contain high-frequency information such as complex textures and fine features 3) sliding window iteration (SWI), which is performed in both spatial and spectral domains to efficiently integrate the results obtained in steps 1) and 2) such that the optimal result can be achieved in terms of surface continuity on block boundaries and a set of fuzzy inference rules. The proposed method, which is suitable for recovering both isolated and contiguous block losses, provides a new approach for error concealment of block-based image coding systems such as the JPEG coding standard and vector quantization-based coding algorithms, The principle of the proposed scheme can also be applied to block-based video compression schemes such as the H.261, MPEG, and HDTV standards, Simulation results are presented to illustrate the effectiveness of the proposed method,
In this paper, a novel scheme for low-power image coding and decoding based on vector quantization is presented. The proposed scheme uses small codebooks, and block transformations are applied to the codewords during ...
详细信息
In this paper, a novel scheme for low-power image coding and decoding based on vector quantization is presented. The proposed scheme uses small codebooks, and block transformations are applied to the codewords during coding. Using small codebooks, the proposed scheme has reduced memory requirements in comparison to classical vector quantization. The transformations applied to the codewords extend computationally the small codebooks compensating for the quality degradation introduced by the small codebook size. Thus the coding task becomes computation-based rather than memory-based, leading to significant power savings since memory-related power consumption forms the major part of the total power consumption of a system. Since the parameters or the transformations depend on the image block under coding, the small codebooks are dynamically adapted to the specific block under coding leading to acceptable image qualities. The proposed scheme leads to power savings of a factor of 10 in coding and of a factor of 3 in decoding, at least in comparison to classical full-search vector quantization. The main factor affecting both image quality and power consumption is the size of the codebook that is used.
Based on the shape adaptive discrete cosine transform (SA-DCT) and its application in image coding, this paper proposes a new image coding algorithm based on image segmentation and shape adaptive all phase biorthogona...
详细信息
ISBN:
(纸本)9781467357913
Based on the shape adaptive discrete cosine transform (SA-DCT) and its application in image coding, this paper proposes a new image coding algorithm based on image segmentation and shape adaptive all phase biorthogonal transform (SA-APBT). In this paper, an image is divided into two parts, the region-of-interest (ROI) and the background area, which can be encoded separately. Intra macroblocks (all pixels are located in ROI or background region) are processed with APBT;while marginal macroblocks (part of pixels are located in ROI) are processed with SA-APBT. Experimental results are obtained with the test images. It can be concluded that the coding performance of the proposed algorithm is better than that of conventional algorithms. Both the objective quality and subjective effect are improved.
In this paper we present a method for motion compensated image coding based upon a two-step displacement estimation procedure. The first step utilizes a maximum a posteriori (MAP) estimator to determine the best integ...
详细信息
In this paper we present a method for motion compensated image coding based upon a two-step displacement estimation procedure. The first step utilizes a maximum a posteriori (MAP) estimator to determine the best integer displacement, while the second step requires solving for the regression coefficients that supply the same information as the noninteger portion of the displacement. This approach is a different and simplified procedure in that the integer displacement is measured first, and then a linear combination of only four values from the previous image, shifted by the measured integer displacement, is used. This procedure differs both from the ones which measure the displacement vector D first and interpolate the previous image, and from the ones which use only linear prediction. This method is derived and results are presented for two separate 40-frame digital image sequences. A sum of absolute error distortion measure is used to determine the optimal structure of the residual quantizer.
Fast-growing intelligent media processing applications demand efficient processing throughout the processing chain from the edge to the cloud, and the complexity bottleneck usually lies in the parallel decoding of mul...
详细信息
Fast-growing intelligent media processing applications demand efficient processing throughout the processing chain from the edge to the cloud, and the complexity bottleneck usually lies in the parallel decoding of multiple-channel compressed bitstreams before analyzing. This occurs because the traditional media coding scheme generates a binary stream without a semantic structure, which is unable to be operated directly at the bitstream level to support different tasks such as classification, recognition, detection, etc. Therefore, in this article, we propose a learning-based semantically structured image coding (SSIC) framework to generate a semantically structured bitstream (SSB), where each part of the bitstream represents a specific object and can be directly used for the aforementioned intelligent tasks. Specifically, we integrate an object location extraction module into the compression framework to locate and align objects in the feature domain. Then, each object together with the background is compressed separately and reorganized to form a structured bitstream to enable the analysis or reconstruction of specific objects directly from partial bitstream. Furthermore, in contrast to existing learning-based compression schemes that train the specific model for a specific bitrate, we share most of the model parameters among various bitrates to significantly reduce the model size for variable-rate compression. The experimental results demonstrate the effectiveness of the proposed coding scheme whose compression performance is comparable to existing image coding schemes, where intelligent tasks such as classification and pose estimation can be directly performed on a partial bitstream without performance degradation, significantly reducing the complexity for analyzing tasks.
By using a lifting scheme, we present a technique to construct compactly supported wavelets whose coefficients are composed of free variables that lie in an interval. By selecting the coefficients of the 9 to 7 wavele...
详细信息
By using a lifting scheme, we present a technique to construct compactly supported wavelets whose coefficients are composed of free variables that lie in an interval. By selecting the coefficients of the 9 to 7 wavelet filter and associated lifting scheme, an efficient approach via wavelets for image compression is developed. Furthermore, the rationalized coefficients wavelet filter that can be implemented with simple integer arithmetic is achieved, and its characteristic is close to the well-known original irrational coefficients 9 to 7 wavelet filters developed by Cohen, Daubechies, and Feauveau. Software and hardware simulations show that the new method has very low complexity, and simultaneously preserves a high quality of a compressed image. (C) 2003 Society of Photo-Optical Instrumentation Engineers.
暂无评论