In this paper, a two-stage block-mode classification scheme of H.264|MPEG-4 Part 10 AVC is presented as a pattern classification approach using SVMs in order to reduce high computational complexity of its encoders. Fo...
详细信息
In this paper, a two-stage block-mode classification scheme of H.264|MPEG-4 Part 10 AVC is presented as a pattern classification approach using SVMs in order to reduce high computational complexity of its encoders. For the block-mode classification, the feature vectors for each macroblock are formed for the SVMs with SATD and CBP values to detect the large and small block modes. From the experimental results, the proposed scheme yields 80% and 95% of the correct classification rate in average for the first and second stage, which has led to from 35% to 55% reduction in the total encoding time while maintaining negligible amounts of bit rate increases and PSNR drops for test sequences with QCIF, CIF, and 4CIF resolutions and various quantization parameter values.
Intra coding is one of the most effective ways of reducing the impact of error propagation caused by predictive coding. However, intra coding requires a higher bitrate when compared to inter coding. In order to use In...
详细信息
Intra coding is one of the most effective ways of reducing the impact of error propagation caused by predictive coding. However, intra coding requires a higher bitrate when compared to inter coding. In order to use Inter coding and reduce error propagation it is important that inter macroblocks predict from ¿safe¿ areas that have a decreased chance of spreading errors. To this end we propose a low complexity method of biasing the prediction mechanism towards recently intra updated macroblocks. We devise a method of adjusting the distortion used in rate distortion optimization to take into account the temporal distance of the last Intra macroblock. Our simulations show that our intra-distance derived weighting (IDW) method improves video coding performance in a lossy environment by up to 1.4 dB for a modest increase in bitrate.
Video Compression has played an important role in Multimedia data storage and transmission. Video compression techniques remove spatial as well as temporal redundancy using intra-frame and inter-frame coding respectiv...
详细信息
Video Compression has played an important role in Multimedia data storage and transmission. Video compression techniques remove spatial as well as temporal redundancy using intra-frame and inter-frame coding respectively. A large level of compression can be achieved through inter-frame coding. In this paper, performance of four matching criterion in the temporal coding of video signal, which are Minimum Mean Absolute Error, Vector Matching Criterion, Smooth Constrained - Mean Absolute Error, and proposed algorithm using new pixel values are compared on hardware design using VHDL. Three step search algorithm has been used for searching the matching block as a block matching technique, because three step search algorithm is very simple and efficient search algorithm, which provides near optimum results only in three steps. For the various videos it has been experimentally observed that the minimum average error per pixel, and minimum search points per block reduces up to 0 per pixel and 13.31 per block respectively using proposed criterion and the values for the same parameters and same set of frames are very high using other criterion.
This paper compares two prediction structures for predictive perceptual audio coding in the context of the ultra low delay (ULD) coding scheme. One structure is based on the commonly used AR signal model, leading to a...
详细信息
ISBN:
(纸本)9781424423538
This paper compares two prediction structures for predictive perceptual audio coding in the context of the ultra low delay (ULD) coding scheme. One structure is based on the commonly used AR signal model, leading to an IIR predictor in the decoder. The other structure is based on an MA signal model, leading to an FIR predictor in the decoder. We find that the AR-based predictor has a slightly better performance in case of an undisturbed transmission channel, but the MA-based predictor has a much better performance in case of transmission errors. For a bit error rate (BER) of 1.0e-5, the perceptual quality of the proposed MA model predictor achieves a mean objective difference grade (ODG) of -0.66 ODG whereas the AR model predictor only reaches -3.42 ODG.
As the newest video coding standard, H.264/AVC adopts the high-efficiently predictive coding and variable length entropy coding to achieve high compression efficiency. On the other side, transmission errors become the...
详细信息
As the newest video coding standard, H.264/AVC adopts the high-efficiently predictive coding and variable length entropy coding to achieve high compression efficiency. On the other side, transmission errors become the major problem faced by video broadcasting service providers. Error concealment (EC) here is adopted to handle slices with huge conjunctive corrupted areas inside. Considering error propagation from corrupted slice to succeeding ones is the key factor affecting the video quality, this paper proposes a novel temporal EC scheme including the bi-direction motion vector (MV) retrieval method and an adaptive EC ordering basing on it. Background and motional steady shift part of slice will be given top and second priority, respectively. Combined with our proposed improved boundary matching algorithm (IBMA) which provides more accurate distortion function, experiments results show that our proposal achieves better performance under different error rate channel, compared with EC algorithm adopted in H.264 reference software.
This paper presents a prediction-based image-hiding scheme that embeds secret data into compression codes during image compression. This scheme employs a two-stage structure: a prediction stage and an entropy coding s...
详细信息
This paper presents a prediction-based image-hiding scheme that embeds secret data into compression codes during image compression. This scheme employs a two-stage structure: a prediction stage and an entropy coding stage. The secret data is embedded into the difference values of a given image after the prediction stage is performed. According to the experimental results, the image quality is better than Jpeg-Jsteg and its improved scheme (Inform. Sci. 141 (1-2) (2002) 123). The average image quality of the stego-images in the proposed scheme is greater than 50dB when the hiding capacity is 1 bit per pixel, whereas those values in Jpeg-Jsteg and scheme in Chang et al. (Inform. Sci. 141 (1-2) (2002) 123) are 37.04 and 33.73 dB, respectively. The hiding capacity of the proposed scheme is 65,536 bits when the hiding capacity is I bit per pixel, whereas it is 53,248 bits in scheme (Inform. Sci. 141 (1-2) (2002) 123) and less than 3000 bits in Jpeg-Jsteg. (C) 2004 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
Recently we proposed a block-based conditional correlation coefficient model for natural videos in the spatial-temporal domain. The conditioning is on local texture and the optimal parameters can be calculated for a s...
详细信息
ISBN:
(纸本)9781424439904
Recently we proposed a block-based conditional correlation coefficient model for natural videos in the spatial-temporal domain. The conditioning is on local texture and the optimal parameters can be calculated for a specific video with a mean absolute error (MAE) usually smaller than 5%. We used this conditional correlation model and the classic results on conditional rate distortion functions to calculate new theoretical rate distortion bounds for videos which appear to be the only valid theoretical rate distortion bounds with regard to the current cutting-edge video compression technologies such as those standardized in AVC/H.264. In this paper, we focus on utilizing the new block-based local-texture-dependent correlation model to derive rate distortion bounds for blocking and optimal prediction across neighboring blocks. We study the penalty paid in average rate when the correlation among the neighboring blocks is discarded completely or is incorporated partially through predictive coding. We calculate the thresholds in average rate and distortion when incorporating the correlation among the neighboring blocks through optimal predictive coding becomes worse than completely discarding this correlation. We also discuss the role of local texture in inter-frame prediction.
The use of neural networks as a nonlinear predictor in many applications including predictive image coding has been successfully presented by many researchers. However, almost all of the research papers have focused o...
详细信息
The use of neural networks as a nonlinear predictor in many applications including predictive image coding has been successfully presented by many researchers. However, almost all of the research papers have focused on the architecture of the neural network and very little attention has been given to the design of the training and testing data. This paper demonstrates how the choice of the training data could dramatically affects the performance of the neural networks in image prediction. The important design factors of the training and testing data are assessed and the outcomes of the various simulations are presented.
By research on lossy compression technology about about the encrypt coding image, lossy predictive coding is analyzed, and optimal predictors altogether with optimal quantization was set up. Meanwhile, coding image mo...
详细信息
By research on lossy compression technology about about the encrypt coding image, lossy predictive coding is analyzed, and optimal predictors altogether with optimal quantization was set up. Meanwhile, coding image model produces a fairly good effect by application on the encrypt image at the entrance of a network user on the platform of information network under WEB.
predictively encoded techniques are commonly used for lossless compression of images for its effectiveness of removing statistical redundancy between pixels. However, there can be large prediction errors for pixels ar...
详细信息
ISBN:
(纸本)9781424438273
predictively encoded techniques are commonly used for lossless compression of images for its effectiveness of removing statistical redundancy between pixels. However, there can be large prediction errors for pixels around boundaries. In this paper, we introduce techniques commonly used in control systems to enhance the coding efficiency of predictive coding. Actually, the predictive coding system behaves just like a multi-input single-output system with the predictor itself can be taken as the system model. When compared with the purpose of a control system, which is to follow the system command as precisely as possible, we find the objective of both systems are the same. Moreover, an edge or a boundary among image pixels can be regarded as a step command in control systems. These observations lead to the idea of using control technologies to improve prediction result for pixels around boundaries. To realize this idea, we use an adaptive Takagi-Sugeno fuzzy neural network (TS-FNN) as the predictor. Furthermore, the widely used proportional controller in control system is implemented implicitly in the consequent part of the network so that the prediction error can be further compensated for pixels around boundaries. We find in experiments that the proposed approach can have a very good prediction result even without using any online training area for network adaptation process. This makes the proposed system more feasible under limited resources. Finally, comparisons to existing state-of-the-art lossless predictors and coders will be given to highlight the advantages of the proposed novel approach.
暂无评论