In conventional video coding schemes such as H.264/AVC, prediction value p is obtained for the original pixel value x via inter/intra prediction and the prediction residue d = (x - p) is calculated prior to orthogonal...
详细信息
In conventional video coding schemes such as H.264/AVC, prediction value p is obtained for the original pixel value x via inter/intra prediction and the prediction residue d = (x - p) is calculated prior to orthogonal transform and quantization. Encoding the prediction residue is nothing but ignoring the variety of distribution Pr(d\p) with respect to p, that is, using Pr(d) as a whole instead. Because H(d\p) les H(d), coding d using Pr(d) is not a good idea. For example, when p = 255, the prediction residue is always less than or equal to zero (i.e., d les 0) because 0 < x < 255 (for 8-bit case). Conventional prediction residue coding methods do not take this boundary into account and therefore involve coding inefficiency.
We provide a theoretical analysis of the performance of differential predictive coding using fixed-lag smoothing of the standard decoder output. This performance is compared to related results for coding using latency...
详细信息
We provide a theoretical analysis of the performance of differential predictive coding using fixed-lag smoothing of the standard decoder output. This performance is compared to related results for coding using latency at the encoder, and causal encoding with delayed decoding, as well as with some prior theoretical analyses of these methods. Surprisingly, it is shown that fixed-lag smoothing of the standard decoder output with causal encoding achieves the asymptotic and finite lag performance promised by a completely reoptimized decoder.
We consider linear predictive coding and noise shaping for coding and transmission of auto-regressive (AR) sources over lossy networks. We generalize an existing framework to arbitrary filter orders and propose use of...
详细信息
ISBN:
(纸本)9781424464258;9780769539942
We consider linear predictive coding and noise shaping for coding and transmission of auto-regressive (AR) sources over lossy networks. We generalize an existing framework to arbitrary filter orders and propose use of fixed-lag smoothing at the decoder, in order to further reduce the impact of transmission failures. We show that fixed-lag smoothing up to a certain delay can be obtained without additional computational complexity by exploiting the state-space structure. We prove that the proposed smoothing strategy strictly improves performance under quite general conditions. Finally, we provide simulations on AR sources, and channels with correlated losses, and show that substantial improvements are possible.
In current computation neuroscience, it is widely accepted that the mammalian entorhinal-hippocampus complex provides representations of spatial environment, exhibiting specific responses to particular locations and d...
详细信息
ISBN:
(数字)9789887581581
ISBN:
(纸本)9798350366907
In current computation neuroscience, it is widely accepted that the mammalian entorhinal-hippocampus complex provides representations of spatial environment, exhibiting specific responses to particular locations and directions. Various computation models of spatial cell firing have been proposed to unravel the mysteries of spatial cognition representation in the brain. However, an increasing body of experimental evidence suggests that the brain operates more akin to a predictive *** acquiring and utilizing limited environmental information, the brain constructs internal models during interaction with the external world, generating predictive representations of internal states following environmental changes. Building upon this premise, we propose a predictive coding theory-based entorhinal-hippocampus spatial cognition model. This model incorporates vestibular proprioceptive cues and border vector cells discharge responses to initially derive unimodal place cell discharge fields,followed by hippocampal place cells projecting to the entorhinal cortex via excitatory-inhibitory synaptic connections, ultimately generating periodic hexagonal grid cell discharge fields. Experimental results demonstrate that the neuronal discharge patterns obtained by this model align with physiological research findings, thereby providing new insights for further investigations into brain cognitive mechanisms.
In view of the information security questions, the information hiding technology already becomes the hot spot in the research field. On the basis of the predictive coding, an algorithm using the prediction error to ca...
详细信息
In view of the information security questions, the information hiding technology already becomes the hot spot in the research field. On the basis of the predictive coding, an algorithm using the prediction error to carry on the information hiding is proposed in this paper. In order to restrain the error diffusion which possibly appears during the anti-predictive coding in the information hiding process, an improved predictive coding algorithm is put forward. Through the experiments, the performances of the basic algorithm and improved algorithm are tested, resulting in the proof of the thread correctness. At the same time, the improved algorithm achieves the ultra large information capacity of 0.953 bits/byte and the PSNR of 49.184 dB so as to verify the validity of the improved algorithm.
A novel coding strategy for block-based compressive sensing named spatially directional predictive coding (SDPC) is proposed, which efficiently utilizes the intrinsic spatial correlation of natural images. At the enco...
详细信息
ISBN:
(纸本)9781479923427
A novel coding strategy for block-based compressive sensing named spatially directional predictive coding (SDPC) is proposed, which efficiently utilizes the intrinsic spatial correlation of natural images. At the encoder, for each block of compressive sensing (CS) measurements, the optimal prediction is selected from a set of prediction candidates that are generated by four designed directional predictive modes. Then, the resulting residual is processed by scalar quantization (SQ). At the decoder, the same prediction is added onto the de-quantized residuals to produce the quantized CS measurements, which is exploited for CS reconstruction. Experimental results substantiate significant improvements achieved by SDPC-plus-SQ in rate distortion performance as compared with SQ alone and DPCM-plus-SQ.
Summary form only given. A compression algorithm for high quality speech signal using predictive coding techniques is developed. Code-excited linear predictive coding (CELPC) is one of the key techniques to compress s...
详细信息
Summary form only given. A compression algorithm for high quality speech signal using predictive coding techniques is developed. Code-excited linear predictive coding (CELPC) is one of the key techniques to compress speech signal to a bit-rate around 4.8 Kbps. However, due to the heavy computational requirement in the CELPC and speech signals usually can be divided into two portions: namely the based-band and the high-band frequency range. A hybrid CELPC and voice excited linear predictive coding (VELPC) scheme is developed for speech coding to reduce the complexity of the original CELPC. In the algorithm, a speech signal is firstly divided into two portions, the based-band and high-band respectively, in frequency domain, and then the low portion is coded with CELPC and the high-band portion is coded with VELPC. The test experiments showed this new coder can produce synthesized speech with good quality at a better bit rates than the original CELPC. When using the coding methods for the base-band and the high-band signal, we must decide how to divide the speech signal into two portions. In choosing the bandwidth of the base-band signal, there is a trade-off between the coding quality and the bit rate. In our experiment, the bandwidth of the base-band signal is chosen as one fourth of that of the original speech. Subjective evaluation experiments were conducted to test the performance of the hybrid CELPC and VELPC technique. For speech signal sampled at 8 kHz, a bit rate of 4.0 kbps can be achieved with frame intervals of 23 ms. The experimental results showed that the quality of the synthesized speech using hybrid coding technique at the bit rate of 4.0 kbps was almost the same as that of the CELPC at the bit rate of 4.8 kbps.
The problem of robust communication of predictive encoded video in a joint source-channel setting is addressed. Specifically, the problem of predictive mismatch, where there is a drift between the state of the encoder...
详细信息
The problem of robust communication of predictive encoded video in a joint source-channel setting is addressed. Specifically, the problem of predictive mismatch, where there is a drift between the state of the encoder and the decoder is addressed as a variant of the Wyner-Ziv problem. A video encoding algorithm based on the H.26L video codec, which presents the propagation of error in predictively encoded video in the event of predictive mismatch (or drift) between the encoder and the decoder, is proposed. One of the main advantages of the proposed approach is that there is minimal loss in performance over the standard H.26L encoder during error-free transmission, while simultaneously allowing error recovery in the event of errors. Using turbo codes as coset codes, the performance of the proposed codec is evaluated and the efficacy of the proposed framework is demonstrated. The performance of the proposed approach can only improve with the use of superior coset codes.
We present a new speech coding, named neural predictive coding, based on an original neural predictive structure. The system is designed to predict the samples of a speech signal window from previous ones. The goal of...
详细信息
We present a new speech coding, named neural predictive coding, based on an original neural predictive structure. The system is designed to predict the samples of a speech signal window from previous ones. The goal of this coding is to extract the signal discriminative features relative to the database from which it is extracted. After a precise description of our coding, we discuss about the coding phase of the algorithm. We present experiences which show an improvement on telephone speech recognition compared with other predictive or frequency coding. The signal windows are extracted from the telephone speech signal database NTIMIT.
In this paper, a new fuzzy logic-based lossy predictive coding system for gray-scale still image compression is developed. The proposed coder employs a recently introduced adaptive fuzzy prediction methodology in the ...
详细信息
In this paper, a new fuzzy logic-based lossy predictive coding system for gray-scale still image compression is developed. The proposed coder employs a recently introduced adaptive fuzzy prediction methodology in the predictor design. In addition, it adopts a novel fuzzy gradient-adaptive quantization scheme. The proposed coding technique possesses superior performance over its non-fuzzy counterparts especially at low bit quantization. This is due to the inherent adaptivity in the fuzzy prediction methodology as well as the gradient-adaptive quantization scheme. Simulation results are provided to demonstrate the efficient performance of the proposed fuzzy predictive coding system.
暂无评论