predictive coding methods attempt to minimize the r.m.s. error in the coded signal. However, the human ear does not perceive signal distortion on the basis of r.m.s. error regardless of its spectral shape relative to ...
详细信息
predictive coding methods attempt to minimize the r.m.s. error in the coded signal. However, the human ear does not perceive signal distortion on the basis of r.m.s. error regardless of its spectral shape relative to the signal spectrum. Specifically, for speech signals, the locations of the formant frequencies and their rates of change with time influence the audibility, and thus the subjective distortion of any quantizing noise. In this paper, methods for reducing the subjective distortion in predictive coders for speech siganls are described and evaluated. Improved speech quality is obtained a) by efficient removal of formant and pitch related redundant structure of speech before quantizing and b) by effective masking of the quantizer noise by the speech signal.
The primary visual cortex (V1) has been classically viewed as an immutable feature detector, with robust responses to low-level characteristics of objects in the visual field. Recent studies have shown the capacity of...
详细信息
The primary visual cortex (V1) has been classically viewed as an immutable feature detector, with robust responses to low-level characteristics of objects in the visual field. Recent studies have shown the capacity of this cortical area to perform more complex computations. Nominally, the phenomenon of sequence learning relies on the ability of V1 to encode the serial order and temporal frequency of a spatiotemporal visual sequence. Investigating the mechanisms driving this phenomenon through the lens of predictive coding will further the understanding of how V1 operates locally to encode time and learns to predict the future based on minimal sensory information. Through in vivo multi-unit recordings from awake mice, this study sought to isolate neural evidence for predictive processing within the paradigm of sequence learning. Seventy unique units were isolated from forty-two mice subjected to experimentation. Preliminary analyses revealed a significant effect that agrees with the initial report on sequence learning but contradicts predictive processing theory. Further investigation is required to draw more robust conclusions about the predictive computations that occur during sequence learning. Increased sample size and refinement of data analysis will likely lead to interesting results.
Prediction techniques have been extensively studied for images and video compression over the last few decades. Currently, most existing images and video compression solutions contain some predictive encoding in their...
详细信息
Prediction techniques have been extensively studied for images and video compression over the last few decades. Currently, most existing images and video compression solutions contain some predictive encoding in their algorithms. For example, it is well known that the 25-year history of the joint image group (JPEG) image coding standard predictive encodes quantified DC transform coefficients. There some other sorts of predictive coding techniques which implement redundancies inside images and videos. Similarly, in some particular cases such as image sequences, the compression system usually uses the motion compensation prediction to utilize the time redundancy. We discuss some of the most essential predictive coding techniques used in state-of-the-art image and video encoders. Then we provide some basic concepts about video color spaces representation. Various technologies including linear and non-linear predictions methods are presented in the following *** this thesis, we have implemented and presented an Artificial Neural Network Approach for the encoding of JPEG images. We have shown that JPEG compression can be significantly improved with our decoding method. In these experiments, we have implemented MLP and RNN models on tiny-imagenet data sets. We performed stateless and stateful prediction without using spatial information as well as predictions using partial spatial information and full spatial information. We use the partial and full spatial redundancy in the image to implement a nonlinear mapping function. Compared with the original image, the proposed method outperforms the JPEG image in generating predictive images with relatively small distortion.
This paper describes quantitative and qualitative improvements in 6400 b/s adaptive predictive voice coding when delayed decision is used to quantize the residual signal into one bit/ sample. Signal-to-quantizing nois...
详细信息
This paper describes quantitative and qualitative improvements in 6400 b/s adaptive predictive voice coding when delayed decision is used to quantize the residual signal into one bit/ sample. Signal-to-quantizing noise curves as a function of the delay decision length provide quantitative indications of the improvements while listening tests of sentences processed with and without delayed decisions provide measures of qualitative performance. Although delayed decision does provide better waveform tracking and yields better than 2 dB improvements in signal-to-quantizing noise, auditory tests indicate that listeners cannot hear the improvement with decision delays of 5 or less. Consequently, the increased complexity of delayed decision with 6400 b/s adaptive predictive coding does not result in a useable improvement in voice transmission.
A new particle dynamics model (PDM) is proposed for the prediction-based lossless data compression. The structure, algorithm and properties of PDM used to generate the desired predictive coding are discussed. The prop...
详细信息
A new particle dynamics model (PDM) is proposed for the prediction-based lossless data compression. The structure, algorithm and properties of PDM used to generate the desired predictive coding are discussed. The proposed PDM approach has advantages in terms of parallelism, scalability, and easy hardware implementation over other sequential lossless compression methods.
A lossy coding scheme is proposed for separate encoding and joint decoding of two correlated sequences. The algorithm simultaneously exploits the correlation between the sequences (using a binning-based quantization s...
详细信息
A lossy coding scheme is proposed for separate encoding and joint decoding of two correlated sequences. The algorithm simultaneously exploits the correlation between the sequences (using a binning-based quantization scheme) and that between the samples of each sequence (using linear prediction). Under the proposed coding regime, optimal prediction filter design fundamentally deviates from the traditional approach. More specifically, it is, in general, not optimal to employ first-order prediction for noisy observations of a first-order Markov source, even when the noise is negligibly small. Moreover, even if the prediction filter is constrained to be of degree 1, the optimal filter coefficient is different from the correlation coefficient of the Markov source. In the particular example treated in this paper, it is shown that optimal first- and second-order prediction respectively enjoy up to 0.9 dB and 1.15 dB improvement over the traditional approach.
This study aims to reevaluate the role of the N 400 component in language processing from the perspective of predictive coding theory and investigate the impact of surprisal on N400 amplitude. An ERP experiment was co...
详细信息
ISBN:
(数字)9798331505431
ISBN:
(纸本)9798331505448
This study aims to reevaluate the role of the N 400 component in language processing from the perspective of predictive coding theory and investigate the impact of surprisal on N400 amplitude. An ERP experiment was conducted using three surprisal conditions (Best Completion, Unrelated, Implausible) with a BERT-based model optimized for Japanese contexts. Analysis of EEG data revealed distinct N400 amplitudes and brain connectivity patterns across the surprisal conditions. Notably, changes in alpha-band phase synchronization reflected cognitive processes related to the degree of mismatch between predicted and actual input in each context. These findings suggest that higher levels of surprisal lead to increased cognitive load, manifesting as dynamic changes in brain network activity.
In this article we evaluate the use of predictive coding for compression of SAR phase history data. We first show that the data are mainly correlated along range lines. Then, we exploit this result to define a new DPC...
详细信息
In this article we evaluate the use of predictive coding for compression of SAR phase history data. We first show that the data are mainly correlated along range lines. Then, we exploit this result to define a new DPCM-based compression algorithm named RDPCM-BAQ. The performance of this algorithm is compared with that of BAQ on SIR-C/X-SAR data, showing a significant improvement in signal-to-noise ratio of up to 2 dB with respect to BAQ.
Nowadays, wearable sensors or portable devices have great potentials for real-time monitoring of health and fitness of an individual but they are constrained with limited battery power. Therefore, exploring lightweigh...
详细信息
ISBN:
(纸本)9781665408110
Nowadays, wearable sensors or portable devices have great potentials for real-time monitoring of health and fitness of an individual but they are constrained with limited battery power. Therefore, exploring lightweight signal processing technique is highly demanded for accurately measuring the pulse rate (PR) and respiration rate (RR) from the photoplethysmo-gram (PPG) signal in addition to the data compression in order to reduce or even eliminate the need for frequent charging of devices and replacement of batteries. In this paper, we present a lightweight unified predictive coding framework for achieving simultaneous data compression, PR and RR extraction from the PPG signal. Evaluation results demonstrate that the proposed unified framework can achieve compression ratio of 4:1 with energy saving of 52.38 %. For PR estimation, the method had mean absolute error (MAE) of 1.20 (bpm), Pearson coefficient of 0.9829 and Bland Altman ratio of 5.37. The RR estimation had promising MAE results of 3.1 (1.5-5.6 for 25th-75th percentiles) and outperforms the existing methods.
Recent work in offline reinforcement learning (RL) has demonstrated the effectiveness of formulating decision-making as return-conditioned supervised learning. Notably, the decision transformer (DT) architecture has s...
详细信息
ISBN:
(数字)9798350377705
ISBN:
(纸本)9798350377712
Recent work in offline reinforcement learning (RL) has demonstrated the effectiveness of formulating decision-making as return-conditioned supervised learning. Notably, the decision transformer (DT) architecture has shown promise across various domains. However, despite its initial success, DTs have underperformed on several challenging datasets in goal-conditioned RL. This limitation stems from the inefficiency of return conditioning for guiding policy learning, particularly in unstructured and suboptimal datasets, resulting in DTs failing to effectively learn temporal compositionality. Moreover, this problem might be further exacerbated in long-horizon sparse-reward tasks. To address this challenge, we propose the predictive coding for Decision Transformer (PCDT) framework, which leverages generalized future conditioning to enhance DT methods. PCDT utilizes an architecture that extends the DT framework, conditioned on predictive codings, enabling decision-making based on both past and future factors, thereby improving generalization. Through extensive experiments on eight datasets from the AntMaze and FrankaKitchen environments, our proposed method achieves performance on par with or surpassing existing popular value-based and transformer-based methods in offline goal-conditioned RL. Furthermore, we also evaluate our method on a goal-reaching task with a physical robot.
暂无评论