A novel least-squares formulation of the vector linear prediction (VLP) problem is presented. Based on this formulation, we develop two new design methods for obtaining the optimal vector predictor for frame-adaptive ...
详细信息
A novel least-squares formulation of the vector linear prediction (VLP) problem is presented. Based on this formulation, we develop two new design methods for obtaining the optimal vector predictor for frame-adaptive prediction: the covariance method and the autocorrelation method, which bear the names of the corresponding methods in scalar LPC analysis. Our formulation reveals several previously unrecognized properties of the resulting normal equation. Simulation results for VLP of speech waveforms confirm that the two proposed methods indeed give higher prediction gain than previously developed methods.
A new predictor for DPCM based on a recurrent neural network is proposed. The proposed predictor which uses the bilinear recurrent neural network (BLRNN), has shown good performance for time-series prediction problems...
详细信息
ISBN:
(纸本)0780348591
A new predictor for DPCM based on a recurrent neural network is proposed. The proposed predictor which uses the bilinear recurrent neural network (BLRNN), has shown good performance for time-series prediction problems, and it is applied to DPCM for image compression with predictive coding. The performance of DPCM with BLRNN predictor is compared with the conventional DPCM with different predictors, such as the linear predictor and median predictor. The results show that the proposed method gives improved results over the conventional DPCM with linear predictor or median predictor in terms of PSNR or reconstructed images.
A single-board 16 kb/s and 32 kb/s APC speech coding terminal is described. The board is configurable as either two receivers, two transmitters, or a single-channel full-duplex coder. The terminal consists of two iden...
详细信息
A single-board 16 kb/s and 32 kb/s APC speech coding terminal is described. The board is configurable as either two receivers, two transmitters, or a single-channel full-duplex coder. The terminal consists of two identical, independent TI TMS32020- based modules. Included in each module is a TMS32020, an analog I/O section, A/D/A conversion, digital I/O circuitry, static RAMs, and EPROMs. The board size is 5.6" × 6.0", and can be packaged in a standard secretarial desk phone shell. A software architecture using this hardware to implement the 16/32 kb/s Adaptive predictive coding with Hybrid Quantization (APCHQ) algorithm is described.
In the last years, deep video coding has attracted a lot of research interest. Usually, it employs the concept of inter coding by transmitting features in a latent space that represent a motion field or a residual. Ho...
详细信息
ISBN:
(数字)9798350358483
ISBN:
(纸本)9798350358490
In the last years, deep video coding has attracted a lot of research interest. Usually, it employs the concept of inter coding by transmitting features in a latent space that represent a motion field or a residual. However, in such a setting there are still redundancies between the features of consecutive frames. In previous approaches, these redundancies are exploited for compression by adding an additional input at the encoder and decoder. However, this often comes at the cost of changing the whole network architecture. In this paper, we present a conditional coding for motion features which utilizes already transmitted features for coding the features of the current picture in a more effective way. This concept can be applied on top of any existing coding framework. Our coding experiments, which were conducted for JVET test sequences, demonstrate that the proposed conditional motion feature coding can yield bit-rate savings of up to 9% relative to an independent coding of the motion features for individual pictures.
We investigate the coding of multiview images obtained from a set of multiple cameras. To exploit the inter-view correlation, two view-prediction tools have been implemented and used in parallel: a block-based motion ...
详细信息
We investigate the coding of multiview images obtained from a set of multiple cameras. To exploit the inter-view correlation, two view-prediction tools have been implemented and used in parallel: a block-based motion compensation scheme and a depth image based rendering technique (DIBR). Whereas DIBR relies on an accurate depth image, the block-based motion-compensation scheme can be performed without any geometry information. Our encoder adaptively selects the most appropriate prediction scheme using a rate-distortion criterion for an optimal prediction-mode selection. The attractiveness of the algorithm is that the compression algorithm is robust against inaccurately estimated depth images and requires only one single reference camera for fast random-access to different views. We present experimental results for several multiview sequences, that result in a quality improvement of up to 1.4 dB as compared to H.264 compression.
We propose a new multi-view video coding method using adaptive selection of motion/disparity compensation based on H.264/AVC. One of the key points of the proposed method is the use of view interpolation as a tool for...
详细信息
We propose a new multi-view video coding method using adaptive selection of motion/disparity compensation based on H.264/AVC. One of the key points of the proposed method is the use of view interpolation as a tool for disparity compensation by assigning reference picture indices to interpolated images. Experimental results show that significant gains can be obtained compared to the conventional approach that was often used
We introduce a compressible representation of 3D geometry (including its attributes, such as color texture) intermediate between polygonal meshes and point clouds called a polygon cloud. Polygon clouds, compared to po...
详细信息
ISBN:
(纸本)9781509041183
We introduce a compressible representation of 3D geometry (including its attributes, such as color texture) intermediate between polygonal meshes and point clouds called a polygon cloud. Polygon clouds, compared to polygonal meshes, are more robust to live capture noise and artifacts. Furthermore, dynamic polygon clouds, compared to dynamic point clouds, are easier to compress, if certain challenges are addressed. In this paper, we propose methods for compressing dynamic polygon clouds using transform coding of color and motion residuals. We find that, compared to static polygon clouds and a fortiori static point clouds, dynamic polygon clouds can improve color compression by up to 2-3 dB in fidelity, and can improve geometry compression up to a factor of 2-5 in bit rate.
The multi-view video coding improves the coding efficiency by utilizing motion-compensated prediction (MCP) and disparity-compensated prediction (DCP). However, the complexity of the inter frame prediction is very hig...
详细信息
The multi-view video coding improves the coding efficiency by utilizing motion-compensated prediction (MCP) and disparity-compensated prediction (DCP). However, the complexity of the inter frame prediction is very high, especially when the rate-distortion optimization is used. This paper presents a fast inter frame prediction algorithm to reduce the complexity. Firstly the prediction type is decided according to reference frames. Then some unuseful search regions in view direction are removed. Finally a fast inter mode decision strategy is proposed based on the relationship between MCP and DCP. Experimental results verify that the proposed algorithm can greatly increase the speed of prediction with negligible loss of coding efficiency.
We propose probabilistic method to determine motion vector (MV) for block matching algorithm (BMA). Proposed method allow us to exploit random distribution of motion vector in successive video frames for selection of ...
详细信息
We propose probabilistic method to determine motion vector (MV) for block matching algorithm (BMA). Proposed method allow us to exploit random distribution of motion vector in successive video frames for selection of initial search points for first iteration and refinement stage used in further iteration tracks motion vector in continuously changing video sequence. In our proposed algorithm due to adaptive step size it is able to track motion vectors of low motion as well as high motion video. The Simulation result shows that our proposed algorithm Probability based search motion estimation (PBSME) outperforms all sub-optimal motion estimation (ME) algorithms in terms of quality and speed up performance and in many cases PSNR of proposed algorithm is comparable to Full Search with several times faster.
An adaptive block truncation encoding scheme is described. The adaptive block size encoding approach used removes the bottlenecks of compromising the reduction factor and fidelity of the decoded image present in the c...
详细信息
An adaptive block truncation encoding scheme is described. The adaptive block size encoding approach used removes the bottlenecks of compromising the reduction factor and fidelity of the decoded image present in the corresponding fixed block size scheme. The results show that the adaptive scheme produces a better reduction ratio, keeping approximately the same fidelity measure. The scheme is suitable for implementation with the pyramidal architecture which has a hierarchical nature of processing and needs greatly reduced logarithmic distances for passing information among the processing elements of the meshes. An added advantage of the present system is that it extracts most of the edge information and localizes it at the lowest mesh with respect to the host. This simplifies greatly the further processing of the image regarding its edges and regions.< >
暂无评论