This paper addresses the problem of the representation of images and video objects of arbitrary shape with surface models based on active meshes. Usually, such a control point optimization relies on the mean squares e...
详细信息
ISBN:
(纸本)0819424358
This paper addresses the problem of the representation of images and video objects of arbitrary shape with surface models based on active meshes. Usually, such a control point optimization relies on the mean squares error between the original signal and the model. However, the computation of such a criterion implies to parametrize both the original and the model surfaces and it turns out that on the one hand, the parametrization of the original and model manifolds are independent, and on the other hand that many choices exist. Thus, this approach provides a solution according to these choices and if we change these, we change the approximation result. To avoid this problem, we developped an iterative method based on the surface evolution theory which computes the 3D control points according to the minimisation of a purely geometric criterion. In this paper, we illustate the use of such an approach computing an adaptative mesh-based representation of an image surface. This approach is particularly suited both for compression and for manipulations such as spatial scalability and more general geometric transformations.
A rate-distortion framework is used to define a displacement vector-field estimation technique for use in video coding. This technique achieves maximum reconstructed image quality under the constraint of a target bitr...
详细信息
ISBN:
(纸本)0819424358
A rate-distortion framework is used to define a displacement vector-field estimation technique for use in video coding. This technique achieves maximum reconstructed image quality under the constraint of a target bitrate for the coding of the vector sequence. Use of this technique is evaluated for two application areas in which the need for high compression of displacement vector fields is particularly acute. The first is motion-field coding for very low bit rate image sequence transmission as in videophone applications. The second application area is coding for the transmission of disparity fields. This is needed for the generation at the receiver of intermediate viewpoints through spatial interpolation. It is also needed in a number of other applications requiring accurate depth knowledge, including 3D medical data transmission and transmission of scenes to be postprocessed using depth-keyed segmentation. Experimental results illustrating the performance of the proposed technique in these application areas are presented and evaluated.
Many battlefield applications require the ability to transmit images over narrow bandwidth noisy channels. Previous research has demonstrated that the utilization of Predictive Trellis-Coded Quantization (PTCQ) incorp...
详细信息
ISBN:
(纸本)0819424358
Many battlefield applications require the ability to transmit images over narrow bandwidth noisy channels. Previous research has demonstrated that the utilization of Predictive Trellis-Coded Quantization (PTCQ) incorporating a nonlinear prediction filter results in a method of robust source coding. Robust source coding provides both compression and noise mitigation without the need to allocate additional bandwidth for channel coding. However, the traditional PTCQ algorithm is suboptimal. This suboptimality arises from the prediction operation;a trellis path is eliminated in favor of the survivor path at each stage in time to form the input to the prediction filter. It is reasonable to assume that this eliminated path may have produced a lower overall distortion than the survivor path. In this paper we address this suboptimality by incorporating a look-ahead stage into PTCQ algorithm. This ''less-greedy'' approach allows coding gains with a slight increase in overhead. The resulting algorithm yields an image encoding technique, which enables resilient image transmission over tactical channels.
This paper describes a procedure for model-based coding of dl channels of a multiview image sequence. The 3D model is initialized by accurate adaptation of a 2D wireframe model to the foreground object of one of the v...
详细信息
ISBN:
(纸本)0819427497
This paper describes a procedure for model-based coding of dl channels of a multiview image sequence. The 3D model is initialized by accurate adaptation of a 2D wireframe model to the foreground object of one of the views. The rigid 3D motion is estimated for each triangle, and spatial homogeneity neighbourhood constraints are used to improve the reliability of the estimation efficiency and to smooth the motion field produced. A novel technique is used to estimate flexible motion of the nodes of the wireframe from the rigid 3D motion vectors of the wireframe triangles containing each node. Kalman filtering is used to track both rigid 3D motion of each triangle and flexible deformation of each node of the wireframe. The performance of the resulting 3D flexible motion estimation method is evaluated experimentally.
We present an algorithm to segment image sequences from motion information. A dense vector field estimated by a Wiener-based pel-recursive method represents the key to separate a viewed scene into regions with differe...
详细信息
ISBN:
(纸本)0819424358
We present an algorithm to segment image sequences from motion information. A dense vector field estimated by a Wiener-based pel-recursive method represents the key to separate a viewed scene into regions with different apparent displacement, according to a four-parameter motion model. A preprocessing stage using mathematical morphology enhances pel-recursive motion estimation. The proposed segmentation model, based on Markov Random Fields theory (MRF), considers -besides the motion field- other information sources (gray-level, intensity edges, non-compensated pixels) that help describe the problem more accurately. The maximum a posteriori criterion (MAP) is used for the optimization of the solution, and performed with a deterministic approach. The complete segmentation algorithm includes inicializing, region numbering and labeling, parameter estimation of the motion model in each region, and optimization of the segmentation field. Results on synthetic and real sequences are shown.
In this paper, a novel multiresolution algorithm for low bit-rate image compression is presented. High quality low bit-rate image compression is achieved by first decomposing the image into approximation and detail su...
详细信息
ISBN:
(纸本)0819427497
In this paper, a novel multiresolution algorithm for low bit-rate image compression is presented. High quality low bit-rate image compression is achieved by first decomposing the image into approximation and detail subimages with a shift-orthogonal multiresolution analysis. Then, at the coarsest resolution level, the coefficients of the transformation are encoded by an orthogonal matching pursuit algorithm with a wavelet packet dictionary. Our dictionary consists of convolutional splines of up to order two for the detail and approximation subbands. The intercorrelation between the various resolutions is then exploited by using the same bases from the dictionary to encode the coefficients of the finer resolution bands at the corresponding spatial locations. To further exploit the spatial correlation of the coefficients, the zero trees of wavelets (EZW) algorithm [I] was used to identify the potential zero trees. The coefficients of the presentation are then quantized and arithmetic encoded at each resolution, and packed into a scalable bit stream structure. Our new algorithm is highly bit-rate scalable, and performs better than the segmentation based matching pursuit [2,3] and EZW encoders at lower bit rates, based on subjective image quality and peak signal-to-noise ratio (PSNR).
This paper presents a simple color segmentation technique which could be used in the model-based very low bit-rate coding approaches for videophone applications, in which the delimitation of the face of speaker is req...
详细信息
ISBN:
(纸本)0819424358
This paper presents a simple color segmentation technique which could be used in the model-based very low bit-rate coding approaches for videophone applications, in which the delimitation of the face of speaker is request. This work attempts to segment the face of speaker using color cues. To better take the advantage of the color contents of images, the color segmentation is carried out in HSI (Hue, Saturation, Intensity) space with the three components used in two steps. The original image is first splitted into two groups of regions, one has higher saturation values and other has lower saturation values, by using an adaptive threshold value applied to the histogram of saturation. In the high saturation regions, the hue component can furnish useful references for further segmentation, while in the low saturation regions the intensity component can play the similar role. For each group of regions, a multi-thresholding technique based on either hue or intensity component is then proposed for the subsequent segmentation. After both groups of regions are segmented, a combination of these two segmentation results will provide the finally segmented image. Some experiments with images taken from typical ''head-and-shoulders'' videophone sequences are carried out and some results are presented.
Assuming that welding data among structural parts including positions and directions of their contact points and lines can be easily obtained from CAD data, we have proposed imageprocessing algorithms to reliably det...
详细信息
ISBN:
(纸本)188065332X
Assuming that welding data among structural parts including positions and directions of their contact points and lines can be easily obtained from CAD data, we have proposed imageprocessing algorithms to reliably detect only the terminal points for the welding, and to precisely detect the welding line for the torch position control. The proposed algorithms have been applied to a test piece model of large scale steel tube structures, and the continuous tracking along the welding line measuring its welding gap has been presented.
Methods of near-lossless image compression based on the criterion of maximum allowable deviation of pixel values are described in this paper. Predictive and multiresolution techniques for performing near-lossless comp...
详细信息
ISBN:
(纸本)0819427497
Methods of near-lossless image compression based on the criterion of maximum allowable deviation of pixel values are described in this paper. Predictive and multiresolution techniques for performing near-lossless compression are investigated. A procedure for near-lossless compression using a modification of lossless predictive coding techniques to satisfy the specified tolerance is described. Simulation results with modified versions of two of the best lossless predictive coding techniques known, CALIC and JPEG-LS, are provided. It is shown that the application of lossless coding based on reversible transforms in conjunction with pre-quantization is inferior to predictive techniques for near-lossless compression. A partial embedding two-layer scheme is proposed in which an embedded multiresolution coder generates a lossy base layer, and a simple but effective context-based lossless coder codes the difference between the original image and the lossy reconstruction. Simulation results show that this lossy plus near-lossless technique yields compression ratios very close to those obtained with predictive techniques, while providing the feature of a partially embedded bit-stream.
This paper presents a simple and effective method to determine global camera motion using raw MPEG-1 motion vectors information obtained straight from real MPEG-1 streams such as those of the new HITACHI MP-EG1A digit...
详细信息
ISBN:
(纸本)0819427497
This paper presents a simple and effective method to determine global camera motion using raw MPEG-1 motion vectors information obtained straight from real MPEG-1 streams such as those of the new HITACHI MP-EG1A digital camcorder. The simple approach we have experimented with robustly fits a global affine optic flow model to the motion vectors. Other more robust methods are also proposed. In order to cope with the Group-of-Frames (GOF) discontinuity of the MPEG stream, B frames are used backward to determine the "missing link" to a previous GOF thereby ensuring continuity of the motion estimation across a reasonable number of frames. As a testbed, we have applied the method to the image mosaicing problem, for which interesting results have been obtained. Although several other methods exists to perform camera motion estimation. the approach presented here is particularly interesting because exploits "free" information present in MPEG streams and bypass the highly expensive correlation process.
暂无评论