Digital video databases in compressed form are becoming widely available. In applications such as video browsing, and picture in picture, for a lower bitrate, there is a need to down-sample the video before transmissi...
详细信息
Digital video databases in compressed form are becoming widely available. In applications such as video browsing, and picture in picture, for a lower bitrate, there is a need to down-sample the video before transmission. The conventional approach to downscale a compressed video sequence is to decompress it at the video server, perform the down-sampling in the pixel domain and then recompress it for efficient delivery. This process is computationally intensive due to the motion estimation process required during the recompression phase. In the alternative compressed domain approach, the motion vectors of the downscaled video sequence are computed directly from the motion vectors of the original full size stream. In this paper we propose a compressed domain technique that generates a better estimate for the downscaled motion vectors. Simulations suggest that the performance achieved with the proposed method is superior by up to 1 dB to the current compressed domain techniques.
We describe a multi-chip CMOS VLSI visual motion processing system which combines analog circuitry with an asynchronous digital interchip communications protocol to allow more complex motion processing than is possibl...
详细信息
We describe a multi-chip CMOS VLSI visual motion processing system which combines analog circuitry with an asynchronous digital interchip communications protocol to allow more complex motion processing than is possible with all the circuitry in the focal plane. The two basic VLSI building blocks are a sender chip which incorporates a 2D imager array and transmits the position of moving spatial edges, and a receiver chip which computes a 2D optical flow vector field from the edge information. The elementary two-chip motion processing system consisting of a single sender and receiver is first characterized. Subsequently, two three-chip motion processing systems are described. The first such system uses two sender chips to compute the presence of motion only at a particular stereoscopic disparity. The second such system uses two receivers to simultaneously compute a linear and polar topographic mapping of the image plane, resulting in information about image translation, rotation, and expansion. These three-chip systems demonstrate the modularity and flexibility of the multi-chip neuromorphic approach.
In this paper, an efficient analysis method for global head motion, and a synthesis method of 3-D face scenes using model-based image coding techniques on the Internet browser are presented. By applying the techniques...
详细信息
In this paper, an efficient analysis method for global head motion, and a synthesis method of 3-D face scenes using model-based image coding techniques on the Internet browser are presented. By applying the techniques of model-based image coding in facial image sequence, human face or anthropomorphic agents can be synthesized with real facial motion on the internet browser using VRML and/or JAVA. While typical model-based image coding produces synthetic face scenes in 2-D space at the receiver terminals, the proposed scheme can produce synthetic face scenes in 3-D space interactively supporting some visual standard. This enables users to synthesize 3-D face scenes at any terminals. In this sense, it improves the portability of synthesis in model-based image coding. To speed up the computation in the analysis process, a new method that generates referential frame dynamically is proposed. To produce a 3-D face model which is suitable for the purpose of application, an idea of automatic face modeling using different 3-D face models is presented. Experimental results show the usefulness of applying the techniques of model-based image coding to the analysis of facial motion and synthesis of 3-D face scene on the internet browser.
The proceedings contains 52 papers from the conference on visual communications and image processing 98. The topics discussed include: edge-assisted upper-bands coding techniques;robust embedded zerotree wavelet codin...
详细信息
The proceedings contains 52 papers from the conference on visual communications and image processing 98. The topics discussed include: edge-assisted upper-bands coding techniques;robust embedded zerotree wavelet coding algorithm;motion estimation and compensation based on region-constrained warping prediction;new algorithm for motion estimation on interlaced video;performace evaluation of the MPEG-4 visual coding standard and standard-based software-only video conferencing codec on ultra SPARC.
Super-resolution enhancement algorithms are used to estimate a high-resolution video still (HRVS) from several low-resolution frames, provided that objects within the digital image sequence move with subpixel incremen...
详细信息
Super-resolution enhancement algorithms are used to estimate a high-resolution video still (HRVS) from several low-resolution frames, provided that objects within the digital image sequence move with subpixel increments, A Bayesian multiframe enhancement algorithm is presented to compute an HRVS using the spatial information present within each frame as well as the temporal information present due to object motion between frames, However, the required subpixel-resolution motion vectors must be estimated from low-resolution and noisy video frames, resulting in an inaccurate motion held which can adversely impact the quality of the enhanced image. Several subpixel motion estimation techniques are incorporated into the Bayesian multiframe enhancement algorithm to determine their efficacy in the presence of global data transformations between frames (i.e., camera pan, rotation, tilt, and zoom) and independent object motion. visual and quantitative comparisons of the resulting high-resolution video stills computed from two video frames and the corresponding estimated motion fields show that the eight-parameter projective motion model is appropriate for global scene changes, while block matching and Horn-Schunck optical flow estimation each have their own advantages and disadvantages when used to estimate independent object motion. (C) 1998 Academic Press.
We propose a novel method of arbitrarily focused image generation using multiple differently focused images. First, we describe our previously proposed select and merge method for all focused image acquisition. We can...
详细信息
We propose a novel method of arbitrarily focused image generation using multiple differently focused images. First, we describe our previously proposed select and merge method for all focused image acquisition. We can get good results by using this method but it is not easy to extend this method for generating arbitrarily focused images. Then, based on the assumption that depth of a scene changes stepwise, we derive a formula for reconstruction between the desired arbitrarily focused image and multiple acquired images;we can reconstruct the arbitrarily focused image by iterative use of the formula. We also introduce coarse-to-fine estimation of point spread functions (PSFs) of the acquired images. We reconstruct arbitrarily focused images for a natural scene. In other words, we simulate virtual cameras and generate images focused on arbitrary depths. (C) 1998 SPIE and IS&T. [S1017-9909(98)02201-6].
Model-based image coding is a well-known solution for image communication at very low bit-rate. But very complex techniques and large amount of computation are involved in these systems. It is especially difficult to ...
详细信息
ISBN:
(纸本)0819431249
Model-based image coding is a well-known solution for image communication at very low bit-rate. But very complex techniques and large amount of computation are involved in these systems. It is especially difficult to automatically extract Facial Definition Parameters (FDPs) and Facial Animation Parameters (FAPs), which are defined in MPEG-4, from 2D image to represent 3D moving objects. In this paper, an algorithm using intra- and inter-frame information to estimate feature parameters is proposed. It utilizes spatial information (edge information) as well as temporal difference between successive frames. The combination using of 2 kinds of information makes the system more robust. Physiological symmetry and proportion is another kind of knowledge used here to make the system to less computational intenseness.
The proceedings contains 53 papers from the conference on SPIE: visual Communication and imageprocessing'98. The topics discussed include: image segmentation;motion estimation;video coding;videophones;layered cod...
详细信息
The proceedings contains 53 papers from the conference on SPIE: visual Communication and imageprocessing'98. The topics discussed include: image segmentation;motion estimation;video coding;videophones;layered coding systems;low bit-rate encoding;motion compensation;halftone images;multistandard compression;linear-phase filter banks;motion discontinuity patterns;image compression;image quality;video databases;multiresolution analysis;loseless image compressors;vector quantization and multipass color printers.
In this paper, we propose a novel post-processing techinque to reduce blocking artifacts for block-based image coding schemes. Our approach focuses mainly on the reconstruction of the surface continuity, including the...
详细信息
In this paper, we propose a novel post-processing techinque to reduce blocking artifacts for block-based image coding schemes. Our approach focuses mainly on the reconstruction of the surface continuity, including the continuity of edges and textures in the image objects. To do this, we introduce a set of continuous functions suitable for characterizing edges and, by a linear combination of these functions, estimate the original artifact-free image. We propose the Projections onto the OverComplete Basis (POCB) algorithm to find linear coefficients satisfying a constraint which controls the difference between the blocky and estimated images. The proposed and conventional techniques are tested on various images compressed by the JPEG standard and vector quantization. The simulation results show that the proposed technique yields better results both objectively and subjectively.
This paper considers the decomposition of video sequences into so-called video object planes, which is required for the content-based representation of visual objects in MPEG-4. A new segmentation algorithm is describ...
详细信息
This paper considers the decomposition of video sequences into so-called video object planes, which is required for the content-based representation of visual objects in MPEG-4. A new segmentation algorithm is described that identifies physical objects using a morphological motion filter. For the object of interest, a two-dimensional binary model is derived based on areas in the scene that are moving differently from the background. This model is updated each frame to pick up possible rotation and changes in shape of the object. Temporal correspondence is established by a Hausdorff object tracker: The binary model sequences guide the actual video object plane extraction. Since the model points correspond to edges detected by the Canny operator; a high object boundary location is achieved. Experimental results demonstrate that our proposed technique can successfully extract the physical object from video sequences.
暂无评论