Detecting interesting regions from pictures has become important in order to reduce the computational complexity associated with such time-consuming processes as object recognition. In this paper we assume that figure...
详细信息
Detecting interesting regions from pictures has become important in order to reduce the computational complexity associated with such time-consuming processes as object recognition. In this paper we assume that figure-ground segmentation of a picture is necessary as the first step towards evaluating the attractiveness of regions. In vision research, it has been found that the V4 cortex in the visual system contributes to figure-ground selectivity. By utilizing this finding, we propose a method for extracting figure regions from a picture based on the color and texture contrasts of regions. This method discriminates regions previously segmented from a picture into figure regions and ground regions by a discrimination function defined with contrast parameters. The contrast parameters include the color difference and the texture difference between a region and its surroundings, as well as the focus of the region. A discrimination function is constructed based on the characteristics of the extracted figures and ground obtained in our experiments. In other experiments on evaluating the accuracy of the proposed method, figure regions selected by human subjects are extracted by the method at the rate of 80% accuracy.
Digital video databases in compressed form are becoming widely available. In applications such as video browsing, and picture in picture, for a lower bitrate, there is a need to down-sample the video before transmissi...
详细信息
Digital video databases in compressed form are becoming widely available. In applications such as video browsing, and picture in picture, for a lower bitrate, there is a need to down-sample the video before transmission. The conventional approach to downscale a compressed video sequence is to decompress it at the video server, perform the down-sampling in the pixel domain and then recompress it for efficient delivery. This process is computationally intensive due to the motion estimation process required during the recompression phase. In the alternative compressed domain approach, the motion vectors of the downscaled video sequence are computed directly from the motion vectors of the original full size stream. In this paper we propose a compressed domain technique that generates a better estimate for the downscaled motion vectors. Simulations suggest that the performance achieved with the proposed method is superior by up to 1 dB to the current compressed domain techniques.
We describe a multi-chip CMOS VLSI visual motion processing system which combines analog circuitry with an asynchronous digital interchip communications protocol to allow more complex motion processing than is possibl...
详细信息
We describe a multi-chip CMOS VLSI visual motion processing system which combines analog circuitry with an asynchronous digital interchip communications protocol to allow more complex motion processing than is possible with all the circuitry in the focal plane. The two basic VLSI building blocks are a sender chip which incorporates a 2D imager array and transmits the position of moving spatial edges, and a receiver chip which computes a 2D optical flow vector field from the edge information. The elementary two-chip motion processing system consisting of a single sender and receiver is first characterized. Subsequently, two three-chip motion processing systems are described. The first such system uses two sender chips to compute the presence of motion only at a particular stereoscopic disparity. The second such system uses two receivers to simultaneously compute a linear and polar topographic mapping of the image plane, resulting in information about image translation, rotation, and expansion. These three-chip systems demonstrate the modularity and flexibility of the multi-chip neuromorphic approach.
In this paper, an efficient analysis method for global head motion, and a synthesis method of 3-D face scenes using model-based image coding techniques on the Internet browser are presented. By applying the techniques...
详细信息
In this paper, an efficient analysis method for global head motion, and a synthesis method of 3-D face scenes using model-based image coding techniques on the Internet browser are presented. By applying the techniques of model-based image coding in facial image sequence, human face or anthropomorphic agents can be synthesized with real facial motion on the internet browser using VRML and/or JAVA. While typical model-based image coding produces synthetic face scenes in 2-D space at the receiver terminals, the proposed scheme can produce synthetic face scenes in 3-D space interactively supporting some visual standard. This enables users to synthesize 3-D face scenes at any terminals. In this sense, it improves the portability of synthesis in model-based image coding. To speed up the computation in the analysis process, a new method that generates referential frame dynamically is proposed. To produce a 3-D face model which is suitable for the purpose of application, an idea of automatic face modeling using different 3-D face models is presented. Experimental results show the usefulness of applying the techniques of model-based image coding to the analysis of facial motion and synthesis of 3-D face scene on the internet browser.
Super-resolution enhancement algorithms are used to estimate a high-resolution video still (HRVS) from several low-resolution frames, provided that objects within the digital image sequence move with subpixel incremen...
详细信息
Super-resolution enhancement algorithms are used to estimate a high-resolution video still (HRVS) from several low-resolution frames, provided that objects within the digital image sequence move with subpixel increments, A Bayesian multiframe enhancement algorithm is presented to compute an HRVS using the spatial information present within each frame as well as the temporal information present due to object motion between frames, However, the required subpixel-resolution motion vectors must be estimated from low-resolution and noisy video frames, resulting in an inaccurate motion held which can adversely impact the quality of the enhanced image. Several subpixel motion estimation techniques are incorporated into the Bayesian multiframe enhancement algorithm to determine their efficacy in the presence of global data transformations between frames (i.e., camera pan, rotation, tilt, and zoom) and independent object motion. visual and quantitative comparisons of the resulting high-resolution video stills computed from two video frames and the corresponding estimated motion fields show that the eight-parameter projective motion model is appropriate for global scene changes, while block matching and Horn-Schunck optical flow estimation each have their own advantages and disadvantages when used to estimate independent object motion. (C) 1998 Academic Press.
The proceedings contains 52 papers from the conference on visualcommunications and imageprocessing 98. The topics discussed include: edge-assisted upper-bands coding techniques;robust embedded zerotree wavelet codin...
详细信息
The proceedings contains 52 papers from the conference on visualcommunications and imageprocessing 98. The topics discussed include: edge-assisted upper-bands coding techniques;robust embedded zerotree wavelet coding algorithm;motion estimation and compensation based on region-constrained warping prediction;new algorithm for motion estimation on interlaced video;performace evaluation of the MPEG-4 visual coding standard and standard-based software-only video conferencing codec on ultra SPARC.
We propose a novel method of arbitrarily focused image generation using multiple differently focused images. First, we describe our previously proposed select and merge method for all focused image acquisition. We can...
详细信息
We propose a novel method of arbitrarily focused image generation using multiple differently focused images. First, we describe our previously proposed select and merge method for all focused image acquisition. We can get good results by using this method but it is not easy to extend this method for generating arbitrarily focused images. Then, based on the assumption that depth of a scene changes stepwise, we derive a formula for reconstruction between the desired arbitrarily focused image and multiple acquired images;we can reconstruct the arbitrarily focused image by iterative use of the formula. We also introduce coarse-to-fine estimation of point spread functions (PSFs) of the acquired images. We reconstruct arbitrarily focused images for a natural scene. In other words, we simulate virtual cameras and generate images focused on arbitrary depths. (C) 1998 SPIE and IS&T. [S1017-9909(98)02201-6].
Model-based image coding is a well-known solution for image communication at very low bit-rate. But very complex techniques and large amount of computation are involved in these systems. It is especially difficult to ...
详细信息
ISBN:
(纸本)0819431249
Model-based image coding is a well-known solution for image communication at very low bit-rate. But very complex techniques and large amount of computation are involved in these systems. It is especially difficult to automatically extract Facial Definition Parameters (FDPs) and Facial Animation Parameters (FAPs), which are defined in MPEG-4, from 2D image to represent 3D moving objects. In this paper, an algorithm using intra- and inter-frame information to estimate feature parameters is proposed. It utilizes spatial information (edge information) as well as temporal difference between successive frames. The combination using of 2 kinds of information makes the system more robust. Physiological symmetry and proportion is another kind of knowledge used here to make the system to less computational intenseness.
The proceedings contains 53 papers from the conference on SPIE: visual Communication and imageprocessing'98. The topics discussed include: image segmentation;motion estimation;video coding;videophones;layered cod...
详细信息
The proceedings contains 53 papers from the conference on SPIE: visual Communication and imageprocessing'98. The topics discussed include: image segmentation;motion estimation;video coding;videophones;layered coding systems;low bit-rate encoding;motion compensation;halftone images;multistandard compression;linear-phase filter banks;motion discontinuity patterns;image compression;image quality;video databases;multiresolution analysis;loseless image compressors;vector quantization and multipass color printers.
In this paper, we propose a novel post-processing techinque to reduce blocking artifacts for block-based image coding schemes. Our approach focuses mainly on the reconstruction of the surface continuity, including the...
详细信息
In this paper, we propose a novel post-processing techinque to reduce blocking artifacts for block-based image coding schemes. Our approach focuses mainly on the reconstruction of the surface continuity, including the continuity of edges and textures in the image objects. To do this, we introduce a set of continuous functions suitable for characterizing edges and, by a linear combination of these functions, estimate the original artifact-free image. We propose the Projections onto the OverComplete Basis (POCB) algorithm to find linear coefficients satisfying a constraint which controls the difference between the blocky and estimated images. The proposed and conventional techniques are tested on various images compressed by the JPEG standard and vector quantization. The simulation results show that the proposed technique yields better results both objectively and subjectively.
暂无评论