Super-resolution is a technique that produces higher resolution images from low resolution images (LRIs). In practice, the improvement in resolution is limited. The aim of this paper is to address the problem of wheth...
详细信息
ISBN:
(纸本)0769512720
Super-resolution is a technique that produces higher resolution images from low resolution images (LRIs). In practice, the improvement in resolution is limited. The aim of this paper is to address the problem of whether fundamental limits exist for super-resolution? Specifically, this paper provides explicit limits for a major class of super-resolution algorithms, called reconstruction-based algorithms, under both real and synthetic conditions. Our analysis is based on perturbation theory of linear systems. We also show that a sufficient number of LRIs can be determined to reach the limit. Both real and synthetic experiments are carried out to verify our analysis.
This paper presents a method for computing the visual hull that is based on two novel representations: the rim mesh, which describes the connectivity of contour generators on the object surface; and the visual hull me...
详细信息
ISBN:
(纸本)0769512720
This paper presents a method for computing the visual hull that is based on two novel representations: the rim mesh, which describes the connectivity of contour generators on the object surface; and the visual hull mesh, which describes the exact structure of the surface of the solid formed by intersecting a finite number of visual cones. We describe the topological features of these meshes and show how they can be identified in the image using epipolar constraints. These constraints are used to derive an image-based practical reconstruction algorithm that works with weakly calibrated cameras. Experiments on synthetic and real data validate the proposed approach.
We describe a scene text extraction system for handheld devices to provide enhanced information perception services to the user. It uses a color camera attached to a personal digital assistant as an input device to ca...
详细信息
ISBN:
(纸本)0769512720
We describe a scene text extraction system for handheld devices to provide enhanced information perception services to the user. It uses a color camera attached to a personal digital assistant as an input device to capture scene images from the real world and it employs image enhancement and segmentation methods to extract written information from the scene, convert them to text information and show them to the user so that he/she can see both the real world and information together. We implemented a prototype application: an automatic sign/text language translation for foreign travelers, where people can use the system whenever they want to see text or signs in their own language where they are originally written in a foreign language in the scene.
TV conference systems have been widely used recently. A participant of each site proceeds with a TV conference using video image on a screen and voice of a partner site. In this case, a fixed video camera shoots a sce...
详细信息
ISBN:
(纸本)0769512720
TV conference systems have been widely used recently. A participant of each site proceeds with a TV conference using video image on a screen and voice of a partner site. In this case, a fixed video camera shoots a scene of a site in general. The video image taken by a fixed camera, however, is lacking in changes. Also the fixed camera does not take shots efficiency that the participants of a partner site pay attention. As one of the candidates to avoid these defects, there is a method that the computer-controlled cameras shoots the scene. In this paper we propose an algorithm of shooting the best shot by computer-controlled cameras. The shooting algorithm is mainly decided by estimating an area of the image with high degree of attention, which is given by not only visual information but also auditory information. By using an experimental system, we confirm the effectiveness of our approach by examining a questionnaire from the participants of TV conferences.
Color histograms are a central feature in many image retrieval systems. Indeed they are part of the MPEG-7 standard. But histograms suffer from the "curse of dimensionality " in which the number of bins incr...
详细信息
ISBN:
(纸本)0769512720
Color histograms are a central feature in many image retrieval systems. Indeed they are part of the MPEG-7 standard. But histograms suffer from the "curse of dimensionality " in which the number of bins increases exponentially with the number of dimensions. There is therefore an imperative for methods for simplifying histograms. This paper presents a new method for simplifying histograms based on a cascade of increasing-scale graph morphology filters. The system we choose preserves scale space causality and so preserves the modes of the histogram. The method is quick to compute so is therefore a practically useful feature. We present results using the MPEG-7 Common Color Dataset that show that these new compressed features have a retrieval performance that is equivalent to full histograms.
In this paper, we propose a novel method, called local non-negative matrix factorization (LNMF), for learning spatially localized, parts-based subspace representation of visual patterns. An objective function is defin...
详细信息
ISBN:
(纸本)0769512720
In this paper, we propose a novel method, called local non-negative matrix factorization (LNMF), for learning spatially localized, parts-based subspace representation of visual patterns. An objective function is defined to impose a localization constraint, in addition to the non-negativity constraint in the standard NMF. This gives a set of bases which not only allows a non-subtractive (part-based) representation of images but also manifests localized features. An algorithm is presented for the learning of such basic components. Experimental results are presented to compare LNMF with the NMF and PCA methods for face representation and recognition, which demonstrates advantages of LNMF.
Eye movements are an important aspect of human visual behavior. The temporal and space-variant nature of sampling a visual scene requires frequent attentional gaze shifts, saccades, to fixate onto different parts of a...
详细信息
ISBN:
(纸本)0769512720
Eye movements are an important aspect of human visual behavior. The temporal and space-variant nature of sampling a visual scene requires frequent attentional gaze shifts, saccades, to fixate onto different parts of an image. Experimental evidence suggests that fixations are often directed towards the most informative regions in the visual scene. We develop a model and its simulation that can select such regions based on prior knowledge of similar scenes. Having representations of scene categories as a probabilistic combination of hypothetical objects, i.e., prototypical regions with certain properties, it is possible to assess the likely contribution of each image region to the successive recognition process. Using conditional probabilities for each region given the scene category, the model can then predict its informative value and initiate a sequential spatial information-gathering algorithm, analogous to an eye movement saccade to a new fixation. This algorithm establishes the most likely scene category for a given image.
We study the problem of aligning two 3D line reconstructions expressed in Plucker line coordinates. We introduce the 6/spl times/6 3D line motion matrix that acts on Plucker coordinates in projective, affine or Euclid...
详细信息
ISBN:
(纸本)0769512720
We study the problem of aligning two 3D line reconstructions expressed in Plucker line coordinates. We introduce the 6/spl times/6 3D line motion matrix that acts on Plucker coordinates in projective, affine or Euclidean space. We characterize its algebraic properties and its relation to the usual 4/spl times/4 point motion matrix, and propose various methods for estimating 3D motion from line correspondences, based on image-related and 3D cost functions. We assess the quality of the different estimation methods using simulated data and real images.
The article addresses the problem of model-based image segmentation by fitting deformable models to the image data. From uncertain a priori knowledge of the model parameters, an initial probability distribution of the...
详细信息
ISBN:
(纸本)0769512720
The article addresses the problem of model-based image segmentation by fitting deformable models to the image data. From uncertain a priori knowledge of the model parameters, an initial probability distribution of the model edge in the image is obtained. From the vicinity of the surmised edge, local statistics are learned for both sides of the edge. These local statistics provide locally adapted criteria to distinguish the two sides of the edge, even in the presence of spatially changing properties such as texture, shading, or color. Based on the local statistics, the model parameters are iteratively refined using a MAP estimation. Experiments with RGB images show that the method is capable of achieving high subpixel accuracy and robustness even in the presence of texture, shading, clutter, and partial occlusion.
Pulsed-field Gel Electrophoresis (PFGE) is an important tool in genomic analysis. The result of PFGE is Preprocessing.presented in an image. Each image contains several lanes. And each lane consists of bands. Two lane...
详细信息
Pulsed-field Gel Electrophoresis (PFGE) is an important tool in genomic analysis. The result of PFGE is Preprocessing.presented in an image. Each image contains several lanes. And each lane consists of bands. Two lanes are identified relative positions of bands are the same. We present a method that uses computer to extract the lanes and compare the lanes in the electrophoresis images. The presented method consists of two major steps. The first step is imageprocessing.and lane extraction. The second step is to convert a lane into chain code representation. The lane comparison is carried out through calculating the longest common subsequence between lanes. We define the distance between lanes in term of the LCS and the lengths of two lanes. Two lanes have smaller distance tend to have similar pattern. This method eliminates those very different patterns to help biologists reduce the lanes that need to be compared.
暂无评论