A web-based system for retrieving imaged documents from a digital library is described in this paper. First, some image preprocessing is performed off-line on the underlying imaged document to extract its word objects...
详细信息
computer animated agents and robots bring a social dimension to human computer interaction and force us to think in new ways about how computers could be used in daily life. Face to face communication is a real-time p...
详细信息
Some people's faces are easier to recognize than others, but it is not obvious what subject-specific factors make individual faces easy or difficult to recognize. This study considers 11 factors that might make re...
详细信息
ISBN:
(纸本)0769519008
Some people's faces are easier to recognize than others, but it is not obvious what subject-specific factors make individual faces easy or difficult to recognize. This study considers 11 factors that might make recognition easy or difficult for 1,072 human subjects in the FERET dataset. The specific factors are: race (white, Asian, African-American, or other), gender, age (young or old), glasses (present or absent), facial hair (present or absent), bangs (present or absent), mouth (closed or other), eyes (open or other), complexion (clear or other), makeup (present or absent), and expression (neutral or other). An ANOVA is used to determine the relationship between these subject covariates and the distance between pairs of images of the same subject in a standard Eigenfaces subspace. Some results are not terribly surprising. For example, the distance between pairs of images of the same subject increases for people who change their appearance, e.g., open and close their eyes, open and close their mouth or change expression. Thus changing appearance makes recognition harder. Other findings are surprising. Distance between pairs of images for subjects decreases for people who consistently wear glasses, so wearing glasses makes subjects more recognizable. Pairwise distance also decreases for people who are either Asian or African-American rather than white. A possible shortcoming of our analysis is that minority classifications such as African-Americans and wearers-of-glasses are underrepresented in training. Followup experiments with balanced training addresses this concern and corroborates the original findings. Another possible shortcoming of this analysis is the novel use of pairwise distance between images of a single person as the predictor of recognition difficulty. A separate experiment confirms that larger distances between pairs of subject images implies a larger recognition rank for that same pair of images, thus confirming that the subject is harder to recog
We describe a statistical model for natural images that is built upon a multi-scale wavelet decomposition. The model consists of first- and higher-order statistics that capture certain statistical regularities of natu...
详细信息
We describe a statistical model for natural images that is built upon a multi-scale wavelet decomposition. The model consists of first- and higher-order statistics that capture certain statistical regularities of natural images. We show how this model can be useful in several digital forensic applications, specifically in detecting various types of digital tampering.
Here we consider optical triangulation scanning as a means of creating permanent architectural archives in the form of accurate ground plans and other orthographic views. We present plan drawings created with laser sc...
详细信息
Here we consider optical triangulation scanning as a means of creating permanent architectural archives in the form of accurate ground plans and other orthographic views. We present plan drawings created with laser scan data and use these documents to make comparisons with existing documents. Finally, we present a new technique for decreasing the laser scanning field time required to create plans and other views.
A technique is presented to combine n data points, each available with point-dependent uncertainty, when only a subset of these points come from N \ll n sources, where N is unknown. We detect the significant modes of ...
详细信息
A technique is presented to combine n data points, each available with point-dependent uncertainty, when only a subset of these points come from N \ll n sources, where N is unknown. We detect the significant modes of the underlying multivariate probability distribution using a generalization of the nonparametric mean shift procedure. The number of detected modes automatically defiens N, while the appurtenance of a point to the basin of attraction of a mode provides the fusion rule.
We formulate multiple-view geometry for omni-directional and panorama-camera systems. The mathematical formu-lations enable us to derive the geometrical and algebraic constraints for multiple panorama-camera configura...
详细信息
We formulate multiple-view geometry for omni-directional and panorama-camera systems. The mathematical formu-lations enable us to derive the geometrical and algebraic constraints for multiple panorama-camera configurations. The constraints permit us to reconstruct three-dimensional objects for a large feasible region.
We propose a new Hidden Markov Model with time-dependent states. Estimation of this model is shown to be as fast and easy as the estimation of regular HMMs. We demonstrate the usefulness and feasibility of such time-d...
详细信息
We propose a new Hidden Markov Model with time-dependent states. Estimation of this model is shown to be as fast and easy as the estimation of regular HMMs. We demonstrate the usefulness and feasibility of such time-dependent HMMs with an application in which illegitimate access to personnel-only rooms in airports etc. can be distinguished from access by legitimate personnel, based on differences in the time of access or differences in the motion trajectories.
In this paper, we describe the use of Genetic Programming (GP) techniques to learn a visual feature detection for a mobile robot navigation task. We provide experimental results across a number of different environmen...
详细信息
In this paper, we describe the use of Genetic Programming (GP) techniques to learn a visual feature detection for a mobile robot navigation task. We provide experimental results across a number of different environments, each with different characteristics, and draw conclusions about the performance of the learned feature detector. We also explore the utility of seeding the initial population with a previously evolved individual, and discuss the performance of the resulting individuals.
暂无评论