The proceedings contain 111 papers. The topics discussed include: computervision for homeland security: a perspective on its promise and pitfalls;human motion recognition based on statistical shape analysis;clusterin...
详细信息
ISBN:
(纸本)0780393856
The proceedings contain 111 papers. The topics discussed include: computervision for homeland security: a perspective on its promise and pitfalls;human motion recognition based on statistical shape analysis;clustering of human actions using invariant body shape descriptor and dynamic time warping;relevance learning for spectral clustering with applications on image segmentation and video behaviour profiling;using local and global object's information to track vehicles in urban scenes;denoising image sequences does not require motion estimation;tracking motion objects in infrared videos;multiple object tracking using elastic matching;human tracking in real-time video for varying illumination;a real time surveillance system for metropolitan railways;human posture recognition using active contours and radial basis function neural network;a novel method for graffiti detection using change detection algorithm;and a novel scheme of face verification using active appearance models.
Currently, iris identification systems are not easy to use since they need a strict cooperation of the user during the snapshot acquisition process. Several acquisitions are generally needed to obtain a workable image...
详细信息
ISBN:
(纸本)354029032X
Currently, iris identification systems are not easy to use since they need a strict cooperation of the user during the snapshot acquisition process. Several acquisitions are generally needed to obtain a workable image of the iris for recognition purpose. To make the system more flexible and open to large public applications, we propose to work on the entire sequence acquired by a camera during the enrolment. Hence the recognition step can be applied on a selected number of the "best workable images" of the iris within the sequence. In this context, the aim of the paper is to present a method for pupil tracking based on a dynamic Gaussian Mixture Model (GMM) together with Kalman prediction of the pupil position along the sequence. The method has been experimented on a real video sequence captured by a near Infra-Red (IR) sensitive camera and has shown its effectiveness in nearly real time computing.
Hiding data in curves can be achieved by parameterizing a curve using the B-spline model and adding spread spectrum sequences in B-spline control points. In this paper, we propose an iterative alignment-minimization a...
详细信息
In the first part we discuss the problem of recovering the calibration of a network of pan-tilt-zoom cameras. The intrinsic parameters of each camera over its full range of zoom settings are estimated through a two st...
详细信息
In the first part we discuss the problem of recovering the calibration of a network of pan-tilt-zoom cameras. The intrinsic parameters of each camera over its full range of zoom settings are estimated through a two step procedure. We first determine the intrinsic parameters at the camera's lowest zoom setting very accurately by capturing an extended panorama. The camera intrinsics are then determined at discrete steps in a monotonically increasing zoom sequence that spans the full zoom range of the cameras. Both steps are fully automatic and do not assume any knowledge of the scene structure. We validate our approach by calibrating two different types of pan tilt zoom cameras placed in an outdoor environment. We also show the high-resolution panoramic mosaics built from the images captured during this process. The second section deals with the calibration of omnidirectional cameras. A broad class of both central and non-central cameras, such as fish-eye and catadioptric cameras, can be reduced to 1D radial cameras under the assumption of known center of radial distortion. We study the multi-view geometry of 1D radial cameras. For cameras in general configuration, we introduce a quadrifocal tensor. From this tensor a metric reconstruction of the 1D cameras as well as the observed features can be obtained. In a second phase this reconstruction can then be used as a calibration object to estimate a non-parametric non-central model for the cameras. We study some degenerate cases, including pure rotation. In the case of a purely rotating camera we obtain a trifocal tensor. This allows us to obtain a metric reconstruction of the plane at infinity. Next, we use the plane at infinity as a calibration device to non-parametrically estimate the radial distortion.
Fingerprint direction field estimation is a key step among most of fingerprint enhancement systems, and so far, in most of systems it heavily relies on local ridge gradient information. However, this method is sensiti...
详细信息
Recently, Biometrics-based personal identification is regarded as an effective method of person's identity with recognition automation and high performance. In this paper, the palmprint recognition method based on...
详细信息
In this paper, we present a novel approach to solving the supervised dimensionality reduction problem by encoding an image object as a general tensor of 2nd or higher order. First, we propose a discriminant tensor cri...
详细信息
ISBN:
(纸本)0769523722
In this paper, we present a novel approach to solving the supervised dimensionality reduction problem by encoding an image object as a general tensor of 2nd or higher order. First, we propose a discriminant tensor criterion (DTC), whereby multiple interrelated lower-dimensional discriminative subspaces are derived for feature selection. Then, a novel approach called k-mode cluster-based discriminant analysis is presented to iteratively learn these subspaces by unfolding the tensor along different tensor dimensions. We call this algorithm discriminant analysis with tensor representation (DATER), which has the following characteristics: 1) multiple interrelated subspaces can collaborate to discriminate different classes; 2) for classification problems involving higher-order tensors, the DATER algorithm can avoid the curse of dimensionality dilemma and overcome the small sample size problem; and 3) the computational cost in the learning stage is reduced to a large extent owing to the reduced data dimensions in generalized eigenvalue decomposition. We provide extensive experiments by encoding face images as 2nd or 3rd order tensors to demonstrate that the proposed DATER algorithm based on higher order tensors has the potential to outperform the traditional subspace learning algorithms, especially in the small sample size cases.
This paper describes a method for establishing stereo correspondences using diffuse polarization information. To do this we exploit the spontaneous polarization of light caused by reflection from dielectric surfaces t...
详细信息
This paper proposes the hands-on implementations of the "Formosa middle size soccer robot", which is developed based on the Intelligent vision fusion sensing architecture. The proposed autonomous soccer robo...
详细信息
This paper presents a method for estimating the orientation of planar text surfaces using the edge-direction distribution (EDD) extracted from the image as input to a neural network. We consider canonical rotations an...
详细信息
ISBN:
(纸本)3540290699
This paper presents a method for estimating the orientation of planar text surfaces using the edge-direction distribution (EDD) extracted from the image as input to a neural network. We consider canonical rotations and we developed a mathematical model to analyze how the EDD changes with the rotation angle under orthographic projection. In order to improve performance and solve quadrant ambiguities, we adopt an active-vision approach by considering a pair of images (instead of only one) with a slight rotation difference between them. We then use the difference between the two EDDs as input to the network. Starting with camera-captured front-parallel images with text, we apply single-axis synthetic rotations to verify the validity of the EDD transform model and to train and test the network. The presented text-pose estimation method is intended to provide navigation guidance to a mobile robot capable of reading the textual content encountered in its environment.
暂无评论