In this paper we introduce a generalization of Fisher-Rao's discriminant analysis and its application in a human-computer interaction scenario: a sensing chair. Our algorithm shows to be able to successfully estim...
详细信息
In this paper we introduce a generalization of Fisher-Rao's discriminant analysis and its application in a human-computer interaction scenario: a sensing chair. Our algorithm shows to be able to successfully estimate the underlying distributions of the pressure maps data of the sensing chair. Other linear discriminant techniques, such as LDA, had been found to be inadequate for the job; typically yielding inferior results than PCA. We compare our approach to several template-based approaches and show that the new discriminant function is comparable to the best approach classifier. This is important because generally each application tends to prefer a different algorithm. Fortunately, our new algorithm is usually the top one (or comparable to the top one). In this paper we will however restrict our study tothe classification of sitting postures.
We present a novel matchpoint acquisition method capable of producing accurate correspondences at subpixel precision. Given the known representation of the point to be matched, such as a projected fiducial in a struct...
详细信息
We present a novel matchpoint acquisition method capable of producing accurate correspondences at subpixel precision. Given the known representation of the point to be matched, such as a projected fiducial in a structured light system, the method estimates the fiducial location and its expected uncertainty. Improved matchpoint precision has application in a number of calibration tasks, and uncertainty estimates can be used to significantly improve overall calibration results. A simple parametric model captures the relationship between the known fiducial and its corresponding position, shape, and intensity on the image plane. For each match-point pair, these unknown model parameters are recovered using maximum likelihood estimation to determine a sub-pixel center for the fiducial. The uncertainty of the match-point center is estimated by performing forward error analysis on the expected image noise. Uncertainty estimates used in conjunction with the accurate matchpoints can improve calibration accuracy for multi-view systems.
This paper describes a vision based 3D real-virtual interaction which enables realistic avatar motion control, and in which the virtual camera is controlled by the body posture of the user. The human motion analysis m...
详细信息
This paper describes a vision based 3D real-virtual interaction which enables realistic avatar motion control, and in which the virtual camera is controlled by the body posture of the user. The human motion analysis method is implemented by blob tracking. A physically-constrained motion synthesis method is implemented to generate realistic motion from a limit number of blobs. We address our framework to utilize virtual scene contexts as a priori knowledge. In order to make the virtual scene more realistically beyond the limitation of the real world sensing, we use a framework to augment the reality in the virtual scene by simulating various events of the real world. Concretely, we suppose that a virtual environment can provide action information for the avatar. 3rd-person viewpoint control coupled with body postures is also realized to directly access virtual objects.
Traditional image based hand tracking uses a single Kalman filter to estimate and predict the hand state (position, velocity, and acceleration). However, this approach may fail in the case of large maneuvers and clutt...
详细信息
Traditional image based hand tracking uses a single Kalman filter to estimate and predict the hand state (position, velocity, and acceleration). However, this approach may fail in the case of large maneuvers and cluttered measurements. In this paper we propose to use the interacting multiple model (IMM) filter to catch a maneuver and the probabilistic data association (PDA) method to process noisy measurements and false alarms. A theoretical framework of image based hand tracking by IMM-PDA is set up. Experiment results from several video segments show that IMM-PDA can successfully track hand motions in a natural conversational environment.
This paper addresses the problem of tracking a moving person with a single, omnidirectional camera. An appearance-based tracking system is described which uses a self-acquired appearance model and a Kalman filter to e...
详细信息
This paper addresses the problem of tracking a moving person with a single, omnidirectional camera. An appearance-based tracking system is described which uses a self-acquired appearance model and a Kalman filter to estimate the position of the person. Features corresponding to "depth cues" are first extracted from the panoramic images, then an artificial neural network is trained to estimate the distance of the person from the camera. The estimates are combined using a discrete Kalman filter to track the position of the person over time. The ground truth information required for training the neural network and the experimental analysis was obtained from another vision system, which uses multiple webcams and triangulation to calculate the true position of the person. Experimental results show that the tracking system is accurate and reliable, and that its performance can be further improved by learning multiple, person-specific appearance models.
The use of prior information by learning from training data is used increasingly in image analysis and computervision. The high dimensionality of the parameter spaces and the complexity of the probability distributio...
详细信息
The use of prior information by learning from training data is used increasingly in image analysis and computervision. The high dimensionality of the parameter spaces and the complexity of the probability distributions however often makes the exact learning of priors an impossible problem, requiring an excessive amount of training data that is seldom realizable in practise. In this paper we propose a weaker form of prior estimation which tries to learn the boundaries of impossible events from examples. This is equivalent to estimating the support of the prior distribution or the manifold of possible events. The idea is to model the set of possible events by algebraic inequalities. Learning proceeds by selecting those inequalities that show a consistent sign when applied to the training data set. Every such inequality "carves" out a region of impossible events in the parameter space. The manifold of possible events estimated in this way will in general represent the qualitative properties of the events. We give example of this in the problems of restoration of handwritten characters and automatically tracked body locations
A new efficient matching algorithm dedicated to catadioptric sensors is proposed in this paper. The presented approach is designed to overcome the varying resolution of the mirror. The aim of this work is to provide a...
详细信息
A new efficient matching algorithm dedicated to catadioptric sensors is proposed in this paper. The presented approach is designed to overcome the varying resolution of the mirror. The aim of this work is to provide a matcher that gives reliable results similar to the ones obtained by classical operators on planar projection images. The matching is based on a dynamical size windows extraction, computed from the viewing angular aperture of the neighborhood around the points of interest. An angular scaling of this angular aperture provides a certain number of different neighborhood resolution around the same considered point. A combinatory cost method is introduced in order to determine the best match between the different angular neighborhood patches of two interest points. Results are presented on sparse matched corner points, that can be used to estimate the epipolar geometry of the scene in order to provide a dense 3D map of the observed environment.
Many nano-scale sensing techniques and image processing applications are characterized by noisy, or corrupted, image data. Unlike typical camera-based computervision imagery where noise can be modeled quite well as a...
详细信息
Many nano-scale sensing techniques and image processing applications are characterized by noisy, or corrupted, image data. Unlike typical camera-based computervision imagery where noise can be modeled quite well as additive, zero-mean white or Gaussian noise, nano-scale images suffer from low intensities and thus mainly from Poisson-like noise. In addition, noise distributions can not be considered symmetric due to the limited gray value range of sensors and resulting truncation of over- and underflows. In this paper we adapt B-spline channel smoothing to meet the requirements imposed by this noise characteristics. Like PDE-based diffusion schemes it has a close connection to robust statistics but, unlike diffusion schemes, it can handle non-zero-mean noises. In order to account for the multiplicative nature of Poisson noise the variance of the smoothing kernels applied to each channel is properly adapted. We demonstrate the properties of this technique on noisy nano-scale images of silicon structures and compare to anisotropic diffusion schemes that were specially adapted to this data.
Images of nano-structures are often noisy. On the other hand, in many settings there is quite a lot of model knowledge regarding the observed structures. This paper proposes a method for segmenting an image using a ge...
详细信息
Images of nano-structures are often noisy. On the other hand, in many settings there is quite a lot of model knowledge regarding the observed structures. This paper proposes a method for segmenting an image using a geometric model of the the observed structure. The resulting segmentation is guaranteed to be globally optimal, for an explicitly specified score function. This property provides a great deal of robustness to the algorithm. The algorithm presented explores a pre-defined space of segmentations using a branch-and-bound algorithm. It eliminates those parts of the space that are provably poor and explores in further detail the more promising parts of the space. An example of a segmentation that can be obtained in this way is a straight line segmentation of an image into 2 regions that minimizes the intensity variation within the regions. Results showing extraction of specific nano-structures are presented. A trivial variation on the algorithm can find a maximum a-posteriori probability estimate of the segmentation when there exists an a-priori distribution over the segmentations and the objective function is interpreted as the likelihood of the image given the segmentation.
We address the detection and analysis of gestural hand motion oscillation and symmetries in natural speech. First, we extract hand motion trajectory signals from video dataset. Second, we present our windowed correlat...
详细信息
We address the detection and analysis of gestural hand motion oscillation and symmetries in natural speech. First, we extract hand motion trajectory signals from video dataset. Second, we present our windowed correlation coefficient approach for gestural symmetry extraction. The signs and magnitudes of the correlation coefficients in the cardinal directions of the subject's torso characterize the symmetries. Third, we present a wavelet-based approach that extracts the time-frequency properties of hand motion oscillation. By analyzing these frequency properties durations of homogeneous gestural oscillations are detected. Finally, we apply our approach to a real video dataset captured in candid conversation. We relate the hand motion oscillatory gestures and symmetric gestures to the phases of speech and multimodal discourse analysis. We demonstrate the ability of our algorithm to extract gestural symmetries and oscillations and show how symmetric gestures and oscillatory gestures correspond to natural discourse structure.
暂无评论