More and more researchers focus their studies on multi-view activity recognition, because a fixed view could not provide enough information for recognition. In this paper, we use multi-view features to recognize six k...
详细信息
ISBN:
(纸本)9783540763857
More and more researchers focus their studies on multi-view activity recognition, because a fixed view could not provide enough information for recognition. In this paper, we use multi-view features to recognize six kinds of gymnastic activities. Firstly, shape-based features are extracted from two orthogonal cameras in the form of R transform. Then a multi-view approach based on Fused HMM is proposed to combine different features for similar gymnastic activity recognition. Compared with other activity models, our method achieves better performance even in the case of frame loss.
In this paper, we present an automated mosaicing technique for generating mosaics from microscope images. The proposed algorithm has two stages. The first stage concentrates on finding out the overlapped portions by p...
详细信息
ISBN:
(纸本)9781601320438
In this paper, we present an automated mosaicing technique for generating mosaics from microscope images. The proposed algorithm has two stages. The first stage concentrates on finding out the overlapped portions by pattern matching, while the second stage achieves the task of getting a single large image by eliminating the overlapped portion. The microscopic images captured are assumed to be orthonormal and distortion free.
This paper investigates a concept for modelling complex data based on sub-models. The task of building and choosing optimal models is addressed in a generic information theoretic fashion. We propose an algorithm based...
详细信息
This paper investigates a concept for modelling complex data based on sub-models. The task of building and choosing optimal models is addressed in a generic information theoretic fashion. We propose an algorithm based on minimum description length to find an optimal subdivision of the data into sub-parts, each adequate for linear modelling. This results in an overall more compact model configuration called a model clique and in better generalization behavior. The algorithm is applied to active appearance models, active shape models and eigenimages and is evaluated on 4 different data sets. Experiments indicate that model cliques exhibit better generalization behavior than single models and mimic intuitive sub-division of data. (c) 2006 patternrecognition Society. Published by Elsevier Ltd. All rights reserved.
The responses of neurons in the primary visual cortex (V I) to stimulus inside the receptive field (RF) can be markedly modulated by stimuli outside the classical receptive field. The modulation, relying on contextual...
详细信息
The responses of neurons in the primary visual cortex (V I) to stimulus inside the receptive field (RF) can be markedly modulated by stimuli outside the classical receptive field. The modulation, relying on contextual configurations, yields excitatory and inhibitory activities. The VI neurons compose a functional network by lateral interactions and accomplish specific visual tasks in a dynamic and flexible fashion. Well-organized structures and conspicuous image locations are more salient and thus can pop out perceptually from the background. The excitatory and inhibitory activities give different visual physiological interpretations to the two kinds of saliencies. A model of contour extraction, inspired by visual cortical mechanisms of perceptual grouping, is presented. We unify the dual processes of spatial facilitation and surround inhibition to extract salient contours from complex scenes, and in this way coherent spatial configurations and region boundaries could stand out from their surround. The proposed method can selectively retain object contours, and meanwhile can dramatically reduce non-meaningful elements resulting from a texture background. This work gives a clear understanding for the roles of the inhibition and facilitation in grouping, and provides a biologically motivated computational strategy for contour extraction in computervision. (c) 2007patternrecognition Society. Published by Elsevier Ltd. All rights reserved.
Smart camera is a camera that can not only see but also think and act. A smart camera is an embedded vision system which captures and processes image to extract application-specific information in real time. The brain...
详细信息
ISBN:
(纸本)9783540763857
Smart camera is a camera that can not only see but also think and act. A smart camera is an embedded vision system which captures and processes image to extract application-specific information in real time. The brain of a smart camera is a special processing module that performs application specific information processing. The design of a smart camera as an embedded system is challenging because video processing has insatiable demand for performance and power, but at the same time embedded systems place considerable constraints on the design. We present our work to develop GestureCam, an FPGA-based smart camera built from scratch that can recognize simple hand gestures. The first completed version of GestureCam has shown promising real-time performance and is being tested in several desktop HCI (Human computer Interface) applications.
In this paper we propose a probabilistic observation model for stereo vision systems which avoids explicit data association between observations and the map by marginalizing the observation likelihood over all the pos...
详细信息
ISBN:
(纸本)9783540728467
In this paper we propose a probabilistic observation model for stereo vision systems which avoids explicit data association between observations and the map by marginalizing the observation likelihood over all the possible associations. We define observations as sets of landmarks composed of their 3D locations, assumed to be normally distributed, and their SIFT descriptors. Our model has been integrated into a particle filter to test its performance in map building and global localization, as illustrated by experiments with a real robot.
In this study, we aim to determine if iris recognition accuracy might be improved by correcting for the refractive effects of the human eye when the optical axes of the eye and camera are misaligned. We undertake this...
详细信息
This paper introduces a new class of switching vector median filter. The proposed algorithm first uses four directional masks to analyze the color difference between the central pixel and its neighborhood pixels in th...
详细信息
ISBN:
(纸本)9780819469502
This paper introduces a new class of switching vector median filter. The proposed algorithm first uses four directional masks to analyze the color difference between the central pixel and its neighborhood pixels in the RGB color space and classify each color pixel into noisy pixel or noise-free one, and then employs the standard vector median filtering operations in the detected noisy locations to restore the corrupted pixels and leave the noise-free ones unchanged. The simulation results show that the proposed method excellently suppresses impulsive noise as well as preserving the image details well, and significantly outperforms the existing vector filtering solutions in terms of both the objective measures and the perceptual visual quality.
In many applications of computervision and patternrecognition which use graph-based knowledge representation, it is of great interest to be able to extract the K largest cliques in a graph, but most methods are gear...
详细信息
ISBN:
(纸本)9783540729020
In many applications of computervision and patternrecognition which use graph-based knowledge representation, it is of great interest to be able to extract the K largest cliques in a graph, but most methods are geared either towards extracting the single clique of maximum size, or enumerating all cliques, without following any particular order. In this paper we present a novel approach for partial clique enumeration, that is, the extraction of the K largest cliques of a graph. Our approach is based on a continuous formulation of the clique problem developed by Motzkin and Straus, and is able to avoid extracting the same clique multiple times. This is done by casting the problem into a game-theoretic framework and iteratively rendering unstable the solutions that have already been extracted.
Apply is a Domain-Specific Language for image processing and low-level computervision. Apply allows programmers to write kernel operations that focus on the computation for a single pixel location. The compiler gener...
详细信息
暂无评论