The proceedings contain 84 papers. The topics discussed include: new measure of boolean factor analysis quality;mechanisms of adaptive spatial integration in a neural model of cortical motion processing;self-organized...
ISBN:
(纸本)9783642202667
The proceedings contain 84 papers. The topics discussed include: new measure of boolean factor analysis quality;mechanisms of adaptive spatial integration in a neural model of cortical motion processing;self-organized short-term memory mechanism in spiking neural network;approximation of functions by multivariable hermite basis: a hybrid method;using patternrecognition to predict driver intent;neural networks committee for improvement of metal's mechanical properties estimates;logarithmic multiplier in hardware implementation of neural networks;a robust learning model for dealing with missing values in many-core architectures;a model of saliency-based selective attention for machine vision inspection application;grapheme-phoneme translator for Brazilian Portuguese;and improvement of inventory control under parametric uncertainty and constraints.
Image matching is a fundamental task for computervision. However, due to the local ambiguities induced by repetitive patterns in most of man-made objects or scenes, matching images of repetitive patterns is still a c...
详细信息
Signatures continue to be an important biometric because they remain widely used as a means of personal verification and therefore an automatic verification system is needed. Manual signature-based authentication of a...
详细信息
Signatures continue to be an important biometric because they remain widely used as a means of personal verification and therefore an automatic verification system is needed. Manual signature-based authentication of a large number of documents is a difficult and time consuming task. Consequently for many years, in the field of protected communication and financial applications, we have observed an explosive growth in biometric personal authentication systems that are closely connected with measurable unique physical characteristics (e.g. hand geometry, iris scan, finger prints or DNA) or behavioural features. Substantial research has been undertaken in the field of signature verification involving English signatures, but to the best of our knowledge, very few works have considered non-English signatures such as Chinese, Japanese, Arabic etc. In order to convey the state-of-the-art in the field to researchers, in this paper we present a survey of non-English and non-Latin signature verification systems.
A fundamental problem in image processing is image segmentation. The conventional image segmentation methods, to some extent, all suffe from the problem of inaccurate segmentation. A slightly easier and more approacha...
详细信息
A fundamental problem in image processing is image segmentation. The conventional image segmentation methods, to some extent, all suffe from the problem of inaccurate segmentation. A slightly easier and more approachable problem - interactive segmentation - has also received a lot of attentions over the years. In this paper, we propose a fast and simple interactive image segmentation technique. This segmentation process is conducted in two modules. First, the original image is pre-segmented by a low level segmentation method. Second, some portion of the object-of-interest is marked. The proposed method identifies all the similar-colored objects present in the image. In this work, color feature is used to measure the closeness between two regions and accordingly the label of the unmarked region is decided.
Unsupervised categorization of objects is a fundamental problem in computervision. While appearance-based methods have become popular recently, other important cues like functionality are largely neglected. Motivated...
详细信息
ISBN:
(纸本)9781457703942
Unsupervised categorization of objects is a fundamental problem in computervision. While appearance-based methods have become popular recently, other important cues like functionality are largely neglected. Motivated by psychological studies giving evidence that human demonstration has a facilitative effect on categorization in infancy, we propose an approach for object categorization from depth video streams. To this end, we have developed a method for capturing human motion in real-time. The captured data is then used to temporally segment the depth streams into actions. The set of segmented actions are then categorized in an unsupervised manner, through a novel descriptor for motion capture data that is robust to subject variations. Furthermore, we automatically localize the object that is manipulated within a video segment, and categorize it using the corresponding action. For evaluation, we have recorded a dataset that comprises depth data with registered video sequences for 6 subjects, 13 action classes, and 174 object manipulations.
Graph cuts are widely used in many fields of computervision in order to minimize in small polynomial time complexity certain classes of energies. These specific classes depend on the way chosen to build the graphs re...
详细信息
ISBN:
(纸本)9781457703942
Graph cuts are widely used in many fields of computervision in order to minimize in small polynomial time complexity certain classes of energies. These specific classes depend on the way chosen to build the graphs representing the problems to solve. We study here all possible ways of building graphs and the associated energies minimized, leading to the exhaustive family of energies minimizable exactly by a graph cut. To do this, we consider the issue of coding pixel labels as states of the graph, i.e. the choice of state interpretations. The family obtained comprises many new classes, in particular energies that do not satisfy the submodularity condition, including energies that are even not permuted-submodular. A generating subfamily is studied in details, in particular we propose a canonical form to represent Markov random fields, which proves useful to recognize energies in this subfamily in linear complexity almost surely, and then to build the associated graph in quasilinear time. A few experiments are performed, to illustrate the new possibilities offered.
Recovering 3D geometry from a single view of an object is an important and challenging problem in computervision. Previous methods mainly focus on one specific class of objects without large topological changes, such...
详细信息
ISBN:
(纸本)9781457703942
Recovering 3D geometry from a single view of an object is an important and challenging problem in computervision. Previous methods mainly focus on one specific class of objects without large topological changes, such as cars, faces, or human bodies. In this paper, we propose a novel single view reconstruction algorithm for symmetric piecewise planar objects that are not restricted to some object classes. Symmetry is ubiquitous in manmade and natural objects and provides rich information for 3D reconstruction. Given a single view of a symmetric piecewise planar object, we first find out all the symmetric line pairs. The geometric properties of symmetric objects are used to narrow down the searching space. Then, based on the symmetric lines, a depth map is recovered through a Markov random field. Experimental results show that our algorithm can efficiently recover the 3D shapes of different objects with significant topological variations.
In this paper we consider the problem of aligning multiple non-rigid surface mesh sequences into a single temporally consistent representation of the shape and motion. A global alignment graph structure is introduced ...
详细信息
ISBN:
(纸本)9781457703942
In this paper we consider the problem of aligning multiple non-rigid surface mesh sequences into a single temporally consistent representation of the shape and motion. A global alignment graph structure is introduced which uses shape similarity to identify frames for inter-sequence registration. Graph optimisation is performed to minimise the total non-rigid deformation required to register the input sequences into a common structure. The resulting global alignment ensures that all input sequences are resampled with a common mesh structure which preserves the shape and temporal correspondence. Results demonstrate temporally consistent representation of several public databases of mesh sequences for multiple people performing a variety of motions with loose clothing and hair.
This paper proposes a new template matching method that is robust to outliers and fast enough for real-time operation. The template and image are densely transformed in binary code form by projecting and quantizing hi...
详细信息
ISBN:
(纸本)9781457703942
This paper proposes a new template matching method that is robust to outliers and fast enough for real-time operation. The template and image are densely transformed in binary code form by projecting and quantizing histograms of oriented gradients. The binary codes are matched by a generic method of robust similarity applicable to additive match measures, such as L_p- and Hamming distances. The robust similarity map is computed efficiently via a proposed Inverted Location Index structure that stores pixel locations indexed by their values. The method is experimentally justified in large image patch datasets. Challenging applications, such as intra-category object detection, object tracking, and multimodal image matching are demonstrated.
The classic Bag-of-Features (BOF) model and its extensional work use a single value to represent a visual code. This strategy ignores the relation of visual codes. In this paper, we explore this relation and propose a...
详细信息
ISBN:
(纸本)9781457703942
The classic Bag-of-Features (BOF) model and its extensional work use a single value to represent a visual code. This strategy ignores the relation of visual codes. In this paper, we explore this relation and propose a new algorithm for image classification. It consists of two main parts: 1) construct the codebook graph wherein a visual code is linked with other codes;2) describe each local feature using a pair of related codes, corresponding to an edge of the graph. Our approach contains richer information than previous BOF models. Moreover, we demonstrate that these models are special cases of ours. Various coding and pooling algorithms can be embedded into our framework to obtain better performance. Experiments on different kinds of image classification databases demonstrate that our approach can stably achieve excellent performance compared with various BOF models.
暂无评论