the authors present a GA optimization technique for cosinebased k-nearest neighbors classification that improves predictive accuracy in a class-balanced manner while simultaneously enabling knowledge discovery. the GA...
详细信息
In this paper, we are interested in the sender's name extraction in fax cover pages through a machinelearning scheme. For this purpose, two analysis methods are implemented to work in parallel. the first one is b...
详细信息
In this paper, we are interested in the sender's name extraction in fax cover pages through a machinelearning scheme. For this purpose, two analysis methods are implemented to work in parallel. the first one is based on image document analysis (OCR recognition, physical block selection), the other on text analysis (word feature extraction, local grammar rules). Our main contribution consisted in introducing a neural network to find an optimal combination of the two approaches. Tests carried on real fax images show that the neural network improves performance compared to an empirical combination function and to each method used separately.
We propose an unsupervised approach to select representative face samples (models) from raw videos and build an appearance-based face recognition system. the approach is based on representing the face manifold in a lo...
详细信息
We propose an unsupervised approach to select representative face samples (models) from raw videos and build an appearance-based face recognition system. the approach is based on representing the face manifold in a low-dimensional space using the locally linear embedding (LLE) algorithm and then performing K-means clustering. We define the face models as the cluster centers. Our strategy is motivated by the efficiency of LLE to recover meaningful low-dimensional structures hidden in complex and high dimensional data such as face images. Two other well-known unsupervised learning algorithms (Isomap and SOM) are also considered. We compare and assess the efficiency of these different schemes on the CMU MoBo database which contains 96 face sequences of 24 subjects. the results clearly show significant performance enhancements over traditional methods such as the PCA-based one.
Feature learning aims at automatic optimization of features to be used in the classification process. We consider the situation where, given a parameterized algorithm for extracting the features from the data, the opt...
详细信息
there is increasing interest in the development of reliable, rapid and non-intrusive security control systems. Among the many approaches, biometrics such as palmprints provide highly effective automatic mechanisms for...
详细信息
there is increasing interest in the development of reliable, rapid and non-intrusive security control systems. Among the many approaches, biometrics such as palmprints provide highly effective automatic mechanisms for use in personal identification. this paper presents a new method for extracting features from palmprints using the competitive coding scheme and angular matching. the competitive coding scheme uses multiple 2-D Gabor filters to extract orientation information from palm lines. this information is then stored in a feature vector called the competitive code. the angular matching with an effective implementation is then defined for comparing the proposed codes, which can make over 9,000 comparisons within 1s. In our testing database of 7,752 palmprint samples from 386 palms, we can achieve a high genuine acceptance rate of 98.4% and a low false acceptance rate of 3/spl times/10/sup -6/%. the execution time for the whole process of verification, including preprocessing, feature extraction and final matching, is 1s.
Research on cooperative, adaptive intelligentsystems, involves studying, developing and evaluatingarchitectures and methods to solve complex problemsusing adaptive and cooperative systems. these systemsmay range from ...
详细信息
Research on cooperative, adaptive intelligentsystems, involves studying, developing and evaluatingarchitectures and methods to solve complex problemsusing adaptive and cooperative systems. these systemsmay range from simple software modules (such as aclustering or a classification algorithm) to physicalsystems (such as autonomous robots, machines orsensors).the main characteristic of these systems is that theyare adaptive and cooperative. By adaptive, it is meantthat the systems have a learning ability that makes themadjust their behaviour or performance to cope withchanging situations. the systems are willing tocooperate together to solve complex problems or toachieve common *** patternrecognition, there are notable contributionson the use of multiple classifiers. the most dominantdecomposition model used is an ensemble of classifiers(identical structures) that are trained differently. Mostof the innovations are in the combining methods. thereare weighting and voting approaches, probabilisticapproaches and approximate and fuzzy logicapproaches. In the area of sensor fusion, there havebeen some interesting ideas for fusing the data anddecisions of the sensors. However, most of thesecombining schemes are usually applied as a postprocessing *** this work are concerned with investigatingarchitectures and methods of aggregating decisions in amulti-classifier or multi-agent environment. Newarchitectures that allow active cooperation will bedeveloped. the classifiers (or agents) have to knowsome knowledge about others in the system. Differentforms of cooperation will be reported. In order for thesearchitectures to allow for dynamic decision fusion, theaggregation procedures have to have the flexibility toadapt to changes in the input and output and adjust toimprove on the final output. Changes will be learned bymeans of extracting features using feature *** of these architectures to problems inclassification of data, distributed datamining a
We discuss some estimates for the misclassification rate of a classification tree in terms of the size of the learning set, following some ideas introduced in [3]. We develop some mathematical ideas of [3], extending ...
详细信息
ISBN:
(纸本)3540405046
We discuss some estimates for the misclassification rate of a classification tree in terms of the size of the learning set, following some ideas introduced in [3]. We develop some mathematical ideas of [3], extending the analysis to the case with an arbitrary finite number of classes.
this paper is concerned withthe application of fuzzy neural networks to fault diagnosis systems for rotary machines. In practical fault diagnosis, it is very difficult to improve the recognition rate of pattern recog...
详细信息
this paper is concerned withthe application of fuzzy neural networks to fault diagnosis systems for rotary machines. In practical fault diagnosis, it is very difficult to improve the recognition rate of patternrecognition, especially when the sample data are similar. To solve these difficulties, a fault diagnosis system using fuzzy neural networks is proposed in this research. A fault diagnosis system with fuzzy neural networks is based on a series of standard fault pattern pairings between fault symptoms and fault. Fuzzy neural networks are trained to memorize these standard pattern pairs. Unlike other neural networks, fuzzy neural networks adopt bi-directional association. they make use of information from boththe fault symptoms and the fault patterns, which can improve recognition rate greatly. When an unknown sample becomes the input for a trained fault diagnosis system, the fault diagnosis system can make fault diagnosis by bi-directional association of fuzzy neural networks. through experiments with a rotor testing table and applications in monitoring and fault diagnosis of water pump sets of oil plant, it is verified that fuzzy neural networks have a well distinguished ability and are effective to perform fault diagnosis of rotary machines.
Most recent document standards rely on structured representations. On the other hand, current information retrieval systems have been developed for flat document representations and cannot be easily extended to cope w...
详细信息
ISBN:
(纸本)3540405046
Most recent document standards rely on structured representations. On the other hand, current information retrieval systems have been developed for flat document representations and cannot be easily extended to cope with more complex document types. Only a few models have been proposed for handling structured documents, and the design of such systems is still an open problem. We present here a new model for structured document retrieval which allows to compute and to combine the scores of document parts. It is based on bayesian networks and allows for learningthe model parameters in the presence of incomplete data. We present an application of this model for ad-hoc retrieval and evaluate its performances on a small structured collection. the model can also be extended to cope with other tasks such as interactive navigation in structured documents or corpus.
We present a model-based method for accurate extraction of pedestrian silhouettes from video sequences. Our approach is based on two assumptions, 1) there is a common appearance to all pedestrians, and 2) each individ...
详细信息
ISBN:
(纸本)0769519504
We present a model-based method for accurate extraction of pedestrian silhouettes from video sequences. Our approach is based on two assumptions, 1) there is a common appearance to all pedestrians, and 2) each individual looks like him/herself over a short amount of time. these assumptions allow us to learn pedestrian models that encompass both a pedestrian population appearance and the individual appearance variations. Using our models, we are able to produce pedestrian silhouettes that have fewer noise pixels and missing parts. We apply our silhouette extraction approach to the NIST gait data set and show that under the gait recognition task, our model-based sulhouettes result in much higher recognition rates than silhouettes directly extracted from background subtraction, or any non-model-based smoothing schemes.
暂无评论