This paper investigates the effects of confidence transformation (CT) of the character classifier outputs in handwritten Chinese text recognition. The classifier outputs are transformed to confidence values in three c...
详细信息
This paper investigates the effects of confidence transformation (CT) of the character classifier outputs in handwritten Chinese text recognition. The classifier outputs are transformed to confidence values in three confidence types, namely, sigmoid, soft max and Dempster-Shafer theory of evidence (D-S evidence). The confidence parameters are optimized by minimizing the cross-entropy (CE) loss function (both binary and multi-class) on a validation dataset, where we add non-character samples to enhance the outlier rejection capability in text recognition. Experimental results on the CASIA-HWDB database show that confidence transformation improves the handwritten text recognition performance significantly and adding non-characters for confidence parameter estimation is beneficial. Among the confidence types, the D-S evidence performs best.
The variability in cortical morphology across subjects makes it difficult to develop a general atlas of cortical sulci. In this paper, we present a data-driven technique for automatically learning cortical folding pat...
详细信息
ISBN:
(数字)9783642184215
ISBN:
(纸本)9783642184208
The variability in cortical morphology across subjects makes it difficult to develop a general atlas of cortical sulci. In this paper, we present a data-driven technique for automatically learning cortical folding patterns from MR brain images. A local image feature-based model is learned using machine learning techniques, to describe brain images as a collection of independent, co-occurring, distinct, localized image features which may not be present in all subjects. The choice of feature type (SIFT, KLT, Harris-affine) is explored with regards to identifying cortical folding patterns while also uncovering their group-related variability across subjects. The model is built on lateral volume renderings from the ICBM dataset, and applied to hemisphere classification in order to identify patterns of lateralization based on each feature type.
This paper proposes a new model of an Evolving Spiking Neural Network (ESNN) for spatio-temporal data (STD) classification problems. The proposed ESNN model incorporates an additional layer for capturing both spatial ...
详细信息
This paper proposes a new model of an Evolving Spiking Neural Network (ESNN) for spatio-temporal data (STD) classification problems. The proposed ESNN model incorporates an additional layer for capturing both spatial and temporal components of the STD and then transforms them into high dimensional spiking patterns. These patterns are learned and classified in the evolving classification layer of the ESNN. A fast time-to-first-spike learning algorithm is used that enables the new model to be more suitable for learning from the STD streams in an adaptive and incremental manner. The proposed method is evaluated on a benchmark sign language video that is spatio-temporal in nature. The results show that the proposed method is able to capture important spatio-temporal information from the STD stream. This results in significantly higher classification accuracy than the traditional time-delay MLP neural network model. Future directions for the development of ESNN models for STD are discussed.
In this study, a novel approach is proposed for mental stress recognition through automatic analysis of eye video sequences. The proposed system consists of five stages: video capturing, fuzzy image processing, signal...
详细信息
In this study, a novel approach is proposed for mental stress recognition through automatic analysis of eye video sequences. The proposed system consists of five stages: video capturing, fuzzy image processing, signal processing, feature extraction and, classification. The pupil parameters including Pupil Diameter (PD) and Pupil Dilation Acceleration (PDA) are measured using softcomputing techniques wherein the eye region is detected using the genetic algorithm (GA), and a fuzzy filter is designed for noise reduction. Edge detection is performed based on fuzzy reasoning and linking is done using Hough transform. Then, signal processing technique is applied to the pupil parameters to extract their most relevant features. Extracted features are imported into the learning system to classify the affective states between "stress" and "relaxed". The Fuzzy SVM (FSVM) is applied to this classification process. In order to induce the stress in subjects, a Stroop color-word test is designed. Also, the results obtained from the pupil parameters are compared with two other physiological signals including Electrocardiogram (ECG) and Photoplethysmogram (PPG). The experimental results indicate the pupil parameters have great potential for stress recognition compared to the other two physiological signals and, proposed stress recognition system is promising.
The recent technology of image processing forms the basic principles of research entitled “A Novel Approach for Face recognition and Age estimation using Local Binary pattern, Discriminative approach using Two layere...
详细信息
The recent technology of image processing forms the basic principles of research entitled “A Novel Approach for Face recognition and Age estimation using Local Binary pattern, Discriminative approach using Two layered Back Propagation Network” has been developed to overcome the inconveniences faced by the organizations in recognizing the exact person. The proposed system sustains a high recognition rate in a wide range of resolution levels and it breaks the other alternative methods. Skin patches are also one of the features of our proposed work. We propose a face detection algorithm for different lighting conditions. Human Skin patches is also one of the parameter in the algorithm. The new methods using Local Binary pattern, Discriminative approach, facial algorithm and two layered back propagation algorithm for identifying the face and as well as age estimation. The Texture features and Global features are extracted from the image in different scales. The Gradient Orientation Pyramid can be formed for calculating the Age Progression and Age Estimation. The proposed method having high calculation speed compared with the existing method using Back propagation network with single layer. The dataset are taken from FG-NET and Morph Dataset. The performance comparison has been done using different datasets.
In this paper, we propose a PCA-based spectral band compression and multispectral palmprint recognition method. This method first exploits PCA to compress original multispectral bands to a smaller number of 'bands...
详细信息
In this paper, we propose a PCA-based spectral band compression and multispectral palmprint recognition method. This method first exploits PCA to compress original multispectral bands to a smaller number of 'bands' and then uses the compressed bands to classify palmprint images. The experimental results show that our proposed PCA-based spectral band compression and recognition method can use very low-dimensional data to represent the original multispectral palmprint images and obtain a high classification accuracy.
Very often, the recognition of a pattern is accompanied by a cognitive process of interpretation and understanding. In the arts and sciences, as well as in our daily lives, we learned patterns from nature and create n...
详细信息
Very often, the recognition of a pattern is accompanied by a cognitive process of interpretation and understanding. In the arts and sciences, as well as in our daily lives, we learned patterns from nature and create new patterns for various applications. Weave pattern is one of the most important artificial patterns in our daily lives and there are numerous applications. To manipulate the weave patterns, texton indexing and prioritization are needed to perform, which is associated with a cognitive process of interpretation and understanding of pattern. In this regard, we use an interdisciplinary approach to help selecting weave texture patterns using tailored features and algorithms, taking into account essential features or rules of pattern design. The features and algorithms are designed based on the object-attribute-relation (OAR) model and cognitive informatics model. Three essential features of weave pattern are proposed, i.e. the complexity of patterns in production process, visual structural appearance and cognitive features to track for weave pattern. Our experiments on a wide variety of weave patterns show that the proposed approach is capable of effectively prioritizing weave texture patterns.
In the patternrecognition of Finger Motion's EMG signal, the Stability and Efficiency are both the problem. The paper proposes a new method of patternrecognition of EMG signal. The method uses AR model in th...
详细信息
In the patternrecognition of Finger Motion's EMG signal, the Stability and Efficiency are both the problem. The paper proposes a new method of patternrecognition of EMG signal. The method uses AR model in the modern signal processing theory and numerical variance calculation to compress and make the feature extraction of the EMG. To make the classification of the eigenvalues of the EMG, these eigenvalues have been inputted to the MATLAB to build up a improved multilayer BP neural networks. For the recognition of three different kinds of finger motion's EMG signals, the experiment obtained more higher accuracy. It shows that the method is efficient.
The wide application of General Purpose Graphic Processing Units (GPGPUs) results in large manual efforts on porting and optimizing algorithms on them. However, most existing automatic ways of generating GPGPU code fa...
详细信息
The wide application of General Purpose Graphic Processing Units (GPGPUs) results in large manual efforts on porting and optimizing algorithms on them. However, most existing automatic ways of generating GPGPU code fail to conduct optimization strategies regarding a specific computation and to reuse constantly evolving manual optimizations. In this paper, we present a computation pattern driven approach for computation-specific GPGPU code generation and optimization, which in turn reuses manual optimizations to a certain extent. We suggest language extensions to OpenMP, high-level data structure attributes, in order to assist the process of computation pattern matching and to help give users intuitive performance tuning parameters in the view of data structure attributes. We illustrate the feasibility of this approach through three important computation dwarfs, which are dense matrix, sparse matrix, and structured mesh computation in scientific computing. We also build a prototype OpenMP-to-CUDA translator that consists of computation patternrecognition and code template instantiation. The experimental results demonstrate the performance benefits of computation pattern driven method. To our best knowledge, it is the first work on reusing manual optimizations for GPGPUs with computation pattern driven approach.
In real-world image understanding and retrieval applications, there exists a large number of images containing"verb-object" semantic. The most existing image annotation approaches which mainly focus on annot...
详细信息
ISBN:
(纸本)9781450306164
In real-world image understanding and retrieval applications, there exists a large number of images containing"verb-object" semantic. The most existing image annotation approaches which mainly focus on annotating images with "object" concepts may not well describe the image semantics. In this paper, we propose a novel image annotation approach by learning "verb-object" concepts. The "verb-object" concept learning method is developed based on the assumption that the classifiers of the "verb-object" concepts which contain the same object usually share a common structure. We formulate each "verb-object" concept classifier as a combination of a private part and a common part shared by all the "verb-object" concepts containing the same object. These classifiers are learned simultaneously through a joint optimization process. Experiments on a Web image data set containing 22,812 images with 28 concepts demonstrate that the proposed approach can achieve promising performance compared to the baseline method. Copyright 2011 ACM.
暂无评论