In this paper, we propose a method by engaging the one class support vector machine (OC-SVM) in the identification of diffractive optically variable images (DOVIs). OC-SVM, as a special SVM, can solve the problems of ...
详细信息
In this paper, we propose a method by engaging the one class support vector machine (OC-SVM) in the identification of diffractive optically variable images (DOVIs). OC-SVM, as a special SVM, can solve the problems of high-dimensional data sets and small sample size (SSS) with positive and negative unbalance training data. Image feature matrix is built by extracting image features from texture aspects. OC-SVM can be trained with the high-dimensional matrix directly, and does not have to reduce the dimensionality of feature matrix as the usual methods. The experiment results show the effectiveness of the proposed approach against linear discriminant analysis. Considering time cost and correct classification rate, OC-SVM is suitable for the identification of DOVIs.
Work in opinion mining and classification often assumes the incoming documents to be opinionated. Opinion mining system makes false hits while attempting to compute polarity values for non-subjective or factual senten...
详细信息
Work in opinion mining and classification often assumes the incoming documents to be opinionated. Opinion mining system makes false hits while attempting to compute polarity values for non-subjective or factual sentences or documents. It becomes imperative to decide whether a given document contains subjective information or not as well as to identify which portions of the document are subjective or factual. In this work a theme detection technique has been evolved for more generic domain independent subjectivity detection that classifies sentences with binary feature: opinionated or non-opinionated. Theme detection technique examines sentence level opinion and finally accumulates the opinion clues to reach the discourse level subjectivity. The subjectivity detection system has been evaluated on the multi perspective question answering (MPQA) corpus as well as on Bengali corpus. The system evaluation has shown the precision and recall values of 76.08 and 83.33 for English and 72.16 and 76.00 for Bengali respectively.
Eye detection is a well studied problem for the constrained face recognition problem, where we find controlled distances, lighting, and limited pose variation. A far more difficult scenario for eye detection is the un...
详细信息
Eye detection is a well studied problem for the constrained face recognition problem, where we find controlled distances, lighting, and limited pose variation. A far more difficult scenario for eye detection is the unconstrained face recognition problem, where we do not have any control over the environment or the subject. In this paper, we take a look at two different approaches for eye detection under difficult acquisition circumstances, including low-light, distance, pose variation, and blur. A new machinelearning approach and several correlation filter approaches, including a new adaptive variant, are compared. We present experimental results on a variety of controlled data sets (derived from FERET and CMU PIE) that have been re-imaged under the difficult conditions of interest with an EMCCD based acquisition system. The results of our experiments show that our new detection approaches are extremely accurate under all tested conditions, and significantly improve detection accuracy compared to a leading commercial detector. This unique evaluation brings us one step closer to a better solution for the unconstrained face recognition problem.
Gait disorder is one symptom of neurodegenerative disease. Using wearable motion sensors to monitor the motor function of patients with neurodegenerative disease has attracted more attention. Research has shown that m...
详细信息
Gait disorder is one symptom of neurodegenerative disease. Using wearable motion sensors to monitor the motor function of patients with neurodegenerative disease has attracted more attention. Research has shown that machinelearning techniques can be applied to the classification of neurodegenerative diseases from the gait data recorded by footswitches. In order to identify the most valuable features from 10 raw temporal variables extracted from gait cycles to improve the classification performance, we examine four types of feature selection and construction methods, namely, maximum signal-to-noise ratio based feature selection method, maximum signal-to-noise ratio combined with minimum correlation based feature selection method, maximum prediction power combined with minimum correlation based feature selection method and principal component analysis. Results show that using a set of four features, a relatively high prediction performance has been achieved with classification accuracies ranging from 79.04% to 93.96%. The continual increase of the number of features does not significantly contribute to the improvement of classification performance. This is consistent with clustering-based feature analysis.
Enzymes are proteins that catalyze bio-chemical reactions in different ways and play important roles in metabolic pathways. The exponential rise in sequences of new enzymes has necessitated developing methods that acc...
详细信息
Enzymes are proteins that catalyze bio-chemical reactions in different ways and play important roles in metabolic pathways. The exponential rise in sequences of new enzymes has necessitated developing methods that accurately predict their function. To address this problem, approaches that cluster enzymes based on their sequence and structural similarity have been applied, but are known to fail for dissimilar proteins that perform the same function. In this paper, we present a machinelearning approach to accurately predict the main function class of enzymes based on a unique set of 73 sequence-derived features. Our features can be extracted using freely available online tools. We used different multi-class classifiers to categorize enzyme protein sequences into one of the NC-IUBMB defined six main function classes. Amongst the classifiers, Random Forest reported the best results with an overall accuracy of 88% and precision and recall in the range of 84% to 93% and 82% to 93% respectively. Our results compare favorably with existing methods, and in some cases report better performance. Random Forest has been proven to be a very efficient datamining algorithm. This paper is first in exploring their application to enzyme function prediction. The datasets can be accessed online at the location: .
The problem of PU learning, i.e., learning classifiers with positive and unlabelled examples (but not negative examples), is very important in information retrieval and datamining. We address this problem through a n...
详细信息
ISBN:
(纸本)9781424429165
The problem of PU learning, i.e., learning classifiers with positive and unlabelled examples (but not negative examples), is very important in information retrieval and datamining. We address this problem through a novel approach: reducing it to the problem of learning classifiers for some meaningful multivariate performance measures. In particular, we show how a powerful machinelearning algorithm, Support Vector machine, can be adapted to solve this problem. The effectiveness and efficiency of the proposed approach have been confirmed by our experiments on three real-world datasets.
This paper presents a general framework to automatically generate rules that produce given spatial patterns in complex systems. The proposed framework integrates Genetic Algorithms with Artificial Neural Networks or S...
详细信息
This paper presents a general framework to automatically generate rules that produce given spatial patterns in complex systems. The proposed framework integrates Genetic Algorithms with Artificial Neural Networks or Support Vector machines. Here, it is tested on a well known 3-values, 6-neighbors, k-totalistic cellular automata rule called the "burning paper" rule. Results are encouraging and should pave the way for the use of the proposed framework in real-life complex systems models.
Due to increasing anticipated average life and health expenditure ambient assisted living (AAL) systems attract the attention of researchers. To successfully build and deploy AAL systems knowledge from different field...
详细信息
Local Binary pattern (LBP) is a powerful means of texture description that has achieved great success in face analysis area. In this paper, we propose a face recognition approach using boosted LBP-feature based classi...
详细信息
ISBN:
(纸本)9781424421961
Local Binary pattern (LBP) is a powerful means of texture description that has achieved great success in face analysis area. In this paper, we propose a face recognition approach using boosted LBP-feature based classifiers. The multi-class problem of face recognition is transformed into a two-class one of intra- and extra-class by classifying every pair of face image as intra-class or extra-class ones. The cascade framework, is used to overcome the problem of overwhelmingly large number of samples and grossly imbalance of the positive and negative samples. By boot-strapping negative examples, sub-training spaces (random subsets) are randomly generated, and then weak classifiers are learned using every sub-training space (random subset). The weak classifiers are combined into a strong one by improving recognition accuracy. Experimental results on FERET database show competitive performance.
It is crucial for TCM (Traditional Chinese Medicine) post-hepatitis cirrhosis diagnosis to accurately identify the syndrome. Meanwhile, the selection of features which are relevant to a certain TCM post-hepatitis cirr...
详细信息
ISBN:
(纸本)9781424421961
It is crucial for TCM (Traditional Chinese Medicine) post-hepatitis cirrhosis diagnosis to accurately identify the syndrome. Meanwhile, the selection of features which are relevant to a certain TCM post-hepatitis cirrhosis syndrome not only improves the performance of the classifiers, but also provides well measure for treatment. Therefore, in this paper, we analyze the classical ART2(Adaptive Resonance Theory 2) neural network such as the problem of pattern drifting and the same phase data with different amplitudes. Based on this, here, a novel network named SWART2 is proposed by taking dispersion testing and centroid computation learning, and introducing the weighted and supervised mechanism, which aims at improving ART2's ability of classification greatly for post-hepatitis cirrhosis diagnosis. Experimental results in this paper showed that the ***2 performed better than classical ART2.
暂无评论