Greiner and Zhou (1988) presented ELR, a discriminative parameter-learning algorithm that maximizes conditional likelihood (CL) for a fixed Bayesian belief network (BN) structure, and demonstrated that it often produc...
详细信息
Greiner and Zhou (1988) presented ELR, a discriminative parameter-learning algorithm that maximizes conditional likelihood (CL) for a fixed Bayesian belief network (BN) structure, and demonstrated that it often produces classifiers that are more accurate than the ones produced using the generative approach (OFE), which finds maximal likelihood parameters. this is especially true when learning parameters for incorrect structures, such as naive Bayes (NB). In searching for algorithms to learn better BN classifiers, this paper uses ELR to learn parameters of more nearly correct BN structures - e.g., of a general Bayesian network (GBN) learned from a structure-learning algorithm by Greiner and Zhou (2002). While OFE typically produces more accurate classifiers with GBN (vs. NB), we show that ELR does not, when the training data is not sufficient for the GBN structure learner to produce a good model. Our empirical studies also suggest that the better the BN structure is, the less advantages ELR has over OFE, for classification purposes. ELR learning on NB (i.e., with little structural knowledge) still performs about the same as OFE on GBN in classification accuracy, over a large number of standard benchmark datasets.
A novel classification algorithm, OCEC, based on evolutionary computation for datamining is proposed. It is compared to GA-based and non GA-based algorithms on 8 datasets from the UCI machinelearning repository. Res...
详细信息
ISBN:
(纸本)0780374886
A novel classification algorithm, OCEC, based on evolutionary computation for datamining is proposed. It is compared to GA-based and non GA-based algorithms on 8 datasets from the UCI machinelearning repository. Results show OCEC can achieve higher prediction accuracy, smaller number of rules and more stable performance.
the original k-means clustering algorithm is designed to work primarily on numeric data sets. this prohibits the algorithm from being directly applied to categorical data clustering in many datamining applications. T...
详细信息
the original k-means clustering algorithm is designed to work primarily on numeric data sets. this prohibits the algorithm from being directly applied to categorical data clustering in many datamining applications. the k-modes algorithm [Z. Huang, Clustering large data sets with mixed numeric and categorical value, in: Proceedings of the First Pacific Asia Knowledge Discovery and dataminingconference. World Scientific, Singapore, 1997, pp. 21-34] extended the k-means paradigm to cluster categorical data by using a frequency-based method to update the cluster modes versus the k-means fashion of minimizing a numerically valued cost. However, as is the case with most data clustering algorithms, the algorithm requires a pre-setting or random selection of initial points (modes) of the clusters. the differences on the initial points often lead to considerable distinct cluster results. In this paper we present an experimental study on applying Bradley and Fayyad's iterative initial-point refinement algorithm to the k-modes clustering to improve the accurate and repetitiveness of the clustering results [cf. P. Bradley, U. Fayyad, Refining initial points for k-mean clustering, in: Proceedings of the 15thinternationalconference on machinelearning, Morgan Kaufmann, Los Altos, CA, 1998]. Experiments show that the k-modes clustering algorithm using refined initial points leads to higher precision results much more reliably than the random selection method without refinement, thus making the refinement process applicable to many datamining applications with categorical data. (C) 2002 Elsevier Science B.V. All rights reserved.
this paper researched a neural networks based knowledge discovery and datamining (KDDM) methodology based on granular computing, neural computing, fuzzy computing, Linguistic computing, and patternrecognition. A gra...
详细信息
the Inhibition-Compensation learning Scheme (ICLS) has been proposed as a way of enhancing the performance of the Moving Window Classifier In this paper, the effect of ICLS on three n-tuple based classification techni...
详细信息
ISBN:
(纸本)076951695X
the Inhibition-Compensation learning Scheme (ICLS) has been proposed as a way of enhancing the performance of the Moving Window Classifier In this paper, the effect of ICLS on three n-tuple based classification techniques has been investigated. Pre-segmented handwritten characters from the NIST database have been used as the patterndata. Results show that approximately 2-6% gain in classification accuracy can be achieved in the OCR task domain with no adverse effect on the classification throughput.
the proceedings contain 49 papers. the special focus in this conference is on Application of Discovery to Natural Science, Knowledge Discovery from Unstructured and Semi-structured data. the topics include: Mathematic...
ISBN:
(纸本)3540001883
the proceedings contain 49 papers. the special focus in this conference is on Application of Discovery to Natural Science, Knowledge Discovery from Unstructured and Semi-structured data. the topics include: Mathematics based on learning;datamining with graphical models;on the eigenspectrum of the gram matrix and its relationship to the operator eigenspectrum;in search of the horowitz factor;learning structure from sequences, with applications in a digital library;discovering frequent structured patterns from string databases;discovery in hydrating plaster using machinelearning methods;revising qualitative models of gene regulation;structure extraction using summaries;model complexity and algorithm selection in classification;experiments with projection learning;improved dataset characterisation for meta-learning;racing committees for large datasets;from ensemble methods to comprehensible models;learningthe causal structure of overlapping variable sets;extraction of logical rules from data by means of piecewise-linear neural networks;structuring neural networks through bidirectional clustering of weights;toward drawing an atlas of hypothesis classes;datascape survey using the cascade model;learning hierarchical skills from observation;image analysis for detecting faulty spots from microarray images;inferring gene regulatory networks from time-ordered gene expression data using differential equations;modeling state transition of typhoon image sequences by spatio-temporal clustering;structure-sweetness relationships of aspartame derivatives by GUHA;a hybrid approach for Chinese named entity recognition;extraction of word senses from human factors in knowledge discovery;event pattern discovery from the stock market bulletin and email categorization using fast machinelearning algorithms.
In this paper, we synthesize the main findings of three repeat purchase modelling case studies using real-life direct marketing data. Historically, direct marketing - more recently, targeted web marketing - has been o...
详细信息
ISBN:
(纸本)9729805067
In this paper, we synthesize the main findings of three repeat purchase modelling case studies using real-life direct marketing data. Historically, direct marketing - more recently, targeted web marketing - has been one of the most popular domains for the exploration of the feasibility and the viable use of novel business intelligence techniques. Many a datamining technique has been field tested in the direct marketing domain. this can be explained by the (relatively) low-cost availability of recency, frequency, monetary (RFM) and several other customer relationship data, the (relatively) well-developed understanding of the task and the domain, the clearly identifiable costs and benefits, and because the results can often be readily applied to obtain a high return on investment. the purchase incidence modelling cases reported on in this paper were in the first place undertaken to trial run state-of-the-art supervised Bayesian learning multilayer perceptron (MLP) and least squares support vector machine (LS-SVM) classifiers. For each of the cases, we also aimed at exploring the explanatory power (relevance) of the available RFM and other customer relationship related variable operationalizations for predicting purchase incidence in the context of direct marketing.
this article presents a survey of models of rough neurocomputing that have their roots in rough set theory. Historically, rough neurocomputing has three main threads: training set production, calculus of granules, and...
详细信息
Functional Electrical Stimulation (FES) is an effective and developing method to restore functions for paraplegic patients. In this research, we focused on the switching problem of FES, which is one of the obstacles t...
详细信息
Functional Electrical Stimulation (FES) is an effective and developing method to restore functions for paraplegic patients. In this research, we focused on the switching problem of FES, which is one of the obstacles that prevent FES from further practical uses. An adaptive switching for FES control for the lower limbs' activities of hemiplegic patients was developed, based on the consideration that, lower limbs' activities need the synchronization of limbs of both sides. Electromyogram (EMG) signals detected from normal side of hemiplegic patients were used to recognize the activities that the patients intend to do. the recognition results were utilized as the switching signals. However, motion patterns represented and analyzed by EMG are distinctive of individual variations and characteristic alternation, which inevitably result in classification errors in EMG analyzing. To overcome these problems, a feed-forward artificial neural network (ANN) was embedded in an on-line process to form an analyzing system that can adapt to individual characteristics and trace the nonstationary factor. Furthermore, in order to enable the analyzing system to recognize right timings from the EMG-described dynamical processes of activities, such as standing-up and walking, a practical training-set construction method that utilizes additional reference data was proposed. the proposed switching system was applied to a FES system that supports standing-up and walking for a hemiplegics subject, to verify the effectiveness. (C) 2002 Elsevier Science B.V. All rights reserved.
In patternrecognition, the goal of classification can be achieved from two different types of learning strategy-discriminative teaming and informative learning. Discriminative learning focuses on extracting the discr...
详细信息
ISBN:
(纸本)9810475241
In patternrecognition, the goal of classification can be achieved from two different types of learning strategy-discriminative teaming and informative learning. Discriminative learning focuses on extracting the discriminative information between classes. Informative learning emphasizes the learning of the class information such as class densities. We review major discriminative learning methods, namely, principal component analysis (PCA), linear discriminant analysis (LDA), minimum classification error (MCE) training algorithm and support vector machine (SVM) and one informative learning method-Gaussian mixture models (GMM). We also discuss the combination of the two types of learning and give the corresponding experiments results.
暂无评论