Classification is a machine learning technique whose objective is the prediction of the class membership of data instances. there are numerous models Currently available for performing classification. among which deci...
详细信息
ISBN:
(纸本)9783642043932
Classification is a machine learning technique whose objective is the prediction of the class membership of data instances. there are numerous models Currently available for performing classification. among which decision trees and artificial neural networks. In this article we describe the implementation of a new lazy classification model called similarity classifier. Given an out-of-sample instance, this model predicts its class by finding the training instances that are similar to it, and returning the most frequent class among these instances. the classifier was implemented using Weka's data mining API, and is available for download. Its performance. according to accuracy and speed metrics, compares relatively well withthat of well-established classifiers such as nearest neighbor models or support vector machines. For this reason, the similarity classifier can become a useful instrument in a data mining practitioner's tool set.
In this paper, we present a more effective approach to clustering with eXtended Classifier System (XCS) which is divided into two phases. the first;phase is the XCS learning process with rule compact, during which we ...
详细信息
ISBN:
(纸本)9783642043932
In this paper, we present a more effective approach to clustering with eXtended Classifier System (XCS) which is divided into two phases. the first;phase is the XCS learning process with rule compact, during which we alter the XCS mechanisms and propose a new way to calculate rewards. After learning, the rules are evolved to form the filial population consisting of rules with homogeneous data distribution. the second phase is merging the learnt rules to generate final clusters. We achieve this by modelling the rules as, sub-graphs and merging the sub-graphs based oil some criteria similar to CHAMELEON. Experimental results validate the effectiveness oil a number of datasets, which contain clusters of different shapes, densities and distances.
An attempt has been made to employ evolving Takagi-Sugeno algorithm (eTS) to built models assisting property valuation on the basis of actual data drawn from cadastral system, registry of sales transactions, and a cad...
详细信息
ISBN:
(纸本)9783642043932
An attempt has been made to employ evolving Takagi-Sugeno algorithm (eTS) to built models assisting property valuation on the basis of actual data drawn from cadastral system, registry of sales transactions, and a cadastral map. Seven methods of feature selection were applied an evaluated. the eTS performance was compared to three algorithms implemented in KEEL, including decision trees for regression. neural network. and Support vector machine. the results confirmed the advantages of the eTS algorithm.
Analysis of data without labels is commonly subject to scrutiny by unsupervised machine learning techniques. Such techniques provide more meaningful representations, useful for better understanding of a problem at han...
详细信息
ISBN:
(纸本)9783642043932
Analysis of data without labels is commonly subject to scrutiny by unsupervised machine learning techniques. Such techniques provide more meaningful representations, useful for better understanding of a problem at hand, than by looking only at the data itself. Although abundant expert knowledge exists in many areas where unlabelled data is examined, such knowledge is rarely incorporated into automatic analysis. Incorporation of expert, knowledge is frequently a matter of combining multiple data sources front disparate hypothetical spaces. In cases where such spaces belong to different, data types, this task becomes even more challenging. In this paper we present a novel immune-inspired method that enables the fusion of such disparate types of data. for a specific set of problems. We show that our method provides a better visual understanding of only hypothetical space withthe help of data from another hypothetical space. We believe that, our model has implications for the field of exploratory data analysis and knowledge discovery.
this work is related to the KEEL1 (Knowledge Extraction based oil Evolutionary learning) tool, a non-commercial software that supports data management, design of experiments and all educational section. the KEEL softw...
详细信息
ISBN:
(纸本)9783642043932
this work is related to the KEEL1 (Knowledge Extraction based oil Evolutionary learning) tool, a non-commercial software that supports data management, design of experiments and all educational section. the KEEL software tool is devoted to assess evolutionary algorithms for data Mining problems including regression, classification, clustering, pattern mining and so on. these features implies all advantage for the research and educational Field. the aim of this contribution is to present some guidelines for including new algorithms in KEEL helping the researchers to make their methods easily accessible for other authors and to compare the results of many approaches already included within the KEEL software. By providing a source code template, the developer does not need to take into account the basic requirements of the KEEL software tool, and lie or she has only to focus in the designing and encoding of his or hers approach.
Writer identification is to determine the writer from unknown handwritings, which has been becoming an active research topic in pattern recognition field. this paper proposes a new hybrid method combining Gabor functi...
详细信息
ISBN:
(纸本)9783642043932
Writer identification is to determine the writer from unknown handwritings, which has been becoming an active research topic in pattern recognition field. this paper proposes a new hybrid method combining Gabor function and mesh fractal dimension for off-line, text-in dependent writer identification. Compared to the existing Gabor-based method for off-line, text-independent writer identification, this new method is capable of extracting more useful features to distinguish the handwritings. Experimental results show that this new method can achieve much better identification results than the existing Gabor-based method.
Since the amount of information is rapidly growing, there is an overwhelming interest in efficient network computing systems including Grids, public-resource computing systems, P2P systems and Cloud computing. In this...
详细信息
ISBN:
(纸本)9783642043932
Since the amount of information is rapidly growing, there is an overwhelming interest in efficient network computing systems including Grids, public-resource computing systems, P2P systems and Cloud computing. In this paper we take a detailed look at the problem of modeling and optimization of network computing systems for parallel decision tree induction methods. Firstly, we present a comprehensive discussion Oil mentioned induction methods with a special focus on their parallel versions. Next, we propose a generic optimization model of a network computing system that can be used for distributed implementation of parallel decision trees. To illustrate our work we provide results of numerical experiments showing that the distributed approach enables significant improvement of the system throughput.
this paper presents a multidisciplinary study on the application of statistical and neural models for analysing data oil immissions of atmospheric Pollution ill urban areas. data was collected from the network of poll...
详细信息
ISBN:
(纸本)9783642043932
this paper presents a multidisciplinary study on the application of statistical and neural models for analysing data oil immissions of atmospheric Pollution ill urban areas. data was collected from the network of pollution measurement stations in the Spanish Autonomous Region of Castile-Leon. Four Pollution parameters and a Pollution measurement station in the city of Burgos were used to carry Out the Study in 2007, during a period Of just over six months. Pollution data are compared, their values are interrelated and relationships are established not only withthe Pollution variables, but also with different weeks of the year. the aim of this Study is to classify the levels of atmospheric Pollution in relation to the days of the week, trying to differentiate between working days and non-working days.
the goal of this paper is to investigate which requirements engineering techniques have been applied in the development of Multi-Agent Systems (MAS) and how they were applied. We performed a systematic review of 58 of...
详细信息
ISBN:
(纸本)9783642043932
the goal of this paper is to investigate which requirements engineering techniques have been applied in the development of Multi-Agent Systems (MAS) and how they were applied. We performed a systematic review of 58 of a total of 835 papers found in scientific digital libraries. the results show that most of the proposals for dealing with requirements (79%) use already defined methods or techniques from other software development paradigms and that 69% of these techniques are based on the goal-oriented paradigm. A total of 95% of the reviewed papers focus on techniques for analyzing requirements, and only 45% of them explicitly consider some type of elicitation technique. Finally, only 5% of the papers give some empirical evidence about the effectiveness of their approaches by conducting empirical studies. the results of our study are particularly important in the determination of current research activities in Requirements engineering for MAS and in the identification of research gaps for further investigation.
In large and complex aerodynamic systems the overall performance of a design is mainly defined by interactions between design areas rather than by single design regions. therefore it is necessary to identify these int...
详细信息
ISBN:
(纸本)9783642043932
In large and complex aerodynamic systems the overall performance of a design is mainly defined by interactions between design areas rather than by single design regions. therefore it is necessary to identify these interactions in order to be able to understand and imp rove the designs. However. detecting and modeling those interactive effects between distant design areas is a very challenging task which usually requires a detailed understanding of the flow patterns and dedicated expert knowledge. In this paper we apply the information theoretic concept of interaction information to aerodynamic design data in order to detect and quantify interaction effects between distant design regions. Information graphs are suggested in order to provide the results to the aerodynamic engineer in a graphical form. In order to show the feasibility of this approach, the information theoretic quantities are applied to the data of a 2D wing assembly as well as to the 3D turbine blade design data.
暂无评论