Choosing appropriate classificationalgorithms for a given data set is very important and useful in practice but also is full of challenges. In this paper, a method of recommending classificationalgorithms is propose...
详细信息
Choosing appropriate classificationalgorithms for a given data set is very important and useful in practice but also is full of challenges. In this paper, a method of recommending classificationalgorithms is proposed. Firstly the feature vectors of data sets are extracted using a novel method and the performance of classificationalgorithms on the data sets is evaluated. Then the feature vector of a new data set is extracted, and its k nearest data sets are identified. Afterwards, the classificationalgorithms of the nearest data sets are recommended to the new data set. The proposed data set feature extraction method uses structural and statistical information to characterize data sets, which is quite different from the existing methods. To evaluate the performance of the proposed classificationalgorithmrecommendation method and the data set feature extraction method, extensive experiments with the 17 different types of classificationalgorithms, the three different types of data set characterization methods and all possible numbers of the nearest data sets are conducted upon the 84 publicly available UCI data sets. The results indicate that the proposed method is effective and can be used in practice. (C) 2012 Elsevier Ltd. All rights reserved.
暂无评论