Classification based on association rules is a common and easily understand algorithm for text classification. To improve its classification accuracy, the key is to generate more effective rules. Sometimes, it will ov...
详细信息
Support Vector machine (SVM) is sensitive to noises and outliers. For reducing the effect of noises and outliers, we propose a novel SVM for suppressing error function. The error function is limited to the interval of...
详细信息
ISBN:
(纸本)9780769538877
Support Vector machine (SVM) is sensitive to noises and outliers. For reducing the effect of noises and outliers, we propose a novel SVM for suppressing error function. The error function is limited to the interval of [0, 1]. The separation hypersurface is simplified and the margin of hypersurface is widened. Experimental results show that our proposed method is able to simultaneously increase the classification efficiency and the generalization ability of the SVM.
Markov chains, with Markov property as its essence, are widely used in the fields such as information theory, automatic control, communication techniques, genetics, computer sciences, economic administration, educatio...
详细信息
Markov chains, with Markov property as its essence, are widely used in the fields such as information theory, automatic control, communication techniques, genetics, computer sciences, economic administration, education administration, and market forecasts. While using Markov chains to predict the future events, we must test the Markov property of random variable sequences of the past statistic data. Only when the random variable sequences satisfy the Markov property, can the prediction could be precise. This paper discusses the concept of Markov property and its features, studies its test method, and by example demonstrates the effectiveness of this prediction method.
Support Vector machine (SVM) is a classification technique of machinelearning based on statistical learning theory. A quadratic optimization problem needs to be solved in the algorithm, and with the increase of the s...
详细信息
It has been shown that the fuzzy integral is an effective tool for the fusion of multiple classifiers. Of primary importance in the development of the system is the choice of the measure which embodies the importance ...
详细信息
It has been shown that the fuzzy integral is an effective tool for the fusion of multiple classifiers. Of primary importance in the development of the system is the choice of the measure which embodies the importance of subsets of classifiers. In this paper we propose a method for a dynamic fuzzy measure which will change following the pattern to be classified (data dependent). This method uses the neural network which has good study ability. Our experiment results show that this method make the classification accurate improve.
This paper presents an approach to instance selection for the nearest neighbor rule which aims to obtain a condensed set with high condensing rate and prediction accuracy. By making an improvement on MCS algorithm and...
详细信息
This paper presents an approach to instance selection for the nearest neighbor rule which aims to obtain a condensed set with high condensing rate and prediction accuracy. By making an improvement on MCS algorithm and allowing certain error rate on the training set, a condensed set with high condensing rate and satisfying prediction accuracy is obtained. The condensed set is order-independent of the training instances and insensitive to noise. Comparative experiments have been conducted on real data sets, and the results show its superiority to MCS and FCNN in terms of condensing rate and prediction accuracy.
Coherent point drift(CPD), a sophistic non-rigid point sets registration method, is successfully applied in computer vision, medical image analysis, to name a few. Its registration error, however, is affected greatly ...
详细信息
Coherent point drift(CPD), a sophistic non-rigid point sets registration method, is successfully applied in computer vision, medical image analysis, to name a few. Its registration error, however, is affected greatly by three free parameters. One of them, width parameter of Gaussian kernel function, is studied in this paper to tune registration error of the CPD method. Before computing width parameter by a heuristic algorithm, the given data is regulated using minmax and standard normalization in effort to remove heterogeneity among the features of the data. Several experiments are designed on the available six datasets to examine the effectiveness of CPD based on the refined width parameter. Experimental comparison indicated that the refined width parameter from the normalized data can reduce registration error of the CPD method, e.g., by 13.76% on bat dataset.
Deep Web can provide us a great amount of high quality information. In order to make full use of the information, it is becoming urgent to establish Deep Web data integration system, in which Deep Web interface integr...
详细信息
Fuzzy measure and integral are widely used in Multiple Classifier System (MCS). But the number of coefficients involved in the fuzzy integral model grows exponentially with the number of classifiers to be aggregated. ...
详细信息
Fuzzy measure and integral are widely used in Multiple Classifier System (MCS). But the number of coefficients involved in the fuzzy integral model grows exponentially with the number of classifiers to be aggregated. The main difficulty is to identify all these coefficients. This paper does an attempt Using 2-additrve fuzzy measure in Multiple Classifier System. Our conclusion is that when different interactions exist in different classifiers the complexity of the computation can be significantly reduced by 2-order additive measure.A simple example is included to illustrate the 2-order additive measure.
This paper is to discuss the reduction of computation complexity in decision tree generation for the numerical-valued attributes. The proposed method is based on the partition impurity. The partition impurity minimiza...
详细信息
This paper is to discuss the reduction of computation complexity in decision tree generation for the numerical-valued attributes. The proposed method is based on the partition impurity. The partition impurity minimization is used to select the expanded attribute for generation the sub-node during the tree growth. After inducing the unstable cut-points of numerical-attributes, it is analytically proved that the partition impurity minimization can always be obtained at the unstable cut-points. It implies that the computation on stable cut-points may not be considered during the tree growth. Since the stable cut-points are far more than unstable cut-points, the experimental results show that the proposed method can reduce the computational complexity greatly.
暂无评论