Active learning is a hot topic in machinelearning field. The main task of active learning is to automatically select the representative instances for efficiently reducing the sample complexity. This paper presents a ...
详细信息
Email is a kind of semi-structured document, some important attributes are contained in its structure, and especially using spam-specific features could improve the email classification results. In this paper, we appl...
详细信息
The support vectors play an important role in the training to find the optimal hyper-plane. For the problem of many non-support vectors and a few support vectors in the classification of SVM, a method to reduce the sa...
详细信息
Data extraction in Web is to obtain the desired information to users in Web pages. For a more accurately valuable data extraction, this paper proposes a new method called data extraction based on index path in Web (DE...
详细信息
This paper presents a PSO-based method for learning similarity measure of nominal features for case based reasoning classifiers (i.e. CBR classifiers). The symbolic features considered here takes completely unordered ...
详细信息
By incorporating domination principle in inconsistent decision systems based on dominance relations, we define the concept of distribution function for a decision system to directly reflect the inconsistent degree of ...
详细信息
Markov chains, with Markov property as its essence, are widely used in the fields such as information theory, automatic control, communication techniques, genetics, computer sciences, economic administration, educatio...
详细信息
Text Categorization (TC) is an important component in many information organization and information management tasks. In many TC applications, the case-base grows at a fast rate and this causes inefficiency in the cas...
详细信息
Fuzzy Integral is widely accepted and applied in multi-classifier fusion to express the importance of individual classifiers and the interaction among classifiers. In this fusion model, there are two keys to determine...
详细信息
Decision tree is one of the most popular and widely used classification models in machinelearning. The discretization of continuous-valued attributes plays an important role in decision tree generation. In this paper...
详细信息
暂无评论