Active learning is a hot topic in machinelearning field. The main task of active learning is to automatically select the representative instances for efficiently reducing the sample complexity. This paper presents a ...
详细信息
Email is a kind of semi-structured document, some important attributes are contained in its structure, and especially using spam-specific features could improve the email classification results. In this paper, we appl...
详细信息
When the training dataset is very large, the learning process of potential support vector machine takes up so large memory that the training speed is very slow. To accelerate the training speed of the potential suppor...
详细信息
The support vectors play an important role in the training to find the optimal hyper-plane. For the problem of many non-support vectors and a few support vectors in the classification of SVM, a method to reduce the sa...
详细信息
Data extraction in Web is to obtain the desired information to users in Web pages. For a more accurately valuable data extraction, this paper proposes a new method called data extraction based on index path in Web (DE...
详细信息
Text Categorization (TC) is an important component in many information organization and information management tasks. In many TC applications, the case-base grows at a fast rate and this causes inefficiency in the cas...
详细信息
By incorporating domination principle in inconsistent decision systems based on dominance relations, we define the concept of distribution function for a decision system to directly reflect the inconsistent degree of ...
详细信息
This paper presents a PSO-based method for learning similarity measure of nominal features for case based reasoning classifiers (i.e. CBR classifiers). The symbolic features considered here takes completely unordered ...
详细信息
Recommender systems are an important component of many websites. Two of the most popular approaches are based on matrix factorization (MF) and Markov chains (MC). MF methods learn the general taste of a user by factor...
详细信息
Many current and future NASA missions are capable of collecting enormous amounts of data, of which only a small portion can be transmitted to Earth. Communications are limited due to distance, visibility constraints, ...
详细信息
暂无评论