Support Vector machine (SVM) is a classification technique of machinelearning based on statistical learning theory. A quadratic optimization problem needs to be solved in the algorithm, and with the increase of the s...
详细信息
The prosperity of electronic commerce has changed the traditional trading behaviors. More and more people are willing to perform Internet shopping. At the same time, consumers experience information overload and look ...
详细信息
ISBN:
(纸本)0780384032
The prosperity of electronic commerce has changed the traditional trading behaviors. More and more people are willing to perform Internet shopping. At the same time, consumers experience information overload and look for help in selecting from an overwhelming array of products. In order to overcome such a problem one option is to develop a personalized online assistance to retrieve product information that really matters for the customers. In this paper, we present a method that combines the genetic algorithm and k nearest neighbor technology to reason about the customer's personal preferences from his/her profile and then provide the most appropriate products to meet his/her needs. Our experimental results are showing that our systems have a bright future.
Deep Web can provide us a great amount of high quality information. In order to make full use of the information, it is becoming urgent to establish Deep Web data integration system, in which Deep Web interface integr...
详细信息
Dominance-based rough set approach(DRSA) can handle the attributes with preference orders, and therefore it has been widely applied in multi-criteria decision making problems. In real applications, the collected infor...
详细信息
Dominance-based rough set approach(DRSA) can handle the attributes with preference orders, and therefore it has been widely applied in multi-criteria decision making problems. In real applications, the collected information is updated from time to time which results in dynamic information systems, especially when the attributes or objects are inserted or deleted. The traditional DRSA needs to update the set approximations whenever the information systems change, which decreases the method efficiency greatly. For classification problems with multiple criteria, this paper presents incremental algorithms to update set approximations when an object is inserted or deleted, which is expected to be more efficient than computing the approximations from the scratch. The related theoretical results are presented with proofs, and illustrative examples are also given to support the effectiveness of the proposed incremental method.
In real world problems, the collected data vary from time to time, and therefore, the approximations of a concept by a variable precision rough set model(VPRS) should be correspondingly updated. This paper focuses on ...
详细信息
In real world problems, the collected data vary from time to time, and therefore, the approximations of a concept by a variable precision rough set model(VPRS) should be correspondingly updated. This paper focuses on developing incremental method to update set approximations of VPRS based on dominance relations. Under dynamic environments where an object is inserted or deleted, we present the updating principles and then develop the incremental method for updating approximation sets. The related theoretical results are presented with proofs, and illustrative examples are also given to support the effectiveness of the proposed method.
Classification for large datasets is a classical problem in machinelearning. In this paper, we focus on effevtive classification algorithm for large datasets and imbalanced datasets. First, to deal with imbalanced da...
详细信息
Classification for large datasets is a classical problem in machinelearning. In this paper, we focus on effevtive classification algorithm for large datasets and imbalanced datasets. First, to deal with imbalanced dataset, we define the weight according to the size of positive and negative dataset. Then, a fast learning algorithm on large datasets called a core set weighted support vector machines(CSWSVM) is proposed. In the proposed approach, the corresponding core set(CS) can be solved by employing the core vector machine(CVM) or generalized CVM(GCVM), and then the weighted support vector machines(WSVM) can be used to implement classification for imbalanced datasets. Experimental results on UCI and USPS datasets demonstrate that the proposed method is effective.
According to the definition of dominance relation, an object x is said to dominate another object y only when x dominates y on all attributes, which is too strict especially when the number of attributes is large. To ...
详细信息
According to the definition of dominance relation, an object x is said to dominate another object y only when x dominates y on all attributes, which is too strict especially when the number of attributes is large. To cope with this problem, the extended dominance-based rough set model has been developed by introducing a parameter to the concept of traditional dominance relationship in the reported literature. However, in this extended model, the definitions of lower and upper approximations are the same to the traditional model, which may affect the decision making process. In this paper, we introduce the idea of variable precision to the extended dominance rough set model for better fault tolerance ability. The impact of the parameter on decision results with respect to testing accuracy is studied. Finally, an example is given and the experimental results on UCI data are also shown to support the effectiveness of the proposed method.
Traditional rough set theory(TRS) is based on the concept of equivalence relation to define upper and lower approximation sets of a given target concept, and therefore uncertainties in information systems can be repre...
详细信息
Traditional rough set theory(TRS) is based on the concept of equivalence relation to define upper and lower approximation sets of a given target concept, and therefore uncertainties in information systems can be represented. By using equivalence relations, TRS only considers whether attribute values are distinguished or not, regardless of the preference information contained in attribute values. Rough sets based on dominance relations effectively solve this problem and can deal with preference-ordered data. In these dominance-based approaches, the computational cost of the dominance classes greatly affects the efficiency of attribute reduction and rule extraction. This paper presents an efficient method of computing dominance classes in an ordered information system by rapidly reducing the search space. Based on the definition of dominance class, the inferior class of an object is gradually removed from the universe with the increase of the attributes in the computation process. Experiments on ten UCI data sets show that the proposed algorithm obviously improves the efficiency of computing dominance classes, especially for large-scale data.
Dominance relation rough set approach(DRSA) is a useful mathematical tool to deal with preference-ordered data. The main idea is using dominance relations to replace equivalent relations in classical rough set theory....
详细信息
Dominance relation rough set approach(DRSA) is a useful mathematical tool to deal with preference-ordered data. The main idea is using dominance relations to replace equivalent relations in classical rough set theory. However, the definition of conventional dominance relation is very strict which may limit its application to information systems with relative large number of attributes. In this paper, we relax the conditions in the definition of dominance relation and introduce the concept of extended dominance relation. The proprieties of this new concept are also discussed and it is found that all the properties of classical dominance relation are still satisfied.
Pathfinding is a typical task in many computer games, and its performance will affect the quality of game AI. In order to enhance the efficiency of multi-task pathfinding, case-based reasoning has been introduced in t...
详细信息
Pathfinding is a typical task in many computer games, and its performance will affect the quality of game AI. In order to enhance the efficiency of multi-task pathfinding, case-based reasoning has been introduced in traditional A* algorithm, called the CBMT method. The method needs to select representative paths which can cover the whole map to build a compact case base, which is difficult in large maps. Besides, repeatedly searching for similar cases for each pathfinding task would be a time consuming process. To address these problems, we provide a kd-tree case storage structure and case retrieval mechanical in the CBMT method. The pre-stored cases(previously found paths) are generated randomly and incrementally. The original flat storage structure of the cases is changed into the kd-tree structure. Since the searching space can be reduced by branch pruning in case retrieval, the pathfinding efficiency has been improved obviously, and the number of searched nodes is also reduced.
暂无评论