Based on FP-tree algorithm, this article introduced the method of multi-thread processing and a Multi-Threaded Paralleled frequent item-set mining Algorithm - MTPA was proposed .It has been applied to an enterprise hu...
详细信息
Through analyzing the security problems which exist in general on-line payment systems, this paper proposes a scheme based on WPKI (Wireless Public key Infrastructure) and further designs and implements a gas station ...
详细信息
Through analyzing the security problems which exist in general on-line payment systems, this paper proposes a scheme based on WPKI (Wireless Public key Infrastructure) and further designs and implements a gas station on-line payment system. By using WPKI technology which includes SSL (Secure Sockets Layer) protocol, digital signature and digital certificate, the designed on-line payment system can achieve data privacy, authenticity, integrity and non-repudiation during on-line payment process.
In this paper, we present a novel deterministic heuristic and a new genetic algorithm to solve the problem of optimal triangulation of Bayesian networks. The heuristic, named MinFillWeight, aims to select variables mi...
详细信息
ISBN:
(纸本)9781424453979
In this paper, we present a novel deterministic heuristic and a new genetic algorithm to solve the problem of optimal triangulation of Bayesian networks. The heuristic, named MinFillWeight, aims to select variables minimizing the multiplication of the weights on nodes of fill-in edges. The genetic algorithm, named GA-MFW, uses a new rank-reserving crossover operator and a 2-fold mutation mechanism utilizing the MinFillWeight heuristic. Experiments on representative benchmark show that the deterministic heuristic and the stochastic algorithm have good performance and stability to various problems.
To solve the problem of searching for an optimal elimination ordering of Bayesian networks, a novel effective heuristic, MinSum Weight, and an ACS approach incorporated with multi-heuristic mechanism are proposed. The...
To solve the problem of searching for an optimal elimination ordering of Bayesian networks, a novel effective heuristic, MinSum Weight, and an ACS approach incorporated with multi-heuristic mechanism are proposed. The ACS approach named MHC-ACS utilizes a set of heuristics to direct the ants moving in the search space. The cooperation of multiple heuristics helps ants explore more regions. Moreover, the most appropriate heuristic will be identified and be reinforced with the evolution of the whole system. Experiments demonstrate that MHC-ACS has a better performance than other swarm intelligence methods.
Given a set of lists, where items of each list are sorted by the ascending order of their values, the objective of this paper is to figure out the common items that appear in all of the lists efficiently. This problem...
详细信息
Given a set of lists, where items of each list are sorted by the ascending order of their values, the objective of this paper is to figure out the common items that appear in all of the lists efficiently. This problem is sometimes known as common items extraction from sorted lists. To solve this problem, one common approach is to scan all items of all lists sequentially in parallel until one of the lists is exhausted. However, we observe that if the overlap of items across all lists is not high, such sequential access approach can be significantly improved. In this paper, we propose two algorithms, MergeSkip and MergeESkip, to solve this problem by taking the idea of skipping as many items of lists as possible. As a result, a large number of comparisons among items can be saved, and hence the efficiency can be improved. We conduct extensive analysis of our proposed algorithms on one real dataset and two synthetic datasets with different data distributions. We report all our findings in this paper.
Schema matching is important for schema integration and thus has got great attention. In this paper, we present an ontology-based algorithm to match a larger number of interface schemas, which can hand both simple and...
详细信息
Currently, the research for the extraction of information in deep web is pretty active. Although many researchers already adopted ontology in the data extraction, many problems still exist. This paper proposed an onto...
详细信息
Currently, the research for the extraction of information in deep web is pretty active. Although many researchers already adopted ontology in the data extraction, many problems still exist. This paper proposed an ontology evolution based method for mining in the data area. Not only will this method solve the problem when the website only consists of one record, but it also can identify he meaning of data that has no labels. With the evolution of ontology, the extraction of data records is being more accurate. Experiments indicate that this method could improve the accuracy and efficiency of data extraction.
This paper proposed a complex ontology evolution based method of extracting data, and also completely designed an extraction system, which consists of four important components: Resolver, Extractor, Consolidator and t...
详细信息
This paper proposed a complex ontology evolution based method of extracting data, and also completely designed an extraction system, which consists of four important components: Resolver, Extractor, Consolidator and the ontology construction components. The system gives priority to the construction of mini-ontology. When the user submits query keywords to the deep web query interface, the returned result will pass through the prior three components;after that, the final execution result will be returned to user in a unified form. This paper adopted an extraction method that is different from the general ontology extraction. More specifically, the ontology used in extraction here is dynamic evolution, which can adapt various data source better. Experimental results proved that this method could effectively extract the data in the query result pages.
Image retrieval based on region is one of the most promising and active research directions in recent year's CBIR, while region segmentation, feature selection and feature extraction of region are key issues. Howe...
详细信息
SimRank is a well known algorithm which conducts link analysis to measure similarity between each pair of nodes (nodepair). But it suffers from high computational cost, limiting its usage in large-scale datasets. More...
详细信息
SimRank is a well known algorithm which conducts link analysis to measure similarity between each pair of nodes (nodepair). But it suffers from high computational cost, limiting its usage in large-scale datasets. Moreover, Links between nodes are changing over time. It may be desirable to quickly approximate the similarity score between certain nodepair without performing a large-scale computation on the entire graph. In our approach we propose a method to efficiently estimate the similarity score using only a small subgraph of the entire graph. We call this novel algorithm “Local-SimRank”. The experimental results conducted on real datasets and synthetic dataset show that our algorithm efficiently produces good approximations to the global SimRank scores. Meanwhile, we prove that the Local-SimRank score LS(a, b) is always less than original SimRank score S(a, b) mathematically.
暂无评论