Large, dense-packed and high-dimensional data mining is one challenge of frequent closed itemset mining for association analysis, although frequent closed itemset mining is efficient approach to reduce the complexity ...
详细信息
ISBN:
(纸本)9780769533520
Large, dense-packed and high-dimensional data mining is one challenge of frequent closed itemset mining for association analysis, although frequent closed itemset mining is efficient approach to reduce the complexity of mining frequent quent itemsets. This paper proposes a distributed algorithm to address the challenge of discovering frequent closed itemsets in large, dense-packed and high-dimensional data. The algorithm partitions the search space of frequent closed itemsets into independent nonoverlapping subspaces that can be e extracted independently to generate frequent closed itemsets. The algorithm can generate frequent closed itemsets according to dense priority: the closed itemset more dense or more frequent will be generated preferentially. The experimental results show the algorithm is efficient to extract frequent closed itemsets in large data.
Skyline computations are a way of finding the best data points based on multiple criteria for location-based decision-making. However, as the input data grows larger, these computations become slower and more challeng...
详细信息
ISBN:
(纸本)9783031402913;9783031402920
Skyline computations are a way of finding the best data points based on multiple criteria for location-based decision-making. However, as the input data grows larger, these computations become slower and more challenging. To address this issue, we propose an efficient algorithm that uses Apache Spark, a platform for distributed processing, to perform area skyline computations faster and more salable. Our algorithm consists of three main phases: calculating distances between data points, generating distance tuples, and computing the skyline. In the second phase, we apply a technique called local partial skyline extraction, which reduces the amount of data that needs to be sent from each executor (a parallel processing unit) to the driver (a central processing unit). The driver then computes the final skyline from the received data and creates filters to eliminate irrelevant points. Our experiments show that our algorithm can significantly reduce the data size and the computation time of the area skyline.
This article studies the problem of painting an obstacle free rectangular region by a swarm of mobile robots. Initially the robots are deployed randomly within the target area subject to the condition that the distrib...
详细信息
This article studies the problem of painting an obstacle free rectangular region by a swarm of mobile robots. Initially the robots are deployed randomly within the target area subject to the condition that the distribution is d*-dense, where d* = root 3d/2, and a robot can view up to a distance d. By d*-dense, it is meant that if all the robots are projected on a horizontal line, then the distance between two consecutive robots must be less than or equal to d*. Non-consideration of the popular CORDA ( computational) model in the field of area coverage by swarm robots has been addressed here. The proposed algorithm assumes CORDA model. The robots follow a completely distributed algorithm to paint the region. The robots do not need to be synchronous, but they are assumed to have equal velocities. However, the proposed algorithm supports the robots with different speed. In that case, if r is the given upper bound on the ratios of the speeds of any two robots, then the initial distribution has to be D*-dense, where D* = d/4 root(3 - r) (r + 5) 1 <= r < 3.
With the exponential growth of graph structured data in recent years, parallel distributed techniques play an increasingly important role in processing large-scale graphs. Since strong connections exist between vertic...
详细信息
ISBN:
(纸本)9783319271613;9783319271606
With the exponential growth of graph structured data in recent years, parallel distributed techniques play an increasingly important role in processing large-scale graphs. Since strong connections exist between vertices in graph data, the high communication cost for transforming boundary data is unavoidable in the distributed techniques. How to partition a large graph into several partitions with low coupling and balanced scale becomes a critical problem. Most of research in the literature studies vertex partitioning methods, which leads us to reconsider an alternative approach for edge partitioning. In this paper, we propose a distributed algorithm for graph partition based on edge partitioning, named as VSEP. A novel vertex permutation method is used to partition the large graphs iteratively. Experimental results indicate that VSEP reduces the number of times vertices are cut by about 10% similar to 20% comparing with a state-of-the-art algorithm while retains the scale balance.
Web mining applies the data mining, the artificial intelligence and the chart technology and so on to the web data and traces users' visiting characteristics, and then extracts the users' using pattern. This a...
详细信息
ISBN:
(纸本)9780769535432
Web mining applies the data mining, the artificial intelligence and the chart technology and so on to the web data and traces users' visiting characteristics, and then extracts the users' using pattern. This article will study on Web Mining algorithm based on Usage Mining. And it also produces the design mentality of the electronic commerce website application algorithm. Web usage mining is an application of data mining technology to mining the data of the web server log file. It can discover the browsing patterns of user and some kind of correlations between the web pages. Web usage mining provides the support for the web site design, providing personalization server and other business making decision, etc. This algorithm is simple, effective and easy to realize, it is suitable to the web usage mining demand of construct a low cost B2C website.
Take the forward reasoning algorithm as an example, this paper describes how to design a distributed algorithm with WEB service technology. Based on message passing programming model, the partitioning, communication a...
详细信息
ISBN:
(纸本)9781424458950
Take the forward reasoning algorithm as an example, this paper describes how to design a distributed algorithm with WEB service technology. Based on message passing programming model, the partitioning, communication and the task-scheduling of distributed algorithm are discussed. Finally, the distributed algorithm has been implemented with .NET platform, and tested by the parallel performance measures, and then the results are analyzed.
The class of CRC constraints generalizes several tractable classes of constraints and is expressive enough to model problems in domains such as temporal reasoning, geometric reasoning, and scene labelling. This paper ...
详细信息
ISBN:
(纸本)9781510855076
The class of CRC constraints generalizes several tractable classes of constraints and is expressive enough to model problems in domains such as temporal reasoning, geometric reasoning, and scene labelling. This paper presents the first distributed deterministic algorithm for connected row-convex (CRC) constraints. Our distributed (partial) path consistency algorithm efficiently transforms a CRC constraint network into an equivalent constraint network, where all constraints are minimal (i.e., they are the tightest constraints) and generating all solutions can be done in a backtrackfree manner. When compared with the state-of-the-art distributed algorithm for CRC constraints, which is a randomized one, our algorithm guarantees to generate a solution for satisfiable CRC constraint networks and it is applicable to solve large networks in real distributed systems. The experimental evaluations show that our algorithm outperforms the state-of-the-art algorithm in both practice and theory.
Because power generation of renewable resources are unstable and demands of the customers are time-varying, the supply power and demands of the customers are always unequal. To meet the demands of the customers, power...
详细信息
ISBN:
(纸本)9781479949342
Because power generation of renewable resources are unstable and demands of the customers are time-varying, the supply power and demands of the customers are always unequal. To meet the demands of the customers, power is transmitted from primary power generation to secondary power generation. It will cause high power loss. To solve this problem, a distributed algorithm is proposed in this paper. By using the algorithm, the micro-grids are able to exchange power with their neighbors so as to minimize the total power losses of the smart grid. Moreover, communication overhead (bandwidth) is reduced, comparing with centralized algorithm. Through computer simulations, we demonstrate that the proposed algorithm can lead to near-optimal result for alleviating the average power loss per microgrid and reduce the communication overhead significantly in contrast with the centralized approach.
In this paper, we propose an O(N) time distributed algorithm for computing betweenness centralities of all nodes in the network where N is the number of nodes. Our distributed algorithm is designed under the widely em...
详细信息
ISBN:
(纸本)9781509014828
In this paper, we propose an O(N) time distributed algorithm for computing betweenness centralities of all nodes in the network where N is the number of nodes. Our distributed algorithm is designed under the widely employed CONGEST model in the distributed computing community which limits each message only contains O(logN) bits. To our best knowledge, this is the first linear time deterministic distributed algorithm for computing the betweenness centralities in the published literature. We also give a lower bound for distributively computing the betweenness centrality under the CONGEST model as O(D+N/logN) where D is the diameter of the network. This implies that our distributed algorithm is nearly optimal.
Constructing sensor barriers to detect intruders crossing restricted regions, such as country borders, is one of the major application categories for wireless sensor networks. In this paper, we present a distributed a...
详细信息
ISBN:
(纸本)9781424456383
Constructing sensor barriers to detect intruders crossing restricted regions, such as country borders, is one of the major application categories for wireless sensor networks. In this paper, we present a distributed algorithm to find the maximum number of disjoint sensor barriers in wireless sensor networks. Our solution works for any sensor deployment, for any size and shape of a covered region, and even for heterogeneous sensor nodes. In particular, our algorithm is distributed and works perfectly in an asynchronous communication environment. It utilizes the property of wireless channel and has lower complexity compared with other algorithms. For a deployment of n sensors, our algorithm spends O(n(2)) messages and O(n(2)) time.
暂无评论