Maximal Clique Enumeration (MCE) is a fundamental graph mining problem and is useful as a primitive in identifying dense structures in a graph. Due to the high computational cost of MCE, parallel methods are imperativ...
详细信息
Maximal Clique Enumeration (MCE) is a fundamental graph mining problem and is useful as a primitive in identifying dense structures in a graph. Due to the high computational cost of MCE, parallel methods are imperative for dealing with large graphs. We present shared-memory parallel algorithms for MCE, with the following properties: (1) the parallel algorithms are provably work-efficient relative to a state-of-the-art sequential algorithm, (2) the algorithms have a provably small parallel depth, showing they can scale to a large number of processors, and (3) our implementations on a multicore machine show good speedup and scaling behavior with increasing number of cores and are substantially faster than prior shared-memory parallel algorithms for MCE;for instance, on certain input graphs, while prior works either ran out of memory or did not complete in five hours, our implementation finished within a minute using 32 cores. We also present work-efficient parallel algorithms for maintaining the set of all maximal cliques in a dynamic graph that is changing through the addition of edges.
This work suggests parallel algorithms for solving a sparse system of N - linear equations in N - unknowns by Jacobi method on Extended Fibonacci Cube EFC1(n) [3]. Where n is the degree of EFC1(n) and N is the number ...
详细信息
ISBN:
(纸本)9781467345286
This work suggests parallel algorithms for solving a sparse system of N - linear equations in N - unknowns by Jacobi method on Extended Fibonacci Cube EFC1(n) [3]. Where n is the degree of EFC1(n) and N is the number of processors of EFC1(n). Two parallel versions of the algorithm are discussed. The single pass of the first algorithm involves 2 (N - 1) data communications in N steps. Whereas the second algorithm achieves the same number of data communications in N/2 + logN steps. Further each pass of both algorithms have 3N/2 + 1 additions, N/2 - 1 subtractions, N - 1 multiplications and N divisions.
We consider a system of pseudomonotone equilibrium problem and split common fixed point problem in the framework of real Hilbert spaces. We propose a modified extragradient method with line searching technique for app...
详细信息
We consider a system of pseudomonotone equilibrium problem and split common fixed point problem in the framework of real Hilbert spaces. We propose a modified extragradient method with line searching technique for approximating a common element in the sets of solutions of the two nonlinear problems. The convergence result is proved without prior knowledge of the Lipschitz-like constants of the equilibrium bifunctions and the norm of the bounded linear operator of the split common fixed point problem. We further provide some application and numerical example to show the importance of the obtained results in the paper.
Term pattern matching is the problem of finding all pattern matches in a subject term, given a set of patterns. Finding efficient algorithms for this problem is an important direction for research [21]. We present a n...
详细信息
ISBN:
(数字)9783030853150
ISBN:
(纸本)9783030853150;9783030853143
Term pattern matching is the problem of finding all pattern matches in a subject term, given a set of patterns. Finding efficient algorithms for this problem is an important direction for research [21]. We present a new set automaton solution for the term pattern matching problem that is based on match set derivatives where each function symbol in the subject pattern is visited exactly once. The algorithm allows for various traversal patterns over the subject term and is particularly suited to search the subject term in parallel.
Rotor/stator interaction plays a crucial role in predicting the aerodynamic performance of high-speed multi-row axial compressor. Although mixing plane method has been widely used in steady simulations of multi-row si...
详细信息
Rotor/stator interaction plays a crucial role in predicting the aerodynamic performance of high-speed multi-row axial compressor. Although mixing plane method has been widely used in steady simulations of multi-row single passage flows, the exchange of spanwise non-matched profiles of flow variable between adjacent blade rows is still unknown. Besides, quite little work was devoted to sliding mesh method for unsteady simulations of multi-row full-annulus compressor flows. In this work, mixing plane method and sliding mesh method were developed and presented, respectively, in the environment of parallel computer platforms. Particular attentions were paid to the accurate exchange of spanwise non-matched profiles of fluxes in mixing plane method, and the fast search algorithms of spanwise and circumferential non-matched meshes in sliding mesh method. The developed parallel rotor/stator interaction methods were validated with two multi-row high-speed axial compressors, i.e. NASA Stage 35 and first three stages of NASA 74A, both of which were calculated with steady flow simulation of one blade passage per row and unsteady flow simulation of full-annulus passages per row. Results highlighted that both the developed mixing plane method and sliding mesh method were able to predict the aerodynamic performance curve accurately and satisfy the interface flux conservation tightly. The flow wake and shock wave were observed to transfer continuously across rotor/stator interface with sliding mesh method in the unsteady full-annulus simulation. This work is valuable for the further development of advanced aerodynamic design and accurate performance prediction of multi-row axial compressors. (c) 2021 Elsevier Masson SAS. All rights reserved.
This paper research is focused on the use of a matching algorithm, from graph theory, over an emergency medical system. The application architecture is divided across multiple modules, communicating between them in re...
详细信息
ISBN:
(纸本)9781665440004
This paper research is focused on the use of a matching algorithm, from graph theory, over an emergency medical system. The application architecture is divided across multiple modules, communicating between them in real time: ambulance level module, hospital level modules, central module (medical center, in charge of deciding the hospital/s compatible to the accident), traffic center module (in charge of deciding the route towards the chosen hospital). The experimental results, for both implementation approaches, sequential and parallel, have proven that for Blossom algorithm, a more scalable solution regarding the number of simultaneous accidents, is represented by the parallel implementation.
The finite-time dynamical method is an effective tool studying phase transition in many-body systems. We design a parallel version of this method based on replica exchange whose probability is deduced from Langevin eq...
详细信息
The finite-time dynamical method is an effective tool studying phase transition in many-body systems. We design a parallel version of this method based on replica exchange whose probability is deduced from Langevin equation. The configuration swaps between different replicas enhance the importance sampling in parallel and control the fluctuation, leading to the conspicuous improvement for the simulation efficiency. The computational performance with configuration exchange is above 3 times as high as the calculation effect without the exchange on five current CPU cores. In addition, this parallel algorithm is used to simulate the two- and three-dimensional Ising model and obtain compatible exponents with that from theoretical solutions and numerical simulations from references, verifying the reliability of the scheme. (C) 2021 Elsevier B.V. All rights reserved.
In a minimum partial set cover problem (MinPSC), given a ground set E with n elements, a collection S of subsets of E with vertical bar S vertical bar = m, a cost function c : S -> R+, and an integer k = 1 is the a...
详细信息
In a minimum partial set cover problem (MinPSC), given a ground set E with n elements, a collection S of subsets of E with vertical bar S vertical bar = m, a cost function c : S -> R+, and an integer k <= n, the goal of MinPSC is to find a minimum cost sub-collection of S that covers at least k elements of E. In this paper, we design a parallel algorithm for MinPSC which yields a solution with approximation ratio at most f/1-2 epsilon in O(1/epsilon log mn/epsilon) rounds, where f is the maximum number of sets containing a common element, and 0 < epsilon < 1/2 is a constant. We also design a parallel algorithm for a special MinPSC problem, the minimum power partial cover problem (MinPPC), which achieves approximation ratio at most (3+2 epsilon)(alpha)/1-2 epsilon in O(1/epsilon log mn/epsilon log(2) m) rounds, where alpha >= 1 is the attenuation factor of power. (C) 2021 Elsevier Inc. All rights reserved.
The approximate single-source shortest-path problem is as follows: given a graph with nonnegative edge weights and a designated source vertex s, return estimates of the distances from s to each other vertex such that ...
详细信息
ISBN:
(纸本)9781450369794
The approximate single-source shortest-path problem is as follows: given a graph with nonnegative edge weights and a designated source vertex s, return estimates of the distances from s to each other vertex such that the estimate falls between the true distance and (1 + epsilon) times the distance. This paper provides the first nearly work-efficient parallel algorithm with sublinear span (also called depth) for the approximate shortest-path problem on directed graphs. Specifically, for constant epsilon and polynomially-bounded edge weights, our algorithm has work (O) over tilde (m) and span n(1/2+o(1)). Several algorithms were previously known for the case of undirected graphs, but none of the techniques seem to translate to the directed setting. The main technical contribution is the first nearly linear-work algorithm for constructing hopsets on directed graphs. A (beta, epsilon)-hopset is a set of weighted edges (sometimes called shortcuts) which, when added to the graph, admit beta-hop paths with weight no more than (1+epsilon) times the true shortest-path distances. There is a simple sequential algorithm that takes as input a directed graph and produces a linear-cardinality hopset with beta = (O) over tilde(root n), but its running time is quite highspecifically (O) over tilde (m root n). Our algorithm is the first more efficient algorithm that produces a directed hopset with similar characteristics. Specifically, our sequential algorithm runs in (O) over tilde (m) time and constructs a hopset with (O) over tilde (n) edges and beta = n(1/2+o(1)). A parallel version of the algorithm has work (O) over tilde (m) and span n(1/2+o(1)).
The use of drones can be a valuable solution for the problem of delivering goods for many reasons. In fact, they can be efficiently employed in time-critical situations when there is a traffic jam on the roads, to ser...
详细信息
ISBN:
(纸本)9781665449359
The use of drones can be a valuable solution for the problem of delivering goods for many reasons. In fact, they can be efficiently employed in time-critical situations when there is a traffic jam on the roads, to serve customers in hard-toreach places, or simply to expand the business. However, due to limited battery capacities and the fact that drones can serve a single customer at a time, a drone-based delivery system (DBDS) aims to minimize the drones' energy usage for completing a route from the depot to the customer and go back to the depot for new deliveries. In general, the shortest delivery route could not be the optimal choice since external factors like the wind (which varies with time) can affect energy consumption. Previous work has mainly considered simplified DBDSs assuming architectures with a single drone and with static costs on paths. Moreover, in these non-centralized architectures, the drones themselves compute the routes on the fly employing their onboard processing resources, making this choice costly. In this paper we develop a centralized system for computing energy-efficient time-varying routes for drones in a multi-depot multi-drone delivery system. Specifically, we propose a novel centralized parallel algorithm called parallel Shortest Route Update (PSRU) that, over time, updates the drones' delivery routes avoiding the whole recomputation from scratch. A comprehensive evaluation proves that PSRU is up to 4.5x faster than the state-of-the-art algorithms.
暂无评论