This paper introduces the first gpu-based parallel algorithms to solve the fastest path duration (FPD) problem in temporal graphs. Fastest path duration in temporal graphs is a well studied problem that has multiple u...
详细信息
ISBN:
(纸本)9798400717932
This paper introduces the first gpu-based parallel algorithms to solve the fastest path duration (FPD) problem in temporal graphs. Fastest path duration in temporal graphs is a well studied problem that has multiple use cases in information diffusion, epidemic spreading, and route planning in public transportation. Given a temporal graph G in which each edge associates with a departure time and duration time, and a source vertex s, the Fastest Path Duration (FPD) problem is to compute the journey times from s to all the rest of the vertices in G. The existing multi-core algorithm for FPD by Delling et al. exhibits limited parallelism. In general, many parallel algorithms suffer from doing redundant work while pruning certain computations. Our research focuses on multiple algorithmic ways to avoid redundant work and perform pruning of computations effectively. We introduce three novel gpu-based parallel algorithms for FPD, namely Level Order (LO), Multiple Breadth First Search (MBFS), and Local Work-lists (LW) and implement them on a gpu architecture machine. Our algorithms demonstrate an average speedup of approximately 165 times and a maximum speedup of up to 1383 times over the current state-of-the-art algorithms. This paper provides a comprehensive explanation of various algorithm designs, their optimizations for gpuS, and an extensive evaluation of their performance across various temporal graph scenarios.
This paper presents an algorithm for fast sorting of large lists using modern gpus. The method achieves high speed by efficiently utilizing the parallelism of the CPU throughout the whole algorithm. Initially, gpu-bas...
详细信息
This paper presents an algorithm for fast sorting of large lists using modern gpus. The method achieves high speed by efficiently utilizing the parallelism of the CPU throughout the whole algorithm. Initially, gpu-based bucketsort or quicksort splits the list into enough sublists;then to be sorted in parallel using merge-sort. The algorithm is of complexity n log n, and for lists of 8 M elements and using a single Geforce 8800 GTS-512, it is 2.5 times as fast as the bitonic sort algorithms, with standard complexity of n(log n)(2), which for a long time was considered to be the fastest for CPU sorting. It is 6 times faster than single CPU quicksort, and 10% faster than the recent gpu-based radix sort. Finally, the algorithm is further parallelized to utilize two graphics cards, resulting in yet another 1.8 times speedup. (c) 2008 Elsevier Inc. All rights reserved.
暂无评论