Suffix arrays and the corresponding longest common prefix (lcp) array have wide applications in bioinformatics, information retrieval and data compression. In this work, we propose and theoretically analyze new parall...
详细信息
ISBN:
(纸本)9781479955008
Suffix arrays and the corresponding longest common prefix (lcp) array have wide applications in bioinformatics, information retrieval and data compression. In this work, we propose and theoretically analyze new parallelalgorithms for computing the lcp array given the suffix array as input. Most of our algorithms have a work and depth (parallel time) complexity related to the lcp values of the input. We also present a slight variation of Karkkainen and Sanders' skew algorithm that requires linear work and poly-logarithmic depth in the worst case. We present a comprehensive experimental study of our parallelalgorithms along with existing parallel and sequential lcpalgorithms. On a variety of real-world and artificial strings, we show that on a 40-core shared-memory machine our fastest algorithm is up to 2.3 times faster than the fastest existing parallel algorithm, and up to 21.8 times faster than the fastest sequential lcp algorithm.
暂无评论