The longest common subsequence (LCS) problem on a pair of strings is a classical problem in string algorithms. Its extension, the semilocal LCS problem, provides a more detailed comparison of the input strings, withou...
详细信息
ISBN:
(纸本)9781450390682
The longest common subsequence (LCS) problem on a pair of strings is a classical problem in string algorithms. Its extension, the semilocal LCS problem, provides a more detailed comparison of the input strings, without any increase in asymptotic running time. Several semi-local LCS algorithms have been proposed previously;however, to the best of our knowledge, none have yet been implemented. In this paper, we explore a new hybrid approach to the semi-local LCS problem. We also propose a novel bit-parallel LCS algorithm. In the experimental part of the paper, we present an implementation of several existing and new parallel LCS algorithms and evaluate their performance.
The dynamical properties of many natural phenomena can be related to their support fractal dimension. A relevant example is the connection between flood peaks produced in a river basin, as observed in flood hydrograph...
详细信息
ISBN:
(纸本)9783030390815
The dynamical properties of many natural phenomena can be related to their support fractal dimension. A relevant example is the connection between flood peaks produced in a river basin, as observed in flood hydrographs, and the multi-fractal spectrum of the river itself, according to the Multifractal Instantaneous Unit Hydrograph (MIUH) theory. Typically, the multifractal analysis of river networks is carried out by sampling large collections of points belonging to the river basin and analyzing the fractal dimensions and the Lipschitz-Holder exponents of singularities through numerical procedures which involve different degrees of accuracy in the assessment of such quantities through different methods (box-counting techniques, the generalized correlation integral method by Pawelzik and Schuster (1987), the fixed-mass algorithms by Badii and Politi (1985), being some relevant examples). However, the higher accuracy in the determination of the fractal dimensions requires considerably higher computational times. For this reason, we recently developed a parallel version of some of the cited multifractal methods described above by using the MPI parallel library, by reaching almost optimal speed-ups in the computations. This will supply a tool for the assessment of the fractal dimensions of river networks (as well as of several other natural phenomena whose embedding dimension is 2 or 3) on massively parallel clusters or multi-core workstations.
We present parallel algorithms and data structures for three fundamental operations in Numerical Linear Algebra: (i) Gaussian and CountSketch random projections and their combination, (ii) computation of the Gram matr...
详细信息
We present parallel algorithms and data structures for three fundamental operations in Numerical Linear Algebra: (i) Gaussian and CountSketch random projections and their combination, (ii) computation of the Gram matrix, and (iii) computation of the squared row norms of the product of two matrices, with a special focus on "tall-and-skinny" matrices, which arise in many applications. We provide a detailed analysis of the ubiquitous CountSketch transform and its combination with Gaussian random projections, accounting for memory requirements, computational complexity and workload balancing. We also demonstrate how these results can be applied to column subset selection, least squares regression and leverage scores computation. These tools have been implemented in pylspack, a publicly available Python package(1) whose core is written in C++ and parallelized with OpenMP and that is compatiblewith standard matrix data structures of SciPy and NumPy. Extensive numerical experiments indicate that the proposed algorithms scale well and significantly outperform existing libraries for tall-and-skinny matrices.
Finding the centrality measures of nodes in a graph is a problem of fundamental importance due to various applications from social networks, biological networks, and transportation networks. Given the large size of su...
详细信息
Finding the centrality measures of nodes in a graph is a problem of fundamental importance due to various applications from social networks, biological networks, and transportation networks. Given the large size of such graphs, it is natural to use parallelism as a recourse. Several studies show how to compute the various centrality measures of nodes in a graph on parallel architectures, including multi-core systems and GPUs. However, as these graphs evolve and change, it is pertinent to study how to update the centrality measures on changes to the underlying graph. In this article, we show novel parallel algorithms for updating the betweenness- and closeness-centrality values of nodes in a dynamic graph. Our algorithms process a batch of updates in parallel by extending the approach of handling a single update for betweenness- and closeness-centrality. For the latter, we also introduce techniques based on traversals of the block-cut tree of a graph. Besides, our algorithms incorporate mechanisms to exploit the structural properties of graphs for enhanced performance. We implement our algorithms on two parallel architectures: an Intel 24-core CPU and an Nvidia Tesla V100 GPU. To the best of our knowledge, we are the first to show GPU algorithms for the above two problems. In addition, we conduct detailed experiments to study the impact of various parameters associated with our algorithms and their implementation. Our results on a collection of real-world graphs indicate that our algorithms achieve a significant speedup over corresponding state-of-the-art algorithms.
Cycles are one of the fundamental subgraph patterns and being able to enumerate them in graphs enables important applications in a wide variety of fields, including finance, biology, chemistry, and network science. Ho...
详细信息
Previous work on Dynamic Complexity has established that there exist dynamic constant-time parallel algorithms for regular tree languages and context-free languages under label or symbol changes. However, these algori...
详细信息
In this work, we design, analyze, and optimize sequential and shared-memory parallel algorithms for partitioned local depths (PaLD). Given a set of data points and pairwise distances, PaLD is a method for identifying ...
详细信息
We give parallel algorithms for string diagrams represented as structured cospans of ACSets. Specifically, we give linear (sequential) and logarithmic (parallel) time algorithms for composition, tensor product, constr...
详细信息
Semisort is a fundamental algorithmic primitive widely used in the design and analysis of efficient parallel algorithms. It takes input as an array of records and a function extracting a key per record, and reorders t...
详细信息
The densest subgraph problem has received significant attention, both in theory and in practice, due to its applications in problems such as community detection, social network analysis, and spam detection. Due to the...
详细信息
暂无评论