检索结果-内蒙古大学图书馆

Lifting sequential graph algorithms for distributed-memory parallel computation 05

Lifting sequential graph algorithms for distributed-memory p...

Proceedings of the 20th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications

作者： Douglas Gregor Andrew Lumsdaine Indiana University Bloomington IN

ISBN: (纸本)9781595930316

This paper describes the process used to extend the Boost graph Library (BGL) for parallel operation with distributed memory. The BGL consists of a rich set of generic graph algorithms and supporting data structures, but it was not originally designed with parallelism in mind. In this paper, we revisit the abstractions comprising the BGL in the context of distributed-memory parallelism, lifting away the implicit requirements of sequential execution and a single shared address space. We illustrate our approach by describing the process as applied to one of the core algorithms in the BGL, breadth-first search. The result is a generic algorithm that is unchanged from the sequential algorithm, requiring only the introduction of external (distributed) data structures for parallel execution. More importantly, the generic implementation retains its interface and semantics, such that other distributed algorithms can be built upon it, just as algorithms are layered in the sequential case. By characterizing these extensions as well as the extension process, we develop general principles and patterns for using (and reusing) generic, object-oriented parallel software libraries. We demonstrate that the resulting algorithm implementations are both efficient and scalable with performance results for several algorithms.

关键词： distributed computing generic programming parallel graph algorithms

来源：评论

学校读者我要写书评

暂无评论

A fast, parallel spanning tree algorithm for symmetric multiprocessors (SMPS)

引用

JOURNAL OF parallel AND DISTRIBUTED COMPUTING 2005年第9期65卷 994-1006页

作者： Bader, DA Cong, GJ Georgia Inst Technol Coll Comp Atlanta GA 30332 USA IBM Corp TJ Watson Res Ctr Yorktown Hts NY USA

The ability to provide uniform shared-memory access to a significant number of processors in a single SMP node brings us much closer to the ideal PRAM parallel computer. Many PRAM algorithms can be adapted to SMPs with few modifications. Yet there are few studies that deal with the implementation and performance issues of running PRAM-style algorithms on SMPs. Our study in this paper focuses on implementing parallel spanning tree algorithms on SMPs. Spanning tree is an important problem in the sense that it is the building block for many other parallel graph algorithms and also because it is representative of a large class of irregular combinatorial problems that have simple and efficient sequential implementations and fast PRAM algorithms, but these irregular problems often have no known efficient parallel implementations. Experimental studies have been conducted on related problems (minimum spanning tree and connected components) using parallel computers, but only achieved reasonable speedup on regular graph topologies that can be implicitly partitioned with good locality features or on very dense graphs with limited numbers of vertices. In this paper we present a new randomized algorithm and implementation with superior performance that for the first time achieves parallel speedup on arbitrary graphs (both regular and irregular topologies) when compared with the best sequential implementation for finding a spanning tree. This new algorithm uses several techniques to give an expected running time that scales linearly with the number p of processors for suitably large inputs (n > p(2)). As the spanning tree problem is notoriously hard for any parallel implementation to achieve reasonable speedup, our study may shed new light on implementing PRAM algorithms for shared-memory parallel computers. The main results of this paper are 1. A new and practical spanning tree algorithm for symmetric multiprocessors that exhibits parallel speedups on graphs with regular and irr

关键词： parallel graph algorithms connectivity shared memory high-performance algorithm engineering

来源：评论

学校读者我要写书评

暂无评论

A coarse grained parallel algorithm for closest larger ancestors in trees with applications to single link clustering

引用

1st International on High Performance Computing and Communications (HPCC 2005)

作者： Chan, A Gao, CM Rau-Chaplin, A Fayetteville State Univ Dept Math & Comp Sci Fayetteville NC 28301 USA Dalhousie Univ Fac Comp Sci Halifax NS B3J 2X4 Canada

ISBN: (纸本)3540290311

Hierarchical clustering methods are important in many data mining and pattern recognition tasks. In this paper we present an efficient coarse grained parallel algorithm for Single Link Clustering;a standard inter-cluster linkage metric. Our approach is to first describe algorithms for the Prefix Larger Integer Set and the Closest Larger Ancestor problems and then to show how these can be applied to solve the Single Link Clustering problem. In an extensive performance analysis an implementation of these algorithms on a Linux-based cluster has shown to scale well, exhibiting near linear relative speedup.

关键词： single link clustering closest larger ancestor parallel graph algorithms coarse grained multicomputer hierarchical agglomerative clustering

来源：评论

学校读者我要写书评

暂无评论

What structural features make graph problems to have efficient parallel algorithms? Using outerplanar graphs, trapezoid graphs and in-tournament graphs as examples

引用

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS 2000年第3期E83D卷 541-549页

作者： Masuyama, S Nakayama, S Toyohashi Univ Technol Dept Knowledge Based Informat Engn Toyohashi Aichi 4418580 Japan Univ Tokushima Fac Integral Arts & Sci Dept Math Sci Tokushima 7708502 Japan

This paper analyzes what structural features of graph problems allow efficient parallel algorithms. We survey some parallel algorithms for typical problems on three kinds of graphs, outerplanar graphs, trapezoid graphs and in-tournament graphs. Our results on the shortest path problem, the longest path problem and the maximum flow problem on outerplanar graphs, the minimum-weight connected dominating set problem and the coloring problem on trapezoid graphs and Hamiltonian path and Hamiltonian cycle problem on in-tournament graphs are adopted as working examples.

关键词： parallel graph algorithms structure and complexity outerplanar graph trapezoid graph in-tournament graph

来源：评论

学校读者我要写书评

暂无评论

Finding the k shortest paths in parallel

引用

ALGORITHMICA 2000年第2期28卷 242-254页

作者： Ruppert, E Brown Univ Dept Comp Sci Providence RI 02912 USA

A concurrent-read exclusive-write PRAM algorithm is developed to find the k shortest paths between pairs of vertices in an edge-weighted directed graph. Repetitions of vertices along the paths are allowed. The algorithm computes an implicit representation of the k shortest paths to a given destination vertex from every vertex of a graph with n vertices and in edges, using O (m + nk log(2) k) work and O (log(3) k log* k + log n(log log k + log* n)) time, assuming that a shortest path tree rooted at the destination is pre-computed. The paths themselves can be extracted from the implicit representation in O(log k + log n) time, and O(n log n + L) work, where L is the total length of the output.

关键词： parallel graph algorithms data structures shortest paths

来源：评论

学校读者我要写书评

暂无评论

IMPROVED parallel ALGORITHM FOR MAXIMAL MATCHING BASED ON DEPTH-FIRST-SEARCH

引用

parallel algorithms and Applications 2000年第4期14卷 321-327页

作者： Alak Kumar Datta[a] Ranjan Kumar Sen - Present address: IBM Poughkeepsie USA[b] [a] Department of Computer Science and Technology Bengal Engineering College (DU) Howrah West Bengal India [b] Department of Computer Science Hampton University Hampton VA USA

Computation of maximal matching of a graph based on Depth First Search tree computation was introduced by Datta and Sen (parallel algorithms and Applications,5, 1995, 161-164). They showed that the approach gives efficient parallel algorithms for maximal matching for graphs for which DFS tree can be efficiently computed. They also presented a parallel scheme to compute a maximal matching in O(T(n) log n) time using O(P(n)) number of processors, where T(n) and P(n) are the time and the number of processors required to compute DFS tree of a graph in parallel. We present here, an improved technique to compute maximal matching in parallel based on DFS tree computation. The algorithm takes O(T(n)) time and O(P(n)) processors.

关键词： parallel graph algorithms Depth first search tree Matching parallel graph algorithms Depth first search tree Matching

来源：评论

学校读者我要写书评

暂无评论

Portable and efficient parallel computing using the BSP model

引用

IEEE TRANSACTIONS ON COMPUTERS 1999年第7期48卷 670-689页

作者： Goudreau, MW Lang, K Rao, SB Suel, T Tsantilas, T NEC US C&C Res Labs Princeton NJ 08540 USA Polytech Univ Brooklyn NY 11201 USA Bear Stearns & Co New York NY 10167 USA

The Bulk-Synchronous parallel (BSP) model was proposed by Valiant as a standard interface between parallel software and hardware. In theory. the BSP model has been shown to allow the asymptotically optimal execution of architecture-independent software on a variety of architectures. Our goal in this work is to experimentally examine the practical use of the BSP model on current parallel architectures. We describe the design and implementation of the Green BSP Library, a small library of functions that implement the BSP model, and of several applications that were written for this library. We then discuss the performance of the library and application programs on several parallel architectures. Our results are positive in that we demonstrate efficiency and portability over a range of parallel architectures and show that the BSP cost model is useful for predicting performance trends and estimating execution times.

关键词： BSP minimum spanning tree problem models of parallel computation N-body problem parallel computing parallel graph algorithms shortest path problem

来源：评论

学校读者我要写书评

暂无评论

A parallel algorithm for maximal matching based on depth first search

引用

parallel algorithms and Applications 1995年第3-4期5卷 161-164页

作者： Datta, Alak K. Sen, Ranjan K. Department of Mathematics Indian Institute of Technology Kharagpur 721302 India Department of Computer Science and Engineering Indian Institute of Technology Kharagpur 721302 India

We present a new parallel algorithm for finding a maximal matching of a graph. The time required by our algorithm is O(TD(n)log n) and the number of processors used is PD(n), where TD(n) and PD(n) are the time and num... 详细信息

We present a new parallel algorithm for finding a maximal matching of a graph. The time required by our algorithm is O(T_D(n)log n) and the number of processors used is P_D(n), where T_D(n) and P_D(n) are the time and number of processors needed for a Depth First Search (DFS) of the graph. © 1995, Taylor & Francis Group, LLC. All rights reserved.

关键词： Depth-first-search Interval graphs Matching parallel graph algorithms Planar graphs Vertex cover

来源：评论

学校读者我要写书评

暂无评论

Efficient parallel algorithms for Some graph Theory Problems

引用

Journal of Computer Science & Technology 1993年第4期8卷 362-366页

作者：马军马绍汉 Dept.of Computer Science Shandong UniversityJinan 250100

In this paper,a sequential algorithm computing the all vertex pair distance matrix D and the path matrix Pis *** a PRAM EREW model with p,1≤p≤n^2,processors,a parallel version of the sequential algorithm is *** method can also be used to get a parallel algorithm to compute transitive closure arrayof an undirected *** time complexify of the parallel algorithm is O(n^3/p).If D,P andare known,it is shown that the problems to find all connected components, to compute the diameter of an undirected graph,to determine the center of a directed graph and to search for a directed cycle with the minimum(maximum)length in a directed graph can all be solved in O(n^2/p^+ logp)time.

关键词： parallel graph algorithms shortest paths transitive closure connected components diameter of graph center of graph directed cycle with the minimum (maximum)length parallel random access machines (PRAMs)

来源：评论

学校读者我要写书评

暂无评论

IMPLEMENTING RECURRENT BACK-PROPAGATION ON THE CONNECTION MACHINE

引用

NEURAL NETWORKS 1989年第4期2卷 295-314页

作者： DEPRIT, E Naval Research Laboratory USA

The recurrent back-propagation algorithm for neural networks has been implemented on the Connection Machine, a massively parallel processor. Two fundamentally different graph architectures underlying the nets were tested: one based on arcs, the other on nodes. Confirming the predominance of communication over computation, performance measurements underscore the necessity to make connections the basic unit of representation. Comparisons between these graph algorithms lead to important conclusions concerning the parallel implementation of neural nets in both software and hardware.

关键词： Neural networks Recurrent back-propagation Continuous mapping Associative memory parallel processing Massively parallel processor parallel graph algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：