检索结果-内蒙古大学图书馆

parallel Strong Connectivity Based on Faster Reachability

Proceedings of the ACM on Management of Data 2023年第2期1卷 1-29页

作者： Letong Wang Xiaojun Dong Yan Gu Yihan Sun University of California Riverside Riverside CA USA

Computing strongly connected components (SCC) is among the most fundamental problems in graph analytics. Given the large size of today's real-world graphs, parallel SCC implementation is increasingly important. SCC is challenging in the parallel setting and is particularly hard on large-diameter graphs. Many existing parallel SCC implementations can be even slower than Tarjan's sequential algorithm on large-diameter *** tackle this challenge, we propose an efficient parallel SCC implementation using a new parallel reachability approach. Our solution is based on a novel idea referred to as vertical granularity control (VGC). It breaks the synchronization barriers to increase parallelism and hide scheduling overhead. To use VGC in our SCC algorithm, we also design an efficient data structure called the parallel hash bag. It uses parallel dynamic resizing to avoid redundant work in maintaining frontiers (vertices processed in a round).We implement the parallel SCC algorithm by Blelloch et al. (J. ACM, 2020) using our new parallel reachability approach. We compare our implementation to the state-of-the-art systems, including GBBS, iSpan, Multi-step, and our highly optimized Tarjan's (sequential) algorithm, on 18 graphs, including social, web, k-NN, and lattice graphs. On a machine with 96 cores, our implementation is the fastest on 16 out of 18 graphs. On average (geometric means) over all graphs, our SCC is 6.0× faster than the best previous parallel code (GBBS), 12.8× faster than Tarjan's sequential algorithms, and 2.7× faster than the best existing implementation on each *** believe that our techniques are of independent interest. We also apply our parallel hash bag and VGC scheme to other graph problems, including connectivity and least-element lists (LE-lists). Our implementations improve the performance of the state-of-the-art parallel implementations for these two problems.

关键词： graph algorithms graph analytics parallel algorithms reachability strong connectivity

来源：评论

学校读者我要写书评

暂无评论

parallel GIVENS SEQUENCES FOR SOLVING THE GENERAL LINEAR MODEL ON A EREW PRAM∗∗This work is in part supported by the Swiss National Foundation Grant 21-54109.98

引用

parallel algorithms and Applications 2000年第1-2期15卷 57-75页

作者： Erricos John Kontoghiorghes[a] [a] Insiitut d'informatique Universit de Neuchtel neuch CH Switzerland

parallel Givens sequences for solving the General Linear Model (GLM) are developed and analyzed. The block updating GLM estimation problem is also considered. The solution of the GLM employs as a main computational device the Generalized QR Decomposition, where one of the two matrices is initially upper triangular. The proposed Givens sequences efficiently exploit the initial triangular structure of the matrix and special properties of the solution method. The complexity analysis of the sequences is based on a Exclusive Read-Exclusive Write (EREW) parallel Random Access Machine (PRAM) model with limited parallelism. Furthermore, the number of operations performed by a Givens rotation is determined by the size of the vectors used in the rotation. With these assumptions one conclusion drawn is that a sequence which applies the smallest number of compound disjoint Givens rotations to solve the GLM estimation problem does not necessarily have the lowest computational complexity. The various Givens sequences and their computational complexity analyses will be useful when addressing the solution of other similar factorization problems.

关键词： General linear model QR decomposition Givens rotations parallel algorithms EREW PRAM

来源：评论

学校读者我要写书评

暂无评论

parallel Factorization of Structured Matrices Arising in Stochastic Programming: 收藏
分享
引用; SIAM Journal on Optimization 1994年第4期4卷 833-846页; 作者： Elizabeth R. Jessup Dafeng Yang Stavros A. Zenios; Solving the deterministic equivalent formulation of two-stage stochastic programs using interior point algorithms requires the solution of linear systems of the form(AD<span style="position: absolute; top: -4.... 详细信息; Solving the deterministic equivalent formulation of two-stage stochastic programs using interior point algorithms requires the solution of linear systems of the form $(A D 65Y05 68Q22 90C15 90C06 stochastic programming large-scale optimization interior point algorithms parallel algorithms$; 来源：评论; 学校读者我要写书评

暂无评论

Massively parallel granular flow simulations with non-spherical particles

引用

COMPUTER SCIENCE-RESEARCH AND DEVELOPMENT 2010年第1-2期25卷 105-113页

作者： Iglberger, K. Ruede, U. Friedrich Alexander Univ Comp Sci Dept 10 D-91058 Erlangen Germany Friedrich Alexander Univ Erlangen Germany

Although granular materials have always been an important part of our everyday life, their characteristics and behavior is still only rudimentally understood. Therefore the numerical simulation has gained an increasing importance to gain deeper insight into the properties of granular media. One simulation approach is rigid body dynamics. In contrast to particle-based approaches, it fully resolves the granular particles as geometric objects and incorporates frictional contact dynamics. However, due to its complexity and the lack of large-scale parallelization, rigid body dynamics so far could not be used for very large simulation scenarios. In this paper we demonstrate massively parallel granular media simulations by means of a parallel rigid body dynamics algorithm. We will validate the algorithm for granular gas simulations and prove its scalability on up to 131 072 processor cores. Additionally, we will show several parallel granular material simulations both with spherical and non-spherical granular particles.

关键词： Granular media Rigid body dynamics parallel algorithms parallel frameworks Massively parallel Large-scale MPI parallelization

来源：评论

学校读者我要写书评

暂无评论

Weighted Matchings via Unweighted Augmentations 19

Weighted Matchings via Unweighted Augmentations

引用

38th ACM Symposium on Principles of Distributed Computing (PODC)

作者： Gamlath, Buddhima Kale, Sagar Mitrovic, Slobodan Svensson, Ola Ecole Polytech Fed Lausanne Lausanne Switzerland MIT Cambridge MA 02139 USA

ISBN: (纸本)9781450362177

We design a generic method to reduce the task of finding weighted matchings to that of finding short augmenting paths in unweighted graphs. This method enables us to provide efficient implementations for approximating weighted matchings in the massively parallel computation (MPC) model and in the streaming model. For the MPC and the multi-pass streaming model, we show that any algorithm computing a (1- delta)-approximate unweighted matching in bipartite graphs can be translated into an algorithm that computes a (1 - epsilon(delta))-approximate maximum weighted matching. Furthermore, this translation incurs only a constant factor (that depends on epsilon > 0) overhead in the complexity. Instantiating this with the current best MPC algorithm for unweighted matching yields a (1 - epsilon)-approximation algorithm for maximum weighted matching that uses O-epsilon (log logn) rounds, O(m/n) machines per round, and Oe (n poly(logn)) memory per machine. This improves upon the previous best approximation guarantee of (1/2 - epsilon) for weighted graphs. In the context of single-pass streaming with random edge arrivals, our techniques yield a (1/2 + c)-approximation algorithm thus breaking the natural barrier of 1/2.

关键词： Weighted matching parallel algorithms MPC Semi-streaming

来源：评论

学校读者我要写书评

暂无评论

parallel RFSAI-BFGS Preconditioners for Large Symmetric Eigenproblems

引用

JOURNAL OF APPLIED MATHEMATICS 2013年第unknown期2013卷 1-10页

作者： Bergamaschi, L. Martinez, A. Univ Padua Dept Civil Environm & Architectural Engn I-35100 Padua Italy Univ Padua Dept Math I-35100 Padua Italy

We propose a parallel preconditioner for the Newton method in the computation of the leftmost eigenpairs of large and sparse symmetric positive definite matrices. A sequence of preconditioners starting from an enhanced approximate inverse RFSAI (Bergamaschi and Martinez, 2012) and enriched by a BFGS-like update formula is proposed to accelerate the preconditioned conjugate gradient solution of the linearized Newton system to solve Au =q (u)u, q(u) being the Rayleigh quotient. In a previous work (Bergamaschi and Martinez, 2013) the sequence of preconditioned Jacobians is proven to remain close to the identity matrix if the initial preconditioned Jacobian is so. Numerical results onto matrices arising from various realistic problems with size up to 1.5 million unknowns account for the efficiency and the scalability of the proposed low rank update of the RFSAI preconditioner. The overall RFSAI-BFGS preconditioned Newton algorithm has shown comparable efficiencies with a well-established eigenvalue solver on all the test problems.

关键词： parallel algorithms SYMMETRY (Mathematics) EIGENVALUES NEWTON-Raphson method MATRICES APPROXIMATION theory

来源：评论

学校读者我要写书评

暂无评论

parallel compensation of scale factor for the CORDIC algorithm

引用

JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY 1998年第3期19卷 227-241页

作者： Villalba, J Lang, T Zapata, EL Univ Malaga Dept Comp Architecture E-29071 Malaga Spain Univ Calif Irvine Dept Elect & Comp Engn Irvine CA 92717 USA

The compensation of scale factor imposes significant computation overhead on the CORDIC algorithm. In this paper we present two algorithms and the corresponding architectures (one for both rotation and vectoring modes and the other only for rotation mode) to perform the scaling factor compensation in parallel with the classical CORDIC iterations. With these methods, the scale factor compensation overhead is reduced to a couple of iterations for any word length. The architectures presented have been optimized for conventional and redundant arithmetic.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

A brief history of the parallel dawn in karl-marx-stadt/chemnitz

Lecture Notes in Applied and Computational Mechanics

引用

Lecture Notes in Applied and Computational Mechanics 2013年 66卷 1-26页

作者： Haase, Gundolf Pester, Matthias Institut für Mathematik und Wissenschaftliches Rechnen Karl-Franzens Universität Graz Heinrichstrasse 36 8010 Graz Austria Fakultät für Mathematik TU Chemnitz Reichenhainer Strasse 41 09107 Chemnitz Germany

ISBN: (纸本)9783642303159

The paper recalls the period 1988-1993 when the research on parallel algorithms and their implementation started in Karl-Marx-Stadt (renamed to Chemnitz in 1990).We consider the research group formed at this time and the hardware available to this group. parallel hardware as the transputer is considered and the ancient parallel computers from that time are depicted. The group has been formed by the series of workshops and seminars that took place;and the FEM-Symposium is still organized annually. We will focus on a few of these activities and present the developments in hardware, numerical methods, parallel algorithms and analysis that have been discussed between professors, research assistants and students. The paper contains also a brief view on parallel computers available to that group today and some examples document how the computing power has increased during a period of more than 20 years. © 2013 Springer-Verlag Berlin Heidelberg.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

A parallel ALGORITHM FOR FOREST RECONSTRUCTION

引用

parallel Processing Letters 1992年第2N03期2卷 157-160页

作者： STEPHAN OLARIU ZHAOFANG WEN Department of Computer Science Old Dominion University Norfolk VA 23529 USA Department of Computer Science University of Minnesota Duluth MN 55812 USA

The purpose of this short note is to show that the problem of reconstructing a directed forest from a collection of leaf-to-root paths can be done efficiently in parallel by reducing the problem to integer sorting. Specifically, given M the total length of the paths in the collection, and n the number of distinct node labels, our algorithm reconstructs the corresponding forest (if such a forest exists) in O(M/p) time using p ≤ M/n processors or time using M/n < p < M processors, and O (M) space on the EREW-PRAM.

关键词： parallel algorithms consistency checking database design distributed databases directed forests

来源：评论

学校读者我要写书评

暂无评论

Large Scale Image Classification Using GPU-based Genetic Programming 22

Large Scale Image Classification Using GPU-based Genetic Pro...

引用

Genetic and Evolutionary Computation Conference (GECCO)

作者： Zeng, Peng Lensen, Andrew Sun, Yanan Sichuan Univ Coll Comp Sci Chengdu Peoples R China Victoria Univ Wellington Sch Engn & Comp Sci Wellington New Zealand

ISBN: (纸本)9781450392686

Genetic programming (GP) has been applied to image classification and achieved promising results. However, most GP-based image classification methods are only applied to small-scale image datasets because of the limits of high computation cost. Efficient acceleration technology is needed when extending GP-based image classification methods to large-scale datasets. Considering that fitness evaluation is the most time-consuming phase of the GP evolution process and is a highly parallelized process, this paper proposes a CPU multi-processing and GPU parallel approach to perform the process, and thus effectively accelerate GP for image classification. Through various experiments, the results show that the highly parallelized approach can significantly accelerate GP-based image classification without performance degradation. The training time of GP-based image classification method is reduced from several weeks to tens of hours, enabling it to be run on large-scale image datasets.

关键词： genetic programming image classification parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法