检索结果-内蒙古大学图书馆

A performance predictor for implementation selection of parallelized static and temporal graph algorithms

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE 2022年第2期34卷

作者： Rehman, Akif Ahmad, Masab Khan, Omer Univ Connecticut Elect & Comp Engn Storrs CT USA

Task-based execution of graph workloads allows various ordered and unordered implementations, with tasks representing dependencies between graph vertices and edges. This work explores graph algorithms in the context of ordered and unordered task-based implementations, that trade-off work-efficiency with parallelism. The monotonicity of convergent graph solutions is the reason behind the trade-off between work-efficiency and parallelism. This trade-off results in variable performance-based choices within and across different machines (CPUs and GPUs), graph algorithms, implementations (ordered, relaxed, and unordered). Input graphs also augment this choice space, with this work analyzing temporally changing graphs in addition to the static graphs explored by prior works. These algorithmic and architectural choices are first explored in this work, and it is seen that different graph workload-input combinations perform optimally on diverse architectural configurations. The resulting choice space is analyzed and this work represents it in the form of characteristic variables that correlate with each choice space. Using these characteristic variables, this work proposes analytical and neural network models to correlate these choice spaces to find the best performing implementation. The variables and the prediction models proposed in this work are also integrated with a state-of-the-art performance predictor on a multiaccelerator setup, and shows geometric performance gains of 54% on a CPU, 14% on a GPU, and 31.5% in a multiaccelerator setup over baseline implementations without performance prediction.

关键词： ordered algorithms parallel graph algorithms temporal graphs unordered algorithms

来源：评论

学校读者我要写书评

暂无评论

A parallel algorithm for maximal matching based on depth first search

引用

parallel algorithms and Applications 1995年第3-4期5卷 161-164页

作者： Datta, Alak K. Sen, Ranjan K. Department of Mathematics Indian Institute of Technology Kharagpur 721302 India Department of Computer Science and Engineering Indian Institute of Technology Kharagpur 721302 India

We present a new parallel algorithm for finding a maximal matching of a graph. The time required by our algorithm is O(TD(n)log n) and the number of processors used is PD(n), where TD(n) and PD(n) are the time and num... 详细信息

We present a new parallel algorithm for finding a maximal matching of a graph. The time required by our algorithm is O(T_D(n)log n) and the number of processors used is P_D(n), where T_D(n) and P_D(n) are the time and number of processors needed for a Depth First Search (DFS) of the graph. © 1995, Taylor & Francis Group, LLC. All rights reserved.

关键词： Depth-first-search Interval graphs Matching parallel graph algorithms Planar graphs Vertex cover

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：