检索结果-内蒙古大学图书馆

Parallel Communication-Avoiding Algorithm for triangular matrix inversion on Homogeneous and Heterogeneous Platforms

引用

INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING 2015年第4期43卷 631-655页

作者： Mahfoudhi, Ryma Mahjoub, Zaher Nasri, Wahid Univ Tunis El Manar Fac Sci Tunis Tunis 2092 Tunisia Higher Sch Sci & Tech Tunis Tunis 1008 Tunisia

We address in this paper the parallelization of a recursive algorithm for large scale triangular matrix inversion based on the 'Divide and Conquer' (D&C) paradigm. A set of different versions of an original sequential algorithm are first presented. A theoretical performance study permits to establish an accurate comparison between the designed algorithms. Afterwards, we develop in the second part of the paper, an optimal parallel avoiding-communication algorithm for a given number of available homogeneous and heterogeneous processors. To reach this target, we use a so called 'non equitable and incomplete' version of the D&C paradigm consisting in recursively decomposing the original problem into two sub-problems of non equal sizes, then decomposing only one sub-problem in the same previous manner. The theoretical study is validated by a series of experiments achieved on three target platforms, namely an 8-core shared memory machine, a distributed memory cluster and a heterogeneous CPU-GPU cluster. The obtained results permit to illustrate the interest of the contribution.

关键词： Communication-avoiding Cost optimality Distributed memory machine Divide and Conquer GPU Homogeneous platform Heterogeneous platform Load balancing Parallel algorithm Recursive algorithm Scheduling Shared memory machine triangular matrix inversion

来源：评论

学校读者我要写书评

暂无评论

Parallel Communication-Free Algorithm for triangular matrix inversion on Heterogenoues Platform

Parallel Communication-Free Algorithm for Triangular Matrix ...

引用

Federated Conference on Computer Science and Information Systems (FedCSIS)

作者： Mahfoudhi, Ryma Mahjoub, Zaher Nasri, Wahid Univ Tunis El Manar Fac Sci Tunis Manar 2 Tunis 2092 Tunisia Higher Sch Sci & Techn Tunis Tunis 1008 Tunisia

ISBN: (纸本)9788360810484

We address in this paper the parallelization of a recursive algorithm for triangular matrix inversion (TMI) based on the 'Divide and Conquer' (D&C) paradigm. A series of different versions of an original sequential algorithm are first presented. A theoretical performance study permits to establish an accurate comparison between the designed algorithms. Afterwards, we develop an optimal parallel communication-free algorithm targeting a heterogeneous environment involving processors of different speeds. For this purpose, we use a non equitable and incomplete version of the D&C paradigm consisting in recursively decomposing the original TMI problem in two subproblems of non equal sizes, then decomposing only one subproblem and so on. The theoretical study is validated by a series of experiments achieved on two platforms, namely an 8-core shared memory machine and a distributed memory cluster of 16 nodes. The obtained results permit to illustrate the interest of the contribution.

关键词： communication free divide and conquer heterogeneous platform parallel algorithm recursive algorithm triangular matrix inversion

来源：评论

学校读者我要写书评

暂无评论

Optimal parallelization of a recursive algorithm for triangular matrix inversion on MIMD computers

引用

PARALLEL COMPUTING 2001年第13期27卷 1767-1782页

作者： Nasri, W Mahjoub, Z Fac Sci Tunis Dept Informat Tunis 1060 Tunisia

This paper studies the parallelization of a recursive algorithm for triangular matrix inversion (TMI), using the "divide and conquer" paradigm. For a (large scale) matrix of size n = m2(k) (m,k greater than or equal to 1) and p = 2(q) (less than or equal to n/2) available processors, we first construct an adequate 2-phases task segmentation and inducing a balanced layered task graph. Then. we design a greedy scheduling leading to a cost optimal parallel algorithm, i.e. whose efficiency is equal to 1 for large n. The practical interest of the contribution is proven through an experimental study of two versions of the original algorithm on an IBM SP1 distributed memory multiprocessor. (C) 2001 Published by Elsevier Science B.V.

关键词： divide and conquer parallel algorithm recursive schemes scheduling load balancing triangular matrix inversion

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：