检索结果-内蒙古大学图书馆

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 2002年第12期13卷 1220-1233页

作者： Hodzic, E Shang, WJ Santa Clara Univ Dept Comp Engn Santa Clara CA 95053 USA

With the objective of minimizing the total execution time of a parallel program on a distributed memory parallel computer, this paper discusses the selection of an optimal supernode shape of a supernode transformation (also known as tiling). We identify three parameters of a supernode transformation: supernode size, relative side lengths, and cutting hyperplane directions. For supernode transformations on algorithms with perfectly nested loops and uniform dependencies, we prove the optimality of a constant linear schedule vector and give a necessary and sufficient condition for optimal relative side lengths. We also prove that the total running time is minimized by a cutting hyperplane direction matrix from a particular subset of all valid directions and we discuss the cases where this subset is unique. The results are derived in continuous space and should be considered approximate. Our model does not include cache effects and assumes an unbounded number of available processors, the communication cost approximated by a constant, uniform dependences, and loop bounds known at compile time. A comprehensive example is discussed with an application of the results to the Jacobi algorithm.

关键词： supernode transformation tiling algorithm partitioning parallelizing compilers minimizing running time distributed memory multicomputer

来源：评论

学校读者我要写书评

暂无评论

Scalable parallel processor array for Jacobi-type matrix computations

引用

INTEGRATION-THE VLSI JOURNAL 1995年第1期20卷 41-61页

作者： vanDijk, HW Hekstra, GJ Deprettere, EF Department of Electrical Engineering Delft University of Technology P.O. Box 5031 2628 CD Delft The Netherlands

This paper addresses the problem of designing a family of potential processor arrays for the execution of the so-called Jacobi algorithms. It extends the more familiar problem of designing a single fixed-size processor array for a particular program and it is parametrised with respect to size in two ways. Firstly, the program is no longer a particular one but is a member from a set of related programs. Secondly, the processor array itself is now also parametrised with respect to its dimension and size. There are thus three parameters involved, one to identify the program, one to select the program's size and one for the possible dimensions/sizes of the array implementation. The approach proposed in this paper is to use the design model and methods which have been used so far for the 'one array for one program' design problem and provide - instead of a processor array - a parameter controlled generic processor and a program to generate the control for the execution of a selected program on a specific array of such processors. This allows a user to compose an array out of a number of these generic processors and generate the necessary control signals actually executing the selected program. The control signals propagate down the array and instruct each processor how to process the incoming data. The control is hierarchical in the sense that a processor decodes and processes the incoming control signals so as to fix internal behaviour. The more processors are used, the less sequential the execution of the program will be. The generic processor uses Cordic arithmetic for its processing part and in addition to this it consists of a communication part and an internal memory bank. Communication between processors is a-synchronous while the internal timing is clocked.

关键词： Jacobi-algorithms algorithmic transformations adaptive algorithms parallel processing algorithm partitioning tesselation

来源：评论

学校读者我要写书评

暂无评论

COMPUTING NETWORK FLOW ON A MULTIPLE PROCESSOR PIPELINE

引用

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 1994年第6期5卷 653-658页

作者： AGRAWAL, P NG, A UNIV CALIF BERKELEY DEPT ELECT ENGN & COMP SCIBERKELEYCA 94720

We demonstrate the feasibility of a distributed implementation of the Goldberg-Tarjan algorithm for finding the maximum flow in a network. Unlike other parallel implementations of this algorithm, where the network graph is partitioned among many processors, we partition the algorithm among processors arranged in a pipeline. The network graph data are distributed among the processors according to local requirements. The partitioned algorithm is implemented on six processors within a 15-processor pipelined message-passing multicomputer operating at 5 MHz. We used randomly generated networks with integer capacities as examples. Performance estimates based upon a six-processor pipelined implementation indicated a speedup between 4.8 and 5.9 over a single processor.

关键词： algorithm partitioning MESSAGE-PASSING MULTICOMPUTER MAXFLOW PROBLEM PUSH-RELABEL algorithm FLOW-NETWORKS

来源：评论

学校读者我要写书评

暂无评论

partitioning AND MAPPING algorithmS INTO FIXED SIZE SYSTOLIC ARRAYS

引用

IEEE TRANSACTIONS ON COMPUTERS 1986年第1期35卷 1-12页

作者： MOLDOVAN, DI FORTES, JAB PURDUE UNIV SCH ELECT ENGNW LAFAYETTEIN 47907

A technique for partitioning and mapping algorithms into VLSI systolic arrays is presented in this paper. algorithm partitioning is essential when the size of a computational problem is larger than the size of the VLSI array intended for that problem. Computational models are introduced for systolic arrays and iterative algorithms. First, we discuss the mapping of algorithms into arbitrarily large size VLSI arrays. This mapping is based on the idea of algorithm transformations. Then, we present an approach to algorithm partitioning which is also based on algorithm transformations. Our approach to the partitioning problem is to divide the algorithm index set into bands and to map these bands into the processor space. The partitioning and mapping technique developed throughout the paper is summarized as a six step procedure. A computer program implementing this procedure was developed and some results obtained with this program are presented.

关键词： algorithm models VLSI algorithms VLSI architectures algorithm partitioning algorithm transformations

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：