检索结果-内蒙古大学图书馆

parallelization in an HPF language processor

NEC RESEARCH & DEVELOPMENT 1998年第4期39卷 414-421页

作者： Hayashi, Y Sakon, S Seo, Y Suehiro, K Tamura, M Murai, H NEC Corp Ltd Comp Software Div 1 Tokyo Japan NEC Corp Ltd C&C Media Res Labs Tokyo Japan

We have developed an HPF (High Performance Fortran) language processor for SX-4 series, aimed at distributed memory multiprocessor systems. HPF is a de facto standard data-parallel language mainly aimed at distributed memory multiprocessor systems. HPF allows users to develop parallelized programsdby only specifying how to map data onto processors. The HPF compiler partitions computation among processors based on specified mapping information, and generates necessary data transfers. Therefore, both how to map computation onto processors and how to achieve high-speed data transfer are important for the efficient implementation of HPF compilers. This paper describes automatic parallelization and data transfer technology in NEC's HPF language processor. This paper also discusses the utilization of shared memory parallelization and vectorization on SX-4 and SX-5 series.

关键词： HPF (High Performance Fortran) distributed memory parallelization data transfer data-parallel language compiler language processor

来源：评论

学校读者我要写书评

暂无评论

Compiling array references with affine functions for data-parallel programs

引用

JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 1998年第4期14卷 695-723页

作者： Wei, WH Shih, KP Sheu, JP Natl Cent Univ Dept Comp Sci & Informat Engn Chungli 320 Taiwan

An important research topic is parallelizing of compilers to generate local memory access sequences and communication sets while compiling a data-parallel language into an SPMD (Single Program Multiple data) program. In this paper, we present a scheme to efficiently enumerate local memory access sequences and to evaluate communication sets. We use a class table to store information that is extracted from array sections and data distribution patterns. Given array references and data distributions, we can utilize the class table to generate communication sets in closed forms. Furthermore, we derive the algorithms for sending and receiving necessary data between processors. An algorithm for generating the class table is presented, and the time complexity of this algorithm is O(s), where s is the array section stride. The technique of generating communication sets for one index variable has been implemented on a DEC Alpha 3000 workstation. The experimental results confirm the advantage of our scheme, especially when the array section stride is larger than the block size. Finally, we adapt our approach to handle array references with multiple index variables. The time complexity for constructing the whole class table is O(s(2)).

关键词： communication set data-parallel language distributed memory multicomputers HPF parallelizing compilers SPMD

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：