检索结果-内蒙古大学图书馆

Processor mapping techniques toward efficient data redistribution

IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 1995年第12期6卷 1234-1247页

作者： Kalns, ET Ni, LM MICHIGAN STATE UNIV DEPT COMP SCIE LANSINGMI 48824

Run-time data redistribution can enhance algorithm performance in distributed-memory machines, Explicit redistribution of data can be performed between algorithm phases when a different data decomposition is expected to deliver increased performance for a subsequent phase of computation. Redistribution, however, represents increased program overhead as algorithm computation is discontinued while data are exchanged among processor memories, In this paper, we present a technique that minimizes the amount of data exchange for BLOCK to CY-CLIC (c) (or vice-versa) redistributions of arbitrary number of dimensions, Preserving the semantics of the target (destination) distribution pattern, the technique manipulates the data to logical processor mapping of the target pattern, When implemented on an IBM SP, the mapping technique demonstrates redistribution performance improvements of approximately 40% over traditional data to processor mapping. Relative to the traditional mapping technique, the proposed method affords greater flexibility in specifying precisely which data elements are redistributed and which elements remain on-processor.

关键词： distributed-memory architectures data decomposition data redistribution High Performance Fortran processor mapping data-parallel programming

来源：评论

学校读者我要写书评

暂无评论

Person identification based on multiscale matching of cortical images

Person identification based on multiscale matching of cortic...

引用

International Conference and Exhibition on High-Performance Computing and Networking

作者： Kruizinga, P Petkov, N UNIV GRONINGEN DEPT COMP SCI9700 AV GRONINGENNETHERLANDS

ISBN: (纸本)3540593934

A set of so-called cortical images, motivated by the function of simple cells in the primary visual cortex of mammals, is computed from each of two input images and an image pyramid is constructed for each cortical image. The two sets of cortical image pyramids are matched synchronously and an optimal mapping of the one image onto the other image is determined. The method was implemented on the Connection Machine CM-5 of the University of Groningen(1) in the data-parallel programming model and applied to the problem of face recognition.

关键词： optical flow multiscale resolution person identification data-parallel programming Connection Machine CM-5

来源：评论

学校读者我要写书评

暂无评论

A COMPARATIVE-STUDY OF THE USE OF THE data-parallel APPROACH FOR COMPRESSIBLE FLOW CALCULATIONS

引用

parallel COMPUTING 1994年第3期20卷 363-373页

作者： SAWLEY, ML BERGMAN, CM KUNGLIGA TEKNISKA HOGSKOLAN CTR COMPUTAT MATH & MECHS-10044 STOCKHOLMSWEDEN

The results are presented of an investigation into the use of the data-parallel programming approach on four different massively-parallel computers: the MasPar MP-1 and MP-2 and the Thinking Machines CM-200 and CM-5. A code to calculate inviscid compressible flow, originally written in FORTRAN 77 for a traditional vector computer, has been re-written entirely in Fortran 90 to take advantage of the compilers available on the massively-parallel computers. It is shown that the discretization of the governing equations on a regular mesh is well adapted to data parallelism. For a typical test problem of supersonic flow through a ramped duct, computational speeds have been achieved using these massively-parallel computers that are superior to those obtained using a single processor of a Cray Y-MP. In addition, this study has enabled the question of code portability between the different computers to be assessed.

关键词： COMPUTATIONAL FLUID DYNAMICS EULER EQUATIONS data-parallel programming PORTABILITY PERFORMANCE RESULTS

来源：评论

学校读者我要写书评

暂无评论

A parallel programming ENVIRONMENT SUPPORTING MULTIPLE data-parallel MODULES

引用

INTERNATIONAL JOURNAL OF parallel programming 1992年第5期21卷 363-386页

作者： SEEVERS, BK QUINN, MJ HATCHER, PJ OREGON STATE UNIV DEPT COMP SCICORVALLISOR 97331 UNIV NEW HAMPSHIRE DEPT COMP SCIDURHAMNH 03824

We describe a system that allows programmers to take advantage of both control and data parallelism through multiple intercommunicating data-parallel modules. This programming environment extends C-type stream 1/0 to include intermodule communication channels. The programmer writes each module as a separate data-parallel program, then develops a channel linker specification describing how to connect the modules together. A channel linker we have developed loads the separate modules on to the parallel machine and binds the communication channels together as specified. We present performance data that demonstrates a mixed control- and data-parallel solution can yield better performance than a strictly data-parallel solution. The system described currently runs on the Intel iWarp multicomputer.

关键词： COMMUNICATION CHANNELS CONTROL-parallel programming data-parallel programming dataparallel-C INTEL IWARP

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：