检索结果-内蒙古大学图书馆

6TH CONF ON PARALLEL PROCESSING FOR SCIENTIFIC COMPUTING

作者： RAJOPADHYE, S MUDDARANGEGOWDA, M

来源：评论

学校读者我要写书评

暂无评论

DATA REDUCTION AND FAST ROUTING - A STRATEGY FOR EFFICIENT ALGORITHMS FOR MESSAGE-PASSING PARALLEL computers

引用

ALGORITHMICA 1992年第1期7卷 77-89页

作者： SANZ, JLC CYPHER, R 1. Department K54/802 IBM Almaden Research Center 650 Harry Road 95120 San Jose CA USA

This paper presents several algorithms for solving problems using massively parallel simd hypercube and shuffle-exchange computers. The algorithms solve a wide variety of problems, but they are related because they all use a common strategy. Specifically, all of the algorithms use a divide-and-conquer approach to solve a problem with N inputs using a parallel computer with P processors. The structural properties of the problem are exploited to assure that fewer than N data items are communicated during the division and combination steps of the divide-and-conquer algorithm. This reduction in the amount of data that must be communicated is central to the efficiency of the algorithm. This paper addresses four problems, namely the multiple-prefix, data-dependent parallel-prefix, image-component-labeling, and closest-pair problems. The algorithms presented for the data-dependent parallel-prefix and closest-pair problems are the fastest known when N greater-than-or-equal-to P and the algorithms for the multiple-prefix and image-component-labeling problems are the fastest known when N is sufficiently large with respect to P.

关键词： PARALLEL ALGORITHMS simd computers HYPERCUBES ROUTING

来源：评论

学校读者我要写书评

暂无评论

DOMAIN DECOMPOSITION ALGORITHMS OF SCHWARZ TYPE, DESIGNED FOR MASSIVELY PARALLEL computers

DOMAIN DECOMPOSITION ALGORITHMS OF SCHWARZ TYPE, DESIGNED FO...

引用

5TH INTERNATIONAL SYMP ON DOMAIN DECOMPOSITION METHODS FOR PARTIAL DIFFERENTIAL EQUATIONS

作者： BJORSTAD, PE SKOGEN, MD Univ of Bergen Bergen Norway

ISBN: (纸本)0898712882

We discuss implementation of additive Schwarz type algorithms on simd computers. A recursive, additive algorithm is compared with a two-level scheme. These methods are based on a subdivision of the domain into thousands of micro-patches that can reflect local properties, coupled with a coarser, global discretization where the `macro' behavior is reflected. The two-level method shows very promising flexibility, convergence and performance properties when implemented on a massively parallel simd computer.

关键词： DOMAIN DECOMPOSITION SCHWARZ ALGORITHM MASSIVELY PARALLEL ALGORITHMS ELLIPTIC PARTIAL DIFFERENTIAL EQUATIONS simd computers

来源：评论

学校读者我要写书评

暂无评论

ACHIEVING SPEEDUPS FOR APL ON AN simd DISTRIBUTED MEMORY MACHINE

引用

INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING 1990年第2期19卷 111-127页

作者： GREENLAW, R SNYDER, L UNIV NEW HAMPSHIRE DEPT COMP SCIDURHAMNH 03824 UNIV WASHINGTON DEPT COMP SCI & ENGNSEATTLEWA 98195

The potential speedup for simd parallel implementations of APL programs is considered. Both analytical and (simulated) empirical studies are presented. The approach is to recognize that nearly 95% of the operators appearing in APL programs are either scalar primitive, reduction or indexing and so the performance of these operators gives a good estimate of the amount of speedup a full program might receive. Substantial speedups are demonstrated for these operators and the empirical evidence accords with the analytical estimates.

关键词： APL DATA PARALLEL PARALLELISM PARALLEL PROGRAMMING simd computers

来源：评论

学校读者我要写书评

暂无评论

PERMUTATIONS ON ILLIAC-IV-TYPE NETWORKS

引用

IEEE TRANSACTIONS ON computers 1986年第7期35卷 662-669页

作者： RAGHAVENDRA, CS KUMAR, VKP Department of Electrical Engineering—Systems University of Southern California

Performing permutations of data on simd computers efficiently is important for high-speed execution of parallel algorithms. In this correspondence we consider realizing permutations such as perfect shuffle, matrix transpose, bit-reversal, the class of bit-permute- complement (BPC), the class of Omega, and inverse Omega permutations on N = 2n processors with Illiac IV-type interconnection network, where each processor is connected to processors at distances of ± 1 and ± N. The minimum number of data transfer operations required for realizing any of these permutations on such a network is shown to be 2(N − 1). We provide a general three-phase strategy for realizing permutations and derive routing algorithms for performing perfect shuffle, Omega, Inverse Omega, bit reversal, and matrix-transpose permutations in 2(N − 1) steps. Our approach is quite simple, and unlike previous approaches, makes efficient use of the topology of the Illiac IV-type network to realize these permutations using the optimum number of data transfers. Our strategy is quite powerful: any permutation can be realized using this strategy in 3(N − 1) steps.

关键词： Bit-permute-complement permutations Omega permutations simd computers interconnection network parallel algorithms permutations

来源：评论

学校读者我要写书评

暂无评论

THE MEASUREMENT OF PERFORMANCE ON A HIGHLY PARALLEL SYSTEM

引用

IEEE TRANSACTIONS ON computers 1983年第1期32卷 32-37页

作者： PARKINSON, D LIDDELL, HM DAP Support Unit Queen Mary College University of London

The problems of measuring the performance of a highly parallel multiple processor system, such as the 4096 element ICL Distributed Array Processor are presented in relation to the conventional methods used for serial processors; this is preceded by a brief description of the DAP hardware in order to. provide a framework for the discussion, together with some of the resulting implications for algorithm design. The importance of choosing algorithms for parallel computation in such a way as to make the best use of the parallelism of the hardware for the problem to be solved is discussed, and examples are given of parallel and hybrid algorithms—in the latter a mixture of serial and parallel techniques are used. A method of comparison of performance at the problem solving level is presented, which is illustrated by results obtained by DAP users studying problems which arise in a wide range of application areas.

关键词： Associative processors simd computers distributed array processor multiple processor systems parallel algorithms parallel computation performance measurement

来源：评论

学校读者我要写书评

暂无评论

QUOTIENT NETWORKS

引用

IEEE TRANSACTIONS ON computers 1982年第4期31卷 288-295页

作者： FISHBURN, JP FINKEL, RA UNIV WISCONSIN DEPT COMP SCIMADISONWI 53706

A large-network algorithm solves a problem of size N on a network of N processors. We present a method for transforming certain large networks into quotient networks that emulate those large networks with fewer processors. Large-network algorithms are easily modified to execute on the quotient network. The emulations result in no loss in execution efficiency. Quotient networks allow algorithms to be designed assuming any number of processors and executed efficiently at a great savings in hardware cost.

关键词： Interconnection networks simd computers large problem size/ machine size parallel FFT theory of parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：