检索结果-内蒙古大学图书馆

PERFORMANCE EVALUATION OF PRACTICAL parallel COMPUTER model LogPQ

International Journal of Foundations of Computer Science 2001年第3期12卷 325-340页

作者： TAKAYOSHI TOUYAMA SUSUMU HORIGUCHI Dept. of Electrical and Electronics Engineering Nippon Institute of Technology 4-1 Gakuendai Miyashiro Saitama 345-8501 Japan Graduate School of Information Science Japan Advanced Institute of Science of Technology 1-1 Asahidai Tatsunokuchi Ishikawa 923-1292 Japan

The present super computer will be replaced by a massively parallel computer consisting of a large number of processing elements which satisfy the continuous increasing depend for computing power. Practical parallel computing model has been expected to develop efficient parallel algorithms on massively parallel computers. Thus, we have presented a practical parallel computation model LogPQ by taking account of communication queues into the LogP model. This paper addresses the performance of a parallel matrix multiplication algorithm using LogPQ and LogP models. The parallel algorithm is implemented on Cray T3E and the parallel performances are compared with on the old machine CM-5. This shows that the communication network of T3E has superior buffering behavior than CM-5, in which we don't need to prepare extra buffering on T3E. Although, a little effect remains for both of the send and receive bufferings. On the other hand, the effect of message size remains, which shows the necessity of the overhead and gap proportional to the message size.

关键词： Massively parallel computers parallel computation model LogP model Cray T3E CM-5 MPI

来源：评论

学校读者我要写书评

暂无评论

A parallel computation model and programming language for the description of collaboration among objects

引用

Systems and Computers in Japan 1998年第5期28卷

作者： Naoyasu Ubayashi Atsuo Ohki Yasushi Kuno Graduate School of Systems Management University of Tsukuba 3-29-1 Otsuka Bunkyo Tokyo Japan 112

Several parallel computation models including the Actor model have been proposed. Since these models have only primitive constructs for parallel computation, it is not easy to build a model in terms of what kind of roles objects in the real world play and how they collaborate with each other. When we create a parallel system model with such frameworks, it is difficult to understand the system behavior as a whole. To solve this problem, we propose a new model, the Producer model, and its description language Produce/1. In the Producer model, parallel objects collaborate with each other under the coordination of producer objects. In the Actor model, objects build network topologies and send messages to each other. On the other hand, in the Producer model, we distinguish these two kinds of computations, through the execution of the former by producer objects, and of the latter by actor objects. By reading procedures of producer objects, we can understand collaboration among objects easily. © 1997 Scripta Technica, Inc. Syst Comp Jpn, 28(5): 33–43, 1997

关键词： parallel computation model object oriented description of collaboration among objects message passing mechanism

来源：评论

学校读者我要写书评

暂无评论

An efficient implementation for the BROADCAST Instruction of BSR⁺

引用

IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 1999年第8期10卷 852-863页

作者： Xiang, LM Ushijima, K Akl, SG Stojmenovic, I Kyushu Univ Dept Comp Sci & Commun Engn Higashi Ku Fukuoka 8128581 Japan Queens Univ Dept Comp & Informat Sci Kingston ON K7L 3N6 Canada Univ Ottawa Dept Comp Sci Ottawa ON K1N 9B4 Canada

BSR (Broadcasting with Selective Reduction) is a PRAM more powerful than any CRCW PRAM. in order to extend the Broadcast Instruction of BSR and make it more useful for a large class of applications, this article permits it to use a general form of selection, specifically, an arbitrary relational expression. BSR with general selection is denoted by BSR+. Thus, BSR or BSR with L criteria (k > 1) is BSR+ in a special case. An efficient implementation for the Broadcast Instruction of BSR+ is proposed;requiring (1/k)th of the circuits used by the best previous implementation of BSR with k criteria. Of all PRAMs, BSR+ is the most powerful in computation.

关键词： parallel computation model broadcasting with selective reduction single selection multiple selection general selection

来源：评论

学校读者我要写书评

暂无评论

Research on the Optimal parallel Algorithms of Broadcast-Class Problems

引用

Journal of Computer Science & Technology 1998年第5期13卷 455-463页

作者：李晓峰寿标郑世荣 DepartmentofComputerScience UniversityofScienceandTecnologyofChinaHefei230027PR.China

Speedup is considered as the criterion of determining whether a parallel algorithm is optimal. But broadcast-class problems, existing only on parallel computer system, have no sequential algorithms at all. Speedup standard becomes invalid here. Through this research on broadcast algorithms under several typical parallel computation models,a model-independent evaluation standard min C2 is developed, which can be not only used to determine an optimal broadcasting algorithm, but also normalized to apply to any parallel algorithm. As a new idea, min C2 will lead to a new way in this field.

关键词： Optimal parallel algorithm broadcast-class problem parallel computation model min C^2

来源：评论

学校读者我要写书评

暂无评论

AUTOMATIC TASK GRAPH GENERATION TECHNIQUES

引用

parallel Processing Letters 1995年第4期5卷 527-538页

作者： M. COSNARD M. LOI Laboratoire de l'Informatique du Parallélisme Ecole Normale Supérieure de Lyon CNRS Lyon 69364 France This work was supported in part by the Eureka Eurotops project and the EEC Human Capital Mobility MAP project. This work was supported in part by the Région Rhône-Alpes.

We present a model of parallel computation, the parameterized task graph, which is a compact, problem size independent, representation of some frequently used directed acyclic task graphs. Techniques automating the construction of such a representation, starting from an annotated sequential program are proposed. We show that many important properties of the task graph such as the computational load of the nodes and the communication volume of the edges can be automatically deduced in a problem size independent way.

关键词： parallel computation model task graphs affine dependences computational load communication volume

来源：评论

学校读者我要写书评

暂无评论

Performance predictions for parallel diagonal-implicitly iterated Runge-Kutta methods 95

Performance predictions for parallel diagonal-implicitly ite...

引用

Proceedings of the ninth workshop on parallel and distributed simulation

作者： Thomas Rauber Gudula Rünger Computer Science Department Universität des Saarlandes

ISBN: (纸本)9780818671203

Many simulations in the natural sciences and engineering require the numerical solution of nonlinear differential equations. For this class of numerical methods, we propose an appropriate parallel computation model on distributed memory machines that supports the prediction of execution times. As a case study, we investigate the parallel implementation of the diagonal-implicitly iterated Runge-Kutta method, a solution method for stiff systems of ordinary differential equations. An implementation on the Intel iPSC/860 confirms the accuracy of the prediction model.

关键词： nonlinear differential equations digital simulation performance evaluation Runge-Kutta methods distributed memory machines parallel diagonal-implicitly iterated Runge-Kutta methods simulations Intel iPSC/860 parallel algorithms parallel computation model prediction model

来源：评论

学校读者我要写书评

暂无评论

PLACE TRANSITION NETS WITH DEBIT ARCS

引用

INFORMATION PROCESSING LETTERS 1992年第1期41卷 25-33页

作者： STOTTS, PD GODFREY, P UNIV MARYLAND DEPT COMP SCICOLLEGE PKMD 20742

We add an extension called debit arcs to traditional place/transition nets. A debit arc incident upon a transition represents an always true precondition;when the transition fires, a token is subtracted from the place issuing the debit arc, creating an antioken if no tokens are present to substract. We show that two different policies on how tokens and antitokens annihilate produce two classes of automata with different recognition powers.

关键词： PETRI NETS PLACE TRANSITION NETS AUTOMATA THEORY FORMAL LANGUAGES parallel computation model COLORED NETS HIGH-LEVEL NETS

来源：评论

学校读者我要写书评

暂无评论

AN INTRODUCTION TO FIFO NETS - MONOGENEOUS NETS - A SUBCLASS OF FIFO NETS

引用

THEORETICAL COMPUTER SCIENCE 1985年第2-3期35卷 191-214页

作者： MEMMI, G FINKEL, A LRI F-91405 ORSAYFRANCE UNIV PARIS 11 CTR ORSAYF-91405 ORSAYFRANCE THOMSON CSF LCRF-91401 ORSAYFRANCE

We introduce a new model of parallel computation, the FIFO nets. We show how it can simulate Petri nets and coloured Petri nets and prove that a restriction of it (alphabetical FIFO nets) has the power of Turing machines. Furthermore, we define monogeneous FIFO nets and use the coverability graph for proving that it is decidable whether or not a monogeneous net is bounded and whether or not its language is regular.

关键词： FIFO nets parallel computation model program machines regular languages monogeneous net boundedness coverability graph deterministic languages of Petri nets

来源：评论

学校读者我要写书评

暂无评论

SYNTHESIS OF DECISION-FREE CONCURRENT SYSTEMS FOR PRESCRIBED RESOURCES AND PERFORMANCE

引用

IEEE TRANSACTIONS ON SOFTWARE ENGINEERING 1980年第6期6卷 525-530页

作者： MURATA, T Department of Information Engineering University of Illinois

This paper presents a method for synthesizing or growing live and safe marked graph models of decision-free concurrent comutations. The approach is modular in the sense that subsystems r represented by arcs (and nodes) are added one by one without the need of redesigning the entire system. The foliowing properties of marked graph models can be prescribed in the synthesis: liveness (absence of deadlocks), safeness (absence of overflows), the number of reachability classes, the maximum resource (temporary storage) requirement, computation rate (performance), as well as the numbers of arcs and states.

关键词： Prescribed Perfornance Absence Of Overflows Deadlock Freeness Decision Free Concurrent Systems Marked Graphs Maximum Resources Modular Synthesis parallel computation model

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：