检索结果-内蒙古大学图书馆

The Fork95 parallel programming language: Design, implementation, application

INTERNATIONAL JOURNAL OF parallel programming 1997年第1期25卷 17-50页

作者： Kessler, CW Seidl, H Fachbereich IV-Informatik Universität Trier Trier Germany

Fork95 is an imperative parallel programming language intended to express algorithms for synchronous shared memory machines (PRAMs). It is based on ANSI C and offers additional constructs to hierarchically divide processor groups into subgroups and manage shared and private address subspaces. Fork95 makes the assembly-level synchronicity of the underlying hardware available to the programmer at the language level. Nevertheless, it supports locally asynchronous computation where desired by the programmer. We present a one pass compiler, fee, which compiles Fork95 and C programs to the SB-PRAM machine. The SB-PRAM is a lock-step synchronous, massively parallel multiprocessor currently being built at Saarbrucken University, with a physically shared memory and uniform memory access time. We examine three important types of parallel computation Frequently used for the parallel solution of real-world problems. While farming and parallel divide-and-conquer are directly supported by Fork95 language constructs, pipelining can be easily expressed using existing language Features;an additional language construct for pipelining is not required.

关键词： Fork95 parallel programming language synchronous program execution PRAM parallel programming paradigms

来源：评论

学校读者我要写书评

暂无评论

High Performance Fortran for practical scientific algorithms: An up-to-date evaluation

引用

FUTURE GENERATION COMPUTER SYSTEMS 1999年第3期15卷 343-352页

作者： Ding, CHQ Univ Calif Berkeley Lawrence Berkeley Lab Natl Energy Res Sci Comp Ctr Berkeley CA 94720 USA

A suite of High Performance Fortran (HPF) coding examples of practical scientific algorithms are examined in detail, with the idea that on these simple but non-trivial examples, we can fairly well understand issues related to different data distributions, different parallel constructs, and different programming styles (static Versus dynamic allocations). Coding examples include 2D stencils solution of PDEs, N-body problem, LU factorization, several vector/matrix library routines, 2D and 3D array redistribution. Performances of HPF codes are compared to hand-written Fortran codes with message passing libraries. From 1997 to 1998, HPF compilers are improved significantly such that HPF codes perform as well as Fortran+MPI codes for all the examples investigated here. However, many important peculiarities of HPF coding still exist. (C) 1999 Elsevier Science B.V. All rights reserved.

关键词： HPF distributed-memory architecture parallel programming language MPI stencils computation N-body problem LU factorization array redistribution FFT

来源：评论

学校读者我要写书评

暂无评论

Performance evaluation of neural network hardware using time-shared bus and integer representation architecture

引用

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS 1996年第6期E79D卷 888-896页

作者： Yasunaga, M Ochiai, T Institute of Information Science and Electronics University of Tsukuba Tsukuba-shi 305 Japan

Neural network hardware using time-shared bus and integer representation architecture has already been fabricated and reported from the design viewpoint. However, nothing related to performance evaluation of hardware has yet been presented. Computation-speed, scalability and learning accuracy of hardware are evaluated theoretically and experimentally using a Back Propagation (BP) algorithm. In addition, a mirror-weight assignment technique is proposed for high-speed computation in the BP. NETTalk, an English-pronunciation-reasoning task, has been chosen as the target application for the BP. In the experiment, recently-developed neuro-hardware based on the above architecture and its parallel programming language are used. An outline of the language is described along with BP programming. Mirror-weight assignment allows maximum speed at 55.0 MCUPS (Million Connections Updated Per Second) using 256 neurons in the hidden-layer (numbers of neurons in input- and output-layers are fixed at 203 and 26 respectively in NETTalk). In addition, if scalability is defined as a function of the number of neurons in the hidden-layer, the machine retains high scalability at 0.5 if such a maximum speed needs to be used. No degradation in learning accuracy occurs when experimental results computed using the neuro-hardware are compared with those obtained by floating-point representation architecture (workstation). The experiment indicates that the present integer representational design of the neuro-hardware is sufficient for NETTalk. Performance has been evaluated theoretically. evaluation purposes, it is assumed that most of the execution-time is taken up by bus cycles. On the basis of this assumption, an analytical model of computation-speed and scalability is proposed. Analytical predictions agreed well with experimental results.

关键词： neural networks parallel computing parallel programming language performance evaluation scalability

来源：评论

学校读者我要写书评

暂无评论

Rendezvous Facilities in a Distributed Computer System

引用

Journal of Computer Science & Technology 1995年第2期10卷 188-192页

作者：廖先湜金兰 DepartmentofComputerScience TsinghuaUniversityBeijing100084 DepartmentofComputerScience CaliforniaSt

The distributed computer system described in this paper is a set of computernodes interconnected in an interconnection network via packet-switching *** nodes communicate with each other by means of message-passing protocols. Thispaper presents the implementation of rendezvous facilities as highlevel prhoitives provided by a parallel programming language to support interprocess cornmunication andsynchronisation.

关键词： Rendevous packet-switching interface message-passing protocols interprocess communication and synchronization high-level primitive parallel programming language interconnection network

来源：评论

学校读者我要写书评

暂无评论

COOL - AN OBJECT-BASED language FOR parallel programming

引用

COMPUTER 1994年第8期27卷 13-26页

作者： CHANDRA, R GUPTA, A HENNESSY, JL Comput. Syst. Lab. Stanford Univ. CA

Effectively using shared-memory multiprocessors requires substantial programming effort. We present the programming language COOL (Concurrent Object-Oriented language), which was designed to exploit coarse-grained parallelism at the task level in shared-memory multiprocessors. COOL's primary design goals are efficiency and expressiveness. By efficiency we mean that the language constructs should be efficient to implement and a program should not have to pay for features it does not use. By expressiveness, we imply that the program should flexibly support different concurrency patterns, thereby allowing various decompositions of a problem. COOL emphasizes the integration of concurrency and synchronization with data abstraction to ease the task of creating modular and efficient parallel programs. It is an extension of C++, which was chosen because it supports abstract data type definitions and is widely used

关键词： C++ COOL Concurrent Object-Oriented language abstract data type definitions abstract data types concurrency patterns data abstraction design goals efficiency expressiveness language constructs implementation object-oriented languages parallel languages parallel programming language problem decompositions shared-memory multiprocessors synchronisation synchronization task-level coarse-grained parallelism

来源：评论

学校读者我要写书评

暂无评论

DATA-parallel programming ON MIMD COMPUTERS

引用

IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 1991年第3期2卷 377-383页

作者： HATCHER, PJ QUINN, MJ LAPADULA, AJ SEEVERS, BK ANDERSON, RJ JONES, RR OREGON STATE UNIV DEPT COMP SCICORVALLISOR 97331

We are convinced that the combination of data-parallel languages and MIMD hardware can make an important contribution to high-speed computing. In this paper, we describe the implementation of two compilers for the data-parallel programming language Dataparallel C. One compiler generates code for Intel and nCUBE hypercube multicomputers;the other generates code for Sequent multiprocessors. We have compiled and executed a suite of Dataparallel C programs, and we present their execution times and speedups on the Intel iPSC/2, the nCUBE 3200, and the Sequent Symmetry.

关键词： COMPILER IMPLEMENTATION DATA-parallel ALGORITHM MULTICOMPUTER MULTIPROCESSOR parallel PROCESSING parallel programming language PERFORMANCE EVALUATION

来源：评论

学校读者我要写书评

暂无评论

THE BLAZE language - A parallel language FOR SCIENTIFIC programming

引用

parallel COMPUTING 1987年第3期5卷 339-361页

作者： MEHROTRA, P VANROSENDALE, J UNIV UTAH DEPT COMP SCISALT LAKE CITYUT 84112

programming multiprocessor parallel architectures is a complex task. This paper describes a block-structured scientific programming language, BLAZE, designed to simplify this task. BLAZE contains array arithmetic, ‘forall’ loops, and APL-style accumulation operators, which allow natural expression of fine grained parallelism. It also employs an applicative or functional procedure invocation mechanism, which makes it easy for compilers to extract coarse grained parallelism using machine specific program restructuring. Thus BLAZE should allow one to achieve highly parallel execution on multiprocessor architectures, while still providing the user with conceptually sequential control flow. A central goal in the design of BLAZE is portability across a broad range of parallel architectures. The multiple levels of parallelism present in BLAZE code, in principle, allow a compiler to extract the types of parallelism appropriate for the given architecture, while neglecting the remainder. This paper describes the features of BLAZE, and show how this language would be used in typical scientific programming.

关键词： The BLAZE language parallel programming language multiprocessors MIMD architectures restructuring of conventional sequential languages compiler

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：