检索结果-内蒙古大学图书馆

IEEE Software 1997年第6期14卷 107-107页

作者： Schaller, Nan C. Rochester Institute of Technology United States

"Practical parallel programming" by Gregory V. Wilson is reviewed.

关键词： parallel programming Concurrent computing Terminology History Books Gold Data mining parallel processing Computer languages Electronic mail

来源：评论

学校读者我要写书评

暂无评论

Compiling High Performance Fortran for distributed-memory architectures

引用

parallel COMPUTING 1999年第13-14期25卷 1785-1825页

作者： Benkner, S Zima, H NEC Europe Ltd C&C Res Labs D-53757 St Augustin Germany Univ Vienna Inst Software Technol & Parallel Syst A-1090 Vienna Austria

High Performance Fortran (HPF) is a data-parallel language that provides a high-level interface for programming scientific applications, while delegating to the compiler the task of generating explicitly parallel message-passing programs. This paper provides an overview of HPF compilation and runtime technology for distributed-memory architectures, and deals with a number of topics in some detail. In particular, we discuss distribution and alignment processing, the basic compilation scheme and methods for the optimization of regular computations. A separate section is devoted to the transformation and optimization of independent loops with irregular data accesses. The paper concludes with a discussion of research issues and outlines potential future development paths of the language. (C) 1999 Elsevier Science B.V. All rights reserved.

关键词： High Performance Fortran (HPF) parallel programming parallelization code generation irregular problems distributed-memory architectures

来源：评论

学校读者我要写书评

暂无评论

parallel implementation of simulated annealing using transaction processing

引用

IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES 1999年第2期146卷 107-113页

作者： Pao, DCW Lam, SP Fong, AS City Univ Hong Kong Dept Elect Engn Tat Chee Ave Kowloon Peoples R China

Simulated annealing is an effective method for solving large combinatorial optimisation problems. Because of its iterative nature the annealing process requires a substantial amount of computation time. A new parallel implementation based on the concurrency control theory of database systems is presented;the parallelised annealing process is serialisable. Concurrent updates to the base solution are allowed provided that they do not have data conflict. Using the travelling salesman problem as the example application, the parallel simulated annealing algorithm is implemented on a Motorola Delta 3000 shared-memory multiprocessor system with eight processors. With a moderate problem size of 400 cities, a speedup efficiency of over 90% is achieved at high annealing temperature and close to 100% at a low annealing temperature.

关键词： concurrency control simulated annealing parallel implementation parallel programming combinatorial optimisation parallel algorithms transaction processing parallelised Optimisation techniques shared-memory

来源：评论

学校读者我要写书评

暂无评论

parallelizing I/O-intensive image access and processing applications

引用

IEEE CONCURRENCY 1999年第2期7卷 28-37页

作者： Messerli, V Figueiredo, O Gennart, B Hersch, RD Ecole Polytech Fed Lausanne Dept Comp Sci Peripheral Syst Lab CH-1015 Lausanne Switzerland

CAP, a computer-aided parallelization tool, generates highly pipelined applications that run communication and I/O operations in parallel with processing operations. One of CAP's successes is the Visible Human Slice Server (http://visible ***), a 3D tomographic image server that allows clients to choose and view any cross section of the human body.

关键词： Concurrent computing Application software Humans Distributed computing Web server parallel programming Personal communication networks Computer errors File systems

来源：评论

学校读者我要写书评

暂无评论

Cache-only memory architectures

引用

COMPUTER 1999年第6期32卷 72-+页

作者： Dahlgren, F Torrellas, J Ericsson Mobile Commun Lund Sweden Univ Illinois Dept Comp Sci Urbana IL 61801 USA

The shared-memory concept makes it easier to write parallel programs, but tuning the application to reduce the impact of frequent long-latency memory accesses still requires substantial programmer effort. Researchers have proposed using compilers, operating systems, or architectures to improve performance by allocating data close to the processors that use it, The Cache-Only Memory Architecture (CO,MA) increases the chances of data being available locally because the hardware transparently replicates the data and migrates it to the memory module of the node that is currently accessing it. Each memory module acts as a huge cache memory in which each block has a tag with the address and the state. The authors explain the functionality, architecture, performance, and complexity of COMA systems. They also outline different COMA designs, compare COMA to traditional nonuniform memory access (NUMA) systems, and describe proposed improvements in NUMA systems that target the same performance obstacles as COMA.

关键词： COMA Cache-Only Memory Architecture NUMA systems cache storage compilers data allocation data replication frequent long latency memory accesses huge cache memory memory architecture memory module nonuniform memory access operating systems parallel programming parallel programs performance obstacles programmer effort shared memory concept shared memory systems storage management

来源：评论

学校读者我要写书评

暂无评论

Space efficient execution of deterministic parallel programs

引用

IEEE TRANSACTIONS ON SOFTWARE ENGINEERING 1999年第6期25卷 870-882页

作者： Simpson, DJ Burton, FW Simon Fraser Univ Burnaby BC V5A 1S6 Canada

We model a deterministic parallel program by a directed acyclic graph of tasks, where a task can execute as soon as all tasks preceding it have been executed. Each task can allocate or release an arbitrary amount of memory (i.e., heap memory allocation can be modeled). We call a parallel schedule "space efficient" if the amount of memory required is at mast equal to the number of processors times the amount of memory required for some depth-first execution of the program by a single processor. We will describe a simple, locally depth-first, scheduling algorithm and shaw that it is always space efficient. Since the scheduling algorithm is greedy, it will be within a factor of two of being optimal with respect to time. For the special case of a program having a series-parallel structure, we show how to efficiently compute the worst case memory requirements over all possible depth-first executions of a program. Finally, we show how scheduling can be decentralized, making the approach scalable to a large number of processors when there is sufficient parallelism.

关键词： memory management scheduling worst case performance parallel programming memory bounds shared memory

来源：评论

学校读者我要写书评

暂无评论

The role of graphics in parallel program development

引用

JOURNAL OF VISUAL LANGUAGES AND COMPUTING 1999年第3期10卷 215-243页

作者： Zhang, K Hintz, T Ma, XW Macquarie Univ Dept Comp Sydney NSW 2109 Australia Univ Technol Sydney Sch Comp Sci Sydney NSW 2007 Australia Fujitsu Australia Software Technol French Forest NSW 2150 Australia

Graphical visualisation plays an important role in parallel program development. Researchers have proposed and developed many visualisation tools that assist the development of parallel programs. A number of graph formalisms or notations have been used to visualise various aspects of parallel programs and their executions. This paper attempts to classify and compare these graph formalisms and notations which provide different information at different stages of parallel program development. (C) 1999 Academic Press.

关键词： parallel programming visual programming program visualisation graph models debugging

来源：评论

学校读者我要写书评

暂无评论

The origins of pattern theory: The future of the theory, and the generation of a living world

引用

IEEE SOFTWARE 1999年第5期16卷 71-72页

作者： Alexander, C Coplien, JO

Once in a while, a great idea makes it across the boundary of one discipline to take root in another. The adoption of Christopher Alexander's patterns by the software community is one such event. Alexander both commands respect and inspires controversy in his own discipline. It is odd that his ideas should have found a home in software, a discipline that deals not with timbers and tiles but with pure thought stuff, and with ephemeral and weightless products called programs. The software community embraced the pattern vision for its relevance to problems that had long plagued software design in general and object-oriented design in particular. Focusing on objects had caused us to lose the system perspective. Preoccupation with design method had caused us to lose the human perspective. The curious parallels between Alexander's world of buildings and our world of software construction helped the ideas to take root and thrive in grassroots programming communities worldwide. The pattern discipline has become one of the most widely applied and important ideas of the past decade in software architecture and design.

关键词： Books Speech Design methodology Humans Buildings parallel programming Software architecture Software design Technological innovation Software quality

来源：评论

学校读者我要写书评

暂无评论

A parallel algorithm for the global minimization of Gibbs free energy

引用

ANNALS OF OPERATIONS RESEARCH 1999年第90期90卷 271-291页

作者： Berner, S McKinnon, KIM Millar, C Univ Edinburgh Dept Math & Stat Edinburgh EH9 3JZ Midlothian Scotland

A chemical mixture under conditions of constant temperature and pressure may split into different phases. The number of phases and the composition of each may be determined by globally minimizing the Gibbs free energy of the system. This can be done by iterating between an easy local minimization problem with a high number of variables and a difficult global search and verification problem in a small number of variables. The global problem can be solved by a branch and bound method, using bounds from interval analysis. When implemented in parallel, the method has lower communication requirements than other related branch and bound approaches for general global minimization. We present a parallel implementation on a network cluster of workstations that exploits this characteristic. On difficult instances, utilizations of over 90% are obtained using up to 14 processors. The algorithm copes well with varying workstation loads and has low communication overheads. A method of assessing the performance of a parallel algorithm on a shared heterogeneous network of workstations is developed.

关键词： global optimization interval-analysis parallel programming branch and bound Gibbs free energy phase equilibrium primal-dual methods

来源：评论

学校读者我要写书评

暂无评论

ARCH, an object oriented mpi-based library for asynchronous and loosely synchronous parallel system programming 4th

ARCH, an object oriented mpi-based library for asynchronous ...

引用

4th European Conference on parallel Virtual Machine and Message Passing Interface, PVM/MPI 1997

作者： Adamo, Jean-Marc Université Lyon I and Ecole Supérieure de Chimie Physique et Electronique de Lyon Laboratoire LISA Domaine Scientifique de La Doua Bat. 308 43 Bd. Du 11 novembre 1918 B.P. 2077 Villeurbanne cedex69616 France

ISBN: (纸本)3540636978

There are important classes of parallel systems built from components that need to be described in terms of asynchronous concurrent activities. For such systems, the model on which MPI relies proves to be far too restrictive. This paper presents ARCH, a Library of tools for concurrent parallel programming that was designed to address these classes of systems. The new library was implemented on top of MPI with C++. The latter was not just used as a development language. Instead, It was tried to transmit through ARCH all the benefits and power of the object oriented technology, ARCH consists of several packages of C++ classes. The first package deals with threading, Two classes provide functions for thread and concurrent process construction, destruction, initialization, scheduling, and so forth. The library supplies three additional packages for thread/process synchronization and communication, each associated with a well identified communication model: (I) point-topoint synchronous, (2) point-to-point asynchronous and (3) one-sided via direct write/read function calls. Finally, the library proposes a package providing spread pointers and arrays. © Springer-Verlag Berlin Heidelberg 1997.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：