检索结果-内蒙古大学图书馆

Wide-area parallel programming using the remote method invocation model

Concurrency and Computation: Practice and Experience 2000年第8期12卷

作者： Rob van Nieuwpoort Jason Maassen Henri E. Bal Thilo Kielmann Ronald Veldema Department of Computer Science Vrije Universiteit De Boelelaan 1081 HV Amsterdam The Netherlands

Java's support for parallel and distributed processing makes the language attractive for metacomputing applications, such as parallel applications that run on geographically distributed (wide-area) systems. To obtain actual experience with a Java-centric approach to metacomputing, we have built and used a high-performance wide-area Java system, called Manta. Manta implements the Java Remote Method Invocation (RMI) model using different communication protocols (active messages and TCP/IP) for different networks. The paper shows how wide-area parallel applications can be expressed and optimized using Java RMI. Also, it presents performance results of several applications on a wide-area system consisting of four Myrinet-based clusters connected by ATM WANs. We finally discuss alternative programming models, namely object replication, JavaSpaces, and MPI for Java. Copyright © 2000 John Wiley & Sons, Ltd.

关键词： Java RMI metacomputing parallel programming

来源：评论

学校读者我要写书评

暂无评论

Challenges - Designing next-generation middleware systems

引用

COMMUNICATIONS OF THE ACM 2002年第6期45卷 39-42页

作者： Tripathi, A Univ Minnesota Dept Comp Sci Minneapolis MN 55455 USA

This framework promises new classes of service, especially in terms of security, for policy-based development of distributed and collaborative applications.

关键词： Middleware ARCHITECTURE Interoperability JavaBeans Operating systems New classes parallel programming Distributed software Distribute actor

来源：评论

学校读者我要写书评

暂无评论

Design and prototype of a performance tool interface for OpenMP

引用

JOURNAL OF SUPERCOMPUTING 2002年第1期23卷 105-128页

作者： Mohr, B Malony, AD Shende, S Wolf, F ZAM Res Ctr Julich Julich Germany Univ Oregon Dept Comp & Informat Sci Eugene OR 97403 USA

This paper proposes a performance tools interface for OpenMP, similar in spirit to the MPI profiling interface in its intent to define a clear and portable API that makes OpenMP execution events visible to runtime performance tools. We present our design using a source-level instrumentation approach based on OpenMP directive rewriting. Rules to instrument each directive and their combination are applied to generate calls to the interface consistent with directive semantics and to pass context information (e.g., source code locations) in a portable and efficient way. Our proposed OpenMP performance API further allows user functions and arbitrary code regions to be marked and performance measurement to be controlled using new OpenMP directives. To prototype the proposed OpenMP performance interface, we have developed compatible performance libraries for the Expert automatic event trace analyzer [17, 18] and the TAU performance analysis framework [13]. The directive instrumentation transformations we define are implemented in a source-to-source translation tool called OPARI. Application examples are presented for both Expert and TAU to show the OpenMP performance interface and OPARI instrumentation tool in operation. When used together with the MPI profiling interface (as the examples also demonstrate), our proposed approach provides a portable and robust solution to performance analysis of OpenMP and mixed-mode (OpenMP+MPI) applications.

关键词： performance analysis parallel programming OpenMP

来源：评论

学校读者我要写书评

暂无评论

A comparison of three programming models for adaptive applications on the Origin2000

引用

JOURNAL OF parallel AND DISTRIBUTED COMPUTING 2002年第2期62卷 241-266页

作者： Shan, HZ Singh, JP Oliker, L Biswas, R Princeton Univ Dept Comp Sci Princeton NJ 08544 USA Univ Calif Berkeley Lawrence Berkeley Lab Natl Energy Res Sci Comp Ctr Berkeley CA 94720 USA NASA Adv Supercomp Div Ames Res Ctr Moffett Field CA 94035 USA

Adaptive applications have computational workloads and communication patterns that change unpredictably at runtime, requiring dynamic load balancing to achieve scalable performance on parallel machines. Efficient parallel implementations of such adaptive applications is therefore a challenging task. In this paper, we compare the performance of and the programming effort required for two major classes of adaptive applications under three leading parallel programming models on an SGI Origin2000 system, a machine that supports all three models efficiently. Results indicate that the three models deliver comparable performance;however, the implementations differ significantly beyond merely using explicit messages versus implicit loads/stores even though the basic parallel algorithms are similar. Compared with the message-passing (using MPI) and SHMEM programming models, the cache-coherent shared address space (CC-SAS) model provides substantial ease of programming at both the conceptual and program orchestration levels, often accompanied by performance gains. However, CC-SAS currently has portability limitations and may suffer from poor spatial locality of physically distributed shared data on large numbers of processors. (C) 2002 Elsevier Science (USA).

关键词： parallel programming shared address space message passing dynamic mesh adaptation N-body problem

来源：评论

学校读者我要写书评

暂无评论

High-level language support for user-defined reductions

引用

JOURNAL OF SUPERCOMPUTING 2002年第1期23卷 23-37页

作者： Deitz, SJ Chamberlain, BL Snyder, L Univ Washington Seattle WA 98195 USA

The optimized handling of reductions on parallel supercomputers or clusters of workstations is critical to high performance because reductions are common in scientific codes and a potential source of bottlenecks. Yet in many high-level languages, a mechanism for writing efficient reductions remains surprisingly absent. Further, when such mechanisms do exist, they often do not provide the flexibility a programmer needs to achieve a desirable level of performance. In this paper, we present a new language construct for arbitrary reductions that lets a programmer achieve a level of performance equal to that achievable with the highly flexible, but low-level combination of Fortran and MPI. We have implemented this construct in the ZPL language and evaluate it in the context of the initialization of the NAS MG benchmark. We show a 45 times speedup over the same code written in ZPL without this construct. In addition, performance on a large number of processors surpasses that achieved in the NAS implementation showing that our mechanism provides programmers with the needed flexibility.

关键词： user-defined reductions parallel programming high-level languages scientific computing

来源：评论

学校读者我要写书评

暂无评论

Zoltan data management services for parallel dynamic applications

引用

COMPUTING IN SCIENCE & ENGINEERING 2002年第2期4卷 90-U1页

作者： Devine, K Boman, E Heaphy, R Hendrickson, B Vaughan, C Sandia Natl Labs Computat Comp & Math Ctr Zoltan Project Albuquerque NM 87185 USA

The Zoltan library is a collection of data management services for parallel, unstructured, adaptive, and dynamic applications that is available as open-source software. It simplifies the load-balancing, data movement, unstructured-communication, and memory usage difficulties that arise in dynamic applications such as adaptive finite-element methods, particle methods, and crash simulations. Zoltan's data-structure-neutral design also lets a wide range of applications use it without imposing restrictions on application data structures. Its object-based interface provides a simple and inexpensive way for application developers to use the library and researchers to make new capabilities available under a common interface

关键词： Open source software Application software Software libraries Packaging machines parallel programming Partitioning algorithms Memory management Testing Educational institutions

来源：评论

学校读者我要写书评

暂无评论

Using chaos-parallel evolutionary programming to solve the flow-shop scheduling problem

Using chaos-parallel evolutionary programming to solve the f...

引用

World Congress on Intelligent Control and Automation (WCICA)

作者： Liu Xingwei Pan Yongxiang Gao Hongmei Xi''an University of Technology China

In the paper, the chaos-parallel evolutionary programming algorithm is presented to solve the flow-shop scheduling problem. First, the individuals of each sub-population in the parallel evolutionary programming are found in the search space by use of the ergodicity properties of chaos states, then each sub-population evolves independently and the best individuals are exchanged between them periodically. Simulation results demonstrate that the new algorithm is efficient for optimizing large scale manufacturing process and the better results can be achieved on both the calculating time and optimizing rate.

关键词： Chaos Genetic programming Job shop scheduling Space technology parallel programming Scheduling algorithm Large-scale systems Manufacturing processes

来源：评论

学校读者我要写书评

暂无评论

Portable runtime support for graph-oriented parallel and distributed programming

Portable runtime support for graph-oriented parallel and dis...

引用

International Symposium on parallel Architectures, Algorithms and Networks (ISPAN)

作者： J. Cao Y. Liu L. Xie B. Mao K. Zhang National Key Laboratory for Novel Software Technology Nanjing University Nanjing China Department of Computing Hong Kong Polytechnic University Hong Kong China Department of Computer Science University of Texas Dallas Richardson TX USA

ISBN: (纸本)0769509363

In this paper, we describe the design and implementation of a portable run-time system for GOP, a graph-oriented programming framework aiming at providing high-bevel abstractions for configuring and programming cooperative parallel processes. The runtime system provides an interface with a library of programming primitives to the low-level facilities required to support graph-oriented communications and synchronization. The implementation is on top of the parallel Virtual Machine (PVM) in a local area network of Sun workstations. Issues related to the implementation of graph operations in a distributed environment are discussed. Performance of the runtime system is evaluated by estimating the overheads associated with using GOP primitives as opposed to PVM.

关键词： parallel programming programming profession Concurrent computing Distributed computing Runtime library Data structures Distributed control Message passing Dynamic programming Portable computers

来源：评论

学校读者我要写书评

暂无评论

The parallel cellular programming model

The parallel cellular programming model

引用

Euromicro Workshop on parallel and Distributed Processing

作者： P.J. Cagnard Computer Science Theory Laboratory Swiss Federal Institute of Technology Lausanne Switzerland

We present a synchronous parallel programming model designed for massively parallel fine grained applications such as cellular automata, finite element methods or partial differential equations. In this model we assume that the number of parallel processes in a program is much larger than the number of processors of the machine on which it is run. We present the computational model and the communication model. We introduce the virtual cellular machine, an abstract machine implementing this programming model which requires means to simulate efficiently the execution of many processes on a single processor; and to use the available communication bandwidth efficiently. Finally, we show an example program written in a prototype language designed for programming the virtual machine.

关键词： parallel programming Concurrent computing Electrical capacitance tomography Computational modeling Prototypes parallel processing Computer science Laboratories Automatic programming Application software

来源：评论

学校读者我要写书评

暂无评论

A multi-locking mechanism on shared object DSM 9

A multi-locking mechanism on shared object DSM

引用

9th International Conference on parallel and Distributed Systems (ICPADS 2002)

作者： Wong, AKL Zhu, WP Univ Queensland Sch Info Tech & Elec Eng St Lucia Qld 4072 Australia

ISBN: (纸本)0769517609

Shared object Distributed Shared Memory (DSM) minimizes the problem of false sharing by allowing programmer to control the sharing size. This shared object approach for distributed parallel programming works well in task parallelism but not in data parallelism. When the data of a shared object is being modified, a lock on that object must be enforced to exclude any concurrent access on that same object. If the shared data within an object is large, internal false sharing would become a problem. We present a multi-locking mechanism for shared object DSM which allows multiple locks be applied to the different data sets of a shared object and thus enhances its concurrency power.

关键词： Australia Computer science Concurrent computing Force control Hardware parallel processing parallel programming programming profession Scattering Size control

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：