We present a parallel implementation of the Car-Parrinello method (1985) for first principles molecular dynamics simulations, in the parallel Virtual Machine (PVM, version 3.3.4) environment on the CRAY-T3D computer. ...
详细信息
We present a parallel implementation of the Car-Parrinello method (1985) for first principles molecular dynamics simulations, in the parallel Virtual Machine (PVM, version 3.3.4) environment on the CRAY-T3D computer. parallelism is achieved at the Single Program Multiple Data (SPMD) level, dividing among processors the calculations on different columns of a large matrix, containing the electronic wavefunctions of a condensed matter system object of the simulation, written in a plane-wave basis. Fast Fourier Transforms (FFTs) of these electronic states are the heaviest computations in the program. Communications are avoided an most of the code, except where the columns must be orthogonalized among each other. Even if it is not possible to make parallel the whole code, we have reached a parallel implementation of the most time-consuming part. We get an excellent behaviour of this part, and a satisfactory global behaviour.
Presents a platform for developing and evaluating control systems for emerging multimedia networks. The platform allows us to closely approximate the functional and dynamic behavior of network control systems. By prov...
详细信息
Presents a platform for developing and evaluating control systems for emerging multimedia networks. The platform allows us to closely approximate the functional and dynamic behavior of network control systems. By providing support for real-time visualization and interactive emulation, it can be used to study multimedia networks in various scenarios, such as different load patterns, network sizes and management operations. The current implementation runs on a KSR-1 and an SP2 parallel processor which are connected to a graphics workstation via ATM links.
We analyze the scheduling aspects of database queries submitted to an abstract model of a very large distributed system. The essential elements of this model are: (a) a finite number of identical processing nodes with...
详细信息
We analyze the scheduling aspects of database queries submitted to an abstract model of a very large distributed system. The essential elements of this model are: (a) a finite number of identical processing nodes with limited storage capacity, (b) a finite number of queries to be serviced, (c) a very large read-only data set that is shared by all queries and (d) a fixed inter-node communication latency. This framework models an important class of applications that use distributed processing of very large data sets. Examples of these applications exist in the very large database and multimedia problem domains. To meet the objective of minimizing flow time of queries while exploiting inter-query locality, various heuristics are proposed and evaluated through extensive simulation.
Most experimental studies of the performance of parallelsimulation protocols use speedup or number of events processed per unit time as the performance metric. Although helpful in evaluating the usefulness of paralle...
详细信息
ISBN:
(纸本)9780818675393
Most experimental studies of the performance of parallelsimulation protocols use speedup or number of events processed per unit time as the performance metric. Although helpful in evaluating the usefulness of parallelsimulation for a given simulation model, these metrics tell us little about the efficiency of the simulation protocol used. In this paper, we describe an Ideal simulation Protocol (ISP), based on the concept of critical path, which experimentally computes the best possible execution time for a simulation model on a given parallel architecture. Since ISP computes the bound by actually executing the model on the given parallel architecture, it is much more realistic than that computed by a uniprocessor critical path analysis. The paper illustrates, using parameterized synthetic benchmarks, how an ISP-based performance evaluation can lead to much better insights into the performance of parallelsimulation protocols than what would be gained from speedup graphs alone.
Advances in massively parallel platforms are increasing the prospects for high performance discrete event simulation. Still the difficulty in parallel programming persists and there is increasing demand for high level...
详细信息
ISBN:
(纸本)9780818675393
Advances in massively parallel platforms are increasing the prospects for high performance discrete event simulation. Still the difficulty in parallel programming persists and there is increasing demand for high level support for building discrete event models to execute on such platforms. We present a parallel DEVS-based (Discrete Event System Specification) simulation environment that can execute on distributed memory multicomputer systems with benchmarking results of a class of high resolution, large scale ecosystem models. Underlying the environment is a parallel container class library for hiding the details of message passing technology while providing high level abstractions for hierarchical, modular DEVS models. The C++ implementation working on the Thinking Machines CM-5 demonstrates that the desire for high level modeling support need not be irreconcilable with sustained high performance.
Heterogeneous network computing allows the development of a single complex application using a distributed network of machines; these machines may differ in terms of CPU and memory capacity and/or architecture and spe...
详细信息
Heterogeneous network computing allows the development of a single complex application using a distributed network of machines; these machines may differ in terms of CPU and memory capacity and/or architecture and specialized functions. We present a modeling technique, based on generalized stochastic Petri nets (GSPNs), for the performance analysis of applications targeted to this class of systems (heterogeneous applications). We illustrate the use of the proposed technique by modeling and analyzing the CASA 3D-REACT heterogeneous application.
This paper presents a methodology for the design of microelectronic embedded systems, which include hardware and software for open-loop and closed-loop control in the area of automotive bus and control systems. An int...
详细信息
This paper presents a methodology for the design of microelectronic embedded systems, which include hardware and software for open-loop and closed-loop control in the area of automotive bus and control systems. An integrated approach to specification and design, analysis and simulation of the overall system has been developed. As embedded systems often require real-time capabilities, the environment presented gives special consideration to these constraints. An example for a concrete application will be given, which shows, how the design of real-time systems is supported by this design methodology using a Rapid Prototyping environment.
A new Conservative algorithm for both parallel and sequential simulation of networks is described. The technique is motivated by the construction of a high performance simulator for ATM networks. It permits very fast ...
详细信息
ISBN:
(纸本)9780818675393
A new Conservative algorithm for both parallel and sequential simulation of networks is described. The technique is motivated by the construction of a high performance simulator for ATM networks. It permits very fast execution of models of ATM systems, both sequentially and in parallel. A simple analysis of the performance of the system is made. Initial performance results from parallel and sequential implementations are presented and compared with comparable results from an optimistic TimeWarp based simulator. It is shown that the conservative simulator performs well when the "density" of messages in the simulated system is high, a condition which is likely to hold in many interesting ATM scenarios.
The Annai integrated tool environment helps exploit the inherent power of distributed-memory parallel computers with standardized languages and convenient programming support. Portable application development is suppo...
详细信息
The Annai integrated tool environment helps exploit the inherent power of distributed-memory parallel computers with standardized languages and convenient programming support. Portable application development is supported in High Performance Fortran and/or with explicit message-passing, using MPI as the machine interface. Integration within a unified tool environment allows the performance monitor and analyzer (PMA) component to interact with source code browsers and the loaded executable on the same terms as when using the parallel debugger. Data distribution and other program information furnished by the parallelization support and compilation systems is also exploited for additional insight. Powerful, directed analysis and interactive graphical summaries address scalability, while detailed charts of the time-varying behavior of individual processes and communication events can also be browsed when desired, always retaining essential reference to the original program source code.
暂无评论