Most current cycle stealing systems, especially those designed for the Internet require programmers to adopt the master-worker parallel programming paradigm. The master-worker paradigm is extremely limiting from an ab...
详细信息
ISBN:
(纸本)1892512459
Most current cycle stealing systems, especially those designed for the Internet require programmers to adopt the master-worker parallel programming paradigm. The master-worker paradigm is extremely limiting from an abstraction point of view as it requires a single master thread to expose all of the parallelism that the application hopes to exploit. In this paper we describe how we have extended our Internet cycle stealing system G2, to support more general parallel programming paradigms in which any task can create subtasks and in which tasks become self contained units of abstraction. The major problem addressed is how to allow tasks to wait for the results of their subtasks without tying up the resources of volunteer machines. By drawing a programming analogy with the asynchronous implementation of web services in *** we create a familiar yet powerful programming model for creating parallel programs using a variety of paradigms.
In this paper we present first experiences concerning the integration of MPI-based numerical software into an advanced programming environment for building parallel and distributed high-performance applications, which...
详细信息
ISBN:
(纸本)0769518753
In this paper we present first experiences concerning the integration of MPI-based numerical software into an advanced programming environment for building parallel and distributed high-performance applications, which is under development in the context of Italian national research projects. Such programming environment, named ASSIST is based on a combination of the concepts of structured parallel programming and component-based programming. Some activities within the projects are devoted to the definition, implementation and testing of a methodology for the integration of a parallel numerical library into ASSIST The, goal is providing a set of efficient, accurate and reliable tools that can be easily used as building blocks for high-performance scientific applications. We focus on the integration of existing and widely used MPI-based numerical library modules. To this aim, we propose a general approach to embed MPI computations into the ASSIST basic programming unit. This approach has been tested using the MPICH implementation of MPI for networks of workstations. Some modifications have been applied to the MPICH process startup procedure, in order to make it compliant with the ASSIST environment. Results of experiments concerning the integration of routines from a well-known FFT package are discussed.
With the increase of both computing power available and computer application size and complexity, existing programming methodologies and technologies for parallel and distributed computing demonstrated their inadequac...
详细信息
With the increase of both computing power available and computer application size and complexity, existing programming methodologies and technologies for parallel and distributed computing demonstrated their inadequacy. New techniques have therefore been designed and are currently being developed that aim at providing the user/programmer with higher level programming methodologies, environments and run time supports. In this work, we take into account some of these new technologies and we discuss their features, both positive and negative. Eventually, exploiting our experience in structured parallel programming environment design, we try to summarize which features have to be included in the programming environments of the near future, those answering (or trying to answer) the pressures and urgencies of current days claiming for new, efficient, easy to use high performance programming environments. (C) 2003 Elsevier B.V. All rights reserved.
Typical Grid computing scenarios involve many distributed hardware and software. components. The more components that are involved, the more likely it is that one of them may fail. In order for Grid computing to succe...
详细信息
ISBN:
(纸本)076952026X
Typical Grid computing scenarios involve many distributed hardware and software. components. The more components that are involved, the more likely it is that one of them may fail. In order for Grid computing to succeed, there must be a simple mechanism to determine which component failed and why. Instrumentation of all Grid applications and middleware is an important part of the solution to this problem. However it must be possible to control and adapt the amount of instrumentation data produced in order to not be flooded by this data. In this paper we describe a scalable, high-performance instrumentation activation mechanism that addresses this problem.
The X4CP32 is a parallel/reconfigurable microprocessor with 2 programming levels. Although it is a general-purpose microprocessor, it has the reliable performance of a reconfigurable architecture. This paper exposes i...
详细信息
ISBN:
(纸本)0769520464
The X4CP32 is a parallel/reconfigurable microprocessor with 2 programming levels. Although it is a general-purpose microprocessor, it has the reliable performance of a reconfigurable architecture. This paper exposes its architecture and programming levels, and discusses the powerful interaction between parallel programming and reconfiguration. It shows two performance-optimized implementations of matrix multiplication using both parallel and reconfigurable paradigms and a parallel implementation of miner intelligent agents.
The systems software necessary to operate large-scale parallel computers presents a variety of research and development issues. One approach is to consider systems software as a collection of interacting components, w...
详细信息
ISBN:
(纸本)0769520669
The systems software necessary to operate large-scale parallel computers presents a variety of research and development issues. One approach is to consider systems software as a collection of interacting components, with well-defined published interfaces. The Scalable Systems software SciDAC project is currently exploring the feasibility of architecting systems software this way. In this paper we present a prototype process manager component for such a system. We describe the component abstractly in terms of its functionality and the interface by which its functionality may be invoked. We propose a precise syntax for this interface and describe one implementation of the process manager component, based on an existing scalable process management system called MPD. We conclude with some experiences using this process manager component in conjunction with other systems software components on a medium-sized Linux cluster.
This paper describes a distributed-memory parallel implementation of a time-domain integral-equation (TDIE) based scheme for analyzing a broad class of EMC/EMI problems. The specific TDIE scheme used is the multilevel...
详细信息
ISBN:
(纸本)0780377796
This paper describes a distributed-memory parallel implementation of a time-domain integral-equation (TDIE) based scheme for analyzing a broad class of EMC/EMI problems. The specific TDIE scheme used is the multilevel plane-wave time-domain (PWTD) accelerated marching-on-in-time (MOT) solver previously used by our group only on serial computing platforms. The proposed parallel MOT-PWTD solver is applied to sample EMC/EMI problems and the efficiency of the algorithm is investigated via speedup analysis..
ARMI is a communication library that provides a framework for expressing fine-grain parallelism and mapping it to a particular machine using shared-memory and message passing library calls. The library is an advanced ...
详细信息
ISBN:
(纸本)9781581135886
ARMI is a communication library that provides a framework for expressing fine-grain parallelism and mapping it to a particular machine using shared-memory and message passing library calls. The library is an advanced implementation of the RMI protocol and handles low-level details such as scheduling incoming communication and aggregating outgoing communication to coarsen parallelism when necessary. These details can be tuned for different platforms to allow user codes to achieve the highest performance possible without manual modification. ARMI is used by STAPL, our generic parallel library, to provide a portable, user transparent communication layer, We present the basic design as well as the mechanisms used in the current Pthreads/OpenMP, MPI implementations and/or a combination thereof. Performance comparisons between ARMI and explicit use of Pthreads or MPI are given on a variety of machines, including an HP V2200, SGI Origin 3800, IBM Regatta-HPC and IBM RS6000 SP cluster.
The ever-rising power levels and operating frequencies of integrated circuits call for a detailed investigation of the electromagnetic radiation from heatsinks. Broadband simulations are required to fully characterize...
详细信息
ISBN:
(纸本)0780377796
The ever-rising power levels and operating frequencies of integrated circuits call for a detailed investigation of the electromagnetic radiation from heatsinks. Broadband simulations are required to fully characterize heatsink emissions. In this paper, an FFT-accelerated time domain integral equation solver is utilized to accurately and. efficiently characterize numerous heatsink designs. The effects of various grounding configurations on the radiated emissions are investigated. The proposed modeling approach is validated by comparing the results obtained to those from independent simulation tools. Sample run times demonstrate the efficiency of the solver.
暂无评论