Network processors today consists of multiple parallel processors (microengines) with support for multiple threads to exploit packet level parallelism inherent in network workloads. With such concurrency, packet order...
详细信息
The proceedings contain 81 papers. The topics discussed include: scalability for Petaflops systems;chip multi-threading and the SPARC evolution;the multicore programming challenge;replication-based partial dynamic sch...
详细信息
ISBN:
(纸本)9783540768364
The proceedings contain 81 papers. The topics discussed include: scalability for Petaflops systems;chip multi-threading and the SPARC evolution;the multicore programming challenge;replication-based partial dynamic scheduling heterogeneous network processors;the optimum location of delay latches between dynamic pipeline stages;a novel fault-tolerant parallel algorithm;the design on SEU-tolerant information processing system of the on-board-computer;balancing thread partition for efficiency exploiting speculative thread-level parallelism;design and implementation of a high-speed reconfigurable modular arithmetic unit;virtual disk monitor based on multi-core EFI;an optimal design method for de-synchronous circuit based on control graph;and property-preserving composition of distributed system components.
Non-dedicated loosely coupled systems are popular platforms for cluster- and grid-based parallel processing, fundamentally because they have good cost-performance ratios and are scalable. However, these platforms repr...
详细信息
Non-dedicated loosely coupled systems are popular platforms for cluster- and grid-based parallel processing, fundamentally because they have good cost-performance ratios and are scalable. However, these platforms represent highly dynamic environments in which performance and efficiency can be seriously impacted by changes in environmental conditions. This is especially significant where the runtime configuration has been determined statically, either at compilation time or at the start of execution. This paper introduces the concept of agile parallel processing in which the application manages several aspects of its own run-time behaviour, including deployment granularity. This approach reduces the emphasis on the preconfiguration of components, and relies instead on inbuilt learning and discovery capabilities. To facilitate investigation into the extent to which a self-managing approach can be beneficial to parallel processing, an experimental framework has been developed. The framework provides a range of services such as dynamic worker discovery and performance calibration, and policy-controlled facilities such as resource management and adaptation to suit environmental conditions. The framework integrates these services with the parallel application code. The operation and performance of policy-based dynamic deployment scheduling in dynamic environments is analysed in detail.
We mix two approaches of the fault-tolerance: robustness and stabilization. Using these approaches, we propose leader election algorithms that tolerate both transient and crash failures. Our goal is to show the implem...
详细信息
ISBN:
(纸本)9783540766261
We mix two approaches of the fault-tolerance: robustness and stabilization. Using these approaches, we propose leader election algorithms that tolerate both transient and crash failures. Our goal is to show the implement ability of the robust self- and/or pseudo-stabilizing leader election in various systems with weak reliability and synchrony assumptions. We try to propose, when it is possible, communication-efficient implementations. Also, we exhibit some assumptions required to obtain robust stabilizing leader election algorithms. Our results show that the gap between robustness and stabilizing robustness is not really significant when we consider fix-point problems such as leader election.
Replicating data objects onto servers across a system can alleviate access delays. The selection of data objects and servers requires solving a constraint optimization problem, which is NP-complete in general. A major...
详细信息
Free-surface flows occur in several problems in hydrodynamics, Such as fuel or water sloshing in tanks, waves breaking in ships, offshore platforms, harbours and coastal areas. The computation of such highly nonlinear...
详细信息
Free-surface flows occur in several problems in hydrodynamics, Such as fuel or water sloshing in tanks, waves breaking in ships, offshore platforms, harbours and coastal areas. The computation of such highly nonlinear flows is challenging since free-surfaces commonly present merging, fragmentation and breaking parts. leading to the use of interface-capturing Eulerian approaches. In Such methods the surface between two fluids is captured by the use of a marking function which is transported in a flow field. In this work we present a three-dimensional parallel edge-based incompressible SUPG/PSPG finite element method to cope with free-surface problems with volume-of-fluid (VOF) extensions to track the evolving free Surface. The pure advection equation for the scalar marking function was solved by a fully implicit parallel edge-based SUPG finite element formulation. We studied variants of this formulation, considering the effects of discontinuity capturing and a particular tangent transformation designed to increase interface sharpness. Global mass conservation is enforced adding or removing mass proportionally to the absolute value of the normal velocity of the interface. We introduce a parallel dynamic deactivation algorithm to solve the marking function equation only in a small region around the interface. The implementation is targeted to distributed memory systems with cache-based processors. The performance and accuracy of the proposed solution method were tested with several validation problems. Copyright (c) 2007 John Wiley & Sons, Ltd.
The mean-time-between-failure of current high-performance computer systems is much shorter than the running times of many computational applications, whereas those applications are the main workload for those systems....
详细信息
ISBN:
(纸本)9783540768364
The mean-time-between-failure of current high-performance computer systems is much shorter than the running times of many computational applications, whereas those applications are the main workload for those systems. Currently, checkpoint/restart is the most commonly used scheme for such applications to tolerate hardware failures. But this scheme has its performance limitation when the number of processors becomes much larger. In this paper, we propose a novel fault-tolerant parallel algorithm FPAPR. First, we introduce the basic idea of FPAPR. Second, we specify the details of how to implement a FPAPR program by using two NPB kernels as examples. Third, we theoretically analyze the overhead of FPAPR, and find out that the overhead of FPAPR decreases with the increase of the number of processors. At last, the experimental results on a 512-CPU cluster show the overhead introduced by the algorithm is very small.
Data Grid provides integrated view of distributed data scattered across networks. Current Data Grid systems are centrally controlled. In this paper, we present a structured P2P based Data Grid model (P-DataGrid Model,...
详细信息
ISBN:
(纸本)9783540768364
Data Grid provides integrated view of distributed data scattered across networks. Current Data Grid systems are centrally controlled. In this paper, we present a structured P2P based Data Grid model (P-DataGrid Model, PDG) which makes use of construction and routing algorithms of P-Grid a structured P2P system. PDG is organized as virtual multi-branch tree with binary tree as main body. Formal description of PDG is firstly introduced. Then we discuss the realization issues of PDG such as establishment of model, data storage service, information service, etc. Among these issues, our emphasis is on joining of nodes, registration and location of replica. Furthermore, we analyze the successful probability of location. Constructing Data Grid on structured P2P overlay can bring great advantages of scalability, decentralized control and reliability.
Due to the complexity associated with developing parallel applications, scientists and engineers rely on high-level software libraries such as PETSc, ScaLAPACK and PESSL to ease this task. Such libraries assist develo...
详细信息
Most of the current trust models in peer-to-peer (P2P) systems are identity based, which means that in order for one peer to trust another, it needs to know the other peer's identity. Hence, there exists an inhere...
详细信息
暂无评论