This paper surveys the program dependence analysis technique for parallel and/or distributed programs and its applications from the viewpoint of softwareengineering. We present primary program dependences which may e...
详细信息
ISBN:
(纸本)0818678763
This paper surveys the program dependence analysis technique for parallel and/or distributed programs and its applications from the viewpoint of softwareengineering. We present primary program dependences which may exist in a parallel and/or distributed program, a general approach to define, analyze, and represent these program dependences formally, and applications of an explicit program dependence based representation for parallel and/or distributed programs in various softwareengineering activities. We also suggest some research problems on this direction.
Desktop Grids are rapidly gaining popularity as a cost-effective computing platform for the execution of applications with extensive computing needs. As opposed to grids and clusters, these systems are characterized b...
详细信息
ISBN:
(纸本)9781424437511
Desktop Grids are rapidly gaining popularity as a cost-effective computing platform for the execution of applications with extensive computing needs. As opposed to grids and clusters, these systems are characterized by having a non-dedicated infrastructure. These unique characteristics need to be considered in developing resource management strategies for Desktop Grids. Several frameworks for the performance evaluation of resource management strategies have been suggested for grids. However, similar projects for Desktop Grids are still lacking. This paper presents MGST the first performance testing framework for Desktop Grids. We discuss the design of the tool and show how it can be used to analyze and improve the performance of an existing Desktop Grid scheduling policy.
This paper presents a modeling and implementation method for complex robotic systems using multiple robots. Based on conceptual condition-event models of robotic activities, extended Petri nets are adopted as a tool f...
详细信息
ISBN:
(纸本)9781467327435;9781467327428
This paper presents a modeling and implementation method for complex robotic systems using multiple robots. Based on conceptual condition-event models of robotic activities, extended Petri nets are adopted as a tool for expressing independent and cooperative robotic tasks. Net models representing inter-task cooperation with synchronous interaction are implemented on hierarchical and distributed control architecture. The higher level controller executes a conceptual Petri net model representing cooperative behaviors in the system, and parallel activities of the local controllers are efficiently coordinated through the transmission of subtask commands and the reception of event status. Detailed software implementation of the inter-task cooperation in an example robotic cell, is provided.
We present a new greedy b-MATCHING algorithm suitable for running on a GPU. Our algorithm differs from previous efforts at designing parallel algorithms for this problem in that it does not use software locks and that...
详细信息
ISBN:
(纸本)9781538655559
We present a new greedy b-MATCHING algorithm suitable for running on a GPU. Our algorithm differs from previous efforts at designing parallel algorithms for this problem in that it does not use software locks and that it also exploits substantially more of the available concurrency. We achieve this by allowing the same vertex to concurrently match with several other vertices and also by letting multiple vertices simultaneously match with the same target vertex. We have compared our algorithm using a Pascal P100 GPU with the previous best shared memory algorithm for this problem both when running on a 16 core Xeon E5 and on a Xeon Phi. On average our algorithm outperforms the Xeon E5 by a factor of 4.6 and the Xeon Phi by a factor of 2.3. We also show that our algorithm using an NVIDIA DGX-1 multi-GPU system is highly competitive compared to a distributed memory implementation running on one of the top ten computers from the current TOP500 list.
Aggregated HPC resources have rigid allocation systems and programming models which struggle to adapt to diverse and changing workloads. Consequently, HPC systems fail to efficiently use the large pools of unused memo...
详细信息
ISBN:
(纸本)9798350387117;9798350387124
Aggregated HPC resources have rigid allocation systems and programming models which struggle to adapt to diverse and changing workloads. Consequently, HPC systems fail to efficiently use the large pools of unused memory and increase the utilization of idle computing resources. Prior work attempted to increase the throughput and efficiency of super-computing systems through workload co-location and resource disaggregation. However, these methods fall short of providing a solution that can be applied to existing systems without major hardware modifications and performance losses. In this paper, we improve the utilization of supercomputers by employing the new cloud paradigm of serverless computing. We show how serverless functions provide fine-grained access to the resources of batch-managed cluster nodes. We present an HPC-oriented Function-as-a-Service (FaaS) that satisfies the requirements of high-performance applications. We demonstrate a software resource disaggregation approach where placing functions on unallocated and underutilized nodes allows idle cores and accelerators to be utilized while retaining near-native performance. Full Paper Version: https://***/abs/2401.10852 HPC FaaS Implementation: https://***/spcl/rFaaS
Cyber-physical systems (CPS) are computer systems with integrated software and physical components that ideally seamlessly interact with the real world and each other. While the use of distributed CPS has rapidly grow...
详细信息
ISBN:
(纸本)9781665438193
Cyber-physical systems (CPS) are computer systems with integrated software and physical components that ideally seamlessly interact with the real world and each other. While the use of distributed CPS has rapidly grown over the past decade, so has the need for developing efficient methods to ascertain reliability of these systems by validating their correctness. Since exhaustively validating correctness of a distributed CPS is usually not feasible nor possible, many modern validation methods involve run-time verification of distributed CPS based on safety properties. Our work focuses on developing time and resource efficient assurance techniques that can run in parallel with the execution of these systems to ensure reliability.
We discuss here the emergent Web based distributed environments for HPCC on the NII with the focus on Java as an enabling technology. We start with a review of the past, presence and the near term future of the 'J...
详细信息
ISBN:
(纸本)0818675829
We discuss here the emergent Web based distributed environments for HPCC on the NII with the focus on Java as an enabling technology. We start with a review of the past, presence and the near term future of the 'Java phenomenon', exposed here in the background of some related previous approaches towards a distributed interpretative virtual machine architecture.
Embedded designers now have the capability of offloading software routines into custom applicationspecific hardware blocks. This paper evaluates a domain-specific design system for configurable computing platforms tha...
详细信息
ISBN:
(纸本)0769523129
Embedded designers now have the capability of offloading software routines into custom applicationspecific hardware blocks. This paper evaluates a domain-specific design system for configurable computing platforms that combine processors and configurable fabrics. Although the proposed work uses floating-point and communication primitives as the specific domain tasks evaluated, the reconfigurable computing platform and the design challenges addressed will become increasingly common in a number of embedded system environments. This paper explores the added cost of hardware resources, area, and power of moving software library routines into hardware blocks in a configurable embedded system based on the MicroBlaze soft processor.
We consider the automatic generation of parallel templates as basis for the implementation of n(2)-problem solvers on parallelsystems. The automatic template generator relies on the embedding of a virtual systolic lo...
详细信息
ISBN:
(纸本)0818680431
We consider the automatic generation of parallel templates as basis for the implementation of n(2)-problem solvers on parallelsystems. The automatic template generator relies on the embedding of a virtual systolic loop on the given machine's network and optimizes hyper-systolic data communication by means of Simulated Annealing techniques. We have generated a parallel template to efficiently program an n-body code on the parallel SIMD system Quadrics.
We present a general Multi-Agent System framework for distributed data mining based on a Peer-to-Peer model. Agent protocols are implemented through message-based asynchronous communication. The framework adopts a dyn...
详细信息
ISBN:
(纸本)1595934804;9781595934802
We present a general Multi-Agent System framework for distributed data mining based on a Peer-to-Peer model. Agent protocols are implemented through message-based asynchronous communication. The framework adopts a dynamic load balancing policy that is particularly suitable for irregular search algorithms. A modular design allows a separation of the general-purpose system protocols and software components from the specific data mining algorithm. The experimental evaluation has been carried out on a parallel frequent subgraph mining algorithm, which has shown good scalability performances. Copyright 2007 ACM.
暂无评论