We present DADS, the first distributed, online, scalable, and cost-effective dynamic slicer for continuously-running distributed programs with respect to user-specified budget constraints. DADS is distributed by desig...
详细信息
ISBN:
(纸本)9781450370431
We present DADS, the first distributed, online, scalable, and cost-effective dynamic slicer for continuously-running distributed programs with respect to user-specified budget constraints. DADS is distributed by design to exploit distributed and parallel computing resources. With an online analysis, it avoids tracing hence the associated time and space costs. Most importantly, DADS achieves and maintains practical scalability and cost-effectiveness tradeoffs according to a given budget on analysis time by continually and automatically adjusting the configuration of its analysis algorithm on the fly via reinforcement learning. Against eight real-world Java distributedsystems, we empirically demonstrated the scalability and cost-effectiveness merits of DADS. The open-source tool package of DADS with a demo video is publicly available.
Advances in next generation sequencing technologies has resulted in the generation of unprecedented volume of sequence data. DNA segments are combined into a reconstruction of the original genome using computer softwa...
详细信息
ISBN:
(纸本)9781538655559
Advances in next generation sequencing technologies has resulted in the generation of unprecedented volume of sequence data. DNA segments are combined into a reconstruction of the original genome using computer software called genome assemblers. Therefore, assembly now presents new challenges in terms of data management, query, and analysis due the huge number of read sequences and computing intensive CPU-memory algorithms. This restriction reduces the chances to uniformly cover space for exploring statistics, k-mer, software or eukaryotic genomes assembly. To address these issues, we present ArrOW, a cloud-based de novo Assembly clOud Workflow that explores the potential of provenance analytics and parallel computation provided by scientific workflow management systems as SciCumulus. We evaluate the overall performance of ArrOW using up to 256 cores in the Amazon AWS cloud. ArrOW reaches improvements up to 88.3% executing 1,000 reads of genomics datasets. We also highlight how data provenance analytics improved the efficiency for recovering assembling features of genomes.
Inverse problems arise in various areas of science and engineering. These problems are not only difficult to solve numerically, but they also require a large amount of computer resources both in time and memory. It is...
详细信息
ISBN:
(纸本)9781479984480
Inverse problems arise in various areas of science and engineering. These problems are not only difficult to solve numerically, but they also require a large amount of computer resources both in time and memory. It is therefore not surprising that inverse problems are often solved using techniques from high-performance computing. We consider the parallelization of an inverse problem in the field of geothermal reservoir engineering. In this particular scientific application, the underlying software package is already parallelized using the shared-memory programming paradigm OpenMP. Here, we present an extension of this parallelization to distributed memory enabling a hybrid OpenMP/MPI parallelization. The situation is different from the standard way of hybrid parallel programming because the data structures of the OpenMP-parallelized code differ from those in the serial implementation. We exploit this transformation of the data structures in our distributed-memory strategy for parallelizing an ensemble Kalman filter, a particular method for the solution of inverse problems. We describe this novel parallelization strategy, introduce a performance model, and present timing results on a compute cluster using nodes with 2 sockets, each equipped with Intel Xeon X5675 Westmere EP processors with 6 cores. All timing results are obtained with a pure MPI parallelization without using any OpenMP threads.
On-chip parallelism with GPU accelerators is now ubiquitous and has received significant attention in the past few years. GPU is becoming an integral part of mainstream computing systems with highly parallel, multithr...
详细信息
ISBN:
(纸本)9780769546766
On-chip parallelism with GPU accelerators is now ubiquitous and has received significant attention in the past few years. GPU is becoming an integral part of mainstream computing systems with highly parallel, multithreaded, many-core processors of great computational power and high memory bandwidth. Finding the best tradeoff between performance and power efficiency is more challenging than mere performance tuning. To find the principles of power-aware programming with GPU accelerators, we abstract a set of primitives from program statements. These power consumption values of primitives are helpful for power estimation during high-level program development.
This paper introduces Speedcode, an online programming platform that aims to improve the accessibility of software performance-engineering education. At its core, Speedcode provides a platform that lets users gain han...
详细信息
ISBN:
(纸本)9798350364613;9798350364606
This paper introduces Speedcode, an online programming platform that aims to improve the accessibility of software performance-engineering education. At its core, Speedcode provides a platform that lets users gain hands-on experience in software performance engineering and parallel programming by completing short programming exercises. Speedcode challenges users to develop fast multicore solutions for short programming problems and evaluates their code's performance and scalability in a quiesced cloud environment. Speedcode supports parallel programming using OpenCilk, task-parallel computing platform that is open-source and easy to program, teach and use for research. Speedcode aims to reduce barriers to learning and teaching software performance engineering. It allows users to run and evaluate their code on modern multicore machines from their own computer without installing any software. This provides users an easy introduction to the topic, and enables teachers to more easily incorporate lessons on software performance engineering into their courses without incurring the onerous overhead of needing to setup computing environments for their students.
The number of security-breaking attempts originated inside the organizations are increasing steadily. Attacks made in this way, usually done by 'authorized' users of the system, cannot be immediately located. ...
详细信息
The number of security-breaking attempts originated inside the organizations are increasing steadily. Attacks made in this way, usually done by 'authorized' users of the system, cannot be immediately located. As the idea of filtering the traffic at the 'entrance door' (by firewalls, for instance) is not completely successful, the use of other technologies should be considered to increase the defense capacity of a site. Therefore, the introduction of mobile agents to provide the computational security by constantly moving around within the internal infoways of an organization is presented as a natural solution to prevent both external and the internal sources of intrusion. This work presents an evaluation of the use of mobile agents mechanisms to add mobility features to the process of monitoring intrusion in computational systems. A modular approach is proposed, where independent small agents will monitor the communication paths. This approach presents significant advantages in terms of minimizing overhead, increasing scalability and flexibility and providing fault tolerance.
Recent advancements in FPGA technology have allowed manufacturers to place general-purpose processors alongside user-configurable logic gates on a single chip. At first glance, these integrated devices would seem to b...
详细信息
This paper describes the hardware and software architecture of the Data Center for Education (DCE) built up at CentraleSupelec as well as our educational objectives. Students experiment in the context of their studies...
详细信息
ISBN:
(纸本)9781665497473
This paper describes the hardware and software architecture of the Data Center for Education (DCE) built up at CentraleSupelec as well as our educational objectives. Students experiment in the context of their studies with clusters of CPU and servers of GPU. They have different technical backgrounds from beginners to experts in computer science. The lectures have also different experimental requirements: from single host long running jobs for machine learning to multiple hosts short running HPC algorithms benchmarks, with lab works and projects that need to share the same pool of machines.
Power and energy consumption are important concerns in the design of high performance and mobile computing systems, but have not been widely considered in the design of parallel and distributed simulations. The import...
详细信息
ISBN:
(纸本)9781467378222
Power and energy consumption are important concerns in the design of high performance and mobile computing systems, but have not been widely considered in the design of parallel and distributed simulations. The importance of these factors is discussed and metrics for power and energy overhead in parallel and distributed simulations are proposed. Factors affecting the energy consumed by synchronization algorithms and software architectures are examined. An experimental study is presented examining energy consumption of the well-known Chandy/Misra/Bryant algorithm executing on a peer-to-peer mobile computing platform and compared with a centralized client-server approach using the YAWNS synchronization algorithm. Initial results concerning queueing network simulations are also presented. The results of this study suggest that existing distributed simulation algorithms require a significant amount of additional energy compared to a sequential execution. Further, different synchronization algorithms can yield different energy consumption behaviors.
Resource monitoring in distributedsystems is required to understand the 'health' of the overall system and to help identify particular problems, such as dysfunctional hardware, a faulty, system or application...
详细信息
ISBN:
(纸本)9780769534725
Resource monitoring in distributedsystems is required to understand the 'health' of the overall system and to help identify particular problems, such as dysfunctional hardware, a faulty, system or application software. Desirable characteristics for monitoring systems are the ability to connect to any number of different types of monitoring agents and to provide different views of the system, based on a client's particular preferences. This paper outlines and discusses the ongoing activities within the GridRM wide-area resource-monitoring project.
暂无评论