the distributed Array Query and Visualization (DAQV) project aims to develop systems and tools that facilitate interacting withdistributed programs and data structures. Arrays distributed across the processes of a pa...
详细信息
ISBN:
(纸本)0818685794
the distributed Array Query and Visualization (DAQV) project aims to develop systems and tools that facilitate interacting withdistributed programs and data structures. Arrays distributed across the processes of a parallel or distributed application are made available to external clients via well-defined interfaces and protocols. Our design considers the broad issues of language targets, models of interaction, and abstractions for data access, while our implementation attempts to provide a general framework that can be adapted to a range of application scenarios. the paper describes the second generation of DAQV work and places it in the context of the more general distributed array access problem. Current applications and future work are also described.
With increasing development of applications for heterogeneous, distributedcomputing grids, the focus of performance analysis hall shifted from a posteriori optimization on homogeneous parallel systems to application ...
详细信息
ISBN:
(纸本)0818685794
With increasing development of applications for heterogeneous, distributedcomputing grids, the focus of performance analysis hall shifted from a posteriori optimization on homogeneous parallel systems to application tuning for heterogeneous resources with time varying availability. this shift has profound implications for performance instrumentation and analysis techniques. Autopilot is a new infrastructure for dynamic performance tuning of heterogeneous computational grids based on closed loop control. this paper describes the Autopilot model of distributed sensors, actuators, and decision procedures, reports preliminary performance benchmarks, and presents a case study in which the Autopilot library is utilized in the development of an adaptive parallel input/output system.
the author discusses the role of high-level parallel programming and its applications in the scalable exploitation of ubiquitous parallelism. He cites the 7thinternationalsymposium on High-Level parallel Programming...
详细信息
the author discusses the role of high-level parallel programming and its applications in the scalable exploitation of ubiquitous parallelism. He cites the 7thinternationalsymposium on High-Level parallel Programming and Applications held on July 3 and 4, 2014 in Doelenzaal of the University of Amsterdam, Holland where it received 34 submissions from which 15 contributions were selected for presentation at the symposium and for inclusion in the special issue.
In this paper we present and compare two algorithms to transform and optimize the Finite Difference Time Domain (FDTD) method computations in irregular computational areas for multiprocessor systems. Both algorithms a...
详细信息
ISBN:
(纸本)9780769534725
In this paper we present and compare two algorithms to transform and optimize the Finite Difference Time Domain (FDTD) method computations in irregular computational areas for multiprocessor systems. Both algorithms art, based oil a genetic approach with different design of chromosomes, In the first case, a chromosome represents a complete merging algorithm as a sequence of merging functions that are used to reduce a number of macro nodes. In the second case, a chromosome corresponds to an assignment of each macro node to specified processor. We have tested a mixed fitness function, which can switch between two sub functions: cut min value and difference between the maximally and minimally loaded macro nodes in an individual. As our experiments have shown, it allows to adjust the behavior of a genetic algorithm to the requirements of the computational system. We have tested both algorithms for multiprocessor systems with shared and distributed memory, with MPI and RDMA communication.
the paper concerns methodology of organizing execution control in parallel and distributed programs based on special control infrastructure provided in the system, which monitors application global states. In such inf...
详细信息
ISBN:
(纸本)9780769534725
the paper concerns methodology of organizing execution control in parallel and distributed programs based on special control infrastructure provided in the system, which monitors application global states. In such infrastructure, automatic collecting the information from parallel processes about their local states and constructing global application states can be done on-line without tedious intervention of a programmer. Global If predicates can be next evaluated to examine the obtained global states and control decisions on program behaviour can be taken. Application processes can use the information about global predicate fulfillment to control their execution in a way, which is desired from a global point of view. there are man possible ways a process can access the information about the global predicate fulfillment. We discuss them it? detail, dividing the described variants into two main groups: synchronous and asynchronous. We present an implementation of an asynchronous variant Implementation of other variants, which present very interesting perspectives is also discussed.
In this paper we present an extension to MONARC, a generic simulator for large scale distributed systems, which allows realistic evaluation of various actual distributed system technologies based on real-world monitor...
详细信息
ISBN:
(纸本)9780769534725
In this paper we present an extension to MONARC, a generic simulator for large scale distributed systems, which allows realistic evaluation of various actual distributed system technologies based on real-world monitored data supplied by MonALISA. the field of modelling and simulation was long-time seen as a viable solution to develop new algorithms and technologies and to enable the development of large-scale distributed systems, where analytical validations are prohibited by the nature of the encountered problems. this paper presents a novel approach to combining two distributed systems domains, monitoring and simulation, highlighting a realistic solution to the problem of accurately evaluating various distributed systems technologies using simulation. We also present a simulation study which demonstrates the interoperability between the simulation framework and the monitoring instrument, demonstrating important properties of the US LHCNet research network in the context of the LHC experiments in CERN.
Specialized encryption processors offer both low latency and high throughput at the expense of higher cost. A modern x86 system that encompasses several compute architectures (SISD/SIMD) might be able to perform well ...
详细信息
ISBN:
(纸本)9781479959198
Specialized encryption processors offer both low latency and high throughput at the expense of higher cost. A modern x86 system that encompasses several compute architectures (SISD/SIMD) might be able to perform well compared to a dedicated encryption unit at the fraction of the cost. this paper presents how one might accelerate AES ECB 128 bit using modern commodity hardware found in today's x86 computers. Focus architecture is AMD A6 5400K, coupled with a discrete GPU AMD R7 250. Benchmark results compare CPU OpenSSL execution, CPU AES-NI acceleration, integrated, discrete GPU and heterogeneous combinations of the above processing units. We present multiple test results and attempt to explain some inconsistencies of what would be expected.
this paper studies the QoS-aware replica placement problem. Although there has been much work on replica placement problem, most of them concerns average system performance and ignores quality assurance issue. Quality...
详细信息
ISBN:
(纸本)9781424403431
this paper studies the QoS-aware replica placement problem. Although there has been much work on replica placement problem, most of them concerns average system performance and ignores quality assurance issue. Quality assurance is very important, especially in heterogeneous environments. We propose a new heuristic algorithm that determines the positions of replicas in order to satisfy the quality requirements imposed by data requests. the experimental results indicate that the proposed algorithm finds a near-optimal solution effectively and efficiently for algorithm can also adapt to various parallel and distributed environments.
CCGrid' 2007, the 7th IEEE internationalsymposium on Cluster computing and the Grid was held in Rio de Janeiro, Brazil, on May 14 17, 2007. CCGrid has become a premium conference of truly international coverage, ...
详细信息
CCGrid' 2007, the 7th IEEE internationalsymposium on Cluster computing and the Grid was held in Rio de Janeiro, Brazil, on May 14 17, 2007. CCGrid has become a premium conference of truly international coverage, bringing together researchers and practitioners, enabling them to share their insight, results, and experience in the multi-faceted areas of Grid and cluster computing. the whole conference experience consisted of more than its strong technical program and also included three keynote speeches by scholars. there were seven workshops on emerging topics like the IEEE Technical Committee on Scalable computing Doctoral symposium, Agent-based Grid computing and Context-Awareness and Mobility in Grid computing. the paper by K. EL Maghraoui focused on adaptability for iterative MPI applications as adaptability enables a parallel application's execution system to split or merge processes, thereby modifying the executed program's granularity.
this paper presents a High Performance computing-based application for 3D structural analysis of buildings. Since the solution of a large linear system of sparse equations supposes the most time-consuming phase, sever...
详细信息
this paper presents a High Performance computing-based application for 3D structural analysis of buildings. Since the solution of a large linear system of sparse equations supposes the most time-consuming phase, several public domain parallel numerical libraries, with state-of-the-art capabilities, have been tested. the parallel application developed allows reducing the analysis time and simulating larger structures. Nevertheless, structural engineers rarely have available high cost parallel machines. thus, a Grid Structural Analysis service, that integrates the parallel application, has been implemented, taking advantage of computers geographically distributed in Internet. this service makes it possible to simulate in a realistic way, and concurrently, a high number of different structural alternatives of large dimension buildings during their design stage, without considering structural simplifications or investing in expensive computers. (c) 2006 Elsevier Ltd. and Civil-Comp Ltd. All rights reserved.
暂无评论