this paper deals withthe challenging problem of performance analysis for Java programs. We describe procedures and requirements for instrumenting, monitoring, and analyzing distributed Java codes, and introduces Aksu...
详细信息
ISBN:
(纸本)0780390741
this paper deals withthe challenging problem of performance analysis for Java programs. We describe procedures and requirements for instrumenting, monitoring, and analyzing distributed Java codes, and introduces Aksum, a highly customizable and flexible system for performance analysis that helps programmers to semi-automatically locate and understand performance problems in parallel and distributed Java programs. We also describe a sophisticated agent architecture as part of Aksum for static and dynamic instrumentation of Java programs. Experiments are presented for a widely distributed application running on a heterogeneous set of machines with different operating systems to illustrate the usefulness of our approach.
this contribution presents our experiences from developing an advanced course in Grid computing, aimed at application and infrastructure developers. the course was intended for computer science students with extensive...
详细信息
ISBN:
(纸本)0780390741
this contribution presents our experiences from developing an advanced course in Grid computing, aimed at application and infrastructure developers. the course was intended for computer science students with extensive programming experience and previous knowledge of distributed systems, parallelcomputing, computer networking, and security. the presentation includes brief presentations of all topics covered in the course, a list of the literature used, and descriptions of the mandatory computer assignments performed using Globus Toolkit 2 and 3. A summary of our experiences from the course and some suggestions for future directions concludes the presentation.
Dynamically partitioning of adaptive applications and migration of excess workload from overloaded processors to underloaded processors during execution are critical techniques needed for distributedcomputing. Distri...
详细信息
ISBN:
(纸本)0780390741
Dynamically partitioning of adaptive applications and migration of excess workload from overloaded processors to underloaded processors during execution are critical techniques needed for distributedcomputing. distributed systems differ from traditional parallel systems in that they consist of heterogeneous resources connected with shared networks, thereby preventing existing schemes from benefiting large-scale applications. In particular, the cost entailed by workload migration is significant when the excess workload is transferred across heterogeneous distributed platforms. this paper introduces a novel distributed data migration scheme for large-scale adaptive applications. the major contributions of the paper include: (1) a novel hierarchical data migration scheme is proposed by considering the heterogeneous and dynamic features of distributedcomputing environments;and (2) a linear programming algorithm is presented to effectively reduce the overhead entailed in migrating excess workload across heterogeneous distributed platforms. Experiment results show that the proposed migration scheme outperforms common-used schemes with respect to reducing the communication cost and the application execution time.
this short paper describes the cooperative caching architecture of pCFS [5], a shared disk cluster file system (CFS) which aims to achieve high performance in a broad spectrum of I/O intensive applications ranging fro...
详细信息
ISBN:
(纸本)1424403073
this short paper describes the cooperative caching architecture of pCFS [5], a shared disk cluster file system (CFS) which aims to achieve high performance in a broad spectrum of I/O intensive applications ranging from computational access to large data sets to video streaming and databases, and includes an extended API for parallel I/O access. pCFS is targeted at small to medium sized clusters where data is stored in Fibre Channel shared devices on a Storage Area Network (SAN) and exploits two interconnect fabrics: a SAN to access on-disk data, and a LAN, used both for the exchange of control information (related to locking and cache management) and for cooperative caching dataflow.
One of the challenges in high-performance computing is to provide users with reliable, remote data access in a distributed, heterogeneous environment. the increasing popularity of high-speed wide area networks and cen...
详细信息
ISBN:
(纸本)0780390741
One of the challenges in high-performance computing is to provide users with reliable, remote data access in a distributed, heterogeneous environment. the increasing popularity of high-speed wide area networks and centralized data repositories lead to the possibility of direct high-speed access to remote data sets from within a parallel application. In this paper we describe SEMPLAR, a library for remote, parallel I/O that combines the standard programming interface of MPI-IO withthe remote storage functionality of the SDSC Storage Resource Broker (SRB). SEMPLAR relies on parallel TCP streams to maximize the remote data throughput in a design that preserves the parallelism of the access all the way from the storage to the application. We have provided I/O performance resultsfor a high-performance computing work-load on three different clusters. On the NCSA TeraGrid cluster the ROMIO perf benchmark attained an aggregate read bandwidth of 291Mbps with 18 processors. the NAS btio benchmark achieved an aggregate write bandwidth of 74Mbps with 16 processors. the benchmark results are encouraging and show that SEMPLAR provides applications with scalable, high-bandwidth I/O across wide area networks.
the aIOLi project aims at optimizing the I/O accesses within the cluster by providing a simple POSIX API, thus avoiding the constraints to use a dedicated parallel I/O library. this paper introduces an extension of aI...
详细信息
ISBN:
(纸本)1424403073
the aIOLi project aims at optimizing the I/O accesses within the cluster by providing a simple POSIX API, thus avoiding the constraints to use a dedicated parallel I/O library. this paper introduces an extension of aIOLi to address the issue of disjoint accesses generated by different concurrent applications in a cluster In such a context, performance,fairness and response time are the criteria for which good tradeoffs have to be assessed. A test composed of two concurrent IOR benchmarks showed improvements on read accesses by a factor ranging from 3.5 to 35 with POSIX calls and from 3.3 to 5 with ROMIO.
Parameter studies, genetic algorithms and Monte Carlo type calculations are examples of pleasantly parallel computational tasks. Pleasantly parallel computational tasks can be effectively calculated in computer cluste...
详细信息
Parameter studies, genetic algorithms and Monte Carlo type calculations are examples of pleasantly parallel computational tasks. Pleasantly parallel computational tasks can be effectively calculated in computer clusters or grids. In this work, we consider a weight minimization problem of a laminated composite structure in the post-buckling region. the design variables are the number of layers and the layer orientations given in a discrete set of allowable angles for layer orientations. Optimization is carried out using a deterministic search process, where the lay-up configurations are generated iteratively in the design space from the selected design points of the population at the preceding cycle. Computation is performed using NorduGrid grid computing platform. In this work, we briefly go through some general grid concepts and the use of grid in optimization of laminated composite structures. (C) 2005 Elsevier Ltd. All rights reserved.
Grids are more heterogeneous and dynamic than traditional parallel or distributed systems, both in terms of processors and of interconnects. A grid communication system must handle many issues: first, it must run on n...
详细信息
ISBN:
(纸本)0780390741
Grids are more heterogeneous and dynamic than traditional parallel or distributed systems, both in terms of processors and of interconnects. A grid communication system must handle many issues: first, it must run on networks that are not yet determined when the application is launched, including user-space interconnects;second, it must transparently run on different networks at the same time;third, it should yield performance close to that of specialized communication systems. In this paper we present NETIBIS, a new Java communication system that provides a uniform interface for any underlying intercluster or intracluster network NETIBIS solves the heterogeneity issues posed by Grid computing by dynamically constructing network protocol stacks out of drivers, self-contained building blocks for flexible configuration, with limited functionality per driver We describe the design and implementation of the major NETIBIS drivers for serialization, multicast, reliability, and various underlying networks. We also describe various optimizations for performance, like layer collapsing for the GM driver We evaluate the performance of NETIBIS on several platforms, including a European grid.
this paper describes the experiences of teaching a new undergraduate grid computing course to students across the State of North Carolina in Fall 2004. this course is specifically focused at the undergraduate level an...
详细信息
ISBN:
(纸本)0780390741
this paper describes the experiences of teaching a new undergraduate grid computing course to students across the State of North Carolina in Fall 2004. this course is specifically focused at the undergraduate level and has a large number of geographically distributed participating sites. New materials have been developed for undergraduate students. We have developed a coherent set of grid computing programming assignments. All the materials developed are available on-line. this course also had internationally known guest speakers to provide state-of-the art presentations. Apart from developing grid computing materials, a number of issues arose in using the grid computing software and coordinating the activities, which are described here.
暂无评论