In the past two years the ATLAS Collaboration at the LHC has collected a large volume of data and published a number of ground breaking papers. The grid-based ATLAS distributedcomputing infrastructure played a crucia...
详细信息
In the past two years the ATLAS Collaboration at the LHC has collected a large volume of data and published a number of ground breaking papers. The grid-based ATLAS distributedcomputing infrastructure played a crucial role in enabling timely analysis of the data. We will present a study of the performance and usage of the ATLAS grid as platform for physics analysis in 2011. This includes studies of general properties as well as timing properties of user jobs (wait time, run time, etc). These studies are based on mining of data archived by the PanDA workload management system.
Computational and Data grids consist of the coordinated utilization of large sets of diverse, geographically distributed resources for high performance computation. In order to make better use of these computing entit...
详细信息
ISBN:
(纸本)076952138X
Computational and Data grids consist of the coordinated utilization of large sets of diverse, geographically distributed resources for high performance computation. In order to make better use of these computing entities, a substantial amount of monitoring data is collected for a variety of tasks. The large number of heterogeneous resources available in grids makes the task challenging. The processing of mass data is another problem with large amounts of monitoring data increasing. In this paper, a monitoring system in grid environment called gridView is presented. It gathers much important information from a large computing facility. For giving more efficient performance evaluation, comprehensive computation capacity (C3) model is set up. gridView also gives mass data management mechanism and mass data visualization technique. Due to the grid environment, gridView must consider dynamic changes and heterogeneous characteristic of different platforms.
distributed system has an inherent problem of unevenly distributed load. The possible solution to this problem is load balancing. For the above purpose it is very important to have updated information about the load s...
详细信息
ISBN:
(纸本)9781509036691
distributed system has an inherent problem of unevenly distributed load. The possible solution to this problem is load balancing. For the above purpose it is very important to have updated information about the load status of the nodes comprising the system. This work proposes a distributed Minimum Spanning Tree based Information Exchange (DMSTIE) strategy. The strategy helps in collecting the load information of the nodes through the edges formed during the distributed Minimum Spanning Tree (DMST) process. Based on the information about the system state various load balancing approach can be applied for transferring extra load over the underutilized nodes. An MST based approach results in the information collection and eventually dispatch of load efficiently in terms of communication and computation.
This paper presents a High Performance computing-based application for 3D structural analysis of buildings. Since the solution of a large linear system of sparse equations supposes the most time-consuming phase, sever...
详细信息
This paper presents a High Performance computing-based application for 3D structural analysis of buildings. Since the solution of a large linear system of sparse equations supposes the most time-consuming phase, several public domain parallel numerical libraries, with state-of-the-art capabilities, have been tested. The parallel application developed allows reducing the analysis time and simulating larger structures. Nevertheless, structural engineers rarely have available high cost parallel machines. Thus, a grid Structural Analysis service, that integrates the parallel application, has been implemented, taking advantage of computers geographically distributed in Internet. This service makes it possible to simulate in a realistic way, and concurrently, a high number of different structural alternatives of large dimension buildings during their design stage, without considering structural simplifications or investing in expensive computers. (c) 2006 Elsevier Ltd. and Civil-Comp Ltd. All rights reserved.
gridcomputing provides the basic software infrastructure for integrating geographically distributed resources and services through standardized grid services. One of the key challenges to enable the broader use of Gr...
详细信息
ISBN:
(纸本)0769521673
gridcomputing provides the basic software infrastructure for integrating geographically distributed resources and services through standardized grid services. One of the key challenges to enable the broader use of grid services beyond the domain of scientific computing is the ability to perform complex tasks that require the modeling and coordination of the enactment of a number of distributedgrid services. Workflow technology is a good candidate for supporting grid service flow. However, traditional workflow is static, thus unable to exploit the dynamic information available in the grid and respond to the dynamic nature of the grid. In this paper, we present an adaptive framework that provides adaptive management of grid service flows. The framework is based on an Adaptive grid Service Flow Model and is supported by an Event-Trigger-Rule (ETR) technology that will be used to trigger rules in a distributed fashion to adapt a grid service flow to the dynamic grid environment and the changing requirements of a grid application.
grid technology opens the way to build collaborative environments that enable distributed multi-organizational teams to jointly use computing resources. Thus automatic resources/services discovery should be launched w...
详细信息
ISBN:
(纸本)9780769535012
grid technology opens the way to build collaborative environments that enable distributed multi-organizational teams to jointly use computing resources. Thus automatic resources/services discovery should be launched with the dynamicity of grid elements. Hardware and software failure can be found and solved in lime by monitoring. Analyzing the gathered data could help to find performance bottleneck. In this paper, we analyzed the aims and main tasks of grid Monitoring and the advantage of GMA (grid monitoring Architecture), then proposed a java-based design pattern of grid Monitoring System (JGMS) oil the basis of GMA. The JGMS uses special agents to complete the communication between Producers and Consumers in different grid nodes. So, it can work on different platforms, pass through firewalls and be easy to deploy and use.
Cloud computing, as a newly emerging technology, is an innovation providing dynamically scalable and virtualized resources as services. In this paper, we introduce our recent effort to build a service-oriented distrib...
详细信息
ISBN:
(纸本)9781424452781
Cloud computing, as a newly emerging technology, is an innovation providing dynamically scalable and virtualized resources as services. In this paper, we introduce our recent effort to build a service-oriented distributed computational system based on the Cloud concepts named distributed Computational Service Cloud. This kind of Cloud hosts scalable grid services, which are implemented with Web-Services-Resource-Framework-compliant (WSRF) [1] Web services, enabling high-performance and distributedcomputing. We evaluate the Cloud using scalable decision tree service, which provides computational intensive data mining algorithm. The architecture of the system as well as details of the distributed decision tree construction is the kernel content of this paper.
gridcomputing provides a virtual framework for controlled sharing of resources across institutional boundaries. Recently, trust has been recognized as an important factor for scheduling in grid Trust is a complex sub...
详细信息
ISBN:
(纸本)0769524052
gridcomputing provides a virtual framework for controlled sharing of resources across institutional boundaries. Recently, trust has been recognized as an important factor for scheduling in grid Trust is a complex subject relating to such as reliability, honesty, and competence of the trusted entity. Trust value computing becomes more difficult in grid as the nodes are independent and distributed, and with a security aware task execution model, task scheduling is crucial to achieving high performance. In this paper, we present a trust model wherein each node is assigned a trust value that reflects the transaction experiences, Eigenvector is used to calculate the trust value and distribute eigentrust algorithm is modified according to the characteristic of the grid Furthermore, trust managers are set to guide scheduling, and security-driven algorithms are proposed to ensure the security of the executions. Simulations are performed to evaluate the performance of the algorithms.
Many members of large science collaborations already have specialized grids available to advance their research in the need of getting more computing resources for data analysis. This has forced the Collider Detector ...
详细信息
Many members of large science collaborations already have specialized grids available to advance their research in the need of getting more computing resources for data analysis. This has forced the Collider Detector at Fermi lab (CDF) collaboration to move beyond the usage of dedicated resources and start exploiting grid resources. Nowadays, CDF experiment is increasingly relying on glidein-based computing pools for data reconstruction. Especially, Monte Carlo production and user data analysis, serving over 400 users by central analysis farm middleware (CAF) on the top of Condor batch system and CDF grid infrastructure. Condor is designed as distributed architecture and its glidein mechanism of pilot jobs is ideal for abstracting the gridcomputing by making a virtual private computing pool. We would like to present the first production use of the generic pilot-based Workload Management System (glideinWMS), which is an implementation of the pilot mechanism based on the Condor distributed infrastructure. CDF gridcomputing uses glideinWMS for its data reconstruction on the FNAL campus grid, user analysis and Monte Carlo production across Open Science grid (OSG). We review this computing model and setup used including CDF specific configuration within the glideinWMS system which provides powerful scalability and makes gridcomputing working like in a local batch environment with ability to handle more than 10000 running jobs at a time.
Debugging parallel programs is one of the most tedious jobs in programming scalable multiprocessor architectures. Due to the distributed resources of these machines, programming is often architecture dependent. Most d...
详细信息
Debugging parallel programs is one of the most tedious jobs in programming scalable multiprocessor architectures. Due to the distributed resources of these machines, programming is often architecture dependent. Most development tools still reflect this dependency even during the analysis phase of parallel programs. This paper presents the distributed debugger DETOP, which offers a global name space and hides architectural features like the mapping of processes. DETOP is part of the integrated tool environment TOPSYS implemented on IPSC hypercubes, networks of SPARCstations and partly on Transputer and PowerPC based systems.
暂无评论