computational grids are emerging as a new infrastructure for high performance computing. Since the resources in a Grid can be heterogeneous and distributed, mesh-based applications require a mesh partitioner that cons...
详细信息
In this paper, we describe the design and implementation of two mechanisms for fault-tolerance and recovery for complex scientific workflows on computational grids. We present our algorithms for over-provisioning and ...
详细信息
ISBN:
(纸本)9781424442379
In this paper, we describe the design and implementation of two mechanisms for fault-tolerance and recovery for complex scientific workflows on computational grids. We present our algorithms for over-provisioning and migration, which are our primary strategies for fault-tolerance. We consider application performance models, resource reliability models, network latency and bandwidth and queue wait times for batch-queues on compute resources for determining the correct fault-tolerance strategy. Our goal is to balance reliability and performance in the presence of soft real-time constraints like deadlines and expected success probabilities, and to do it in a way that is transparent to scientists. We have evaluated our strategies by developing a Fault-Tolerance and Recovery (FTR) service and deploying it as a part of the Linked Environments for Atmospheric Discovery (LEAD) production infrastructure. Results from real usage scenarios in LEAD show that the failure rate of individual steps in workflows decreases from about 30% to 5% by using our fault-tolerance strategies.
We here discuss how to run Bag-of-Tasks applications on computational grids. Bag-of-Tasks applications (those parallel applications whose tasks are independent) are both relevant and amendable for execution on grids. ...
详细信息
ISBN:
(纸本)0769520170
We here discuss how to run Bag-of-Tasks applications on computational grids. Bag-of-Tasks applications (those parallel applications whose tasks are independent) are both relevant and amendable for execution on grids. However, few users currently execute their Bag-of-Tasks applications on grids. We investigate the reason for this state of affairs and introduce MyGrid, a system designed to overcome the identified difficulties. MyGrid provides a simple, complete and secure way for a user to run Bag-of-Tasks applications on all resources she has access to. Besides putting together a complete solution useful for real users, MyGrid embeds two important research contributions to grid computing. First, we introduce some simple working environment abstractions that hide machine configuration heterogeneity from the user. Second, we introduce Work Queue with Replication (WQR), a scheduling heuristics that attains good performance without relying on information about the grid or the application, although consuming a few more cycles. Note that not depending on information makes WQR much easier to deploy in practice.
To optimize the placement of wind turbines using a genetic algorithm for the fixed size of wind farm, the appropriate computational grids are the basis of the succeeding work. The optimized scheme was tightly restrict...
详细信息
ISBN:
(纸本)9781424447022
To optimize the placement of wind turbines using a genetic algorithm for the fixed size of wind farm, the appropriate computational grids are the basis of the succeeding work. The optimized scheme was tightly restricted by the rationality and accuracy of computational grids. In this paper, based on the consideration of actual wind and wake characteristics of wind turbines, the (a) shape of the grids, (b) arranging the direction of the grids, and (c) the density of the grids were introduced to study the effect of computation grids on the optimization results. Furthermore, the grids' division method in the scheme's optimization of wind turbines placement under different conditions was discussed to increase the power capacity of the wind farm to obtain the maximum benefit of the investment.
Remote sensing image processing is a very demanding procedure in terms of data manipulation and computing power. Grid computing is a possible solution when the required computing performance or data sharing is not ava...
详细信息
ISBN:
(纸本)9789608457720
Remote sensing image processing is a very demanding procedure in terms of data manipulation and computing power. Grid computing is a possible solution when the required computing performance or data sharing is not available at the user's site. Two scenarios of using Service grids were analyzed in our papers [17, 181. This paper discusses another scenario of using computational grids. According to this scenario a prototype code for satellite image classification was designed, implemented and tested.
Wireless sensor networks (WSNs) have been greatly developed and emerged their significance in a wide range of important applications such as acquisition and process information from the physical world. The evolvement ...
详细信息
ISBN:
(纸本)9783642115257
Wireless sensor networks (WSNs) have been greatly developed and emerged their significance in a wide range of important applications such as acquisition and process information from the physical world. The evolvement of Grid computing has been based on coordination of distributed and shared resources. A Sensor Grid network can integrate these two leading technologies enabling real-time sensor data collection, the sharing of computational and storage grid resources for sensor data processing and management. Several issues have occurred from this integration which dispute the modern design of sensor grids. In order to address these issues, in this paper we propose a sensor grid architecture supporting it by a testbed which focuses on the design issues and on the improvement of our sensor grid architecture design.
Scheduling independent tasks in computational grids commonly arises in many Grid-enabled large scale applications. Much of current research in this domain is focused on the improvement of the efficiency of the Grid sc...
详细信息
ISBN:
(纸本)9780769539676
Scheduling independent tasks in computational grids commonly arises in many Grid-enabled large scale applications. Much of current research in this domain is focused on the improvement of the efficiency of the Grid schedulers, both at global and local levels, which is the basis for Grid systems to leverage large computing capacities. However, unlike traditional scheduling, in Grid systems security requirements are very important to scheduling tasks/applications to Grid resources. The objective is thus to achieve efficient and secure allocation of tasks to machines. In this paper we propose a new model for secure scheduling at the Grid sites by combining game-theoretic and genetic-based meta-heuristic approaches. The game-theoretic model takes into account the realistic feature that Grid users usually perform independently of each other. The scheduling problem is then formalized as a noncooperative non-zero sum game with Nash equilibria as the solutions. The game cost function is minimized, at global and user levels, by using four genetic-based hybrid metaheuristics. We have evaluated the proposed model through a static benchmark of instances, for which we have measured two basic metrics, namely the makespan and flowtime. The obtained results suggest that it is more resilient for the Grid users ( and local schedulers) to tolerate some job delays defined as additional scheduling cost due to security requirements instead of taking a risk of allocating at unreliable resources.
High-performance computing increasingly occurs on "computational grids" composed of heterogeneous and geographically distributed systems of computers, networks, and storage devices that collectively act as a...
详细信息
High-performance computing increasingly occurs on "computational grids" composed of heterogeneous and geographically distributed systems of computers, networks, and storage devices that collectively act as a single "virtual" computer. One of the great challenges for this environment is to provide efficient access to data that is distributed across remote data servers in a grid. In this paper, we describe our solution, a framework we call armada. The framework allows applications and dataset providers to flexibly compose graphs of processing modules that describe the distribution, application interfaces, and processing required of the dataset before computation. The armada runtime system then restructures the graph, and places the processing modules at appropriate hosts to reduce network traffic. (C) 2002 Elsevier Science B.V. All rights reserved.
One of the key steps in economy based grid resource allocation is to make reasonable prices for the grid resources. For resources' prices decide the resource flow in the Grid, further influence Grid resources'...
详细信息
ISBN:
(纸本)0769523153
One of the key steps in economy based grid resource allocation is to make reasonable prices for the grid resources. For resources' prices decide the resource flow in the Grid, further influence Grid resources' trading efficiency. This article, instead of discussing the pricing algorithm, argues an autonomous pricing mechanism that leaves the prices to be decided by the Grid traders within their trading process. We firstly point out the factors that influence the resources' prices;then make out the trading flow for resource consumer agents and provider agents. By doing these, the two trading agents can decide their price due to the dynamic changes of the Grid environment without any manmade interferences. Since price is the core of market economy, the autonomous pricing strategy provides us with a foundation to use the invisible hand of market to allocate the Grid resource.
暂无评论