It is envisaged that the grid infrastructure will be a large-scale distributed software system that will provide high-end computational and storage capabilities to differentiated users. A number of distributed computi...
详细信息
Replication is a key technique for improving fault tolerance but can introduce considerable performance overhead under some circumstances. To explore the tradeoff between performance and failure resilience, we develop...
详细信息
ISBN:
(纸本)9781424442379
Replication is a key technique for improving fault tolerance but can introduce considerable performance overhead under some circumstances. To explore the tradeoff between performance and failure resilience, we develop a calculus that takes into consideration the I/O characteristics of applications and failure behavior of distributed storage nodes. With the developed evaluation model, we then prescribe a file system replication strategy that maximizes the utilization of computational resources for long-running and compute-intensive grid applications.
Scalability constitutes a key property of Peer-to-Peer overlay networks. Recent advances that improve scalability include super-peer infrastructures and network coordinates. As an integral part of an upcoming middlewa...
详细信息
ISBN:
(纸本)9781424442379
Scalability constitutes a key property of Peer-to-Peer overlay networks. Recent advances that improve scalability include super-peer infrastructures and network coordinates. As an integral part of an upcoming middleware for distributed computing, we have developed a mechanism that builds and maintains a super-peer overlay network in a decentralized, self-organized way, using network coordinates to facilitate inter-peer distance estimation. We discuss the benefits of super-peer infrastructures in the context of Desktop grids and present the outcome of experiments conducted on PlanetLab to observe the constructed topology's behavior in a practical environment.
Amazon's spot instances allow customers to bid on unused Amazon EC2 capacity and run those instances for as long as their bid exceeds the current spot price. Customers may expect their services at lower cost with ...
详细信息
ISBN:
(纸本)9780769549965;9781467364652
Amazon's spot instances allow customers to bid on unused Amazon EC2 capacity and run those instances for as long as their bid exceeds the current spot price. Customers may expect their services at lower cost with spot instances compared to on-demand or reserved. However, the reliability is compromised since the instances providing the service may become unavailable at any time. In this paper, we study various checkpointing schemes that can be used with spot instances. Also we devise some algorithms for checkpointing scheme on top of application-centric resource provisioning framework that increase the reliability while reducing the cost significantly.
Multi-agent based simulation (MABS) is a discrete event simulation technique used to study complex systems with entities having social and autonomous behavior. MABS applications are characterized by unpredictable exec...
详细信息
ISBN:
(纸本)9781424442379
Multi-agent based simulation (MABS) is a discrete event simulation technique used to study complex systems with entities having social and autonomous behavior. MABS applications are characterized by unpredictable execution behavior and high communication-to-computation ratio. In this paper, we propose an adaptation strategy to support efficient execution of large-scale MABS applications on typical grid infrastructures. To achieve this objective, the behavior of MABS applications and the execution environment is investigated, in order to constantly obtain performance prediction models. These models will then be used to realize dynamic load balancing and resource allocation schemes. We discuss our basic approach, initial experimental results, the planned future research and an application of our research in the transportation and logistics simulation domain.
The introduction of economic principles in grid resource management provides an interesting avenue for efficiently addressing the problem of conflicting user requirements. In shared computing infrastructures such as g...
详细信息
ISBN:
(纸本)9781424442379
The introduction of economic principles in grid resource management provides an interesting avenue for efficiently addressing the problem of conflicting user requirements. In shared computing infrastructures such as grids, such conflicting requirements are prevalent and stem from the selfish actions users follow when formulating their service requests. We develop and analyze both a centralized and a decentralized algorithm for economic resource management in the context of consumer requests for CPU bound applications with deadline-based QoS requirements and non-migratable workloads. A comparison with an algorithm recently proposed in the literature is presented with a focus on performance in terms of realized consumer value. We establish that our algorithms perform well and that they compare favorably to existing approaches.
This work focuses on the evaluation of the suitability of Mangoose++, a medical image application for reconstruction of 3D volumes, by means of Cloud computing. Due to the increasing resolution of panel detectors in c...
详细信息
ISBN:
(纸本)9781479927845
This work focuses on the evaluation of the suitability of Mangoose++, a medical image application for reconstruction of 3D volumes, by means of Cloud computing. Due to the increasing resolution of panel detectors in computed tomography and the need of lower execution times, the use of parallel implementations for clusters and accelerators have been generalized. Anyhow, the renewal and maintenance of hardware is expensive which makes Cloud computing a valuable alternative. In our evaluation, we analyze and discuss the costs and efficiency of the Mangosee++ application over Amazon EC2 platform, demonstrating that lower times can be achieved in a reasonable price compared with owned HPC-based hardware. We also provide a comparison between distinct hardware configurations so that we can infer the advantages and disadvantages of each one.
Workflow Management System is generally utilized to define, manage and execute workflow applications on grid resources. However, the increasing scale complexity, heterogeneity and dynamism of grid environment that inc...
详细信息
ISBN:
(纸本)9781424442379
Workflow Management System is generally utilized to define, manage and execute workflow applications on grid resources. However, the increasing scale complexity, heterogeneity and dynamism of grid environment that includes networks, resources and applications have made such workflow management systems brittle, unmanageable and insecure. Autonomic computing provides a holistic approach for the design and development of systems/applications that can adapt themselves to meet requirements of performance, fault tolerance, reliability, security, etc., without manual intervention. Therefore, this research aims to design and develop mechanisms for building an autonomic workflow management system that will incorporate the properties of autonomic computing and exhibit the ability to reconfigure itself to the changes in the grid environment, discover, diagnose and react to the disruptions of workflow execution, and monitor and tune grid resources automatically.
Preventing the misuse of personally identifiable information and preserving user privacy are key issues in the management of IT services, especially when organizational borders are crossed. In this paper we first pres...
详细信息
ISBN:
(纸本)9781424442379
Preventing the misuse of personally identifiable information and preserving user privacy are key issues in the management of IT services, especially when organizational borders are crossed. In this paper we first present an analysis of the differences between grid environments and previous models of inter-organizational collaboration. Based on requirements derived thereof we demonstrate how existing policy-based privacy management architectures can be extended to provide grid-specific functionality and can be integrated into existing infrastructures. Special emphasis is put on privacy policies which can be configured by users themselves, and distinguishing between the initial data access and the later data usage control phases. We also discuss the application of this approach to a XacmL-based privacy management system.
Resource information systems are a key component of Computational grids. Centralized information systems hamper scalability and reliability, and thus, completely distributed resource information systems, based on Dist...
详细信息
ISBN:
(纸本)9781424442379
Resource information systems are a key component of Computational grids. Centralized information systems hamper scalability and reliability, and thus, completely distributed resource information systems, based on Distributed Hash Tables have been proposed. In some cases resource distribution might be highly uneven, load balancing of data becomes thus a crucial problem. However, current load balancing schemes cannot handle large amounts of data corresponding to a single resource type. In this paper we propose therefore RESERV a distributed information system for grid applications with a novel load balancing approach, able to handle extreme load unbalance.
暂无评论