There is active research around the world to develop. fundamental technologies that would provide pervasive, dependable, consistent and inexpensive access to advanced computational capabilities of the grid system allo...
详细信息
ISBN:
(纸本)1892512459
There is active research around the world to develop. fundamental technologies that would provide pervasive, dependable, consistent and inexpensive access to advanced computational capabilities of the grid system allowing scientists to solve the most challenging computational problems. Achieving large-scale distributedcomputing in a seamless manner on gridcomputing introduces not only the problem of efficient utilization and satisfactory response time but also the problem of fault-tolerance. In particle, the scheduling middleware is the critical infrastructure of gridcomputing as it provides services upon which applications depend heavily. As such the scheduling middleware requires constant, dependable operation in the face of various failures and other disruptive events that might cause a loss of service. In this paper, we present a reconfigurable multi-layered grid scheduling infrastructure that provides fault-tolerance mechanisms to ensure that a grid client can obtain reliable services, even if the scheduler that provides the desired services may suffer from crash failures.
We investigate the problem of efficient computation of a partition of a Random Geometric Graph (RGG) into a limited number of densely packed bipartite grid sub graphs. The study focuses on the collection of subgraphs ...
详细信息
ISBN:
(纸本)9781538639917
We investigate the problem of efficient computation of a partition of a Random Geometric Graph (RGG) into a limited number of densely packed bipartite grid sub graphs. The study focuses on the collection of subgraphs each individually having similar size and structure and the union employing most (e.g. over 85%) of the vertices. The residual vertices we seek to minimize are attributed to the inherent variations in densities of the randomly placed vertices and to any shortcomings of our greedy algorithms. RGG's have been extensively employed in recent times to model the deployment of numerous instances of Wireless Sensor Networks (WSN's) Pl. The properties investigated in our selected bipartite grid backbones are those deemed most relevant for applications to the foundations of this widely growing field. distributed algorithms are primarily used to determine backbones. Our results review what backbone grid partitions exist in the data. This provides a metric to measure the effectiveness of any distributed algorithm against an existing optimal result. The visual display of selected backbone grids suggests local algorithm design strategies. Furthermore, these partitions must be efficiently computable for highly scalable computation, e.g. WSN's with 100's of thousands of vertices and millions of edges in the resulting RGG. We consider distributions over a segment of the plane and over the surface of the sphere to model sensor distributions both in limited planar regions, all around the globe or on distant planets.
grid technologies are emerging as the next generation of distributedcomputing, allowing the aggregation of heterogeneous resources that are geographically distributed. The heterogeneous nature of the grid makes it mo...
详细信息
ISBN:
(纸本)9781467356930
grid technologies are emerging as the next generation of distributedcomputing, allowing the aggregation of heterogeneous resources that are geographically distributed. The heterogeneous nature of the grid makes it more vulnerable to faults which lead to either the failure of the job or delay in completing the execution of the job. Checkpointing is one of the many fault tolerance techniques which are used to make grid more efficient and reliable. In this paper we have developed an application checkpointing based fault tolerance technique for Alchemi based grid environment. In this technique application threads generate their checkpoints and store them in the checkpoint table at the manager node. In case a thread fails checkpoint of the corresponding thread is used to resume the execution from the point of failure. This technique introduces a slight overhead in fault free situations but very effective in case of a node failure. Increased checkpoint frequency improves job's resuming capability but also increases the overhead of generating and storing checkpoints which results in increased processing time of the job.
The grid is the next generation computing infrastructure able to handle the growing requirements for computing power. Portals are anticipated as the user's access point to these resources. The whole grid-portal in...
详细信息
ISBN:
(纸本)1892512491
The grid is the next generation computing infrastructure able to handle the growing requirements for computing power. Portals are anticipated as the user's access point to these resources. The whole grid-portal infrastructure constitutes a distributed environment in which efficient and flexible communication manners play a key role. The emerging web services technology has been chosen as the best solution for organizing communication in grid-portal systems. In this paper we would like to present the PROGRESS grid-portal environment in which we implemented web services communication between distributed modules of the system.
gridcomputing is a new paradigm for distributedcomputing, and service has become building block of grid applications. However, current approaches can not free developers from low-level laborious work when building g...
详细信息
ISBN:
(纸本)9780769527369
gridcomputing is a new paradigm for distributedcomputing, and service has become building block of grid applications. However, current approaches can not free developers from low-level laborious work when building grid applications. We propose a service-oriented virtual machine called Abacus Virtual Machine to simplify the task of grid application development. As a language level virtual machine, it provides a service-oriented instruction set to abstract the operations on the services of a grid application. It also virtualizes services and creates a virtual global system image for grid applications, thus services can be transparently distributed and shared In this way, Abacus Virtual Machine hides the cumbersome underlying details from programmers and reduces the complexity greatly in grid application development.
In this paper we propose a systematic approach to performance analysis of workflow applications on the grid. We introduce an ideal model for the workflow execution time and explain the difference to the real measured ...
详细信息
ISBN:
(纸本)9781424403431
In this paper we propose a systematic approach to performance analysis of workflow applications on the grid. We introduce an ideal model for the workflow execution time and explain the difference to the real measured times based on a hierarchy of performance overheads for gridcomputing. We describe how to systematically measure and compute the overheads from individual activities to entire workflow applications. We adjusted well-known parallel processing metrics to the scope of gridcomputing, comprising speedup and efficiency. We have implemented and largely automatised our analysis approach in the context of the ASKALON grid application development and computing environment. We present experimental results that show detailed overhead analysis of two real-world workflow applications executed in a national grid environment.
Smart grid initiatives aim to overlay the existing power grid infrastructure with a communication and computation infrastructure to enable integration of renewable resources and increased efficiency and reliability of...
详细信息
ISBN:
(纸本)9780769543611
Smart grid initiatives aim to overlay the existing power grid infrastructure with a communication and computation infrastructure to enable integration of renewable resources and increased efficiency and reliability of the electric power grid. This requires substantial changes to the existing centralized control architecture as well as the development of drastically more powerful communication and computation infrastructure. In this paper, we describe the elements required for implementation of a "Prosumer" based distributed control architecture for smart grid. We illustrate the operation of this distributed control architecture through the case study of a Demand Response application. We propose a Hard Real Time enabled, Web Services based computing infrastructure that can support the development as well as the operation of this distributed control architecture. Main components of this computing infrastructure are a Generic Real Time API (Generic RT-API) for platform independence, a Rule based Configurable API Translator, a Hard Real Time Web Services (HRT-WS) Engine and the ability to receive XML based configuration/control instructions remotely. We describe the status of our prototype implementation for this computing infrastructure using LXRT/RTAI based Hard Real Time Linux environment.
In relation to the mathematics of financial applications, the present study deals with the solution of the time dependent obstacle problem defined in a three-dimensional domain;this problem arises in the pricing of Am...
详细信息
ISBN:
(纸本)9781905088423
In relation to the mathematics of financial applications, the present study deals with the solution of the time dependent obstacle problem defined in a three-dimensional domain;this problem arises in the pricing of American options derivatives. In order to solve large scale algebraic systems derived from the discretization of the obstacle problem, the parallelization of the numerical algorithm is necessary. So, we present parallel synchronous, and more generally asynchronous, iterative algorithms to solve this problem. For the considered problem, arguments implying the convergence of parallel synchronous and asynchronous algorithms are given in a general framework. Finally, computational experiments on grid' 5000, the French national grid, are presented and analyzed. They allow us to compare both synchronous and asynchronous versions with local and distributed clusters and to show the interest of such methods in the context of gridcomputing.
Shared temporary storage space is often the constraining resource for clusters that serve as execution nodes in wide-area distributed systems. At least one large national-scale computinggrid has reported a failure ra...
详细信息
ISBN:
(纸本)9781424403431
Shared temporary storage space is often the constraining resource for clusters that serve as execution nodes in wide-area distributed systems. At least one large national-scale computinggrid has reported a failure rate of as high as thirty percent of submitted jobs, often due to accidentally filled shared storage spaces. Previous systems have attacked this problem by adding space allocation to the distributed system interface. However, these allocations are not enforced at the filesystem level, and thus unexpected or unaccounted uses of storage may cause the system to fail. By adding an inexpensive allocation mechanism to the operating system, we may improve the robustness of such systems at minimal cost. In this paper, we describe an abstract model of space allocation in the file system and explore three implementations of the model: a user-level library, a recursive loopback filesystem, and a modified kernel filesystem. We evaluate the performance and completeness of these implementations and demonstrate that kernel support is essential to keeping the overhead low. Finally, we demonstrate empirically that a cluster under heavy filesystem load can be made more robust by adding allocations to the filesystem.
gridcomputing is a method of distributedcomputing to manage and share the networked resources, data domains, storage procedure, processing power across geologically distributed locations. In the computational grid e...
详细信息
ISBN:
(纸本)9781538619599
gridcomputing is a method of distributedcomputing to manage and share the networked resources, data domains, storage procedure, processing power across geologically distributed locations. In the computational grid environment, information services are essential to provide information about various computational resources. This information includes configuration of resources, policies, agreement that are managed by both higher level and lower level schedulers. Resource information aggregation schemes are used to reduce the quantity of information exchanged between the networked resources. Through information aggregation, the characteristics of resources are potted and then sent to the scheduling actions. In this paper we are proposing a separate module named RIM (Resource Information Module) for information aggregation in each domain. The experimental results show that the proposed aggregation schemes achieve large information reduction and better resource selection scheme for enabling improved task scheduling decisions.
暂无评论