Deployment of distributed applications on large systems, and especially on grid infrastructures, becomes a more and more complex task. Grid users spend a lot of time to prepare, install and configure middleware and ap...
详细信息
ISBN:
(纸本)3540482741
Deployment of distributed applications on large systems, and especially on grid infrastructures, becomes a more and more complex task. Grid users spend a lot of time to prepare, install and configure middleware and application binaries on nodes, and eventually start their applications. The problem is that the deployment process is composed of many heterogeneous tasks that have to be orchestrated in a specific correct order. As a consequence, the automatization of the deployment process is currently very difficult to reach. To address this problem, we propose in this paper a generic deployment framework allowing to automatize the execution of heterogeneous tasks composing the whole deployment process. Our approach is based on a reification as software components of all required deployment mechanisms or existing tools. Grid users only have to describe the configuration to deploy in a simple natural language instead of programming or scripting how the deployment process is executed. As a toy example, this framework is used to deploy CORBA component-based applications and OpenCCM middleware on one thousand nodes of the French Grid5000 infrastructure.
This article presents a method for evaluating the CPU power, independently from the system used, in heterogeneous networks of work stations. It is based on the use of Java language in order to ensure application porta...
详细信息
ISBN:
(纸本)0769522106
This article presents a method for evaluating the CPU power, independently from the system used, in heterogeneous networks of work stations. It is based on the use of Java language in order to ensure application portability and more particularly on the mechanism of thread CPU processing time measurement introduced in the version 1.5 of Sun Java. That tool will be integrated into the load balancing mechanism which is totally written in Java and that we developed in the LIFL project ADAJ. We show how to evaluate the potential power of the CPU with a software totally written in Java. Moreover, we will justify the results provided by our approach. We will also analyse the exploitation of the calibration tool in order to improve the execution time of parallel and distributed applications in the context of load balancing in a network of workstations.
This paper proposes a decentralised system for managing Desktop Grid (DG). The idea is to bypass the main drawback of existing systems putting all the control on a single master that can fails. Here, each node can pla...
详细信息
ISBN:
(纸本)9781605583655
This paper proposes a decentralised system for managing Desktop Grid (DG). The idea is to bypass the main drawback of existing systems putting all the control on a single master that can fails. Here, each node can play alternatively the role of client or server. Our main contribution is to design the PastryGrid protocol (based on Pastry) for DG in order to decentralise the execution of a distributed application with precedence between tasks. Comparing to a centralised system, we evaluate our approach over 205 machines executing 2500 tasks. The results show that our decentralised system runs better than the same system configured as a master/slave because it gives less overhead. Copyright 2008 ACM.
This paper proposes a decentralized and fault-tolerant software system for the purpose of managing Desktop Grid resources. Its main design principle is to eliminate the need for a centralized server, therefore to remo...
详细信息
This paper proposes a decentralized and fault-tolerant software system for the purpose of managing Desktop Grid resources. Its main design principle is to eliminate the need for a centralized server, therefore to remove the single point of failure and bottleneck of existing Desktop Grids. Instead, each node can play alternatively the role of client or server. Our main contribution is to design the PastryGrid protocol (based on Pastry) for Desktop Grid in order to support a wider class of applications, especially the distributed application with precedence between tasks. Compared with a centralized system, we evaluate our approach over 205 machines executing 2500 tasks. The results we obtain show that our decentralized system outperforms XtremWeb-CH which is configured as a master/slave, with respect to the turnaround time. Copyright (C) 2009 John Wiley & Sons, Ltd.
This paper presents an event routing service (ERS) that represents a pattern for the establishment and processing of lightweight sessions between a source and a destination of events. It is based on the CORBA notifica...
详细信息
This paper presents an event routing service (ERS) that represents a pattern for the establishment and processing of lightweight sessions between a source and a destination of events. It is based on the CORBA notification service. To illustrate the features of the proposed service, its application in computer telephony is discussed in detail. Most of the positive features of the ERS are inherited from the CORBA notification service, but its implementation requires a novel protocol describing the lightweight session establishment mechanism. The implementation of ERS is described and its application to routing events in a computer telephony system is presented. The paper concludes with a performance measurement study. Copyright (C) 2002 John Wiley Sons, Ltd.
Nowadays, various frameworks are emerging for supporting distributed tracing techniques over microservices-based distributed applications. The objective is to improve observability and management of operational proble...
详细信息
Nowadays, various frameworks are emerging for supporting distributed tracing techniques over microservices-based distributed applications. The objective is to improve observability and management of operational problems of distributed applications, considering bottlenecks in terms of high latencies in the interaction among the deployed microservices. However, such frameworks provide information that is disjoint from the management information that is usually collected by cloud computing orchestration platforms. There is a need to improve observability by combining such information to easily produce insights related to performance issues and to realize root cause analyses to tackle them. In this paper, we provide a modern observability approach and pilot implementation for tackling data fusion aspects in edge and cloud computing orchestration platforms. We consider the integration of signals made available by various open-source monitoring and observability frameworks, including metrics, logs and distributed tracing mechanisms. The approach is validated in an experimental orchestration environment based on the deployment and stress testing of a proof-of-concept microservices-based application. Helpful results are produced regarding the identification of the main causes of latencies in the various application parts and the better understanding of the behavior of the application under different stressing conditions.
A new mathematical solution to the problem of water flow through snow is presented and its implementation in a snowmelt model, SNAP, is discussed. An analytical solution for vertical water flow through homogeneous sno...
详细信息
A new mathematical solution to the problem of water flow through snow is presented and its implementation in a snowmelt model, SNAP, is discussed. An analytical solution for vertical water flow through homogeneous snow is developed and formulae that allow the solution to accommodate time-varying surface input from rain or snowmelt are derived. This facilitates use of the technique in a computer snowmelt model. Because the new technique requires no matrix computation, it is sufficiently computationally efficient to be a candidate for use in watershed-scale, distributed forecasting systems. Because it is a physically based model that takes into account the effect of the snow itself on the timing and magnitude of outflow, the model allows more accurate prediction of the magnitude and timing of snowmelt than in currently employed operational models. Results of the new model agree well with previous theoretical solutions and with field measurements of melt and rain-on-snow events in a seasonal snow pack. (C) 1998 John Wiley & Sons, Ltd.
Cloud provides users with a new model of utilizing the computing infrastructure with the ability to perform parallel and distributed computations using elastic large virtual ***,the multilevel and complex features mak...
详细信息
Cloud provides users with a new model of utilizing the computing infrastructure with the ability to perform parallel and distributed computations using elastic large virtual ***,the multilevel and complex features make cloud computing system more prone to *** this paper we present a multi-level fault-tolerant system for distributed applications in cloud named *** CDMCR system backups the complete state of applications periodically with a snapshot-based distributed checkpointing protocol,including file system ***,we cannot only recover processes but also rollback data.A multi-level recovery strategy is proposed which includes process-level recovery,virtual machine (VM) recreation and host rescheduling,enabling comprehensive and efficient fault tolerance for different components in *** deploy CDMCR as PaaS,so that users can be liberated from node management and system configuration,and get access to fault-tolerant service *** have implemented this system based on the Xen virtualization platform and the OpenNebula cloud *** on the prototype demonstrate the correctness of our *** shows that CDMCR does not cause message loss or data loss,and the backup time remains nearly constant as the number of nodes increases on virtual cluster.
distributed applications are very hard to write and verify. Even with extensive testing and debugging, errors may persist. A distributed application can be viewed as a collection of processes that execute a number of ...
详细信息
ISBN:
(纸本)9781424497881;9781424497898
distributed applications are very hard to write and verify. Even with extensive testing and debugging, errors may persist. A distributed application can be viewed as a collection of processes that execute a number of atomic actions. The notion of atomicity can be employed to significantly reduce the state space to be considered in verification. Moreover, atomicity violations in a run typically indicate the presence of program bugs. In this paper, we exploit the notion of atomicity of a code block to simplify the debugging and verification of distributed applications. The notion of an atomic action has been formally defined and an algorithm to detect atomicity violations has been developed.
Highly distributed networks have now become a common infrastructure for wide-area distributed applications whose key design principle is network awareness, namely the ability to deal with dynamic changes of the networ...
详细信息
Highly distributed networks have now become a common infrastructure for wide-area distributed applications whose key design principle is network awareness, namely the ability to deal with dynamic changes of the network environment. Network-aware computing has called for new programming languages that exploit the mobility paradigm as a basic interaction mechanism. In this paper we present the architecture of KLANA, an experimental Java package for distributed applications and code mobility. We describe how KLAVA permits code mobility by relying on Java and present a fend distributed applications that exploit mobile code programmed in KLAVA. Copyright (C) 2002 John Wiley Sons, Ltd.
暂无评论