The use of logs to provide recovery from failures in transaction systems.is well known. Checkpointing is also a familiar technique for speeding restart from failures. However, most work on logs and checkpointing has c...
详细信息
ISBN:
(纸本)0818605014
The use of logs to provide recovery from failures in transaction systems.is well known. Checkpointing is also a familiar technique for speeding restart from failures. However, most work on logs and checkpointing has considered only centralized systems. In this paper a logging, checkpointing, and restart mechanism is described for distributedsystems. Moreover, nested transactions are used to enhance the performance and flexibility of the design. The result is that actions occurring at different sites can be significantly decoupled while avoiding any domino effect. Further, unreliability of one site has a limited impact on performance elsewhere.
Two design rules which aid the construction of distributed computing systems.and the provision of fault tolerance are described, namely that: (i) a distributed computing system should be functionally equivalent to the...
详细信息
ISBN:
(纸本)0818605014
Two design rules which aid the construction of distributed computing systems.and the provision of fault tolerance are described, namely that: (i) a distributed computing system should be functionally equivalent to the individual computing systems.of which it is composed, and (ii) fault tolerant systems.should be constructed from generalized fault tolerant components. The reasoning behind these two 'recursive structuring principles', and the consequences of attempting to adhere to them, are discussed. Where appropriate this discussion is illustrated by reference to a distributed system based on UNIX that is now operational at Newcastle and several other locations. This system has been implemented by adding a software subsystem, known as the Newcastle Connection, to each of a set of UNIX systems. By this means the authors has constructed a distributed system which is functionally equivalent at both the user and the program level to a conventional uniprocessor UNIX system.
Many proposals have been made for structuring distributed programs. This paper looks at one such proposal, the one embedded in the Argus programming language and system. The paper provides a discussion of decisions ma...
详细信息
ISBN:
(纸本)0818605014
Many proposals have been made for structuring distributed programs. This paper looks at one such proposal, the one embedded in the Argus programming language and system. The paper provides a discussion of decisions made in the two major areas of process structure and communication, and compares the chosen structures with alternatives. The paper emphasizes the rationale for decisions and the issues that must be considered in making such decisions.
In this paper the authors present fault-tolerant softwarereliability models based on component reliabilities. Two methods for estimating component reliabilities and the associated variances are given along with an ap...
详细信息
ISBN:
(纸本)0818605014
In this paper the authors present fault-tolerant softwarereliability models based on component reliabilities. Two methods for estimating component reliabilities and the associated variances are given along with an approach for calculating the system reliability estimate variance. The derived models are used as a basis for discussing trade-offs between the two dominate fault-tolerant software strategies, Recovery Blocks and N-Version Programming.
This paper describes an object-oriented design model for structuring reliable distributedsystems. A system is viewed as a collection of objects that are accessed and modified by transactions. Recovery techniques are ...
详细信息
ISBN:
(纸本)0818605014
This paper describes an object-oriented design model for structuring reliable distributedsystems. A system is viewed as a collection of objects that are accessed and modified by transactions. Recovery techniques are incorporated to make transactions atomic in the presence of component crashes and concurrent operations. Atomicity of transactions is based on constructing recoverable objects using multiple versions and commit protocols. These concepts are extended to nested transactions. The operations on distributed objects are performed as remote procedure calls. This requires implementation of remote procedure calls in a reliable fashion. The facilities of reliable nested transactions and remote procedure calls are used to synthesize distributed objects that are highly reliable.
An analysis is made of a number of interesting actual or potential parallels between the problems and techniques associated with achieving high reliability, and those associated with the provision of security, in dist...
详细信息
ISBN:
(纸本)0818606908
An analysis is made of a number of interesting actual or potential parallels between the problems and techniques associated with achieving high reliability, and those associated with the provision of security, in distributed computing systems.
This paper describes the requirements and attributes of the Experimental distributed Processing Facility (EDPF). The EDPF will provide a flexible and dynamically reconfigurable distributed environment that will be use...
详细信息
ISBN:
(纸本)0818605014
This paper describes the requirements and attributes of the Experimental distributed Processing Facility (EDPF). The EDPF will provide a flexible and dynamically reconfigurable distributed environment that will be used to address generic distributed processing issues as applied to the needs of the future battlefield. Survivability and reliability are the key issues under investigation. The EDPF will integrate the requirements for standard hardware and software. This facility will focus on loosely coupled nodes over newly developed (e. g. , fiber optics) or transitional communications media. All designs and implementations performed on the EDPF will use the Ada programming language. The EDPF will be used in a laboratory environment where system performance analysis can be performed on various distributed processing models. The EDPF will be capable of being deployed in a field environment for future tactical demonstrations.
作者:
Bhargava, BharatUniv of Pittsburgh
Dep of Computer Science Pittsburgh PA USA Univ of Pittsburgh Dep of Computer Science Pittsburgh PA USA
It is shown that the optimistic approach to concurrency control lends itself naturally towards a design of a reliable distributeddatabase system. In addition, the functions of concurrency control, integrity control, ...
详细信息
It is shown that the optimistic approach to concurrency control lends itself naturally towards a design of a reliable distributeddatabase system. In addition, the functions of concurrency control, integrity control, and atomicity control can be integrated. This integration can provide a higher degree of concurrency and continuity of operations.
Evidence of distribution, such as work distribution, distributed file and data base management, etc. , creates a number of reliability issues that must be addressed by the system software. Many of these are similar to...
详细信息
Evidence of distribution, such as work distribution, distributed file and data base management, etc. , creates a number of reliability issues that must be addressed by the system software. Many of these are similar to problems arising in centralized systems.however, there are a number of new issues as well as complications of common ones. Some of the specific reliability issues raised by the organizational and operational characteristics of fully distributed processing systems.(FDPS's) are outlined. It is important to bear in mind that decision making is decentralized and that there are no permanent master-slave relationships in an FDPS.
Independently updating copies of a database during a network partition may lead to data inconsistencies and anomalous behavior. To better cope with these problems, we propose principles for the design of the database ...
详细信息
ISBN:
(纸本)0818605014
Independently updating copies of a database during a network partition may lead to data inconsistencies and anomalous behavior. To better cope with these problems, we propose principles for the design of the database schema and transactions. The authors also propose a tool (Data-patch) that aids the database administrator in the development of an integration program for database copies that diverged.
暂无评论