OMG CORBA applications require a distributed naming service in order to install and to retrieve object references. high availability of the naming service is important since most CORBA applications need to access it a...
详细信息
OMG CORBA applications require a distributed naming service in order to install and to retrieve object references. high availability of the naming service is important since most CORBA applications need to access it at least once during their lifetime. Unfortunately, the OMG standards do not deal with availability issues;the naming services of many of the commercially available CORBA object request brokers introduce single points of failure. In this paper we describe the design and implementation of a replicated, highly-available CORBA name server that adheres to the OMG Common Object Services Specification. Our naming service can be replicated at run-time, while many applications are installing and retrieving object references. We compare our approach withthe approaches taken by the ILU, NEO, Orbix, and DOME object request brokers. the performance of our name server is measured for various degrees of replication.
this paper introduces a new architecture for a fault-tolerant computer system which connects high-end PCs or workstations by a high-speed network. To achieve platform independence, coupling is based on the widely used...
详细信息
this paper introduces a new architecture for a fault-tolerant computer system which connects high-end PCs or workstations by a high-speed network. To achieve platform independence, coupling is based on the widely used PCI-bus. In contrast to commercially available fault-tolerant systems we strongly emphasize mechanisms for tolerating transient and intermittent faults. To keep hardware costs low the system is built with off-the-shelf computers and their extensions are kept as small as possible. To reduce the operational costs the system can be dynamically adapted to different demands on fault tolerance on a program-by-program basis. Adaptation is done transparently to the application software by the operating system. We use a commercially available real-time operating system with a POSIX-compliant UNIX-interface. the bandwidth of fault tolerance reaches from a non-redundant system of stand-alone computers, a master/checker configuration to a TMR-system. the high-performance network allows the system to operate as a parallel multicomputer, too.
In this paper, we discuss the runtime support required for the parallelization of unstructured data-parallel applications on nonuniform and adaptive environments. the approach presented is reasonably general and is ap...
详细信息
In this paper, we discuss the runtime support required for the parallelization of unstructured data-parallel applications on nonuniform and adaptive environments. the approach presented is reasonably general and is applicable to a wide variety of regular as well as irregular applications. We present performance results for the solution of an unstructured mesh on a cluster of heterogeneous workstations.
OMG CORBA applications require a distributed naming service in order to install and to retrieve object references. high availability of the naming service is important since most CORBA applications need to access it a...
详细信息
OMG CORBA applications require a distributed naming service in order to install and to retrieve object references. high availability of the naming service is important since most CORBA applications need to access it at least once during their lifetime. Unfortunately, the OMG standards do not deal with availability issues; the naming services of many of the commercially available CORBA object request brokers introduce single points of failure. In this paper we describe the design and implementation of a replicated, highly-available CORBA name server that adheres to the OMG Common Object Services Specification. Our naming service can be replicated at run-time, while many applications are installing and retrieving object references. We compare our approach withthe approaches taken by the ILU, NEO, Orbix, and DOME object request brokers. the performance of our name server is measured for various degrees of replication.
Discusses the emergent World Wide Web-based distributed environments for high-performancecomputing and communications (HPCC) on the National Information Infrastructure (NII) withthe focus on Java as an enabling tech...
详细信息
ISBN:
(纸本)9780818675829
Discusses the emergent World Wide Web-based distributed environments for high-performancecomputing and communications (HPCC) on the National Information Infrastructure (NII) withthe focus on Java as an enabling technology. We start with a review of the past, present and near-term future of the "Java phenomenon", exposed in the background of some related previous approaches towards a distributed interpretative virtual machine architecture. Next, we discuss the anticipated role of Java in building distributed Web-based computing environments. We outline an evolutionary path from the current Web technology "soup" towards "all-Java" systems and we illustrate this process in terms of the Northeast Parallel architectures Center (NPAC) Web technology prototypes (WebVM, WebFlow, Bridge-based Collaboratory) and selected applications (CareWeb, 3D Visible Human).
Wide-area parallel processing systems will soon be available to researchers to solve a range of problems. In these systems, it is certain that host failures and other faults will be a common occurrence. Unfortunately,...
详细信息
Wide-area parallel processing systems will soon be available to researchers to solve a range of problems. In these systems, it is certain that host failures and other faults will be a common occurrence. Unfortunately, most parallel processing systems have not been designed with fault-tolerance in mind. Mentat is a high-performance object-oriented parallel processing system that is based on an extension of the data-flow model. the functional nature of data-flow enables both parallelism and fault-tolerance. In this paper, we exploit the data-flow underpinning of Mentat to provide easy-to-use and transparent fault-tolerance. We present results on both a small-scale network and a wide-area heterogeneous environment that consists of three sites: the National Center for Supercomputing Applications, the University of Virginia and the NASA Langley Research Center.
In order to use networks of workstations in parallel processing applications, several schemes have been devised to allow processes on different, possibly heterogeneous, platforms to communicate with one another. the M...
详细信息
In order to use networks of workstations in parallel processing applications, several schemes have been devised to allow processes on different, possibly heterogeneous, platforms to communicate with one another. the Message-Passing Interface (MPI) is one such scheme that allows for message-passing across different architectures. the MPI specification does not make provisions for the migration of a process between machines. this paper describes the work required to modify, an MPI implementation to allow for task migration. It also describes "Hector", our heterogeneous computing task allocator that is used to migrate tasks automatically and improve the overall performance of a parallel program.
Networks of workstations provide an economic solution for scalable computing because they do not require specialized components. Even though recent advances have shown that it is possible to obtain high bandwidth betw...
详细信息
ISBN:
(纸本)9780818675829
Networks of workstations provide an economic solution for scalable computing because they do not require specialized components. Even though recent advances have shown that it is possible to obtain high bandwidth between applications, interconnect latency remains a serious concern. We present CNI, a cluster network interface that not only provides both low latency and high bandwidth but also efficiently supports multiple programming paradigms. this is done by functionally coupling the network adaptor board more closely to the CPU without changing the standard workstation architecture. CNI results in performance gains for applications, substantially reducing communication overhead and delay.
Two facts that suggest the desirability of a hierarchical approach to cost-effective high-performancecomputing are empirically established in this paper. the first fact is the temporal locality of programs with respe...
详细信息
Two facts that suggest the desirability of a hierarchical approach to cost-effective high-performancecomputing are empirically established in this paper. the first fact is the temporal locality of programs with respect to the degree of parallelism. Two temporal (instruction and data) locality principles are identified and empirically established for a set of programs. the impact of this behavior is discussed with respect to the proposed heterogeneous multilevel architecture. the second fact that supports the hierarchical architecture is the cost-efficiency advantage of heterogeneous over homogeneous multiprocessor systems. An initial performance analysis is presented which quantifies this fact for the proposed heterogeneous hierarchical organization. the proposed multilevel processor configuration uses fast and costly resources sparingly to reduce sequential and low parallelism bottlenecks. the resulting organization tries to balance cost, speed and parallelism granularity.
Rapidly expanding cellular communication technology, wireless LANs and satellite services have made it possible for mobile users to access information anywhere and at any time. In a mobile computing environment replic...
详细信息
Rapidly expanding cellular communication technology, wireless LANs and satellite services have made it possible for mobile users to access information anywhere and at any time. In a mobile computing environment replication might be considered as an essential technique providing reliability, throughput increase and data availability. this paper addresses the replica control protocols with an emphasis on workstation mobility issues. the modifications that have to be made to the primary copy method for replicated database management strategies in order to address the effect of mobility on the existing replica control protocols are analysed and proposed. A variation of the primary copy algorithm, called virtual primary copy method is proposed and it is shown that this method is well suited for the distributed mobile computing environment. the performance of virtual primary copy method comparative to traditional primary copy method using computer simulation is analysed.
暂无评论