Large-scale scientific applications present great challenges to computational scientists in terms of obtaining high performance and in managing large datasets. These applications (most of which are simulations) may em...
详细信息
Request dispatching mechanism is at the heart of any cluster-computing environment. The goal is to support a single service provider address image for a given cluster of workstations. That is to say a given cluster pu...
详细信息
Request dispatching mechanism is at the heart of any cluster-computing environment. The goal is to support a single service provider address image for a given cluster of workstations. That is to say a given cluster publicises a single IP address and client requests are addressed to that IP address. This means, from the client's viewpoint, there are no nodes in the cluster, only services and the network names through which they are accessible. As a result, the cluster becomes a single virtual node. We propose a simple and elegant scheme for making the cluster transparent to clients outside the cluster such that the clients can interact with the cluster as if it were a single high-performance and highly available resource.
Networks of workstations (NOWs) are attractive for parallel processing due to their cost advantage. This paper investigates the performance issues in processing join operations and the inherent tradeoff in the network...
详细信息
Networks of workstations (NOWs) are attractive for parallel processing due to their cost advantage. This paper investigates the performance issues in processing join operations and the inherent tradeoff in the networked workstation environment. Specifically, we look at the performance of the nested-loop join algorithm. Since NOWs are heterogeneous in nature, load sharing is important for their performance. We evaluated the performance of three load sharing methods: static equal, static proportional and dynamic scheduling with fixed-chunk size. The three scheduling methods are evaluated on an experimental heterogeneous network of workstations with non-query background loads. Our experimental result suggest that, when there is no background load, dynamic scheduling outperforms static equal scheduling (up to 40%) and marginally better (about 10% better speedup) than the static proportional scheduling. When there is dynamic background load on nodes, dynamic scheduling provides substantial performance improvement over the static proportional scheduling (up to 50%) and static equal scheduling (up to about 100%). In all cases, selection of an appropriate chunk size is important in dynamic scheduling.
This paper presents an algorithm for scheduling communication-intensive parallel applications in workstation clusters environment. The proposed scheduling policy combines the best attributes of both space-sharing and ...
详细信息
This paper presents an algorithm for scheduling communication-intensive parallel applications in workstation clusters environment. The proposed scheduling policy combines the best attributes of both space-sharing and time-sharing principles and coexists with local schedulers (e.g., the Windows NT scheduler), which both provides coordinated scheduling and can generalise to provide a wide range of resource abstractions.
When a set of geographically distributed autonomous clusters of workstations are combined into a single large-scale virtual distributed system, a control mechanism for coordinating the activities of the combined syste...
详细信息
When a set of geographically distributed autonomous clusters of workstations are combined into a single large-scale virtual distributed system, a control mechanism for coordinating the activities of the combined system for effective utilisation of the resources is indispensable. This paper presents a co-ordination mechanism suitable for scheduling and distributing services across the host of such large-scale virtual distributed systems. The proposed scheme is scalable and reliable. Also, it combines both centralised and decentralised co-ordination mechanisms while eliminating/minimising their drawbacks.
With the increasing number of scientific applications manipulating huge amounts of data, effective high-level data management is an increasingly important problem. Unfortunately, so far the solutions to the high‐leve...
With the increasing number of scientific applications manipulating huge amounts of data, effective high-level data management is an increasingly important problem. Unfortunately, so far the solutions to the high‐level data management problem either require deep understanding of specific storage architectures and file layouts (as in high-performance file storage systems) or produce unsatisfactory I/O performance in exchange for ease-of-use and portability (as in relational DBMSs). In this paper we present a novel application development environment which is built around an active meta-data management system (MDMS) to handle high-level data in an effective manner. The key components of our three-tiered architecture are user application, the MDMS, and a hierarchical storage system (HSS). Our environment overcomes the performance problems of pure database-oriented solutions, while maintaining their advantages in terms of ease-of-use and portability. The high levels of performance are achieved by the MDMS, with the aid of user-specified, performance-oriented directives. Our environment supports a simple, easy-to-use yet powerful user interface, leaving the task of choosing appropriate I/O techniques for the application at hand to the MDMS. We discuss the importance of an active MDMS and show how the three components of our environment, namely the application, the MDMS, and the HSS, fit together. We also report performance numbers from our ongoing implementation and illustrate that significant improvements are made possible without undue programming effort.
Our results demonstrate that our novel application development environment provides both ease-of-use and high performance for large-scale, I/O-intensive scientific applications.
ISBN:
(纸本)9781581132700
Our results demonstrate that our novel application development environment provides both ease-of-use and high performance for large-scale, I/O-intensive scientific applications.
This paper focuses on buffer management issues in wormhole-routed torus multicomputer networks. The commonly used buffer organizations are the centralized and dedicated buffer organizations. The results presented in t...
详细信息
Network of workstations (NOW) is a cost-effective alternative to a multiprocessor system. Here we propose a centralized architecture for parallel query processing on network of workstations. We describe a three-level ...
详细信息
With the increasing number of scientific applications manipulating huge amounts of data, effective data management is an increasingly important problem. Unfortunately, so far the solutions to this data management prob...
详细信息
With the increasing number of scientific applications manipulating huge amounts of data, effective data management is an increasingly important problem. Unfortunately, so far the solutions to this data management problem either require deep understanding of specific storage architectures and file layouts (as in high-performance file systems) or produce unsatisfactory I/O performance in exchange for ease-of-use and portability (as in relational DBMSs). In this paper we present a new environment which is built around an active meta-data management system (MDMS). The key components of our three-tiered architecture are user application, the MDMS, and a hierarchical storage system (HSS). Our environment overcomes the performance problems of pure database-oriented solutions, while maintaining their advantages in terms of ease-of-use and portability. The high levels of performance are achieved by the MDMS, with the aid of user-specified directives. Our environment supports a simple, easy-to-use yet powerful user interface, leaving the task of choosing appropriate I/O techniques to the MDMS. We discuss the importance of an active MDMS and show how the three components, namely application, the MDMS, and the HSS, fit together. We also report performance numbers from our initial implementation and illustrate that significant improvements are made possible without undue programming effort.
暂无评论