this paper develops a parallelcomputing system based on open standards, such as Extensible Markup Language (XML), Simple Object Access Protocol (SOAP)1 and Common Language Runtime (CLR) 2. To date parallelsystems ba...
详细信息
this paper develops a parallelcomputing system based on open standards, such as Extensible Markup Language (XML), Simple Object Access Protocol (SOAP)1 and Common Language Runtime (CLR) 2. To date parallelsystems based on clustered computers have been primarily restricted to the Unix and Linux family of operating systems. However, the popularity of the Microsoft Windows platform and the richness of reusable modules provided by the *** Framework makes it an attractive platform. Motivated by this, a new parallelcomputing System based *** (***) is described. *** provides higher level abstractions for message passing than the widely used Message Passing Interface (MPI).
We study the behavior of a new load balancing scheme applied to adaptive task partitioning for multivariate integration. parallel performance and scalability are analyzed. Performance results are given for test famili...
详细信息
We study the behavior of a new load balancing scheme applied to adaptive task partitioning for multivariate integration. parallel performance and scalability are analyzed. Performance results are given for test families of functions with significant irregular behavior. the effects of certain integrand characteristics can be accounted for in the strategy. the scheme is incorporated in the PARINT package for parallel multivariate integration.
High performance computingsystems and cluster computers are becoming so cost-effective that even small research groups can afford them. Hence, efforts to take advantage of these widely distributed resources are becom...
详细信息
High performance computingsystems and cluster computers are becoming so cost-effective that even small research groups can afford them. Hence, efforts to take advantage of these widely distributed resources are becoming popular. Although recent projects provide resource management and job scheduling to support groups of computational resources across the country working together on massive problems, they have not yet fully addressed how distributedparallel programs will communicate. therefore, we propose a new paradigm to support cluster-to-cluster (C2C) communications, which handles run-time communications between parallel programs running on distributed clusters.
the experimental researches of efficiency of coarse-grain parallel method of artificial neural networks training with static mapping onto processors of parallel computer are presented in this paper. the features of pa...
详细信息
the experimental researches of efficiency of coarse-grain parallel method of artificial neural networks training with static mapping onto processors of parallel computer are presented in this paper. the features of parallel algorithm implementation using C language and MPI library are described. the research of parallel method is carried out using analysis of its speed-up and efficiency.
Execution times of dynamic distributed real-time systems are affected by variables that originate in external environments, and this leads to a new class of task allocation problems. A taxonomy was introduced to accom...
详细信息
Execution times of dynamic distributed real-time systems are affected by variables that originate in external environments, and this leads to a new class of task allocation problems. A taxonomy was introduced to accommodate these external variables and help systematically understand and characterize the set of problems. Existing work can be classified withthe taxonomy, and the classification also reveals many open problems.
Fault tolerance with active replication and load balancing are two complementary techniques. their marriage not only enhances a distributed system's robustness, but also improves a distributed system's efficie...
详细信息
Fault tolerance with active replication and load balancing are two complementary techniques. their marriage not only enhances a distributed system's robustness, but also improves a distributed system's efficiency. this paper analyzes the pros and cons of both techniques, presents a novel load balancing framework for fault tolerant systems with active replication. Hierarchical architecture is described in detail. Further, three potential task scheduler group selection methods are given out and their advantages and disadvantages are addressed and compared.
this paper deals with service management in the large distributedsystems. A framework called Remote Maintenance Shell (RMS) is presented. It enables the distributed service management operations (deployment, configur...
详细信息
this paper deals with service management in the large distributedsystems. A framework called Remote Maintenance Shell (RMS) is presented. It enables the distributed service management operations (deployment, configuration, control, monitoring, upgrading, and versioning), as well as the advanced features related to the verification on the actual target system. RMS is based on the operations performed by mobile agents that act within an agent team. the case study elaborates a scenario in which RMS prototype is used for managing a service employed in the distributed environment.
the message passing interface (MPI) is a commonly used application programming interface for the development of portable parallel programs. It is easy, however, to create MPI programs that are prone to deadlock. It is...
详细信息
the message passing interface (MPI) is a commonly used application programming interface for the development of portable parallel programs. It is easy, however, to create MPI programs that are prone to deadlock. It is desirable to be able to detect these deadlocks in running programs. It is further desirable to perform this deadlock detection in a distributed manner, without assuming the existence of shared memory for communication. A distributed deadlock detector has been developed that can find deadlocks with a very low overhead and minimal additional communication required among nodes. the detector makes use of the MPI profiling layer, allowing it to be added to a program at link time, requiring no change or recompilation of the user's code. the detector has also been tested on widely varying MPI implementations, demonstrating its portability.
this paper presents a concept called virtual clusters (VCs) to allocate resources for an application from a computing utility with a geographically distributed resource base. the VC creation process is modeled as a fa...
详细信息
this paper presents a concept called virtual clusters (VCs) to allocate resources for an application from a computing utility with a geographically distributed resource base. the VC creation process is modeled as a facility location problem and an efficient heuristic is devised to solve it. We extend the model to include an "overload partition" to a VC such that demand surges can be efficiently handled. Extensive simulations have been conducted to examine the performance of VCs under different scenarios and to compare it with a fully dynamic scheme called the Service Grid. the results indicate that VC is more cost-effective and robust than Service Grid.
Program analysis is an important activity to evaluate and subsequently improve the quality of software. Many different visualization tools offer more or less sophisticated functionality for this task. However, the vis...
详细信息
Program analysis is an important activity to evaluate and subsequently improve the quality of software. Many different visualization tools offer more or less sophisticated functionality for this task. However, the visual capabilities of the tool are usually pre-defined by the tool developers' intentions or are only marginally adaptable to the user's needs. On contrary, the VisWiz tool offers a means of providing user-defined visualization for analysis of parallel and distributed programs. By configuring the mapping of observed events and their relations using a XML configuration file, users are able to develop specialized graphical displays, which better suit their expectations and improve program comprehension. Examples of VisWiz are given for debugging, performance tuning, and runtime monitoring of parallel and distributed programs.
暂无评论