Recent advances in Internet connectivity and implementations of safer distributedcomputing through languages such as Java provide the foundation for transforming computing resources into tradable commodities. We have...
详细信息
ISBN:
(纸本)0818684275
Recent advances in Internet connectivity and implementations of safer distributedcomputing through languages such as Java provide the foundation for transforming computing resources into tradable commodities. We have developed Javelin, a Java-based prototype of a globally distributed heterogeneous, high-performance computational infrastructure that conveniently enables rapid execution of massively parallel applications. Our infrastructure consists of three entities: Hosts, clients, and brokers. Our goal is to allow users to buy and sell computational power using supply and demand, and market mechanisms to marshal computational power far beyond what can be achieved via conventional techniques. Several research issues must be worked out to make this vision a reality: allocating resources between computational objects via market mechanisms;expressing and enforcing scheduling and quality of service constraints;modeling programming in a global computing ecosystem;supporting heterogeneous execution without sacrificing computational speed;ensuring host security;global naming and communication;and client privacy.
With the current advances in computing and network technology and software, the gap between parallel and distributedcomputing environment is gradually becoming narrower. Consequently, parallel programs run on paralle...
详细信息
This paper describes two different parallelcomputing approaches for image processing problems on a Pentium based multiprocessor-system. These multiprocessor computers are often used as network servers. We demonstrate...
详细信息
ISBN:
(纸本)0819425885
This paper describes two different parallelcomputing approaches for image processing problems on a Pentium based multiprocessor-system. These multiprocessor computers are often used as network servers. We demonstrate the utilization of one of these machines, equipped with four Intel Pentium processors, far a parallel image processing task. A parallel computation of motion vector-fields based on correlation techniques is discussed to show the possible acceleration. The computational results show that a high efficiency can be reached, even a linear speedup is possible under certain conditions. Besides the mentioned correlation technique there are various image processing problems that can easily be evaluated in parallel. Although massively parallel systems and special purpose systems are much faster, off-line image processing can be accelerated by using these broadly available low-cost machines.
The proceedings contain 62 papers. The special focus in this conference is on Evaluation, Performance, Extensions and Improvements. The topics include: Comparison of PVM and MPI performance in short-range molecular dy...
ISBN:
(纸本)3540636978
The proceedings contain 62 papers. The special focus in this conference is on Evaluation, Performance, Extensions and Improvements. The topics include: Comparison of PVM and MPI performance in short-range molecular dynamics simulation;analyzing the performance of MPI in a cluster of workstations based on fast ethernet;a comparison of MPI performance on different MPPs;predicting the performance of injection communication patterns on PVM;evaluation of the communication performance on a parallel processing system;evaluating personal high performance computing with PVM on windows and LINUX environments;experimental evaluation of PVM group communication;ARCH, an object oriented MPI-based library for asynchronous and loosely synchronous parallel system programming;an extension to MPI for distributedcomputing on MPPs;heterogeneous MPI application interoperation and process management under PVMPI;advanced capabilities in PVM 3.4;using PVM to build an interface to support cooperative work in a distributed systems environment;towards portable message passing in java;control activities in message passing environment;global semaphores in a parallel programming environment;a synchronizing shared memory;communication in multi-physics applications;embedding SCI into PVM;breaking the curse of dynamics by task migration;PVM on the rhodos distributed operating system;a function to dynamic workload allocation in distributed applications;integrating PVaniM into WAMM for monitoring meta-applications;evaluation of parallel application’s behavior in message passing environment;message-passing program development by ensemble and interaction patterns detection in PVM programs to support simulation.
Although quite a few large scale applications have been ported to multiprocessors in recent years, no framework has yet been set up for the process of designing parallel and distributed applications from sequential pr...
详细信息
ISBN:
(纸本)0818677430
Although quite a few large scale applications have been ported to multiprocessors in recent years, no framework has yet been set up for the process of designing parallel and distributed applications from sequential programs that takes into account the specific needs of large scale scientific applications where programs have complex, irregular control flow and data structures. This paper proposes such a framework and reports on its application in the portable parallelization of the industrial Computational Fluid Dynamics (CFD) software package TfC(TASCflow for CAD) within the scope of an interdisciplinary research project.
This paper proposes that a parallel implementation of the genetic algorithm (GA) on the Internet will improve the algorithm's performance, It is motivated by the possibility of aiding research into complex search ...
详细信息
ISBN:
(纸本)0780341473
This paper proposes that a parallel implementation of the genetic algorithm (GA) on the Internet will improve the algorithm's performance, It is motivated by the possibility of aiding research into complex search and optimization problems that use the GA, Requirements and constraints regarding parallelization of the GA are identified, A parallel GA is developed for an ideal PRAM architecture and is shown to have an asymptotic running time of O(log n), an improvement over the sequential GA. A parallel GA is also designed for a Unix network and has an asymptotic running time comparable to the ideal system, The algorithm is a decentralized, asynchronous, and fault-tolerant design that matches characteristics of the network, The GA population is divided into colonies that are distributed among processors, Trade policies are executed for the exchange of genes.
An algorithm for the parallelcomputing of the boundary-element and finite-element combination method is presented in this paper. By introducing domain decomposition of an entire domain into the boundary-element and f...
详细信息
An algorithm for the parallelcomputing of the boundary-element and finite-element combination method is presented in this paper. By introducing domain decomposition of an entire domain into the boundary-element and finite-element subdomains, each analysis is performed independently and in parallel. Renewal iterative scheme for the parallelcomputing is the Schwarz method which was adopted to the domain decomposition parallel scheme in the boundary-element analysis. A cluster parallelcomputing system by workstations connected by LAN is constructed and employed aiming at efficient analysis. Convergence and accuracy of solutions on internal virtual boundaries are shown through some numerical examples.
The clock synchronization problem in distributed system is meant to address the following issues: events generation at exact moment in time and to distribute computing for parallel and distributed algorithms. Nowadays...
详细信息
ISBN:
(纸本)9789604740994
The clock synchronization problem in distributed system is meant to address the following issues: events generation at exact moment in time and to distribute computing for parallel and distributed algorithms. Nowadays the biggest distributed system, Internet, has the clocks synchronized based on the Network Time Protocol. This paper presents the implementation of an NTP simulator coded in Java. This simulator brings to the user the following advantages: the user has directly access to the protocol logic;trough the simulator the user see the guiding steps of the algorithm such that it understands how it works in background on the Internet or somewhere else and the user has also total control of the simulation and of the needed computing steps. Among the simulator requirements we enumerate: extensibility (the simulator can be easily extended to simulate other distributed algorithms), different configurations, undo facility implemented using memento design pattern, save and load of configurations, step by step running of simulations (the user can do an event guided simulation or time based simulation and the analyze the log files), platform independence (while it is based on Java can be run on different operating systems) etc.
XtremWeb-CH (XWCH) is a software system that makes it easy for scientists and industrials to deploy and execute their parallel and distributed applications on a public-resource computing infrastructure. The objective ...
详细信息
ISBN:
(纸本)9783540723592
XtremWeb-CH (XWCH) is a software system that makes it easy for scientists and industrials to deploy and execute their parallel and distributed applications on a public-resource computing infrastructure. The objective of XWCH is to develop a real High Performance Peer-To-Peer platform with a distributed scheduling and communication system. The main idea is to build a completely symmetric model where nodes can be providers and consumers at the same time. This paper describes the different "components" of an XWCH infrastructure and the new features proposed by this platform compared to other similar Global computing projects. It also describes the porting, the deployment and the execution of a phylogenetic CPU time consuming application on an experimental XWCH platform.
Long-lived parallel applications running on workstation clusters are vulnerable to single-node or multiple-node failures. Fault recovery is therefore required to prevent immature program termination. However, much of ...
详细信息
Long-lived parallel applications running on workstation clusters are vulnerable to single-node or multiple-node failures. Fault recovery is therefore required to prevent immature program termination. However, much of the runtime overhead imposed by fault tolerance schemes is generally due to the cost of transferring the checkpoint states of applications by disk I/O operations. In this paper, we propose a fault tolerant model in which checkpoint states are transferred between replicated parallel applications. We also describe how the resource consumption of the replicated applications can be minimized. The fault tolerant model has been implemented and tested on a workstation cluster and a Fujitsu AP3000 multi-processor machine. The measurements of our experiments have showed that efficient fault tolerance can be achieved by replicating parallel applications on clusters of computers.
暂无评论