The use of application-specific co-processors created using reconfigurable hardware (FPGAs) has been shown to realize significant speed increases for many computationally intensive applications. The addition of reconf...
详细信息
ISBN:
(纸本)0769520693
The use of application-specific co-processors created using reconfigurable hardware (FPGAs) has been shown to realize significant speed increases for many computationally intensive applications. The addition of reconfigurable hardware to clusters composed of commodity machines in order to improve the execution times of parallel applications would, therefore, appear to be a logical step. However, the extra complications introduced by this technique may make the real-world application of such technology appear to be prohibitively difficult. In this paper the design and implementation of a metacomputer designed to simplify the development of applications for clusters containing reconfigurable hardware are presented. The operation of the metacomputer is also discussed in some detail, including the process of implementing applications for execution on the metacomputer.
In this paper, we study a high-performance Heterogeneous distributed System (HDS) that is employed as a computing platform or grid. Precisely, we study the problem of scheduling a large number of CPU-intensive tasks o...
详细信息
ISBN:
(纸本)0769520693
In this paper, we study a high-performance Heterogeneous distributed System (HDS) that is employed as a computing platform or grid. Precisely, we study the problem of scheduling a large number of CPU-intensive tasks on such systems. In this study, the time spent by a task in the system is considered as the main issue that needs to be minimized. The proposed techniques of scheduling dynamic tasks consist of two heuristic algorithms;Recursive Neighbor Search (RNS) and Augmented Tabu-Search (ATS) algorithm. Our technique does not address directly the load-balancing problem since it is completely unrealistic in such large environments, but we will show that even a non-perfectly load-balanced system can behave reasonably well by taking into account the tasks' time demands. These algorithms are compared to a well known scheduling algorithm, in order to compare, evaluate, and clarify their performance.
Molecular dynamics (MD) simulations are performed on systems of various sizes up to 32 million atoms. Performance is compared between an MD-GRAPE 2 board, specialized hardware optimized for MD, and a general purpose c...
详细信息
ISBN:
(纸本)0769520693
Molecular dynamics (MD) simulations are performed on systems of various sizes up to 32 million atoms. Performance is compared between an MD-GRAPE 2 board, specialized hardware optimized for MD, and a general purpose cluster computer utilizing varied numbers of processors for both short- and long-range potentials. While it is possible to outperform a singleMD-GRAPE 2 board with a sufficient number of processors, tests verses a local cluster and comparison of original purchase price show the MD-GRAPE 2 board is often cost-effective. Potential-specific considerations are discussed.
Recovering from node failures is a critical issue in distributed database systems. In conventional log-based recovery protocols, the nodes providing recovery service may be overburdened, especially when the recovery i...
详细信息
ISBN:
(纸本)0769520693
Recovering from node failures is a critical issue in distributed database systems. In conventional log-based recovery protocols, the nodes providing recovery service may be overburdened, especially when the recovery is resource consuming. In this paper, an agent-based dynamic recovery protocol is presented. It divides the whole recovery process into three major steps: log-recovery, agent-recovery, and synchronization. The key idea of this protocol is to cache new database operations initiated during recovery into agents. All these cached operations are then replayed independently for further recovery. The analysis indicates that the new protocol can minimize internode's dependency and improve recovery speed. As a result, system failure rate is cut down and the overall performance gets improved.
In this paper, a parallel dynamic load balancing algorithm;which offers a good trade-off between the balancing degree and communication costs;is proposed. The proposed algorithm is analyzed, investigated, and implemen...
详细信息
ISBN:
(纸本)0769520693
In this paper, a parallel dynamic load balancing algorithm;which offers a good trade-off between the balancing degree and communication costs;is proposed. The proposed algorithm is analyzed, investigated, and implemented. Also, a new mathematical solution for that algorithm is presented. The analytical solution is tested on standard task graphs (STG) as common test-bed benchmarks. All the different parameters concerned with processors, memory, interconnection networks, and task programs are also presented. The new proposed algorithm is compared and evaluated with a traditional dynamic load balancing one. Experimental results are presented by considering a set of criteria for evaluating the overall performance. Simulation and analytical results proved that the proposed algorithm outperforms the chosen one.
As software distributed Shared Memory(DSM) systems become attractive on larger clusters, the focus of attention moves toward improving the reliability of systems. In this paper, we propose a lightweight logging scheme...
详细信息
ISBN:
(纸本)0769520693
As software distributed Shared Memory(DSM) systems become attractive on larger clusters, the focus of attention moves toward improving the reliability of systems. In this paper, we propose a lightweight logging scheme, called remote logging, and a recovery protocol for home-based DSM. Remote logging stores coherence-related data to the volatile memory of a remote node. The logging overhead can be moderated with high-speed system area network and user-level DMA operations supported by modern communication protocols. Remote logging tolerates multiple failures if the backup nodes of failed nodes are alive. It makes the reliability of DSM grow much higher. Experimental results show that our fault-tolerant DSM has low overhead compared to conventional stable logging and it can be effectively recovered from some concurrent failures.
distributed Database Management Systems offer advantages and new possibilities to centralized database systems. However, some aspects and issues need to receive further consideration. Concurrency control is one of the...
详细信息
ISBN:
(纸本)0769520693
distributed Database Management Systems offer advantages and new possibilities to centralized database systems. However, some aspects and issues need to receive further consideration. Concurrency control is one of these aspects that has to be improved in order to increase system performance. Strict Two Phase locking s2pl is the con-currency control mechanism that is used in most commercial database systems. The performance of s2pl can be greatly improved using speculative techniques with speculative Two Phase Locking algorithms sp2pl. In this paper we present extensions of the sp2pl algorithm based on Wait Depth Limited techniques wdl-sp2pl. A comprehensive simulation model is developed from which the performance of sp2pl and the wdl-sp2pl's are evaluated. Our analysis also shows that the new algorithms give a better performance than sp2pl, maintaining the robustness of sp2pl.
In this paper we take a look at what the 2nd generation Intel Core processor brings to high performance computing. We compare three generations of Intel CPU based systems and present a performance assessment of these ...
详细信息
MOM technology has become a key component of an enterprise application server which is usually deployed to process business logics. It has to cope with hundreds or even thousands of concurrent connections at the same ...
详细信息
ISBN:
(纸本)0769520693
MOM technology has become a key component of an enterprise application server which is usually deployed to process business logics. It has to cope with hundreds or even thousands of concurrent connections at the same time. In this end, the efficient and scalable connection management is required for large scale of clients' requests, and various network transport protocols must be taken into consideration when designing the network module of MOM. To facilitate the full support of PTP (Point-to-Point) and publish/subscribe messaging domains, and group communication service featured by MoIM-Message which is our MOM, this paper proposes three types of primitive messaging operations, "synchronous send", "synchronous receive", and "asynchronous receive", and describes the architecture of transport protocol adapter which can be extended to employee various transport protocols. Those functions are implemented into the Message Transport Layer of our MOM which is designed to have three hierarchical layers. Moreover "polling with multiple service thread model" that is applied into the socket adapter, is described to cope with the large number of concurrent clients' connections efficiently. To support mobile client platform, we ported the Client Runtime modules into IPaq PDA by utilizing PersonalJava.
Automatic Term Recognition is used to extract domain-specific terms that belong to a given domain. In order to be accurate, these corpus and language-dependent methods require large volumes of textual data that need t...
详细信息
暂无评论