Parallel computation of unsteady, free-surface flow applications are performed using stabilized finite element method. the finite element formulations are written for fix meshes and are based on the Navier-Stokes equa...
详细信息
We introduce the Stupid Barrier Tricks (SBT) library for on-line debugging and performance monitoring of shared-memory parallel programs. Single-program-multiple-data (SPMD) programs often use barriers to synchronize ...
详细信息
We present an initial performance evaluation of the Quadrics interconnection network (QsNET). We describe the main hardware and software features of QsNET of relevance to the system designer and to the end user. Actua...
详细信息
the gap between the speed of logic and the DRAM memory access is widening. Traditional processors hide some of the mismatch in memory latency using techniques such as multi-level caches, instruction prefetching and me...
详细信息
Matrix transpose operation (MT) is used frequently in many multimedia and highperformance applications. therefore, using a faster MT operation results in a shorter execution time of these applications. In this paper ...
详细信息
Input buffered switch architecture has become attractive for implementing highperformance switches for workstation clusters whose expanding use sees an increasing need for quality of service. It is challenging to pro...
详细信息
Distributed data mining algorithms executing on a shared network of workstations often suffer from unpredictable performance problems due to limited network resources that are being shared. We show that data mining al...
详细信息
A large class of scientific applications are comprised of irregular reductions on large data sets. On shared-memory multiprocessors these reductions are typically parallelized by computing partial results into replica...
详细信息
Previous studies in speculative prefetching focus on building and evaluating access models for the purpose of access prediction. this paper on the other hand investigates the performance of speculative prefetching. Wh...
We consider the problem of designing rollback error recovery algorithms for dynamic, wide area distributed systems like the Internet. the characteristics and the scale of such a system complicate the design and perfor...
详细信息
暂无评论