PopSPY is an instrumentation tool for the PowerPC microprocessor family that modifies code at the executable level, providing the ability to instrument a wide range of applications including commercial programs. PopSP...
详细信息
ISBN:
(纸本)3540664432
PopSPY is an instrumentation tool for the PowerPC microprocessor family that modifies code at the executable level, providing the ability to instrument a wide range of applications including commercial programs. PopSPY instruments sequential, multithreaded and multiprocess applications including their associated dynamic libraries. PopSPY uses a dynamic trace library that either generates execution traces of applications or communicates with an execution driven simulator for accurate multiprocessor simulation.
We present and analyze a general algorithm which computes efficient static schedulings of block computations for parallel sparse linear factorization. Our solver, based on a supernodal fan-in approach, is fully driven...
详细信息
ISBN:
(纸本)3540664432
We present and analyze a general algorithm which computes efficient static schedulings of block computations for parallel sparse linear factorization. Our solver, based on a supernodal fan-in approach, is fully driven by this scheduling. We give an overview of the algorithms and present performance results on a 16-node IBM-SP2 with 66 MHz Power2 thin nodes for a collection of grid and irregular problems.
this paper introduces MorphoSys, a parallel system-on-chip which combines a RISC processor with an array of coarse-grain reconfigurable cells. MorphoSys integrates the flexibility of general-purpose systems and high p...
详细信息
We describe a new dynamic software scheduling technique for VLIW architectures, which compiles into VLIW code the program paths that are actually executed. Unlike trace processors, or DIF, the technique executes opera...
详细信息
We investigate here the programming effort and performance of a programming model which is a hybrid between shared memory and message passing. this model permits an easy implementation in shared memory, while still be...
详细信息
ISBN:
(纸本)3540664432
We investigate here the programming effort and performance of a programming model which is a hybrid between shared memory and message passing. this model permits an easy implementation in shared memory, while still being able to benefit from performance advantages of message-passing for performance critical tasks. We have integrated message-passing with a software DSM system, and evaluated the programming effort and performance withthree different applications and various degree of message-passing in the applications. In two of the applications we found that only a small fraction of the source code lines responsible for interprocess communication were performance critical and it was therefore easy to convert only those to message-passing primitives and still approach the performance of pure message-passing.
We continue the comparison of parallelalgorithms for solving diagonally dominant and general narrow-banded linear systems of equations that we started in [2]. the solvers compared are the banded system solvers of Sca...
详细信息
ISBN:
(纸本)3540664432
We continue the comparison of parallelalgorithms for solving diagonally dominant and general narrow-banded linear systems of equations that we started in [2]. the solvers compared are the banded system solvers of ScaLAPACK [6] and those investigated by Arbenz and Hegland [1, 5]. We present the numerical experiments that we conducted on the IBM SP/2.
this paper presents a parallel algorithm for solving grounded range search in associative-function mode using the BSP-like Coarse Grained Multicomputer (CGM). Given a set S of n weighted points in the plane, the algor...
详细信息
Many association rule algorithms operate in a parallel environment where the database is divided up among a number of processors, a procedure which is usually carried out indiscriminately. the nature of the database p...
详细信息
暂无评论