The authors present an object-oriented framework for constructing parallel implementations of stencil algorithms. This framework simplifies the development process by encapsulating the common aspects of stencil algori...
详细信息
The primary purpose of Cray Research computer systems is the timely solution of complex problems in science and engineering. A few examples illustrate that the CRAY C90 is currently the world's most powerful tool ...
详细信息
By restricting weight functions to satisfy the quadrangle inequality or the inverse quadrangle inequality, significant progress has been made in developing efficient sequential algorithms for the least-weight subseque...
详细信息
By restricting weight functions to satisfy the quadrangle inequality or the inverse quadrangle inequality, significant progress has been made in developing efficient sequential algorithms for the least-weight subsequence problem [10], [9], [12], [16]. However, not much is known on the improvement of the naive parallel algorithm for the problem, which is fast but demands too many processors (i.e., it takes O(log2 n) time on a CREW PRAM with n3/log n processors). In this paper we show that if the weight function satisfies the inverse quadrangle inequality, the problem can be solved on a CREW PRAM in O(log2 n log log n) time with n/log log n processors, or in O(log2 n) time with n log n processors. Notice that the processor-time complexity of our algorithm is much closer to the almost linear-time complexity of the best-known sequential algorithm [12].
The RELACS language is a systolic programming language, which simplifies the programmer's task by making explicit the data-flow of systolic algorithms, and by exposing the data delivery mechanism. The underlying a...
详细信息
The authors present a block data decomposition algorithm for two-dimensional grid problems. Their method includes local balancing to accommodate heterogeneous processors, and they characterize the conditions that must...
详细信息
The overall efficiency of parallel algorithms is most decisively effected by the strategy applied for the mapping of workload. Strategies for balancing dynamically generated workload on a processor network which are a...
详细信息
The approach taken in the Triton project is to let a high-level machine-independent parallelprogramming language drive the design of parallel hardware. This approach permits machine-independent parallel programs to b...
详细信息
Many applications of the eigenvalue decomposition of dense matrices are well known. This work was prompted by research in the numerical simulation of disordered electronic systems, in which one of the most common appr...
详细信息
ISBN:
(纸本)0819412767
Many applications of the eigenvalue decomposition of dense matrices are well known. This work was prompted by research in the numerical simulation of disordered electronic systems, in which one of the most common approaches is to diagonalize random Hamiltonian matrices in order to study the eigenvalues and eigenfunctions of a single electron in the presence of a random potential. In this paper, we describe an effort to implement a matrix diagonalization routine for real symmetric dense matrices on massively parallel SIMD computers, the Maspar MP-1 and MP-2 systems. Results of numerical tests and timings are also presented.
In the field of parallel processing, there is a great diversity of languages and architectures which become obsolete at a rapid pace. In this environment, portability is an important issue. Unfortunately, most paralle...
详细信息
The systolic screen is a very natural parallel architecture for image processing. A square root n∗ square root n systolic screen consists of a square root n∗ square root n mesh-of-processors with each processor repres...
详细信息
暂无评论