Cenju-4 is a parallel computer designed and manufactured by NEC Corp. Cenju-4 supports two memory architectures: distributed memory with user-level message passing communication and distributed shared memory with cach...
详细信息
A highly parallel strategy is described for the well known banker's problem. Efficiency is reached by distributing the tasks to be performed between a set of parallel activities and by avoiding whenever possible t...
详细信息
作者:
Krill, B.Amira, A.
Faculty of Computing and Engineering University of Ulster Newtownabbey Co. Antrim BT37 0QB Belfast United Kingdom
This paper presents the design and implementation of a generic cyclic convolution architecture for imaging applications on field programmable gate array (FPGA). Two main architectures are implemented. A parallel archi...
详细信息
Two recent curriculum studies, the ACM/IEEE Curricula 2013 Report and the NSF/IEEE-TCPP Curriculum Initiative on parallel and distributedcomputing, argue that every undergraduate computer science program should inclu...
详细信息
ISBN:
(纸本)9781479941155
Two recent curriculum studies, the ACM/IEEE Curricula 2013 Report and the NSF/IEEE-TCPP Curriculum Initiative on parallel and distributedcomputing, argue that every undergraduate computer science program should include topics in parallel and distributedcomputing (PDC). Although not within the scope of these reports, there is also a need for students in computing related general education courses to be aware of the role that parallel and distributedcomputing technologies play in the computing landscape. One approach to integrating these topics into existing curricula is to spread them across several courses. However, this approach requires development of multiple instructional modules targeted to introduce PDC concepts at specific points in the curriculum. Such modules need to mesh with the goals of the courses for which they are designed in such a way that minimal material has to be removed from existing topics. At the same time the modules should provide students with an understanding of and experience employing fundamental PDC concepts. In this paper we report on our experience developing and deploying such modules.
The block Fourier decomposition method recently proposed by the first author is a special method for decoupling any block tridiagonal matrix of the form K = block-tridiag [B, A,B], where A and B are square submatrices...
详细信息
Throughput-oriented computing via co-running multiple applications in the same machine has been widely adopted to achieve high hardware utilization and energy saving on modern supercomputers and data centers. However,...
详细信息
ISBN:
(纸本)9798350364613;9798350364606
Throughput-oriented computing via co-running multiple applications in the same machine has been widely adopted to achieve high hardware utilization and energy saving on modern supercomputers and data centers. However, efficiently co-running applications raises new design challenges, mainly because applications with diverse requirements can stress out shared hardware resources (IO, Network and Cache) at various levels. The disparities in resource usage can result in interference, which in turn can lead to unpredictable co-running behaviors. To better understand application interference, prior work provided detailed execution characterization. However, these characterization approaches either emphasize on traditional benchmarks or fall into a single application domain. To address this issue, we study 25 up-to-date applications and benchmarks from various application domains and form 625 consolidation pairs to thoroughly analyze the execution interference caused by application co-running. Moreover, we leverage mini-benchmarks and real applications to pinpoint the provenance of co-running interference in both hardware and software aspects.
The paper presents the mathematical models, the architecture, the microprogram design and the symbolic programming system of a distributed digital integrating machine, based on fast 16-bit bipolar microprogrammed micr...
详细信息
The paper presents the mathematical models, the architecture, the microprogram design and the symbolic programming system of a distributed digital integrating machine, based on fast 16-bit bipolar microprogrammed microprocessors, which emulate homogeneous parallel integrating structures. After a short introduction in the mathematical theory the working power of the basic computing elements (DDA, servo-adder, comparator), the internal structure of the basic integrating modules, their microprogram design and the internal and external programmable commutation mechanism are explained. An expert system for programming of the machine, implemented in Prolog on the IBM-PC, which determines by symbolic equation transformation the connections in the structure and the initial integrating values, is discussed.
The performance of airplane in commercial airline environment is determined by, and therefore an indicator of performance measure of, the thermodynamic properties of airplane. The aim of this study was to establish th...
详细信息
In recent years, the rapid development and widespread applications of non-overlapping community discovery have led to a growing issue of privacy leakage. The core of this problem lies in the community discovery algori...
详细信息
暂无评论