Most of the actual approaches to address the problem of serial code executing on parallel multi-core processors try to exploit the parallelism of the code during compile time. Such a solution is very efficient but pla...
详细信息
ISBN:
(纸本)9780769534435
Most of the actual approaches to address the problem of serial code executing on parallel multi-core processors try to exploit the parallelism of the code during compile time. Such a solution is very efficient but places the burden on the programmer to know the code will be running on a parallel architecture. In addition, the programmer needs to know the details of the architecture in order to generate efficient parallel code. In this work we present an alternative approach, the layer disruption model, that improves the performance of serial code executing on parallel architectures. We present a small test case that shows the benefits of the model.
Visualization is one of the most important applications of computer graphics. To have a parallel infrastructure for visualization, some technologies would be needed. We identify the state-of-the-art technologiesthat ...
详细信息
ISBN:
(纸本)9780769533599
Visualization is one of the most important applications of computer graphics. To have a parallel infrastructure for visualization, some technologies would be needed. We identify the state-of-the-art technologiesthat have prepared for building such an infrastructure and examine a collection of applicationsthat would benefit from it. We consider a broad range of scientific and technological advances in visualization, which are relevant to visual supercomputing. Mainly, we present the original abstracts from the cited papers.
Advancement in parallel computers technology has greatly influenced the numerical methods used for solving partial differential equations (pdes). A lot of attention has been devoted to the development of numerical sch...
详细信息
ISBN:
(纸本)9780769534435
Advancement in parallel computers technology has greatly influenced the numerical methods used for solving partial differential equations (pdes). A lot of attention has been devoted to the development of numerical schemes which are suitable for the parallel environment. In this work, we investigate the parallel implementation of the four-point Modified Explicit Decoupled Group (MEDG) method which was introduced by Ali and Ng (2007) as a fast solver for the two dimensional Poisson pde. the method was shown to be more superior than all the methods belonging to the four-points explicit group family namely the Explicit Group (EG) [8], Explicit Decoupled Group (EDG) [1] and Modified Explicit Group (MEG) [7]. this paper presents the preliminary results of the parallel algorithms implemented on a distributed memory PC cluster. Two parallelizing strategies comprising of the two-color zebra and the four-color chessboard orderings in solving a two dimensional Poisson model problem will be discussed.
Pattern recognition applications such as natural phenomena detection and structural health monitoring have been widely applied using wireless sensor networks. these applications involve large amount of data to be anal...
详细信息
ISBN:
(纸本)9780769534435
Pattern recognition applications such as natural phenomena detection and structural health monitoring have been widely applied using wireless sensor networks. these applications involve large amount of data to be analysed, and thus incur high computational time and complexity In this paper we present a parallel associative memory-based pattern recognition algorithm known as distributed Hierarchical Graph Neuron (DHGN). It is a single-cycle learning algorithm with in-network processing capability;able to reduce computational loads by efficiently disseminates recognition processes throughout the network. Hence, suitable to be deployed in wireless sensor networks. the results of the accuracy and scalability tests show that our system performs with high accuracy and remains scalable for increases in pattern size and the number of stored patterns. the response time for pattern recognition remains within milliseconds irrespective of the size of the network.
Withthe continued progress in VLSI technologies, we can integrate numerous cores in a single billion-transistor chip to build a multi-core system-on-a-chip (SoC). this also brings great challenges to traditional para...
详细信息
ISBN:
(纸本)9781424416936
Withthe continued progress in VLSI technologies, we can integrate numerous cores in a single billion-transistor chip to build a multi-core system-on-a-chip (SoC). this also brings great challenges to traditional parallel programming as to how we can increase the performance of applications with increased number of cores. In this paper;we meet the challenges using a novel approach. Specifically, we propose a reconfigurable heterogeneous multi-core system. Under our proposed system, in addition to conventional processor cores, we introduce dynamically reconfigurable accelerator cores to boost the performance of applications. We have built a prototype of the system using FPGAs. Experimental evaluation demonstrates significant system efficiency of the proposed heterogeneous multi-core system in terms of computation and power consumption.
A parallel algorithm, namely parallel block diagonal dominant (PBDD) algorithm, is proposed to solve block tridiagonal linear systems on multi-computers. this algorithm is based on divided-and-conquer idea of the PDD ...
详细信息
ISBN:
(纸本)9780769534435
A parallel algorithm, namely parallel block diagonal dominant (PBDD) algorithm, is proposed to solve block tridiagonal linear systems on multi-computers. this algorithm is based on divided-and-conquer idea of the PDD method. When the systems is strictly block diagonal dominant, the PBDD is highly parallel and provides approximate solutions that equals to the exact solutions within machine accuracy. the PBDD method has been implemented on a 64-node multi-computer. the analytic results match closely withthe results measured from the numerical experiments.
How to share the data in databases of autonomous organizations quickly and flexibly is an important problem. Traditional data integration systems are supposed to be a solution to the problem. Unfortunately, they requi...
详细信息
ISBN:
(纸本)9780769534435
How to share the data in databases of autonomous organizations quickly and flexibly is an important problem. Traditional data integration systems are supposed to be a solution to the problem. Unfortunately, they require a global schema before they can be used to share data and they are centralized system. they can not support data sharing without a common schema and on-demand data sharing in applications. In addition, security is also the important requirements. this paper presents a community-based system for distributed data integration and sharing, Honeycomb. Honeycomb supports many virtual data integration and sharing systems on a single machine, and the system utilizes Web Services to access data on other nodes to avoid changing security configuration of autonomous organizations. In addition, the system adopts new cache techniques to improve performance greatly.
Metacube is an attractive, hypercube-like interconnection network that can connect an extremely large number of nodes with a small node degree while keeping a relatively short diamemter A Metacube MC(k, m) connects 2(...
详细信息
ISBN:
(纸本)9780769534435
Metacube is an attractive, hypercube-like interconnection network that can connect an extremely large number of nodes with a small node degree while keeping a relatively short diamemter A Metacube MC(k, m) connects 2(2km+k) nodes with only k + m links per node. Metacube can be used to build parallelcomputing systems of very large scale with a small number of links per node. In this paper we propose a new presentation of Metacube for algorithmic design on Metacube. Based on the new presentation, we give an efficient algorithm for parallel prefix computation on Metacubes that runs in 2(k)m(k + 1) + k communication steps and 2(k+1) m + 2k computation steps.
Improvements in computer and networking technologies over the past decades produced new type of collaborative computing environment called Gild Networks. Grid is a parallel and distributedcomputing network system tha...
详细信息
ISBN:
(纸本)9781424419678
Improvements in computer and networking technologies over the past decades produced new type of collaborative computing environment called Gild Networks. Grid is a parallel and distributedcomputing network system that provides the ability to perform higher throughput computing by taking advantage of many computing resources available in the network. therefore, to achieve a scalable and reliable Grid network system, we need to efficiently distribute the load among the resources accessible on the network. In this paper, we present a distributed and scalable load-balancing framework for Gild Networks. the generated network system is self-organized and depends only on local information for load distribution and resource discovery. Simulation results show that the generated network system provides an effective, scalable, and reliable load-balancing scheme for the distributed resources accessible on Grid networks.
Numerical methods based on interval arithmetic ore efficient means to reliably solve nonlinear systems of equations. Algorithm bc3revise is an interval method that tightens variables' domains by enforcing a proper...
详细信息
ISBN:
(纸本)9780769534435
Numerical methods based on interval arithmetic ore efficient means to reliably solve nonlinear systems of equations. Algorithm bc3revise is an interval method that tightens variables' domains by enforcing a property called box consistency. It has been successfully used on difficult problems whose solving eluded traditional numerical methods. We present a new algorithm to enforce box consistency that is simpler than bc3revise, faster and easily data parallelizable. A parallel implementation with Intel SSE2 SIMD instructions shows that an increase in performance of up to an order of magnitude and more is achievable.
暂无评论