Recent developments in the international arena has meant the technology is now mature enough to bring together those required for the implementation of a grid computing facility. this paper examines the requirements a...
详细信息
ISBN:
(纸本)0769517722
Recent developments in the international arena has meant the technology is now mature enough to bring together those required for the implementation of a grid computing facility. this paper examines the requirements and applications for an eScience infrastructure with particular reference to developments in Europe.
Beowulf clusters, on face value, offer the potential of a viable cost effective alternative for the provision of highperformancecomputing. In this paper we compare the performance of Beowulf clusters built from comm...
详细信息
ISBN:
(纸本)0769516262
Beowulf clusters, on face value, offer the potential of a viable cost effective alternative for the provision of highperformancecomputing. In this paper we compare the performance of Beowulf clusters built from commodity "off the shelf" components in the support of major research and production codes, with current high-end hardware such as the IBM SP, Compaq AlphaServer SC and SGI Origin 3800. the results concentrate on the application area of computational chemistry. Benchmark data on six commodity-based systems (CS1-CS6) featuring Intel, AMD Athlon and Alpha CPU architectures coupled to traditional Beowulf interconnect, such as Myrinet and Ethernet, are presented. Furthermore, we provide performance data on systems utilising the Quadrics QSNet interconnect technology, and initial results from a prototype of the Cray Supercluster.
Tightly coupled parallel applications are increasingly run in Grid environments. Unfortunately, on many Grid sites the ability of machines to create or accept network connections is severely limited by ?rewalls, netwo...
详细信息
ISBN:
(纸本)1595936734
Tightly coupled parallel applications are increasingly run in Grid environments. Unfortunately, on many Grid sites the ability of machines to create or accept network connections is severely limited by ?rewalls, network address translation (NAT)or non-routed networks. Multi homing further complicates connection setup and machine identi?cation. Although ad-hoc solutions exist for some of these problems, it is usually up to the application's user to discover the cause of the connectivity problems and ?nd a solution. In this paper we describe SmartSockets1 a communication library that lifts this burden by automatically discovering the connectivity problems and solving them with as little support from the user as possible. Copyright 2007 ACM.
this paper presents implementation of a very fast parallel complex FFT on M2, the second generation of MorphoSys Reconfigurable computation platform, which is targeting on streamed applications such as multimedia and ...
详细信息
ISBN:
(纸本)0769520464
this paper presents implementation of a very fast parallel complex FFT on M2, the second generation of MorphoSys Reconfigurable computation platform, which is targeting on streamed applications such as multimedia and DSP. the proposed mapping comprises fast presorting, cascaded radix-2 stages, and post-reordering. Data and twiddle factors are 16-bit real and 16-bit imaginary in 2's complement format and scaling is performed to avoid overflow. the mapping is tested on our cycle-accurate simulator, "Mulate", and the performance is encouragingly better than other architectures such as Imagine and VIRAM. Moreover, the performance is scalable according to FFT sizes. Since there is no functionality specifically tailored to FFT, the results demonstrate the capability of MorphoSys architecture to extract parallelism from streamed applications. Further rationales are given based on the concepts of scalar operand networks and memory hierarchy.
Welcome to the 16th Workshop on Interaction between Compilers and computerarchitectures (INTERACT-16), held on February 25, 2012, in New Orleans, Louisiana. this year, the workshop was held in conjunction withthe 18...
详细信息
Due to mobility, energy limitations, and unreliable wireless channels, applications running on mobile devices suffer from faults such as temporary disconnection and data loss. We, therefore, need a fault tolerance mec...
详细信息
ISBN:
(纸本)1595936734
Due to mobility, energy limitations, and unreliable wireless channels, applications running on mobile devices suffer from faults such as temporary disconnection and data loss. We, therefore, need a fault tolerance mechanism to guarantee their smooth working and performance. In this paper, we present a novel proxy-based uncoordinated checkpointing scheme with pessimistic message logging for efficient fault recovery in mobile Grid system. Simulation results show that this scheme is reliable, efficient and, at the sametime, consumes less network traffic.
A bipartite graph G = (V, W, E) is convex if there exists an ordering of the vertices of W such that, for each v is an element of V, the neighbors of v are consecutive in W. In this work we describe a BSP/CGM algorith...
详细信息
ISBN:
(纸本)0769520464
A bipartite graph G = (V, W, E) is convex if there exists an ordering of the vertices of W such that, for each v is an element of V, the neighbors of v are consecutive in W. In this work we describe a BSP/CGM algorithm for finding a maximum matching in a convex bipartite graph. For p processors, the algorithm runs in time O((\V\/p) lg(\V\/p) lgp) and it uses O(lgp) communication rounds.
For practical use of microwave simulations in industry applications such as high frequency product design, this paper presents a conceptual design of 3-D finite difference time domain (FDTD) dedicated computer with da...
详细信息
For practical use of microwave simulations in industry applications such as high frequency product design, this paper presents a conceptual design of 3-D finite difference time domain (FDTD) dedicated computer with dataflow architecture as one of the portable highperformancecomputing technologies. A basic concept of the dataflow architecture for the FDTD dedicated computer itself was presented already in 2003 for 2-D microwave simulations. Detail design of 3-D FDTD dataflow machine is considered in this paper.
this paper presents a highperformance communication system based on generic programming. the system adapts itself according to the protocol being used on communication, simplifying the development of libraries. In or...
详细信息
In this paper we describe QsNet(III), an adaptively routed network for highperformancecomputing (HPC) applications. We detail the structure of the network, the evolution of our adaptive routing algorithms from previ...
ISBN:
(纸本)9780769533803
In this paper we describe QsNet(III), an adaptively routed network for highperformancecomputing (HPC) applications. We detail the structure of the network, the evolution of our adaptive routing algorithms from previous generations of network and new applications of these techniques. We describe other HPC specific features including hardware support for barrier and broadcast and large numbers small packets. We also describe the implementation of the network.
暂无评论