For practical use of microwave simulations in industry applications such as high frequency product design, this paper presents a conceptual design of 3-D finite difference time domain (FDTD) dedicated computer with da...
详细信息
For practical use of microwave simulations in industry applications such as high frequency product design, this paper presents a conceptual design of 3-D finite difference time domain (FDTD) dedicated computer with dataflow architecture as one of the portable highperformancecomputing technologies. A basic concept of the dataflow architecture for the FDTD dedicated computer itself was presented already in 2003 for 2-D microwave simulations. Detail design of 3-D FDTD dataflow machine is considered in this paper.
We propose a hybrid parallelism-independent scheduling method, predominantly performed at compile time, which generates a machine code efficiently executable on any number of workstations or PCs in a cluster computing...
详细信息
ISBN:
(纸本)0769516262
We propose a hybrid parallelism-independent scheduling method, predominantly performed at compile time, which generates a machine code efficiently executable on any number of workstations or PCs in a cluster computing environment. Our new scheduling algorithm called Dynamical Level Parallelism-Independent Scheduling algorithm (DLPIS) is applicable for distributed computer systems because additionally to the task scheduling, we perform a message communication scheduling. It provides an explicit task synchronization mechanism guiding the task allocation and data dependency solution at run time at reduced overhead. Furthermore, we provide a mechanism allowing the self-adaptation of the machine code to the degree of parallelism of the system at run-time. therefore our scheduling method supports the variable number of processors in the users' computing systems and the adaptive parallelism, which may occur in distributed computing systems due to computer or link failure.
In this work we discuss a range of approaches to full-system simulation of distributed memory parallel computers, with emphasis on the interconnection network. We present our environment, based on Simics, and discuss ...
详细信息
ISBN:
(纸本)1595936734
In this work we discuss a range of approaches to full-system simulation of distributed memory parallel computers, with emphasis on the interconnection network. We present our environment, based on Simics, and discuss how unforeseen interactions and fine tuning of components can affect results.
Grid or mesh techniques are frequently used to approximate continuous entities that behave in a wave or fluid-like fashion. Partial Differential Equations (PDE's) are usually involved in the description of such en...
详细信息
ISBN:
(纸本)0769516262
Grid or mesh techniques are frequently used to approximate continuous entities that behave in a wave or fluid-like fashion. Partial Differential Equations (PDE's) are usually involved in the description of such entities or processes. Distributed parallel computation was used in various computer cluster configurations to calculate PDE solutions of electrostatic field. the study of the efficacy of the selected architecture using mesh techniques was intended. the match between the algorithm and the architecture in achieving maximum computational performance was also investigated. the developed architectures, algorithms, and findings are presented in the paper.
the adequate occupation of the computing resources can influence, in a decisive way, the global performance of the system. therefore, in order to achieve a highperformance, it is mandatory to know all the computing r...
详细信息
ISBN:
(纸本)0769516262
the adequate occupation of the computing resources can influence, in a decisive way, the global performance of the system. therefore, in order to achieve a highperformance, it is mandatory to know all the computing resources involved and their respective occupation level in a certain moment. Withthe objective of improving the system performance, this paper presents the OpenTella model to update the information related to the occupation of resources and the respective analysis of this occupation so that the migration of processes among computers of a same cluster can be completed. Withthe objective of increasing the scale level in the system and decreasing the number of messages among the computers, this Peer-to-peer protocol defines sub-nets, which are clusters that make up a more comprehensive cluster. thus, groups are defined to interchange information and update the occupation of resources, in order to minimize the communication and to achieve a calculation to balance the load and meet the system needs, resulting in the migration of processes.
the proceedings contain 31 papers. the topics discussed include: transparent network services via a virtual traffic layer for virtual machines;failure-aware checkpointing in fine-grained cycle sharing systems;using qu...
详细信息
ISBN:
(纸本)1595936734
the proceedings contain 31 papers. the topics discussed include: transparent network services via a virtual traffic layer for virtual machines;failure-aware checkpointing in fine-grained cycle sharing systems;using queue structures to improve job reliability;cooperative secondary authorization recycling;a statistical approach to risk mitigation in computational markets;feedback-directed thread scheduling with memory considerations;precise and realistic utility functions for user-centric performance analysis of schedulers;a provisioning model and its comparison with best-effort for performance-cost optimization in grids;data driven workflow planning in cluster management systems;partial content distribution on highperformance networks;using content-addressable networks for load balancing in desktop grids;peer-to-peer checkpointing arrangement for mobile grid computing systems;and evaluating the impacts of network information models on applications and network service providers.
We present the Lightweight Information Validation Environment, LIVE as asolution to the high complexity and data sizes of modern day computational science applications. LIVE is a data workspace that facilitates the cr...
详细信息
ISBN:
(纸本)1595936734
We present the Lightweight Information Validation Environment, LIVE as asolution to the high complexity and data sizes of modern day computational science applications. LIVE is a data workspace that facilitates the creation of dynamic data processing overlays we call I/O graphs. We use LIVE as aplatform for dynamic extension of scientific applications using lightweight data extraction, runtime discovery and flexible data selection.
Matrix multiplication is a widely-used routine in science and engineering applications. Accelerating this routine is important, because applications with large-scale matrix multiplication are increasingly common, espe...
详细信息
ISBN:
(纸本)9781538637906
Matrix multiplication is a widely-used routine in science and engineering applications. Accelerating this routine is important, because applications with large-scale matrix multiplication are increasingly common, especially in the area of high-performancecomputing (HPC). However, existing computing platforms including CPU, GPGPU and FPGA suffer from unsatisfactory performance or efficiency for this routine. In this paper, we propose a high-performance accelerator for double-precision floating-point matrix multiplication, and build a performance model for design space exploration based on a memory access scheduling. Impact of architecture parameters on accelerator performance and efficiency are evaluated and analyzed. Experimental results show that our proposed accelerator with 256 processing elements (PEs) can achieve a maximum performance of 767.99 GFLOPS and an efficiency of 99.99% for large-scale matrix multiplication, which is well suited to the requirement of HPC applications.
this paper deals with a novel, distributed, QoS-aware, peer-to-peer checkpointing arrangement component for Mobile Grid (MoG) computing systems middleware. Checkpointing is more crucial in MoG systems than in their wi...
详细信息
ISBN:
(纸本)1595936734
this paper deals with a novel, distributed, QoS-aware, peer-to-peer checkpointing arrangement component for Mobile Grid (MoG) computing systems middleware. Checkpointing is more crucial in MoG systems than in their wired counterparts due to node mobility and less reliable wireless links resulting in frequent and dynamic connections and disconnections. Having determined the globally optimal checkpoint arrangement to be NP-complete, we consider ReD, our Reliability Driven (ReD) protocol, employing QoS-aware heurisitcs, for constucting superior peer-to-peer checkpointing arrangements efficiently.
Tightly coupled parallel applications are increasingly run in Grid environments. Unfortunately, on many Grid sites the ability of machines to create or accept network connections is severely limited by ?rewalls, netwo...
详细信息
ISBN:
(纸本)1595936734
Tightly coupled parallel applications are increasingly run in Grid environments. Unfortunately, on many Grid sites the ability of machines to create or accept network connections is severely limited by ?rewalls, network address translation (NAT)or non-routed networks. Multi homing further complicates connection setup and machine identi?cation. Although ad-hoc solutions exist for some of these problems, it is usually up to the application's user to discover the cause of the connectivity problems and ?nd a solution. In this paper we describe SmartSockets1 a communication library that lifts this burden by automatically discovering the connectivity problems and solving them with as little support from the user as possible. Copyright 2007 ACM.
暂无评论