The proceedings contains 14 papers on Concurrency: Practice and Experience. Topics discussed include: array remapping;parallel supercomputing;parallel programming;high performance computing;Nexus communication systems...
详细信息
The proceedings contains 14 papers on Concurrency: Practice and Experience. Topics discussed include: array remapping;parallel supercomputing;parallel programming;high performance computing;Nexus communication systems;Web-based parallel programming environment;Java-based global computing;and coarse grain distributedcomputing.
We exploited the recent advances in Internet connectivity and Web technologies for building Web-based parallel programming environments (WPPEs) that facilitate the development and execution of parallel programs on rem...
详细信息
ISBN:
(纸本)0818681187
We exploited the recent advances in Internet connectivity and Web technologies for building Web-based parallel programming environments (WPPEs) that facilitate the development and execution of parallel programs on remote high-performance computers. A Web browser running on the user's machine provides a user-friendly interface to sewer-site user accounts and allows the use of parallelcomputing platforms and software in a convenient manner. The user may create, edit, and execute files through this Web browser interface. This new Web-based client-sewer architecture has the potential of being used as a future front-end to high-performance computer systems. We discuss the design and implementation of several prototype WPPEs that are currently in use at the Northeast parallel Architectures Center and the Cornell Theory Center These initial prototypes support high-level parallel programming with Fortran 90 and Nigh Performance Fortran (HPF), as well as explicit tow-level programming with Message Passing Interface (MPI). We detail the lessons learned during the development process and outline the tradeoffs of various design choices in the realization of the design. We especially concentrate on providing sewer-site user accounts, mechanisms to access those accounts through the Web, and the Web-related system security issues.
SRAM (static random access memory)-based pipelined algorithmic solutions have become competitive alternatives to TCAMs (ternary content addressable memories) for high-throughput IP lookup. Multiple pipelines can be ut...
详细信息
SRAM (static random access memory)-based pipelined algorithmic solutions have become competitive alternatives to TCAMs (ternary content addressable memories) for high-throughput IP lookup. Multiple pipelines can be utilized in parallel to improve the throughput further. However, several challenges must be addressed to make such solutions feasible. First, the memory distribution over different pipelines, as well as across different stages of each pipeline, must be balanced. Second, the traffic among these pipelines should be balanced. Third, the intra-flow packet order (i.e. the sequence) must be preserved. In this paper, we propose a parallel SRAM-based multi-pipeline architecture for IP lookup. A two-level mapping scheme is developed to balance the memory requirement among the pipelines as well as across the stages in each pipeline. To balance the traffic, we propose an early caching scheme to exploit the data locality inherent in the architecture. Our technique uses neither a large reorder buffer nor complex reorder logic. Instead, a flow-aware queuing scheme exploiting the flow information is used to maintain the intra-flow sequence. Extensive simulation using real-life traffic traces shows that the proposed architecture with 8 pipelines can achieve a throughput of up to 10 billion packets per second, i.e. 3.2 Tbps for minimum size (40 bytes) packets, while preserving intra-flow packet order. (c) 2009 Elsevier Inc. All rights reserved.
With the tremendous advances in processor and memory technology, I/O has risen to become the bottleneck in high-performance computing for many applications. The development of parallel file systems has helped to ease ...
详细信息
ISBN:
(纸本)0769517455
With the tremendous advances in processor and memory technology, I/O has risen to become the bottleneck in high-performance computing for many applications. The development of parallel file systems has helped to ease the performance gap, but I/O still remains an area needing significant performance improvement. Research has found that noncontiguous I/O access patterns in scientific applications combined with current file system methods to perform these accesses lead to unacceptable performance for large data sets. To enhance performance of noncontiguous I/O, we have created list I/O, a native version of noncontiguous I/O. We have used the parallel Virtual File System (PVFS) to implement our ideas. Our research and experimentation shows that list I/O outperforms current noncontiguous I/O access methods in most I/O situations and can substantially enhance the performance of real-world scientific applications.
Advanced general-purpose parallel systems should be able to support diverse applications with different resource requirements without compromising effectiveness and efficiency. We present a new resource management mod...
详细信息
Advanced general-purpose parallel systems should be able to support diverse applications with different resource requirements without compromising effectiveness and efficiency. We present a new resource management model for cluster computing that allows multiple scheduling policies to co-exist dynamically. In particular, we have built Octopus, an extensible and distributed hierarchical scheduler that implements new space-sharing, gang-scheduling and load-sharing strategies. A series of experiments performed on an IBM SP2 suggest that Octopus can effectively match application requirements to available resources, and improve the performance of a variety of parallel applications within a cluster.
The performance of a distributed system depends upon the efficiency of job distribution among processing nodes, as well as that of its system architecture and operating system. This paper presents an extended C langua...
详细信息
ISBN:
(纸本)0818680679
The performance of a distributed system depends upon the efficiency of job distribution among processing nodes, as well as that of its system architecture and operating system. This paper presents an extended C language, ParaC, that supports efficient parallel programming on distributed systems. ParaC is designed to reduce the effort of job distribution on distributed programming environments. Our design includes the description of design goals for the parallel language, the definition of a programming model and the design of ParaC constructs. This paper also addresses the detailed design issues related to translation and finally presents our prototype.
PVM-based parallel/distributed computation tools have been designed, implemented, and applied to two important mathematical algorithms. The tools make PVM easier to use and applicable to a wider class of computations....
详细信息
PVM-based parallel/distributed computation tools have been designed, implemented, and applied to two important mathematical algorithms. The tools make PVM easier to use and applicable to a wider class of computations. The application shows how advanced algebraic algorithms can take advantage of modern parallel/distributedcomputing with the aid of such tools. The tools and interfaces described include PVM-ET (a set of enhancement tools for PVM), PvmJobs (a general bag of jobs library that works with any user created job structure in a master/slave paradigm), and SaclibPvm (a simple software package interfacing SACLIB to PVM). The ability to interface symbolic computing to PVM allows us to tackle the parallelization of the Grobner Bases algorithm and the Characteristic Sets method, two very compute intensive algorithms important in algebraic computations. These algorithms, their parallelization, and experimental results are presented.
The proceedings contains 73 papers from the 1997 IEEE International Performance computing & Communications conference. Topics discussed include: real-time system performance;parallelcomputing;distributed systems;...
详细信息
The proceedings contains 73 papers from the 1997 IEEE International Performance computing & Communications conference. Topics discussed include: real-time system performance;parallelcomputing;distributed systems;fault-tolerant networks;distributed algorithms;asynchronous transfer mode networks;asynchronous transfer mode services;asynchronous transfer mode switching;PowerPC microprocessors;chip simulators;computer hardware performance analysis;transport protocols;end-to-end protocols;multicasting models;routing algorithms;wireless networks;data security management;and communication network models.
Despite the recent advances in access control approaches applicable to grid computing, there remain issues that impede the development of effective access control for grid applications. Amongst them are the lack of co...
详细信息
ISBN:
(纸本)0769524052
Despite the recent advances in access control approaches applicable to grid computing, there remain issues that impede the development of effective access control for grid applications. Amongst them are the lack of context-based models for access control, and reliance on identity or capability-based access control schemes. In this paper, we propose RCBAC model which extends the RBAC with context constraints. The RCBAC mechanisms dynamically grant and adapt permissions to users based on a set of contextual information collected from the grid environments, while retaining the advantages of RBAC model.
A large-scale simulation in e-Science experiments can be modeled by using a workflow. The ProGenGrid Workflow Management System is being developed at the University of Salento in Lecce since 2004 and consists of an ed...
详细信息
ISBN:
(纸本)9780769534435
A large-scale simulation in e-Science experiments can be modeled by using a workflow. The ProGenGrid Workflow Management System is being developed at the University of Salento in Lecce since 2004 and consists of an editor for designing the experiment and an engine for scheduling the jobs in a Computational Grid. The initial version was based on wrapping the bioinformatics tools as Web Services and scheduling job execution on the Grid by using an opportune engine. Then the engine was optimized to support batch, parameter and MPI jobs by using the Globus Toolkit. This component was developed as part of the Grid Resource Broker project. In this paper, we present the latest advances regarding the editor's new features such as on line monitoring, and the engine's support for scheduling jobs on other grid middleware such as gLite and Unicore.
暂无评论