The proceedings contain 48 papers. The topics discussed include: new and old tools and programming models for high-performance computing;verifying parallel programs with MPI-Spin;advanced MPI programming;full bandwidt...
详细信息
ISBN:
(纸本)9783540754152
The proceedings contain 48 papers. The topics discussed include: new and old tools and programming models for high-performance computing;verifying parallel programs with MPI-Spin;advanced MPI programming;full bandwidth broadcast, reduction and scan with only two trees;process cooperation in multiple message broadcast;a case for standard non-blocking collective operations;optimization of collective communications in HeteroMPI;investigations on InfiniBand: efficient network buffer utilization at scale;improving MPI support for applications on hierarchically distributed resources;extending the MPI-2 generalized request interface;transparent log-based data storage in MPI-IO applications;analysis of implementation options for MPI-2 one-sided;revealing the performance of MPI RMA implementations, and distributed real-time computing with harness.
The proceedings contain 66 papers. The topics discussed include: an elastic display method for visualizing and navigating a large quantity of alarms in a control room of a nuclear power plant;parallel construction of ...
ISBN:
(纸本)1402052626
The proceedings contain 66 papers. The topics discussed include: an elastic display method for visualizing and navigating a large quantity of alarms in a control room of a nuclear power plant;parallel construction of Huffman codes;semantic description of multimedia content adaptation web services;grid computing communication strategies for cross cluster job execution;glue code synthesis for distributed software programming;an improved configuration similarity retrieval model;content based image retrieval using quadrant motif scan;method of key vectors extraction using R-cloud classifiers;semantic web knowledge management;augmented color recognition by applying erasure capability of Reed-Solomon algorithm;reversers - a programming language construct for reversing out of code;and a mobile location algorithm using clustering technique for NLoS environments.
Application of Wireless Sensor Networks (WSN) in many fields has achieved a significant advances, many research can be seen in military, industrial control surveillance, and bush fire and wild habitant monitoring, How...
详细信息
When human culture advances, current problems in science and engineering become more complicated and need more computing power to tackle and analyze. A supercomputer is not the only choice for complex problems any mor...
详细信息
ISBN:
(纸本)9780769537474
When human culture advances, current problems in science and engineering become more complicated and need more computing power to tackle and analyze. A supercomputer is not the only choice for complex problems any more as a result of the speed-up of personal computers and networks. Grid technology, which connects a number of personal computers with high speed networks, can achieve the same computing power as a supercomputer does, also with a lower cost. However, grid is a heterogeneous system. Scheduling independent tasks on it is more complicated. In order to utilize the power of grid completely, we need an efficient job scheduling algorithm to assign jobs to resources in a grid. In this paper, we propose an Adaptive Scoring Job Scheduling algorithm (ASJS) for the grid environment. Compared to other methods, it can decrease the completion time of all submitted jobs, which may compose of computingintensive jobs and data-intensive jobs.
ZSA, an industrial image processing system is characterized by a modern parallel architecture for digital signal processing. Besides the use of standard video cameras the system is strongly intended to be used with on...
详细信息
I/O performance remains a weakness of parallelcomputing systems today. While this weakness is partly attributed to rapid advances in other system components, I/O interfaces available to programmers and the I/O method...
详细信息
ISBN:
(纸本)0769519199
I/O performance remains a weakness of parallelcomputing systems today. While this weakness is partly attributed to rapid advances in other system components, I/O interfaces available to programmers and the I/O methods supported by file systems have traditionally not matched efficiently with the types of I/O operations that scientific applications perform, particularly noncontiguous accesses. The MPI-IO interface allows for rich descriptions of the I/O patterns desired for scientific applications and implementations such as ROMIO have taken advantage of this ability while remaining limited by underlying file system methods. A method of noncontiguous data access, list I/O, was recently implemented in the parallel Virtual File System (PVFS). We implement support for this interface in the ROMIO MPI-IO implementation. Through a suite of noncontiguous I/O tests we compared ROMIO list I/O to current methods of ROMIO noncontiguous access and found that the list I/O interface provides performance benefits in many noncontiguous cases.
distributed Shared Memory (DSM) offers programmers a shared memory abstraction on top of an underlying network of distributed memory machines. advances in network technology and price/performance of workstations sugge...
详细信息
distributed Shared Memory (DSM) offers programmers a shared memory abstraction on top of an underlying network of distributed memory machines. advances in network technology and price/performance of workstations suggest that DSM will be the dominant paradigm for future high-performance computing. However, as long running DSM applications scale to hundreds or even thousands of machines, the probability of a node or network link failing increases. Fault tolerance is typically achieved via `checkpointing' techniques that allow applications to `roll back' to a recent checkpoint rather than restarting. High-performance DSM systems using relaxed memory consistency are significantly more difficult to checkpoint than uniprocessor or message passing architectures. This paper describes previous approaches to checkpointing message passing parallel programs along with extensions to DSM systems.
Management of resource and application scheduling in a highly distributed heterogeneous Grid environment is a complex and challenging task. Processing jobs at the grid resources in a fine grained form results in a low...
详细信息
advances in the image processing field have brought new methods which are able to perform complex tasks robustly. However, in order to meet constraints on functionality and reliability, imaging application developers ...
详细信息
ISBN:
(纸本)9780819484093
advances in the image processing field have brought new methods which are able to perform complex tasks robustly. However, in order to meet constraints on functionality and reliability, imaging application developers often design complex algorithms with many parameters which must be finely tuned for each particular environment. The best approach for tuning these algorithms is to use an automatic training method, but the computational cost of this kind of training method is prohibitive, making it inviable even in powerful machines. The same problem arises when designing testing procedures. This work presents methods to train and test complex image processing algorithms in parallel execution environments. The approach proposed in this work is to use existing resources in offices or laboratories, rather than expensive clusters. These resources are typically non-dedicated, heterogeneous and unreliable. The proposed methods have been designed to deal with all these issues. Two methods are proposed: intelligent training based on genetic algorithms and PVM, and a full factorial design based on grid computing which can be used for training or testing. These methods are capable of harnessing the available computational power resources, giving more work to more powerful machines, while taking its unreliable nature into account. Both methods have been tested using real applications.
This paper deals with fault tolerant recovery mechanisms and probabilistic results certification issues on large scale architectures. The related works in the result certification domain are based on a total or a part...
详细信息
ISBN:
(纸本)9783642016707
This paper deals with fault tolerant recovery mechanisms and probabilistic results certification issues on large scale architectures. The related works in the result certification domain are based on a total or a partial duplication of the application. However, they are limited to independent tasks executions. In the present work, we extend these mechanisms to dependant tasks applications. First of all we propose an approach, based on an abstract representation of a parallel execution called macro-dataflow graph. Second we introduce probabilistic certification algorithms that avoid the re-execution of the program, allowing for recovery on different platforms under different number of processors. We also sketch how to simulate our framework according to state of the art, modeling workloads and fault injection tools.
暂无评论