the Nile system is a distributed environment for running very large, data-intensive applications across a network of commodity workstations. these applications process data from elementary particle collisions, generat...
详细信息
ISBN:
(纸本)0780350049
the Nile system is a distributed environment for running very large, data-intensive applications across a network of commodity workstations. these applications process data from elementary particle collisions, generated by the Cornell Electron Storage Ring, and are used by physicists of the CLEO experiment. the applications have a simple data-parallel structure, and so Nile executes them using as much parallelism as is available. Nile currently runs at any single site. It is being used by alpha testers and is scheduled for beta release in March 1998. this paper describes how we are adapting this local-area Nile system to allow for wide-area, multiple site interactions. In particular, we consider the two problems of scaling and of fault-tolerance.
A compiler transformation to minimize the number of messages in PDE parallel iterative codes is presented. For the n-dimensional high-order PDE or second-order PDE using high-order finite differences, this transformat...
详细信息
the POPCORN project provides an infrastructure for globally distributed computation over the whole Internet. it provides any programmer connected to the Internet with a single huge virtual parallel computer composed o...
详细信息
ISBN:
(纸本)0780350049
the POPCORN project provides an infrastructure for globally distributed computation over the whole Internet. it provides any programmer connected to the Internet with a single huge virtual parallel computer composed of all processors On the Internet which care to participate at any given moment. the system provides a market-based mechanism of trade it? CPU time to motivate processors To provide their CPU cycles for other peoples computations. "Selling" CPU time is as easy as visiting a certain web site with a Java-enabled browser. "Buying" CPU time is done by writing a parallel program, using our programming paradigm (and libraries). this paradigm was designed as to fit the situation of global computation. A third entity in our system is a "market" for CPU time, which is where buyers and sellers meet and trade. the system has been implemented and may be visited and used on our web-site: http://***/-popcorn.
Cloud computing enables diverse new application areas for distributedcomputing. Many upcoming cloud applications do not fit to simple programming models such as "embarrassing parallelism" but have complex d...
详细信息
ISBN:
(纸本)9781467345651;9780769549033
Cloud computing enables diverse new application areas for distributedcomputing. Many upcoming cloud applications do not fit to simple programming models such as "embarrassing parallelism" but have complex data dependencies and require atomic operations spanning multiple objects. Some large-scale storage systems already implement atomic multi-object operations, but they do not address the complementary problem of efficiently propagating replica updates. In this paper, we present the design and implementation of a smart replication protocol in the ECRAM in-memory storage, which supports atomic multi-object operations. the performance analysis shows that the adaptive mechanism requires much less bandwidth, less memory, and results in improved application performance and responsiveness.
the problem of cache coherence in large-scale shared memory multiprocessor systems has been addressed using directory schemes. the memory overhead for various directory schemes expand as the number of processors becom...
详细信息
We introduce a new performance metric, called Land Balancing Factor (LDF), to assist programmers with evaluating different tuning alternatives. the LBF metric differs from traditional performance metrics since it is i...
详细信息
ISBN:
(纸本)0780350049
We introduce a new performance metric, called Land Balancing Factor (LDF), to assist programmers with evaluating different tuning alternatives. the LBF metric differs from traditional performance metrics since it is intended To measure the performance implications of a specific toning alternative rather than quantifying where rime is spent in the current version of the program. A second unique aspect of the metric is that it provides guidance about moving work within a distributed or parallel program rather than reducing it. A variation of the LBF metric can also be used to predict the performance impact of changing the underlying network, the LBF metric can be computed incrementally and online during the execution of the program to De tuned We also present a case study that shows that our metric can pr-edict the actual performance gains accurately for a test suite of six programs.
parallel transactions in distributed DBs incur high overhead for concurrency control and aborts. We propose an alternative approach by pre-serializing possibly conflicting transactions, and parallelizing non-conflicti...
详细信息
ISBN:
(纸本)9781467345651;9780769549033
parallel transactions in distributed DBs incur high overhead for concurrency control and aborts. We propose an alternative approach by pre-serializing possibly conflicting transactions, and parallelizing non-conflicting update transactions to different replicas. Our system provides strong transactional guarantees. In effect, Gargamel partitions the database dynamically according to the update workload. Each database replica runs sequentially, at full bandwidth;mutual synchronisation between replicas remains minimal. Our simulations show that Gargamel improves both response time and load by an order of magnitude when contention is high (highly loaded system with bounded resources), and that otherwise slow-down is negligible.
this paper defines a new network called the Extended Butterfly. the extended butterfly of degree n (XBn) has n22n nodes, diameter equal to ⌊3n/2⌋ and a constant node degree of 8. XBn is symmetric and contains n distin...
详细信息
An attractive way to improve the performance of a thin client system is to reduce the redundant presentation data sent across networks. While various optimization efforts have been made along this direction, they focu...
详细信息
Molecular docking is a widely used tool in Computer-aided Drug Design and Discovery. Due to the complexity of simulating the chemical events when two molecules interact, highly accelerated molecular docking programs a...
详细信息
ISBN:
(纸本)9781467345651
Molecular docking is a widely used tool in Computer-aided Drug Design and Discovery. Due to the complexity of simulating the chemical events when two molecules interact, highly accelerated molecular docking programs are of great interest and importance for practical use. In this paper, we present a GPU accelerated docking program implemented with CUDA. the hardware-enabled texture interpolation is employed for fast energy evaluation. Two types of parallel genetic algorithms are mapped to the CUDA computing architecture and used for the search of optimal docking result. Comparing to the CPU implementation, the GPU accelerated docking program achieved significant speedup while producing comparable results to the CPU version. the source code is made public at http://***/p/cudock/.
暂无评论