We present a numerical study of several parallel algebraic preconditioners, which speed up the convergence of Krylov iterative methods when solving large-scale linear systems. The studied algebraic preconditioners are...
详细信息
Computational Grids are emerging as a new infrastructure for Internet-based parallel and distributed computing. They enable the sharing, exchange, discovery, and aggregation of resources distributed across multiple ad...
详细信息
This paper proposes a general framework for searching large distributed repositories. Examples of such repositories include sites with music/video content, distributed digital libraries, distributed caching systems, e...
In this work we describe two sequential algorithms and their parallel counterparts for solving nonlinear systems, when the Jacobian matrix is symmetric and positive definite. This case appears frequently in unconstrai...
详细信息
Advances across many fields of study are driving changes in the basic nature of scientific computing applications. Scientists have recognized a growing need to study phenomena by explicitly modeling interactions among...
详细信息
Scalability of applications on distributed shared-memory (DSM) multiprocessors is limited by communication overheads. At some point, using more processors to increase parallelism yields diminishing returns or even deg...
详细信息
ISBN:
(纸本)0769518710
Scalability of applications on distributed shared-memory (DSM) multiprocessors is limited by communication overheads. At some point, using more processors to increase parallelism yields diminishing returns or even degrades performance. When increasing concurrency is futile, we propose an additional mode of execution, called slipstream mode, that instead enlists extra processors to assist parallel tasks by reducing perceived overheads. We consider DSM multiprocessors built from dual-processor chip multiprocessor (CMP) nodes with shared L2 cache. A task is allocated on one processor of each CMP node. The other processor of each node executes a reduced version of the same task. The reduced version skips shared-memory stores and synchronization, running ahead of the true task. Even with the skipped operations, the reduced task makes accurate forward progress and generates an accurate reference stream, because branches and addresses depend primarily on private data. Slipstream execution mode yields two benefits. first, the reduced task prefetches data on behalf of the true task. Second, reduced tasks provide a detailed picture of future reference behavior, enabling a number of optimizations aimed at accelerating coherence events, e.g., self-invalidation. For multiprocessor systems with up to 16 CMP nodes, slipstream, mode outperforms running one or two conventional tasks per CMP in 7 out of 9 parallel scientific benchmarks. Slipstream mode is 12-19% faster with prefetching only and up to 29% faster with self-invalidation enabled.
Since the advent of distributed computer systems an active field of research has been the investigation of scheduling strategies for parallel applications. The common approach is to employ scheduling heuristics that a...
详细信息
ISBN:
(纸本)0769519199
Since the advent of distributed computer systems an active field of research has been the investigation of scheduling strategies for parallel applications. The common approach is to employ scheduling heuristics that approximate an optimal schedule. Unfortunately, it is often impossible to obtain analytical results to compare the efficacy of these heuristics. One possibility is to conducts large numbers of back-to-back experiments on real platforms. While this is possible on tightly-coupled platforms, it is infeasible on modern distributed platforms (i.e. Grids) as it is labor-intensive and does not enable repeatable results. The solution is to resort to simulations. Simulations not only enables repeatable results but also make it possible to explore wide ranges of platform and application scenarios. In this paper we present the SimGrid framework which enables the simulation of distributed applications in distributed computing environments for the specific purpose of developing and evaluating scheduling algorithms. This paper focuses on SimGrid v2, which greatly improves on the first version of the software with more realistic network models and topologies. SimGrid v2 also enables the simulation of distributed scheduling agents, which has become critical for current scheduling research in large-scale platforms. After describing and validating these features, we present a case study by which we demonstrate the usefulness of SimGrid for conducting scheduling research.
The proceedings contain 29 papers. The special focus in this conference is on Web Algorithms, Bit-parallel Algorithms, Compression and Music Retrieval. The topics include: Link information as a similarity measure in w...
ISBN:
(纸本)3540201777
The proceedings contain 29 papers. The special focus in this conference is on Web Algorithms, Bit-parallel Algorithms, Compression and Music Retrieval. The topics include: Link information as a similarity measure in web classification;a three level search engine index based in query log distribution;alternative algorithms for bit-parallel string matching;bit-parallel approximate string matching algorithms with transposition;processing of Huffman compressed texts with a super-alphabet;an optimized compression code for natural language text databases;linear-time off-line text compression by longest-first substitution;structural contexts model for improving compression in semistructured text databases;ranking structured documents using utility theory in the Bayesian network retrieval model;an empirical comparison of text categorization methods;improving text retrieval in medical collections through automatic categorization;flexible and efficient bit-parallel techniques for transposition invariant approximate matching in music retrieval;analysis and evaluation of a Turkish stemming algorithm;non-adjacent digrams improve matching of cross-lingual spelling variants;the implementation and evaluation of a lexicon-based stemmer;new refinement techniques for longest common subsequence algorithms;distributed query processing using suffix arrays;bit filtration technique for approximate string join in biological databases;using wordnet for word sense disambiguation to support concept map construction;memory-adaptative dynamic spatial approximation trees and large edit distance with multiple block operations.
Continued research into distributed agent-based systems and evolving Web based technologies are opening up tremendous possibilities for the deployment of large scale and highly extensible and flexible simulation syste...
详细信息
暂无评论