检索结果-内蒙古大学图书馆

A class of multilevel recursive incomplete LU preconditioning techniques

Journal of Applied Mathematics and computing 2001年第2期8卷 213-234页

作者： Zhang, Jun Laboratory for High Performance Scientific Computing and Computer Simulation Department of Computer Science University of Kentucky 773 Anderson Hall Lexington KY 40506-0046 United States

We introduce a class of multilevel recursive incomplete LU preconditioning techniques (RILUM) for solving general sparse matrices. This technique is based on a recursive two by two block incomplete LU factorization on the coefficient matrix. The coarse level system is constructed as an (approximate) Schur complement. A dynamic preconditioner is obtained by solving the Schur complement matrix approximately. The novelty of the proposed techniques is to solve the Schur complement matrix by a preconditioned Krylov subspace method. Such a reduction process is repeated to yield a multilevel recursive preconditioner. © 2001 Korean Society for Computational & Applied Mathematics and Korean SIGCAM.

关键词： Computational methods

来源：评论

学校读者我要写书评

暂无评论

Adaptive sampling for network management

引用

Journal of Network and Systems Management 2001年第4期9卷 409-434页

作者： Hernandez, Edwin A. High-performance Computing and Simulation (HCS) Research Laboratory Department of Electrical and Computer Engineering University of Florida Gainesville FL 32611-6200 P.O. Box 116200 United States

high-performance networks require sophisticated management systems to identify sources of bottlenecks and detect faults. At the same time, the impact of network queries on the latency and bandwidth available to the applications must be minimized. Adaptive techniques can be used to control and reduce the rate of sampling of network information, reducing the amount of processed data and lessening the overhead on the network. Two adaptive sampling methods are proposed in this paper based on linear prediction and fuzzy logic. The performance of these techniques is compared with conventional sampling methods by conducting simulative experiments using Internet and videoconference traffic patterns. The adaptive techniques are significantly more flexible in their ability to dynamically adjust with fluctuations in network behavior, and in some cases they are able to reduce the sample count by as much as a factor of two while maintaining the same accuracy as the best conventional sampling interval. The results illustrate that adaptive sampling provides the potential for better monitoring, control, and management of high-performance networks with higher accuracy, lower overhead, or both. © 2001 Plenum Publishing Corporation.

关键词： Adaptive sampling Fuzzy logic Linear prediction Network management SNMP

来源：评论

学校读者我要写书评

暂无评论

Gossip-Style Failure Detection and Distributed Consensus for Scalable Heterogeneous Clusters

引用

Cluster computing 2001年第3期4卷 197-209页

作者： Ranganathan, Sridharan George, Alan D. Todd, Robert W. Chidester, Matthew C. High-performance Computing and Simulation (HCS) Research Laboratory Department of Electrical and Computer Engineering University of Florida Gainesville USA

Gossip protocols provide a means by which failures can be detected in large, distributed systems in an asynchronous manner without the limits associated with reliable multicasting for group communications. However, in order to be effective with application recovery and reconfiguration, these protocols require mechanisms by which failures can be detected with system-wide consensus in a scalable fashion. This paper presents three new gossip-style protocols supported by a novel algorithm to achieve consensus in scalable, heterogeneous clusters. The round-robin protocol improves on basic randomized gossiping by distributing gossip messages in a deterministic order that optimizes bandwidth consumption. Redundant gossiping is completely eliminated in the binary round-robin protocol, and the round-robin with sequence check protocol is a useful extension that yields efficient detection times without the need for system-specific optimization. The distributed consensus algorithm works with these gossip protocols to achieve agreement among the operable nodes in the cluster on the state of the system featuring either a flat or a layered design. The various protocols are simulated and evaluated in terms of consensus time and scalability using a high-fidelity, fault-injection model for distributed systems comprised of clusters of workstations connected by high-performance networks.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Exploiting hierarchy in parallel computer networks to optimize collective operation performance

Exploiting hierarchy in parallel computer networks to optimi...

引用

International Symposium on Parallel and Distributed Processing (IPDPS)

作者： N.T. Karonis B.R. de Supinski I. Foster W. Gropp E. Lusk J. Bresnahan High Performance Computing L ab or atory Department of Computer Science Northern Illinois University DeKalb IL USA Center for Applied Scientific Computing Lawrence Livemore National Laboratory Livermore CA USA University of Chicago Chicago IL USA Argonne National Laboratory Argonne IL USA Mathematics and Computer Science Division Argonne National Laboratory Argonne IL USA

The efficient implementation of collective communication operations has received much attention. Initial efforts modeled network communication and produced "optimal" trees based on those models. However, the models used by these initial efforts assumed equal point-to-point latencies between any two processes. This assumption is violated in heterogeneous systems such as clusters of SMPs and wide-area "computational grids", and as a result, collective operations that utilize the trees generated by these models perform suboptimally. In response, more recent work has focused on creating topology-aware trees for collective operations that minimize communication across slower channels (e.g., a wide-area network). While these efforts have significant communication benefits, they all limit their view of the network to only two layers. We present a strategy based upon a multilayer view of the network. By creating multilevel topology trees we take advantage of communication cost differences at every level in the network. We used this strategy to implement topology-aware versions of several MPI collective operations in MPICH-G, the Globus-enabled version of the popular MPICH implementation of the MPI standard. Using information about topology discovered by Globus, we construct these topology-aware trees automatically during execution, thus freeing the MPI application programmer from having to write special files or functions to describe the topology to the MPICH library. We present results demonstrating the advantages of our multilevel approach by comparing it to the default (topology-unaware) implementation provided by MPICH and a topology-aware two-layer implementation.

关键词： Intelligent networks computer networks Chromium

来源：评论

学校读者我要写书评

暂无评论

Simulative performance analysis of gossip failure detection for scalable distributed systems

引用

Cluster computing 1999年第3期2卷 207-217页

作者： Burns, Mark W. George, Alan D. Wallace, Bradley A. High-performance Computing and Simulation (HCS) Research Laboratory Department of Electrical and Computer Engineering University of Florida Gainesville USA

Three protocols for gossip-based failure detection services in large-scale heterogeneous clusters are analyzed and compared. The basic gossip protocol provides a means by which failures can be detected in large distributed systems in an asynchronous manner without the limits associated with reliable multicasting for group communications. The hierarchical protocol leverages the underlying network topology to achieve faster failure detection. In addition to studying the effectiveness and efficiency of these two agreement protocols, we propose a third protocol that extends the hierarchical approach by piggybacking gossip information on application-generated messages. The protocols are simulated and evaluated with a fault-injection model for scalable distributed systems comprised of clusters of workstations connected by high-performance networks, such as the CPlant system at Sandia National Laboratories. The model supports permanent and transient node and link failures, with rates specified at simulation time, for processors functioning in a fail-silent fashion. Through high-fidelity, CAD-based modeling and simulation, we demonstrate the strengths and weaknesses of each approach in terms of agreement time, number of gossips, and overall scalability.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Accurately measuring MPI broadcasts in a computational grid

Accurately measuring MPI broadcasts in a computational grid

引用

International Symposium on high performance Distributed computing

作者： B.R. de Supinski N.T. Karonis Lawrence Livermore National Laboratory Center for Applied Scientific Computing Livermore CA USA High-Performance Computing Laboratory Department of Computer Science Northern Illinois University DeKalb IL USA

An MPI library's implementation of broadcast communication can significantly affect the performance of applications built with that library. In order to choose between similar implementations or to evaluate available libraries, accurate measurements of broadcast performance are required. As we demonstrate, existing methods for measuring broadcast performance are either inaccurate or inadequate. Fortunately, we have designed an accurate method for measuring broadcast performance, even in a challenging grid environment. Measuring broadcast performance is not easy. Simply sending one broadcast after another allows them to proceed through the network concurrently, thus resulting in inaccurate per broadcast timings. Existing methods either fail to eliminate this pipelining effect or eliminate it by introducing overheads that are as difficult to measure as the performance of the broadcast itself. This problem becomes even more challenging in grid environments. Latencies along different links can vary significantly. Thus, an algorithm's performance is difficult to predict from it's communication pattern. Even when accurate prediction is possible, the pattern is often unknown. Our method introduces a measurable overhead to eliminate the pipelining effect, regardless of variations in link latencies.

关键词： Broadcasting Grid computing Libraries Timing Delay Laboratories scientific computing Design methodology computer science Application software

来源：评论

学校读者我要写书评

暂无评论

Modeling, Mesh Generation, and Adaptive Numerical Methods for Partial Differential Equations 1

引用

丛书名： The IMA Volumes in Mathematics and its Applications

1995年

作者： Ivo Babuska William D. Henshaw Joseph E. Oliger Joseph E. Flaherty John E. Hopcroft Tayfun Tezduyar

来源：评论

学校读者我要写书评

暂无评论

Evolving OpenMP in an Age of Extreme Parallelism 1

引用

丛书名： Lecture Notes in computer Science

1000年

作者： Matthias S. Müller Bronis R. Supinski Barbara M. Chapman

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：