检索结果-内蒙古大学图书馆

annual acm symposium on parallel algorithms and architectures 1999年 106-114页

作者： Liao, Cheng Martonosi, Margaret Clark, Douglas W. Princeton Univ Princeton NJ United States

Fast commodity network-connected PC or workstation clusters are becoming more and more popular. This popularity can be attributed to their ability to provide high-performance parallel computing on a relatively inexpensive platform. An accurate global clock is invaluable for these systems, both for measuring network performance and coordinating distributed applications. Typically, however, these systems do not include dedicated clock synchronization support. Previous clock synchronization methods are not suitable here in general, either because of extra, non-commodity hardware requirements or insufficient synchronized clock accuracy. In this paper we present and evaluate an adaptive clock synchronization algorithm. We have implemented and tested the algorithm on our Myrinet-based PC cluster. It is regularly used as part of a parallel performance tool running on the cluster. The algorithm has several important features. First, it does not require any extra hardware support. Second, we show that this algorithm imposes very low overhead on the system and has microsecond-level accuracy. Finally, our results indicate that adding the ability to adaptively adjust the clock's re-synchronization period causes almost no extra overhead while achieving a much better global clock accuracy.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

A framework for simple sorting algorithms on parallel disk systems (extended abstract) 98

A framework for simple sorting algorithms on parallel disk s...

引用

proceedings of the tenth annual acm symposium on parallel algorithms and architectures

作者： Sanguthevar Rajasekaran Dept. of CISE University of Florida Gainesville FL

来源：评论

学校读者我要写书评

暂无评论

Supporting the hypercube programming model on mesh architectures: (a fast sorter for iWarp tori) 92

Supporting the hypercube programming model on mesh architect...

引用

proceedings of the fourth annual acm symposium on parallel algorithms and architectures

作者： Thomas M. Stricker

来源：评论

学校读者我要写书评

暂无评论

Tradeoffs between parallelism and fill in nested dissection

Annual ACM Symposium on Parallel Algorithms and Architecture...

引用

annual acm symposium on parallel algorithms and architectures 1999年 191-200页

作者： Bornstein, Claudson F. Maggs, Bruce M. Miller, Gary L. Universidade Federal do Rio de Janeiro Rio de Janeiro Brazil

In this paper we demonstrate that parallelism and fill can be traded off in orders for Gaussian elimination. While the well-known nested dissection algorithm produces very parallel elimination orders, we show that by reducing the parallelism it is possible to reduce the fill that the orders generate. In particular, we present a new `less parallel nested dissection' algorithm (LPND). We prove that, unlike standard nested dissection, when applied to a chordal graph LPND finds a zero-fill elimination order. Our implementation of LPND generates less fill than state-of-the-art implementations of the nested dissection (METIS), minimum-degree (AMD), and hybrid (BEND) algorithms on a large body of test matrices, at the cost of a small reduction in the paralellism in the orders that it produces. We have also implemented a nested dissection algorithm that is different from METIS and that uses the same separator algorithm used by our implementation of LPND. This algorithm, like LPND, generates less fill than METIS, and on large graphs generates significantly less fill than AMD. The latter comparison is notable, because although it is known that, for certain classes of graphs, minimum-degree produces asymptotically more fill than nested dissection, minimum-degree is believed to produce low-fill orderings in practice. Our experiments contradict this belief.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Session details: parallel algorithms 11

Session details: Parallel algorithms

引用

proceedings of the twenty-third annual acm symposium on parallelism in algorithms and architectures

作者： Rajmohan Rajaraman Northeastern University

No abstract available.

ISBN: (纸本)9781450307437

No abstract available.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Efficient optical communication in parallel computers 92

Efficient optical communication in parallel computers

引用

proceedings of the fourth annual acm symposium on parallel algorithms and architectures

作者： Mihály Geréb-Graus Thanasis Tsantilas

来源：评论

学校读者我要写书评

暂无评论

Session details: parallel algorithms 12

Session details: Parallel algorithms

引用

proceedings of the twenty-fourth annual acm symposium on parallelism in algorithms and architectures

作者： Phil Gibbons Intel Research

No abstract available.

ISBN: (纸本)9781450312134

No abstract available.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Polynomial root-finding: analysis and computational investigation of a parallel algorithm 92

Polynomial root-finding: analysis and computational investig...

引用

proceedings of the fourth annual acm symposium on parallel algorithms and architectures

作者： B. Narendran Prasoon Tiwari

来源：评论

学校读者我要写书评

暂无评论

System-level specification framework for I/O architectures

Annual ACM Symposium on Parallel Algorithms and Architecture...

引用

annual acm symposium on parallel algorithms and architectures 1999年 138-147页

作者： Hill, Mark D. Condon, Anne E. Plakal, Manoj Sorin, Daniel J. Univ of Wisconsin-Madison Madison United States

A computer system is useless unless it can interact with the outside world through input/output (I/O) devices. I/O systems are complex, including aspects such as memory-mapped operations, interrupts, and bus bridges. Often, I/O behavior is described for isolated devices without a formal description of how the complete I/O system behaves. The lack of an end-to-end system description makes the tasks of system programmers and hardware implementors more difficult to do correctly. This paper proposes a framework for formally describing I/O architectures called Wisconsin I/O (WIO). WIO extends work on memory consistency models (that formally specify the behavior of normal memory) to handle considerations such as memory-mapped operations, device operations, interrupts, and operations with side effects. Specifically, WIO asks each processor or device that can issue k operation types to specify ordering requirements in a k×k table. A system obeys WIO if there always exists a total order of all operations that respects processor and device ordering requirements and has the value of each `read' equal to the value of the most recent `write' to that address. This paper then presents examples of WIO specifications for systems with various memory consistency models including sequential consistency (SC), SPARC TSO, an approximation of Intel IA-32, and Compaq Alpha. Finally, we present a directory-based implementation of an SC system, and we sketch a proof which shows that the implementation conforms to its WIO specification.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Session details: parallel algorithms and data structures 12

Session details: Parallel algorithms and data structures

引用

proceedings of the twenty-fourth annual acm symposium on parallelism in algorithms and architectures

作者： Yossi Lev Oracle Labs

No abstract available.

ISBN: (纸本)9781450312134

No abstract available.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：