检索结果-内蒙古大学图书馆

Proceedings of the twenty-seventh annual acm symposium on Theory of computing

作者： Ran Raz Weizmann Institute

来源：评论

学校读者我要写书评

暂无评论

Deriving optimal checkpoint protocols for distributed shared memory architectures 95

Deriving optimal checkpoint protocols for distributed shared...

引用

Proceedings of the fourteenth annual acm symposium on Principles of distributed computing

作者： Lorenzo Alvisi Keith Marzullo Cornell University Department of Computer Science Ithaca NY University of California San Diego Department of Computer Science and Engineering La Jolla CA

No abstract available.

ISBN: (纸本)9780897917100

No abstract available.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Optimal (up to polylog factors) sequential and parallel algorithms for approximating complex polynomial zeros 95

Optimal (up to polylog factors) sequential and parallel algo...

引用

Proceedings of the twenty-seventh annual acm symposium on Theory of computing

作者： Victor Y. Pan Math. & Computer Science Dept. Lehman College CUNY Bronx NY

来源：评论

学校读者我要写书评

暂无评论

Work efficient parallel solution of Toeplitz systems and polynomial GCD 95

Work efficient parallel solution of Toeplitz systems and pol...

引用

Proceedings of the twenty-seventh annual acm symposium on Theory of computing

作者： John H. Reif Department of Computer Science Duke University

来源：评论

学校读者我要写书评

暂无评论

Partitioned register file for TTAs

Partitioned register file for TTAs

引用

IEEE/acm International symposium on Microarchitecture (MICRO)

作者： J. Janssen H. Corporaal Department of Electrical Engineering Delft University of Technnology Delft Netherlands

A practical implementation of high performance instruction level parallel architectures is constrained by the difficulty to build a large monolithic multi-ported register file (RF). A solution is to partition the RF into smaller RFs while keeping the total number of registers and ports equal. This paper applies RF partitioning to transport triggered architectures (TTAs); these architectures are of the VLIW type. One may expect that partitioning increases the number of executed cycles because it constrains the number of ports per RF. It is shown that these performance losses are small; e.g. partitioning an RF with 24 registers and four read and four write ports into four RFs with 6 registers and one read and one write port gives a performance loss of only 5.8%. Partitioned RFs consume less area than monolithic RFs with the same number of ports and registers. Experiments show that, if the area saved by partitioning is spent on extra registers, partitioning does, on average, not reduce the performance; it may even result in a small performance gain.

关键词： Radio frequency Registers Performance loss VLIW Vector processors parallel architectures Performance gain Delay Energy consumption Hardware

来源：评论

学校读者我要写书评

暂无评论

Efficient parallel computations for singular band matrices 95

Efficient parallel computations for singular band matrices

引用

Proceedings of the sixth annual acm-SIAM symposium on Discrete algorithms

作者： Wayne Eberly Department of Computer Science University of Calgary T2N 1N4

来源：评论

学校读者我要写书评

暂无评论

parallel threads: parallel computation labs for CS 3 and CS 4

SIGCSE Bulletin (Association for Computing Machinery, Specia...

引用

SIGCSE Bulletin (Association for Computing Machinery, Special Interest Group on Computer Science Education) 1995年第1期27卷 141-141页

作者： Harlan, Robert M. Akulis, Joseph G. St. Bonaventure University United States

One objective in establishing our NSF ILI funded parallel computation laboratory was to use closed, formal laboratory assignments to introduce parallelism throughout the core computer science curriculum. We discuss laboratory assignments developed for the Computer Organization 1995 and algorithms (CS 4) courses. The CS 3 lab introduces parallelism based upon processor replication and two-performance indices for evaluating performance of parallel algorithms, speedup and efficiency. One factor that effects performance on MIMD message passage architectures, the ratio of computation to communication, is also introduced. The CS 4 lab guides students in developing a parallel version of Dijkstra's single source shortest path algorithm. A case study using parallel addition assists students in identifying potential parallelism by examining the data dependency of computations. Students working in teams of two develop a pseudo-code version of the single source shortest path algorithm for an abstract parallel machine. They also analyze the speedup and efficiency of an implementation of the algorithm for one, four and eight processors. © 1995, acm. All rights reserved.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Work-time-optimal parallel algorithms for string problems 95

Work-time-optimal parallel algorithms for string problems

引用

Proceedings of the twenty-seventh annual acm symposium on Theory of computing

作者： Artur Czumaj Zvi Galil Leszek Gąsieniec Kunsoo Park Wojciech Plandowski Heinz Nixdorf Institute University of Paderborn D-33095 Paderborn Germany Department of Computer Science Columbia University and Tel Aviv University Warsaw University Department of Computer Engineering Seoul National University Seoul 151-742 Korea Instytut Informatyki Uniwersytet Warszawski 02-097 Warszawa Poland

来源：评论

学校读者我要写书评

暂无评论

Petri net versus module scheduling for software pipelining

Petri net versus module scheduling for software pipelining

引用

IEEE/acm International symposium on Microarchitecture (MICRO)

作者： V.H. Allan U.R. Shah K.M. Reddy Department of Computer Science Utah State University Logan UT USA

Software pipelining is a technique that reforms the loop to improve execution time. Iterations are executed in overlapped fashion to increase parallelism. Modulo scheduling places each operation so that the schedule is legal when replicated and offset by a target initiation interval. This process is repeated with larger initiation intervals until success is achieved. Kernel recognition methods schedule operations as rapidly as possible until a pattern is recognized. These two distinctly different methods have various strengths and weaknesses. This paper explores the benefits and draw-backs of each.

关键词： Pipeline processing Kernel Processor scheduling Law Legal factors Pattern recognition parallel processing Computer science Computer architecture Software algorithms

来源：评论

学校读者我要写书评

暂无评论

AN OPTIMAL parallel ALGORITHM FOR DETECTING WEAK VISIBILITY OF A SIMPLE POLYGON

引用

INTERNATIONAL JOURNAL OF COMPUTATIONAL GEOMETRY & APPLICATIONS 1995年第1-2期5卷 93-124页

作者： CHEN, DZ UNIV NOTRE DAME DEPT COMP SCI & ENGNNOTRE DAMEIN 46556

The problem of detecting the weak visibility of an n-vertex simple polygon P is that of finding whether P is weakly visible from one of its edges and (if it is) identifying every edge from which P is weakly visible. In this paper, we present an optimal parallel algorithm for solving this problem. Our algorithm runs in O (log n) time using O (n/log n) processors in the CREW PRAM computational model, and is very different from the sequential algorithms for this problem. Based on this algorithm, several other problems related to weak visibility can be optimally solved in parallel.

关键词： COMPUTATIONAL GEOMETRY CREW PRAM parallel algorithms SIMPLE POLYGONS WEAK VISIBILITY

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：