检索结果-内蒙古大学图书馆

international symposium on parallel and Distributed Processing (IPDPS)

作者： P.B. Gibbons E. Korach AT and T Bell Laboratories Inc. Murray Hill NJ USA Technion-Israel Institute of Technology Haifa Israel

The authors explore the complexity of deciding whether an execution of a shared-memory multiprocessor is sequentially consistent. They present the first results showing the NP-completeness of this problem, even for short programs or small machines. They also explore possible augmentations to the memory system; a fast decision algorithm is presented for such an augmented shared memory. The results obtained demonstrate the difficulty in detecting when an execution of a memory system fails to be sequentially consistent, and supporting all possible sequentially consistent executions in hardware.< >

关键词： Read-write memory NP-complete problem programming profession Detection algorithms System performance Hardware Databases History

来源：评论

学校读者我要写书评

暂无评论

architectures and building blocks for CMOS VLSI analog 'neural' programmable optimizers

Architectures and building blocks for CMOS VLSI analog 'neur...

引用

IEEE international symposium on Circuits and Systems (ISCAS)

作者： R. Dominguez-Castro A. Rodriguez-Vazquez J.L. Huertas E. Sanchez-Sinencio Department of Design of Analog Circuits Centro Nacional de Microelectrόnica Seville Spain

A modular reconfigurable serial architecture is presented for the analog/digital implementation of constrained optimization algorithms with digital programmability of the problem weights. Area overhead due to programmability is reduced by using time multiplexing methodology. It allows all the weights of each multiple inputs processing unit to be digitally controlled by just using one weighted component array. The proposed architecture is very well suited for MOS VLSI realization using switched-capacitor (SC) techniques. SC schematics for the different building blocks are presented and demonstrated via empirical results.< >

关键词： Very large scale integration Cost function Lagrangian functions Analog circuits Neurofeedback Satellites Navigation Vehicles Robot programming Transconductance

来源：评论

学校读者我要写书评

暂无评论

A divide and conquer approach to shortest paths in planar layered digraphs

A divide and conquer approach to shortest paths in planar la...

引用

international symposium on parallel and Distributed Processing (IPDPS)

作者： S. Sairam R. Tamassia J.S. Vitter Department of Computer Science Brown University Providence RI USA

The authors give efficient parallel algorithms to compute shortest-paths in planar layered digraphs. They show that these digraphs admit special kinds of separators, called one-way separators, which allow paths in the graph to cross them only once. They use these separators to give divide-and-conquer solutions to the problem of finding the shortest paths. They first give a simple algorithm that works on the CREW (concurrent-read exclusive-write) PRAM (parallel random-across machine) model and computes the shortest path between any two vertices of an n-node planar layered diagraph in time O(log/sup 3/ n) using n/log n processors. A CRCW (concurrent-read concurrent-write) version of this algorithm runs in O(log/sup 2/ n log log n) time and uses O(n/log log n) processors. The authors then improve the time bound to O(log/sup 2/ n) on the CREW model and O(log n log log n) on the CRCW model. The processor bounds still remain n log n for the CREW model and n/log log n for the CRCW model.< >

关键词： Computer science parallel algorithms Concurrent computing Particle separators Shortest path problem Sparse matrices Transmission line matrix methods Phase change random access memory Application software Dynamic programming

来源：评论

学校读者我要写书评

暂无评论

OREGAMI - TOOLS FOR MAPPING parallel COMPUTATIONS TO parallel architectures

引用

international JOURNAL OF parallel programming 1991年第3期20卷 237-270页

作者： LO, VM RAJOPADHYE, S GUPTA, S KELDSEN, D MOHAMED, MA NITZBERG, B TELLE, JA ZHONG, XX UNIV OREGON DEPT COMP & INFORMAT SCIEUGENEOR 97403

The OREGAMI project involves the design, implementation, and testing of algorithms for mapping parallel computations to message-passing parallel architectures. OREGAMI addresses the mapping problem by exploiting regularity and by allowing the user to guide and evaluate mapping decisions made by OREGAMI's efficient combinatorial mapping algorithms. OREGAMI's approach to mapping is based on a new graph theoretic model of parallel computation called the Temporal Communication Graph. The OREGAMI software tools include three components: (I ) LaRCS is a graph description language which allows the user to describe regularity in the communication topology as well as the temporal communication behavior (the pattern of message-passing over time). (2) MAPPER is our library of mapping algorithms which utilize information provided by LaRCS to perform contraction, embedding, and routing. (3) METRICS is an interactive graphics tool for display and analysis of mappings. This paper gives an overview of the OREGAMI project, the software tools, and OREGAMI's mapping algorithms.

关键词： Mapping routing embedding task assignement regular parallel computations parallel programming environments

来源：评论

学校读者我要写书评

暂无评论

Connected components with split and merge 5

Connected components with split and merge

引用

5th international parallel Processing symposium, IPPS 1991

作者： Kistler, James J. Webb, Jon A. School of Computer Science Carnegie Mellon University PittsburghPA15213 United States

ISBN: (纸本)0818691670

The split and merge model is a reasonable method for architecture-independent programming of global image processing operations on parallel architectures. We consider image connected components from the point of view of this programming model, and develop split and merge algorithms that implement various connected components algorithms that have appeared in the literature. The algorithms are implemented in two architectures independent languages we have developed, namely Apply and Adapt. Performance of the algorithms on the Sun, the Carnegie Mellon Warp, and the Carnegie Mellon Nectar architectures is compared. © 1991 IEEE.

关键词： parallel architectures

来源：评论

学校读者我要写书评

暂无评论

MULTIDIMENSIONAL BINARY PARTITIONS - DISTRIBUTED DATA-STRUCTURES FOR SPATIAL PARTITIONING

引用

international JOURNAL OF CONTROL 1991年第6期54卷 1335-1352页

作者： CYBENKO, G ALLEN, TG ALPHATECH INC BURLINGTONMA 01803

A multidimensional binary partition (MBP) is a data structure determined by a set of points in n-dimensional space. On certain parallel architectures, this data structure can be easily distributed across the processing nodes of the machine and can provide a natural technique for load balancing and partitioning of application problems that depend on a distribution of dynamically changing points in multidimensional space. This paper describes parallel algorithms for generating and using MBPs on a hypercube parallel machine. It is also shown how these distributed data structures allow efficient parallel searches of the data set. The performance of an implementation of these algorithms on an NCUBE hypercube is presented.

关键词： Computer programming

来源：评论

学校读者我要写书评

暂无评论

From algorithms to parallel architectures: A formal approach 5

From algorithms to parallel architectures: A formal approach

引用

5th international parallel Processing symposium, IPPS 1991

作者： Elleithy, Khaki M. Bayoumi, Magdy A. Computer Engineering Department King Fabd University Dhabran31261 Saudi Arabia Center For Advanced Computer Studies University of Southwestern Louisiana LafayetteLA70504 United States

ISBN: (纸本)0818691670

In this paper, we introduce a formal approach for synthesis of parallel architectures. Four different forms are used to express the given algorithms: simultaneous recursion, recursion with respect to different variables, fixed nesting and variable nesting. Four different architectures for the same algorithm are obtained. As an example, a matrix-matrix multiplication algorithm is used to obtain four different optimal architectures. The different architectures of this example are compared in terms of area, time, broadcasting and required hardware. The approach is providing two main features: completeness and correctness. © 1991 IEEE.

关键词： parallel architectures

来源：评论

学校读者我要写书评

暂无评论

Object-oriented fortran for development of portable parallel programs 3

Object-oriented fortran for development of portable parallel...

引用

Proceedings of the Third IEEE symposium on parallel and Distributed Processing

作者： Reese, D.S. Luke, E. Mississippi State University NSF ERC for Computational Field Simulation United States

ISBN: (纸本)0818623101

parallel programming has to date remained inaccessible to the average scientific programmer. parallel programming languages are generally foreign to most scientific applications programmers who only speak Fortran. Automatic parallelization techniques have so far proved unsuccessful in extracting large amounts of parallelism from sequential codes and do not encourage development of new, inherently parallel algorithms. In addition, there is a lack of consistency of programmer interface across architectures which requires programmers to invest a lot of effort in porting code from one parallel machine to another. This paper discusses the object oriented Fortran language and support routines developed at Mississippi State in support of parallelizing complex field simulations. This interface is based on Fortran to ease its acceptance by scientific programmers and is implemented on top of the Unix operating system for portability. © 1991 IEEE.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

parallel construction of trees with optimal weighted path length 91

Parallel construction of trees with optimal weighted path le...

引用

3rd Annual ACM symposium on parallel algorithms and architectures, SPAA 1991

作者： Larmore, Lawrence L. Przytycka, Teresa M. Department of Computer Science University of California RiversideCA92521 United States Instytut Informatyki Uniwersytet Warszawski United States

ISBN: (纸本)0897914384

This paper deals with the problem of parallel construction of trees with optimal weighted path length. We study both the unordered case, known as the Huffman coding problem and the ordered case known as the optimal alphabetic binary tree problem. The methods used in both cases are different. We reduce the Huffman coding problem to the Concave Least Weight Subsequence and give a parallel algorithm that solves the latter problem in O(√n log n) time with n processors on a CREW PRAM. This leads to the first sublinear time o(n2)-total work parallel algorithm for the Huffman coding problem. The alphabetic binary tree problem is a special case of the Optimum Binary Search Tree problem and can be solved in O(log2 n) time with n4 processors using the dynamic programming technique. We show that an optimal height restricted alphabetic tree can be constructed in O(L log n) time on a CREW PRAM using only linearly many processors, where L is an upper bound on the height of the tree. © 1991 ACM.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Large-scale sorting in parallel memories 91

Large-scale sorting in parallel memories

引用

Third Annual ACM symposium on parallel algorithms and architectures - SPAA'91

作者： Nodine, M.H. Vitter, J.S. Dept. of Computer Science Brown University Providence R. I.

ISBN: (纸本)0897914384

We present several algorithms for sorting efficiently with parallel two-level and multilevel memories. Our main result is an elegant, easy-to-implement, optimal, deterministic algorithm for external sorting with P disk drives. This result answers the open problem posed by Vitter and Shriver. Our measure of performance is the number of parallel input/output (I/O) operations, in which each of the P disks can simultaneously transfer a block of B contiguous records. Our optimal algorithm is deterministic, and thus it improves upon the optimal randomized algorithm of [ViS] as well as the well-known deterministic but nonoptimal technique of disk striping. The second part of the paper broadens our coverage from two-level memories to more general multilevel memories. In particular we consider the blocked uniform memory hierarchy (UMH) introduced by Alpern, Carter, and Feig, and its parallelization P-UMH, along with new variants. We give optimal and nearly-optimal algorithms for a wide range of bandwidth degradations, including a parsimonious algorithm for constant bandwidth. We also develop optimal sorting algorithms for all bandwidths for other versions of UMH and P-UMH, including natural restrictions we introduce called RUMH and P-RUMH, which more closely correspond to current programming languages. © 1991 ACM.

关键词： Bandwidth

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：