检索结果-内蒙古大学图书馆

13th annual symposium on parallel algorithms and architectures (SPAA 2001)

作者： Lorenz, U. Dept. of Mathematics and Comp. Sci. University of Paderborn Paderborn Germany

ISBN: (纸本)9781581134094

Tree search algorithms play an important role in many applications in the field of artificial intelligence. When playing board games like chess etc., computers use game tree search algorithms to evaluate a position. In this paper, we present a procedure that we call parallel Controlled Conspiracy Number Search (parallel CCNS). Shortly, we describe the principles of the sequential CCNS algorithm, which bases its approximation-results on irregular subtrees of the entire game tree. We have parallelized CCNS and implemented it in our chess program ***, which now is the first in the world, that could win a high-ranked Grandmaster chess-tournament. We add experiments that show a speedup of about 50 on 159 processors running on an SCI-workstation-cluster.

关键词： Game theory

来源：评论

学校读者我要写书评

暂无评论

Asynchronous shared memory search structures

引用

thEORY OF COMPUTING SYSTEMS 1998年第4期31卷 377-401页

作者： Adler, M Univ Calif Berkeley Div Comp Sci Berkeley CA 94720 USA Int Comp Sci Inst Berkeley CA 94704 USA

We study the problem of storing an ordered E;et On an asynchronous shared memory parallel computer. We examine the case where we want to perform successor (least upper bound) queries efficiently on the set members that are stored. We also examine the case where processors insert and delete members of the set. Due to asynchrony, we require processors to perform queries and to maintain the structure independently. Although several such structures have been proposed, the analysis of these structures has been very limited. We here ut;e the recently proposed QRQW PRAM model to provide upper and lower bounds on the performance of such data structures. In the asynchronous QRQW PRAM, the problem of processors concurrently and independently searching a shared data structure is very similar to the problem of routing packets through a network. Using this as a guide, we introduce the Search-Butterfly, a search structure that combines the efficient packet routing properties of the butterfly graph with the efficient search structure properties of the B-Tree. We analyze the behavior of the Search-Butterfly when the following operations are performed: arbitrary searches, random searches,and random searches, insertions, and deletions. We also provide lower bounds that show that the results are within a factor of O (log n) of optimal where n is the number of keys;in the structure. When the searches are random, the results are within a constant factor of optimal. Many of the proofs are derived from closely related results for packet routing. Others are of independent interest, most notably a method of adding queues to any network belonging to a large class of queuing networks with non-Markovian routing in a manner that allows us to bound the delay experienced by packets in the augmented network.

关键词： COMPUTER storage devices parallel computers ASYNCHRONOUS circuits

来源：评论

学校读者我要写书评

暂无评论

On the parallel implementation of Goldberg's maximum flow algorithm 92

On the parallel implementation of Goldberg's maximum flow al...

引用

4th annual acm symposium on parallel algorithms and architectures - SPAA '92

作者： Anderson, Richard J. Setubal, Joao C. Univ of Washington Seattle WA United States

ISBN: (纸本)089791483X

We describe an efficient parallel implementation of Goldberg's maximum flow algorithm for a shared-memory multiprocessor. Our main technical innovation is a method that allows a 'global relabeling' heuristic to be executed concurrently with the main algorithm. this heuristic is essential for good performance in practice. We present performance results from a Sequent Symmetry for a variety of input distributions. We achieve speed-ups of up to 8.8 with 16 processors, relative to the parallel program with 1 processor (5.8 when compared to our best sequential program). We consider these speed-ups very good and we provide evidence that hardware effects and insufficient parallelism in certain inputs are the main obstacles to achieving better performance.

关键词： Computer operating systems

来源：评论

学校读者我要写书评

暂无评论

parallel graph decompositions using random shifts 13

Parallel graph decompositions using random shifts

引用

25th acm symposium on parallelism in algorithms and architectures, SPAA 2013

作者： Miller, Gary L. Peng, Richard Xu, Shen Chen CMU United States

ISBN: (纸本)9781450315722

We show an improved parallel algorithm for decomposing an undirected unweighted graph into small diameter pieces with a small fraction of the edges in between. these decompositions form critical subroutines in a number of graph algorithms. Our algorithm builds upon the shifted shortest path approach introduced in [Blelloch, Gupta, Koutis, Miller, Peng, Tangwongsan. SPAA 2011]. By combining various stages of the previous algorithm, we obtain a significantly simpler algorithm with the same asymptotic guarantees as the best sequential algorithm.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Computational bounds for fundamental problems on general-purpose parallel models 98

Computational bounds for fundamental problems on general-pur...

引用

Proceedings of the 1998 10th annual acm symposium on parallel algorithms and architectures, SPAA

作者： MacKenzie, Ph.D. Ramachandran, V. Boise State Univ Boise ID United States

ISBN: (纸本)9780897919890

We present lower bounds for time needed to solve basic problems on three general-purpose models of parallel computation: the shared-memory models QSM and s-QSW, and the distributed-memory model, the BSP. For each of these models, we also obtain lower bounds for the number of rounds needed to solve these problems using a randomized algorithm on a p-processor machine. Our results on 'rounds' is of special interest in the context of designing work-efficient algorithms on a machine where latency and synchronization costs are high. Many of our lower bound results are complemented by upper bounds that match the lower bound or are close to it.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for gray-scale image component labeling on a mesh-connected computer 92

Parallel algorithms for gray-scale image component labeling ...

引用

4th annual acm symposium on parallel algorithms and architectures - SPAA '92

作者： Hambrusch, Susanne He, Xin Miller, Russ Purdue Univ West Lafayette IN United States

ISBN: (纸本)089791483X

We present two asymptotically optimal Θ(n) time algorithms for labeling the connected components of a gray-scale image on a mesh-connected computer. We assume that the input is an n × n gray-scale image mapped one pixel per processor onto an n × n mesh-connected computer. Our algorithms label the components so that every component is connected, the maximum difference in the gray-scale values of the pixels within any component does not exceed a given value, and no component can be merged with a neighboring component. the first algorithm is based on a divide-and-conquer approach. Although it is simple, this algorithm has the potential drawback of possibly assigning two adjacent pixels with the same gray-scale value to different components. the second algorithm avoids this potential drawback, and exploits the ability of a mesh-connected computer to efficiently determine a maximal independent set of a planar graph.

关键词： Computer vision

来源：评论

学校读者我要写书评

暂无评论

Explicit multi-threading (XMT) bridging models for instruction parallelism *(extended abstract) 98

Explicit multi-threading (XMT) bridging models for instructi...

引用

Proceedings of the 1998 10th annual acm symposium on parallel algorithms and architectures, SPAA

作者： Vishkin, U. Dascal, Sh. Berkovich, E. Nuzman, J. Univ of Maryland and Tel-Aviv Univ

ISBN: (纸本)9780897919890

this paper envisions an extension to a standard instruction set which efficiently implements PRAM-style algorithms using explicit multi-threaded instruction-level parallelism (ILP);that is, Explicit Multi-threading (XMT), a fine-grained computational paradigm covering the spectrum from algorithms through architecture to implementation is introduced;new elements are added where needed.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

A work-optimal CGM algorithm for the LIS problem

A work-optimal CGM algorithm for the LIS problem

引用

13th annual symposium on parallel algorithms and architectures (SPAA 2001)

作者： thierry, G. Jean-Frádáric, M. David, S. Lab. de Recherche en Info. d'Amiens Univ. de Picardie Jules Verne CURI 5 rue du Moulin Neuf 80000 Amiens France

this paper presents a work-optimal CGM algorithm that solves the Longest Increasing Subsequence Problem. It can be implemented in the CGM with P processors in O(N2/P) time and O(P) communication steps. It is the first... 详细信息

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Dynamic estimation of Task Level parallelism with operating system support

Dynamic estimation of Task Level Parallelism with operating ...

引用

8th International symposium on parallel architectures, algorithms and Networks

作者： Hung, LD Sakai, S Univ Tokyo Grad Sch Informat Sci & Technol Tokyo Japan

ISBN: (纸本)0769525091

the amount of Task Level parallelism (TLP) in runtime workload is useful information to determine the efficient us age of multiprocessors. this paper presents mechanisms to dynamically estimate the amount of TLP in runtime work loads. Modifications are added to the operating system (OS) to collect information about processor utilization, task activities, from which TLP can be calculated. By effectively utilizing the Time Stamp Counter (TSC) hardware, the task activities can be monitored at fine time resolution, result ing in capability of estimation of TLP at fine granularity. We implement the mechanisms on a recent version of Linux OS. Evaluation results indicate that the mechanisms can estimate TLP accurately for various kinds of workloads with small overheads.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Subset barrier synchronization on a private-memory parallel system 92

Subset barrier synchronization on a private-memory parallel ...

引用

4th annual acm symposium on parallel algorithms and architectures - SPAA '92

作者： Feldmann, Anja Gross, thomas O'Hallaron, David Stricker, thomas M. Carnegie Mellon Univ Pittsburgh PA United States

ISBN: (纸本)089791483X

A global barrier synchronizes all processors in a parallel system. this paper investigates algorithms that allow disjoint subsets of processors to synchronize independently and in parallel. the user model of a subset barrier is straight forward;a processor that participates in a subset barrier needs to know only the name of the barrier and the number of participating processors. this paper identifies two general communication models for private-memory parallel systems: the bounded buffer broadcast model and the anonymous destination message passing model and presents algorithms for barrier synchronization in the terms of these models. the models are detailed enough to allow meaningful cost estimates for their primitives, yet independent of a specific architecture and can be supported efficiently by a modern private memory parallel system. the anonymous destination message passing model is the most attractive. the time complexity to synchronize over a uni-directional ring of N processors is O(log N) for common cases, and O(√N) in the worst case. the algorithms have been implemented on iWarp, a private-memory parallel system and are now in daily use. the paper concludes with timing measurements obtained on a 64-node system.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：