检索结果-内蒙古大学图书馆

34th ACM Symposium on parallelism in Algorithms and Architectures (SPAA)

作者： Gu, Yan Napier, Zachary Sun, Yihan Wang, Letong UC Riverside Riverside CA 92521 USA

ISBN: (纸本)9781450391467

The cover tree is the canonical data structure that efficiently maintains a dynamic set of points on a metric space and supports nearest and k-nearest neighbor searches. For most real-world datasets with reasonable distributions (constant expansion rate and bounded aspect ratio mathematically), single-point insertion, single-point deletion, and nearest neighbor search (NNS) only cost logarithmically to the size of the point set. Unfortunately, due to the complication and the use of depth-first traversal order in the cover tree algorithms, we were unaware of any parallel approaches for these cover tree algorithms. This paper shows highly parallel and work-efficient cover tree algorithms that can handle batch insertions (and thus construction) and batch deletions. Assuming constant expansion rate and bounded aspect ratio, inserting or deleting m points into a cover tree with n points takes O(m log n) expected work and polylogarithmic span with high probability. Our algorithms rely on some novel algorithmic insights. We model the insertion and deletion process as a graph and use a maximal independent set (MIS) to generate tree nodes without conflicts. We use three key ideas to guarantee work-efficiency: the prefix-doubling scheme, a careful design to limit the graph size on which we apply MIS, and a strategy to propagate information among different levels in the cover tree. We also use path-copying to make our parallel cover tree a persistent data structure, which is useful in several applications. Using our parallel cover trees, we show work-efficient (or near-work-efficient) and highly parallel solutions for a list of problems in computational geometry and machine learning, including Euclidean minimum spanning tree (EMST), single-linkage clustering, bichromatic closest pair (BCP), density-based clustering and its hierarchical version, and others. To the best of our knowledge, many of them are the first solutions to achieve work-efficiency and polylogarithmic span ass

关键词： cover tree parallel algorithms parallel data structures nearest neighbor search euclidean minimum spanning tree single-linkage clustering

来源：评论

学校读者我要写书评

暂无评论

PaC-Trees: Supporting parallel and Compressed Purely-Functional Collections 2022

PaC-Trees: Supporting Parallel and Compressed Purely-Functio...

引用

43rd ACM SIGPLAN International Conference on Programming Language Design and Implementation (PLDI)

作者： Dhulipala, Laxman Blelloch, Guy E. Gu, Yan Sun, Yihan Univ Maryland College Pk MD 20742 USA Carnegie Mellon Univ Pittsburgh PA 15213 USA UC Riverside Riverside CA USA

ISBN: (纸本)9781450392655

Many modern programming languages are shifting toward a functional style for collection interfaces such as sets, maps, and sequences. Functional interfaces offer many advantages, including being safe for parallelism and providing simple and lightweight snapshots. However, existing high-performance functional interfaces such as PAM, which are based on balanced purely-functional trees, incur large space overheads for large-scale data analysis due to storing every element in a separate node in a tree. This paper presents PaC-trees, a purely-functional data structure supporting functional interfaces for sets, maps, and sequences that provides a significant reduction in space over existing approaches. A PaC-tree is a balanced binary search tree which blocks the leaves and compresses the blocks using arrays. We provide novel techniques for compressing and uncompressing the blocks which yield practical parallel functional algorithms for a broad set of operations on PaC-trees such as union, intersection, filter, reduction, and range queries which are both theoretically and practically efficient. Using PaC-trees we designed CPAM, a C++ library that implements the full functionality of PAM, while offering significant extra functionality for compression. CPAM consistently matches or outperforms PAM on a set of microbenchmarks on sets, maps, and sequences while using about a quarter of the space. On applications including inverted indices, 2D range queries, and 1D interval queries, CPAM is competitive with or faster than PAM, while using 2.1-7.8x less space. For static and streaming graph processing, CPAM offers 1.6x faster batch updates while using 1.3-2.6x less space than the state-of-the-art graph processing system Aspen.

关键词： purely-functional data structures parallel data structures space-efficient data structures

来源：评论

学校读者我要写书评

暂无评论

Theoretically and Practically Efficient parallel Nucleus Decomposition (Abstract)

Theoretically and Practically Efficient Parallel Nucleus Dec...

引用

2023 ACM Workshop on Highlights of parallel Computing, HOPC 2023

作者： Shi, Jessica Dhulipala, Laxman Shun, Julian Massachusetts Institute of Technology CambridgeMA United States University of Maryland College ParkMD United States

来源：评论

学校读者我要写书评

暂无评论

parallel ALGORITHMS FOR EVALUATING SEQUENCES OF SET-MANIPULATION OPERATIONS

引用

JOURNAL OF THE ASSOCIATION FOR COMPUTING MACHINERY 1994年第6期41卷 1049-1088页

作者： ATALLAH, MJ GOODRICH, MT KOSARAJU, SR JOHNS HOPKINS UNIV BALTIMOREMD 21218

Given an off-line sequence S of n set-manipulation operations, we investigate the parallel complexity of evaluating S (i.e., finding the response to every operation in S and returning the resulting set). We show that the problem of evaluating S is in NC for various combinations of common set-manipulation operations. Once we establish membership in NC (or, if membership in NC is obvious), we develop techniques for improving the time and/or processor complexity.

关键词： DIVIDE-AND-CONQUER OFF-LINE EVALUATION parallel COMPUTATION parallel data structures

来源：评论

学校读者我要写书评

暂无评论

Goal-Oriented Self-Adaptive hp Finite Element Simulation of 3D DC Borehole Resistivity Simulations

引用

Procedia Computer Science 2011年 4卷 1485-1495页

作者： Victor M. Calo David Pardo Maciej R. Paszyński Department of Applied Mathematics Computational Science Earth Engineering King Abdullah University of Science and Technology Thuwal 23955-6900 Kingdom of Saudii Arabia Department of Applied Mathematics Statistics and Operational Research Univsersity of the Basque Country and Ikerbasque Bilbao Spain Department of Computer Science AGH University of Science and Technology Al. Mickiewicza 30 30-059 Kraków Poland

In this paper we present a goal-oriented self-adaptive hp Finite Element Method ( hp -FEM ) with shared data structures and a parallel multi-frontal direct solver. The algorithm automatically generates (without any user interaction) a sequence of meshes delivering exponential convergence of a prescribed quantity of interest with respect to the number of degrees of freedom. The sequence of meshes is generated from a given initial mesh, by performing h (breaking elements into smaller elements), p (adjusting polynomial orders of approximation) or hp (both) refinements on the finite elements. The new parallel implementation utilizes a computational mesh shared between multiple processors. All computational algorithms, including automatic hp goal-oriented adaptivity and the solver work fully in parallel. We describe the parallel self-adaptive hp- FEM algorithm with shared computational domain, as well as its efficiency measurements. We apply the methodology described to the three-dimensional simulation of the borehole resistivity measurement of direct current through casing in the presence of invasion.

关键词： Hp-FEM Goal-oriented adaptivity parallel data structures

来源：评论

学校读者我要写书评

暂无评论

A Language for Array and Vector Processors

引用

ACM Transactions on Programming Languages and Systems (TOPLAS) 1979年第2期1卷 177-195页

作者： Perrott, R.H. Department of Computer Science Queen's University Belfast BT7 1NN United Kingdom

The scientific community has consistently demanded from computing machines an increase in the number of instructions executed per second. The latest increase has been achieved by duplication of arithmetic units for an array processor and the pipelining of functional units for vector processors. The high level programming languages for such machines have not benefited from the advances which have been made in programming language design and implementation techniques.A high level language is described in this paper which is appropriate for both array and vector processors and is defined without reference to the hardware of either type of machine. The syntax enables the parallel nature of a problem to be expressed in a form which can be readily exploited by these machines. This is achieved by using the data declarations to indicate the maximum extent of parallel processing and then to manipulate this, or a lesser extent, in the course of program execution. It was found to be possible to modify many of the structured programming and data structuring concepts for this type of parallel environment and to maintain the benefits of compile time and run time checking. Several special constructs and operators are also *** language offers to the large scale scientific computing community many of the advances which have been made in software engineering techniques while it exploits the architectural advances which have been made. © 1979, ACM. All rights reserved.

关键词： array processing parallel control parallel data structures vector processing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：