检索结果-内蒙古大学图书馆

28th Annual ACM-SIAM Symposium on Discrete algorithms (SODA)

作者： Harris, David G. Univ Maryland Dept Comp Sci College Pk MD 20742 USA

ISBN: (纸本)9781611974782

Many randomized algorithms can be derandomized efficiently using either the method of conditional expectations or probability spaces with low (almost-) independence. A series of papers, beginning with work by Luby (1988) and continuing with Berger & Rompel (1991) and Chari et al. (1994), showed that these techniques can be combined to give deterministic parallel algorithms for combinatorial optimization problems involving sums of w-juntas. We improve these algorithms through derandomized variable partitioning. This reduces the processor complexity to essentially independent of w while the running time is reduced from exponential in w to linear in w. For example, we improve the time complexity of an algorithm of Berger & Rompel (1991) for rainbow hypergraph coloring by a factor of approximately log(2) n and the processor complexity by a factor of approximately m(ln2). As a major application of this, we give an NC algorithm for the Lovasz Local Lemma Previous NC algorithms, including the seminal algorithm of Moser & Tardos (2010) and the work of Chandrasekaran et. al (2013), required that (essentially) the bad-events could span only O(log n) variables;we relax this to allowing polylog(n) variables. As two applications of our new algorithm, we give algorithms for defective vertex coloring and domatic graph partition. One main sub-problem encountered in these algorithms is to generate a probability space which can "fool" a given list of GF(2) Fourier characters. Schulman (1992) gave an NC algorithm for this;we dramatically improve its efficiency to near-optimal time and processor complexity and code dimension. This leads to a new algorithm to solve the heavy-codeword problem, introduced by Naor & Naor (1993), with a near-linear processor complexity (mn)(l+o(1)).

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms FOR NEAREST NEIGHBOR SEARCH PROBLEMS IN HIGH DIMENSIONS

引用

SIAM JOURNAL ON SCIENTIFIC COMPUTING 2016年第5期38卷 S667-S699页

作者： Xiao, Bo Biros, George Univ Texas Austin Inst Computat Engn & Sci Austin TX 78712 USA

The nearest neighbor search problem in general dimensions finds application in computational geometry, computational statistics, pattern recognition, and machine learning. Although there is a significant body of work on theory and algorithms, surprisingly little work has been done on algorithms for high-end computing platforms, and no open source library exists that can scale efficiently to thousands of cores. In this paper, we present algorithms and a library built on top of the message passing interface (MPI) and OpenMP that enable nearest neighbor searches to hundreds of thousands of cores for arbitrary-dimensional datasets. The library supports both exact and approximate nearest neighbor searches. The latter is based on iterative, randomized, and greedy KD-tree (k-dimensional tree) searches. We describe novel algorithms for the construction of the KD-tree, give complexity analysis, and provide experimental evidence for the scalability of the method. In our largest runs, we were able to perform an all-neighbors query search on a 13 TB synthetic dataset of 0.8 billion points in 2,048 dimensions on the 131K cores on Oak Ridge's XK6 "Jaguar" system. These results represent several orders of magnitude improvement over current state-of-the-art methods. Also, we apply our method to nonsynthetic data from machine learning data repositories. For example, we perform an all-nearest-neighbors search on a variant of the "MNIST" handwritten digit dataset with 8 million points in 784 dimensions on 16,384 cores of the "Stampede" system at the Texas Advanced Computing Center, achieving less than one second per RKDT iteration.

关键词： nearest neighbor algorithms computational statistics tree codes data analysis parallel algorithms machine learning

来源：评论

学校读者我要写书评

暂无评论

Towards work-efficient parallel parameterized algorithms

arXiv

引用

arXiv 2019年

作者： Bannach, Max Skambath, Malte Tantau, Till Institute for Theoretical Computer Science Universität zu Lübeck Germany Department of Computer Science Kiel University Germany

parallel parameterized complexity theory studies how fixed-parameter tractable (fpt) problems can be solved in parallel. Previous theoretical work focused on parallel algorithms that are very fast in principle, but did not take into account that when we only have a small number of processors (between 2 and, say, 1024), it is more important that the parallel algorithms are work-efficient. In the present paper we investigate how work-efficient fpt algorithms can be designed. We review standard methods from fpt theory, like kernelization, search trees, and interleaving, and prove trade-offs for them between work efficiency and runtime improvements. This results in a toolbox for developing work-efficient parallel fpt algorithms. Copyright © 2019, The Authors. All rights reserved.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms and concentration bounds for the Lovasz Local Lemma via witness-DAGs 17

Parallel algorithms and concentration bounds for the Lovasz ...

引用

Annual ACM-Society for Industrial and Applied Mathmatics Symposium on Discrete algorithms

作者： Bernhard Haeupler David G. Harris School of Computer Science Carnegie Mellon University Department of Computer Science University of Maryland

ISBN: (纸本)9781510836358

The Lovasz Local Lemma (LLL) is a cornerstone principle in the probabilistic method of combinatorics, and a seminal algorithm of Moser & Tardos (2010) provides an efficient randomized algorithm to implement it. This algorithm can be parallelized to give an algorithm that uses polynomially many processors and runs in O(log~3 n) time, stemming from O(log n) adaptive computations of a maximal independent set (MIS). Chung et al. (2014) developed faster local and parallel algorithms, potentially running in time O(log~2 n), but these algorithms work under significantly more stringent conditions than the LLL. We give a new parallel algorithm that works under essentially the same conditions as the original algorithm of Moser & Tardos but uses only a single MIS computation, thus running in O(log~2 n) time. This conceptually new algorithm also gives a clean combinatorial description of a satisfying assignment which might be of independent interest. Our techniques extend to the deterministic LLL algorithm given by Chandrasekaran et al. (2013) leading to an NC-algorithm running in time O(log~2 n) as well. We also provide improved bounds on the run-times of the sequential and parallel resampling-based algorithms originally developed by Moser & Tardos. Our bounds extend to any problem instance in which the tighter Shearer LLL criterion is satisfied. We also improve on the analysis of Kolipaka & Szegedy (2011) to give tighter concentration results.

关键词： parallel algorithms low light algorithms Running in Lemma Bound Combinatorics Runtime

来源：评论

学校读者我要写书评

暂无评论

Adaptive cooperation in parallel memetic algorithms for rich vehicle routing problems

Adaptive cooperation in parallel memetic algorithms for rich...

引用

作者： Nalepa, Jakub Blocho, Miroslaw Institute of Informatics Silesian University of Technology ul. Akademicka 16 Gliwice Poland

Designing and implementing cooperation schemes for parallel algorithms has become a very important task recently. The scheme, which defines the cooperation topology, frequency and strategies for handling transferred solutions, has a tremendous influence on the algorithm search capabilities, and can help balance the exploration and exploitation of the vast solution space. In this paper, we present both static and dynamic schemes - the former are selected before the algorithm execution, whereas the latter are dynamically updated on the fly to better respond to the optimisation progress. To understand the impact of such cooperation approaches, we applied them in the parallel memetic algorithms for solving rich routing problems, and performed an extensive experimental study using well-known benchmark sets. This experimental analysis is backed with the appropriate statistical tests to verify the importance of the retrieved results. Copyright © 2018 Inderscience Enterprises Ltd.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for Select and Partition with Noisy Comparisons 16

Parallel Algorithms for Select and Partition with Noisy Comp...

引用

48th Annual ACM SIGACT Symposium on Theory of Computing (STOC)

作者： Braverman, Mark Mao, Jieming Weinberg, S. Matthew Princeton Univ Dept Comp Sci Princeton NJ 08544 USA

ISBN: (纸本)9781450341325

We consider the problem of finding the kth highest element in a totally ordered set of n elements (SELECT), and partitioning a totally ordered set into the top k and bottom n-k elements (PARTITION) using pairwise comparisons. Motivated by settings like peer grading or crowdsourcing, where multiple rounds of interaction are costly and queried comparisons may be inconsistent with the ground truth, we evaluate algorithms based both on their total runtime and the number of interactive rounds in three comparison models: noiseless (where the comparisons are correct), erasure (where comparisons are erased with probability 1-gamma), and noisy (where comparisons are correct with probability 1/2 + gamma/2 and incorrect otherwise). We provide numerous matching upper and lower bounds in all three models. Even our results in the noiseless model, which is quite well-studied in the TCS literature on parallel algorithms, are novel.

关键词： Top-K Noisy Comparisons parallel algorithms Rank Aggregation Median Finding

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for summing floating-point numbers 16

Parallel algorithms for summing floating-point numbers

引用

28th ACM Symposium on parallelism in algorithms and Architectures, SPAA 2016

作者： Goodrich, Michael T. Eldawy, Ahmed Dept. of Computer Science University of California Irvine IrvineCA92697 United States Dept. of Computer Science and Engineering University of California Riverside RiversideCA92521 United States

ISBN: (纸本)9781450342100

The problem of exactly summing n floating-point numbers is a fundamental problem that has many applications in large-scale simulations and computational geometry. Unfortunately, due to the round-off error in standard floatingpoint operations, this problem becomes very challenging. Moreover, all existing solutions rely on sequential algorithms which cannot scale to the huge datasets that need to be processed. In this paper, we provide several efficient parallel algorithms for summing n floating point numbers, so as to produce a faithfully rounded floating-point representation of the sum. We present algorithms in PRAM, external-memory, and MapReduce models, and we also provide an experimental analysis of our MapReduce algorithms, due to their simplicity and practical efficiency.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for asymmetric read-write costs 16

Parallel algorithms for asymmetric read-write costs

引用

28th ACM Symposium on parallelism in algorithms and Architectures, SPAA 2016

作者： Ben-David, Naama Blelloch, Guy E. Fineman, Jeremy T. Gibbons, Phillip B. Gu, Yan McGuffey, Charles Shun, Julian Carnegie Mellon University United States Georgetown University United States UC Berkeley United States

ISBN: (纸本)9781450342100

Motivated by the significantly higher cost of writing than reading in emerging memory technologies, we consider parallel algorithm design under such asymmetric read-write costs, with the goal of reducing the number of writes while preserving work-efficiency and low span. We present a nested-parallel model of computation that combines (i) small per-task stack-allocated memories with symmetric read-write costs and (ii) an unbounded heap-allocated shared memory with asymmetric read-write costs, and show how the costs in the model map efficiently onto a more concrete machine model under a work-stealing scheduler. We use the new model to design reduced-write, work-efficient, low-span parallel algorithms for a number of fundamental problems such as reduce, list contraction, tree contraction, breadth-first search, ordered filter, and planar convex hull. For the latter two problems, our algorithms are output-sensitive in that the work and number of writes decrease with the output size. We also present a reduced-write, low-span minimum spanning tree algorithm that is nearly work-efficient (off by the inverse Ackermann function). Our algorithms reveal several interesting techniques for significantly reducing shared memory writes in parallel algorithms without asymptotically increasing the number of shared memory reads. © 2016 ACM.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for solving linear systems with block-fivediagonal matrices on multi-core CPU 2

Parallel algorithms for solving linear systems with block-fi...

引用

2nd Ural Workshop on parallel, Distributed, and Cloud Computing for Young Scientists, Ural-PDC 2016

作者： Akimova, Elena Belousov, Dmitry Krasovskii Institute of Mathematics and Mechanics Yekaterinburg Russia Ural Federal University Yekaterinburg Russia

For solving systems of linear algebraic equations with block fivediagonal matrices arising in geoelectrics and diffusion problems, the parallel matrix square root method, conjugate gradient method with pre-conditioner, conjugate gradient method with regularization, and parallel matrix sweep algorithm are proposed and some of them are implemented numerically on multi-core CPU Intel. Investigation of efficiency and optimization of parallel algorithms for solving the problem with quasi-model data are performed. The problem with quasi-model data is solved.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Structure preserving parallel algorithms for solving the Bethe-Salpeter eigenvalue problem

引用

LINEAR ALGEBRA AND ITS APPLICATIONS 2016年 488卷 148-167页

作者： Shao, Meiyue da Jornada, Felipe H. Yang, Chao Deslippe, Jack Louie, Steven G. Univ Calif Berkeley Lawrence Berkeley Natl Lab Computat Res Div Berkeley CA 94720 USA Univ Calif Berkeley Dept Phys Berkeley CA 94720 USA Univ Calif Berkeley Lawrence Berkeley Natl Lab Div Mat Sci Berkeley CA 94720 USA Univ Calif Berkeley Lawrence Berkeley Natl Lab NERSC Berkeley CA 94720 USA

The Bethe-Salpeter eigenvalue problem is a dense structured eigenvalue problem arising from discretized Bethe-Salpeter equation in the context of computing exciton energies and states. A computational challenge is that at least half of the eigenvalues and the associated eigenvectors are desired in practice. We establish the equivalence between Bethe-Salpeter eigenvalue problems and real Hamiltonian eigenvalue problems. Based on theoretical analysis, structure preserving algorithms for a class of Bethe-Salpeter eigenvalue problems are proposed. We also show that for this class of problems all eigenvalues obtained from the Tamm-Dancoff approximation are overestimated. In order to solve large scale problems of practical interest, we discuss parallel implementations of our algorithms targeting distributed memory systems. Several numerical examples are presented to demonstrate the efficiency and accuracy of our algorithms. (C) 2015 Elsevier Inc. All rights reserved.

关键词： Bethe-Salpeter equation Tamm-Dancoff approximation Hamiltonian eigenvalue problems Structure preserving algorithms parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：