检索结果-内蒙古大学图书馆

arXiv 2017年

作者： Harris, David G. Department of Computer Science University of Maryland College ParkMD20742 United States

The Lovász Local Lemma (LLL) shows that, for a collection of "bad" events B in a probability space which are not too likely and not too interdependent, there is a positive probability that no events in B occur. Moser & Tardos (2010) gave sequential and parallel algorithms which transformed most applications of the variable-assignment LLL into efficient algorithms. A framework of Harvey & Vondrák (2015) based on "resampling oracles" extended this to sequential algorithms for other probability spaces satisfying a generalization of the LLL known as the Lopsided Lovász Local Lemma (LLLL). We describe a new structural property which holds for most known resampling oracles, which we call "obliviousness." Essentially, it means that the interaction between two bad-events B, B′ depends only on the randomness used to resample B, and not the precise state within B itself. This property has two major consequences. First, combined with a framework of Kolmogorov (2016), it leads to a unified parallel LLLL algorithm, which is faster than previous, problem-specific algorithms of Harris (2016) for the variable-assignment LLLL and of Harris & Srinivasan (2014) for permutations. This gives the first RNC algorithms for rainbow perfect matchings and rainbow hamiltonian cycles of Kn. Second, this property allows us to build LLLL probability spaces from simpler "atomic" events. This gives the first resampling oracle for rainbow perfect matchings on the complete s-uniform hypergraph Kn(s), and the first commutative resampling oracle for hamiltonian cycles of Kn Copyright © 2017, The Authors. All rights reserved.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Distributed-memory parallel algorithms for counting and listing triangles in big graphs

arXiv

引用

arXiv 2017年

作者： Arifuzzaman, Shaikh Khan, Maleq Marathe, Madhav University of New Orleans 70148 United States Texas A&M University–Kingsville TX78363 Biocomplexity Institute Department of Computer Science Virginia Tech BlacksburgVA24060

来源：评论

学校读者我要写书评

暂无评论

Deterministic parallel algorithms for fooling polylogarithmic juntas and the Lovasz Local Lemma 17

Deterministic parallel algorithms for fooling polylogarithmi...

引用

Annual ACM-Society for Industrial and Applied Mathmatics Symposium on Discrete algorithms

作者： David G. Harris Department of Computer Science University of Maryland

ISBN: (纸本)9781510836358

Many randomized algorithms can be derandomized efficiently using either the method of conditional expectations or probability spaces with low (almost-) independence. A series of papers, beginning with work by Luby (1988) and continuing with Berger & Rompel (1991) and Chari et al. (1994), showed that these techniques can be combined to give deterministic parallel algorithms for combinatorial optimization problems involving sums of w-juntas. We improve these algorithms through derandomized variable partitioning. This reduces the processor complexity to essentially independent of w while the running time is reduced from exponential in w to linear in w. For example, we improve the time complexity of an algorithm of Berger & Rompel (1991) for rainbow hypergraph coloring by a factor of approximately log~2 n and the processor complexity by a factor of approximately m~(ln 2). As a major application of this, we give an NC algorithm for the Lovasz Local Lemma. Previous NC algorithms, including the seminal algorithm of Moser & Tardos (2010) and the work of Chandrasekaran et. al (2013), required that (essentially) the bad-events could span only O(log n) variables; we relax this to allowing polylog(n) variables. As two applications of our new algorithm, we give algorithms for defective vertex coloring and domatic graph partition. One main sub-problem encountered in these algorithms is to generate a probability space which can "fool" a given list of GF(2) Fourier characters. Schulman (1992) gave an NC algorithm for this; we dramatically improve its efficiency to near-optimal time and processor complexity and code dimension. This leads to a new algorithm to solve the heavy-codeword problem, introduced by Naor & Naor (1993), with a near-linear processor complexity (mn)~(1+o(1)).

关键词： algorithms parallel algorithms PROCESSOR Lemma complexity classes major application conditional expectation Numerical controls deterministic Color

来源：评论

学校读者我要写书评

暂无评论

Communication-Avoiding parallel algorithms for Solving Triangular Systems of Linear Equations

Communication-Avoiding Parallel Algorithms for Solving Trian...

引用

International Symposium on parallel and Distributed Processing (IPDPS)

作者： Tobias Wicky Edgar Solomonik Torsten Hoefler Department of Computer Science ETH Zurich Zurich Switzerland Department of Computer Science University of Illinois at Urbana-Champaign Urbana IL USA

We present a new parallel algorithm for solving triangular systems with multiple right hand sides (TRSM). TRSM is used extensively in numerical linear algebra computations, both to solve triangular linear systems of equations as well as to compute factorizations with triangular matrices, such as Cholesky, LU, and QR. Our algorithm achieves better theoretical scalability than known alternatives, while maintaining numerical stability, via selective use of triangular matrix inversion. We leverage the fact that triangular inversion and matrix multiplication are more parallelizable than the standard TRSM algorithm. By only inverting triangular blocks along the diagonal of the initial matrix, we generalize the usual way of TRSM computation and the full matrix inversion approach. This flexibility leads to an efficient algorithm for any ratio of the number of right hand sides to the triangular matrix dimension. We provide a detailed communication cost analysis for our algorithm as well as for the recursive triangular matrix inversion. This cost analysis makes it possible to determine optimal block sizes and processor grids a priori. Relative to the best known algorithms for TRSM, our approach can require asymptotically fewer messages, while performing optimal amounts of computation and communication in terms of words sent.

关键词： Algorithm design and analysis Bandwidth parallel algorithms Two dimensional displays Cost benefit analysis Three-dimensional displays Layout

来源：评论

学校读者我要写书评

暂无评论

DETERMINISTIC parallel algorithms FOR BILINEAR OBJECTIVE FUNCTIONS

arXiv

引用

arXiv 2017年

作者： Harris, David G. Department of Computer Science University of Maryland College ParkMD20742 United States

Many randomized algorithms can be derandomized efficiently using either the method of conditional expectations or probability spaces with low independence. A series of papers, beginning with work by Luby (1988), showed that in many cases these techniques can be combined to give deterministic parallel (NC) algorithms for a variety of combinatorial optimization problems, with low time- and processor-complexity. We extend and generalize a technique of Luby for efficiently handling bilinear objective functions. One noteworthy application is an NC algorithm for maximal independent set. On a graph G with m edges and n vertices, this takes Õ(log2 n) time and (m + n)no(1) processors, nearly matching the best randomized parallel algorithms. Other applications include reduced processor counts for algorithms of Berger (1997) for maximum acyclic subgraph and Gale-Berlekamp switching games. This bilinear factorization also gives better algorithms for problems involving discrepancy. An important application of this is to automata-fooling probability spaces, which are the basis of a notable derandomization technique of Sivakumar (2002). Our method leads to large reduction in processor complexity for a number of derandomization algorithms based on automata-fooling, including set discrepancy and the Johnson-Lindenstrauss Lemma. Copyright © 2017, The Authors. All rights reserved.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Efficient parallel algorithms for k-Center Clustering 45

Efficient Parallel Algorithms for <i>k</i>-Center Clustering

引用

45th International Conference on parallel Processing (ICPP)

作者： McClintock, Jessica Wirth, Anthony Univ Melbourne Dept Comp & Informat Syst Melbourne Vic 3010 Australia

ISBN: (纸本)9781509028238

The k-center problem is a classic NP-hard clustering question. For contemporary massive data sets, RAM-based algorithms become impractical. Although there exist good algorithms for k-center, they are all inherently sequential. In this paper, we design and implement parallel approximation algorithms for k-center. We observe that Gonzalez's greedy algorithm can be efficiently parallelized in several MapReduce rounds;in practice, we find that two rounds are sufficient, leading to a 4-approximation. In practice, we find this parallel scheme is about 100 times faster than the sequential Gonzalez algorithm, and barely compromises solution quality. We contrast this with an existing parallel algorithm for k-center that offers a 10-approximation. Our analysis reveals that this scheme is often slow, and that its sampling procedure only runs if k is sufficiently small, relative to input size. In practice, It is slightly more effective than Gonzalez's approach, but is slow. To trade off runtime for approximation guarantee, we parameterize this sampling algorithm. We prove a lower bound on the parameter for effectiveness, and find experimentally that with values even lower than the bound, the algorithm is not only faster, but sometimes more effective.

关键词： Clustering k-center parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Estimate of locality of parallel algorithms implemented on GPUs 10

Estimate of locality of parallel algorithms implemented on G...

引用

10th Annual International Scientific Conference on parallel Computing Technologies, PCT 2016

作者： Likhoded, N.A. Paliashchuk, M.A. Belarusian State University Belarus

The problem of obtaining blocks of operations and threads of parallel algorithm resulting in a smaller number of accesses to global memory and resulting in the efficient use of caches and shared memory graphics processor is investigated. We formulated and proved statements to assess the volume of communication transactions generated by alternative sizing of blocks, as well as to minimize the number of cache misses due to the use of temporal and spatial locality of data. The research is constructive and allows software implementation for practical use.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Two Fast parallel GCD algorithms of Many Integers 17

Two Fast Parallel GCD Algorithms of Many Integers

引用

42nd ACM International Symposium on Symbolic and Algebraic Computation (ISSAC)

作者： Sedjelmaci, Sidi Mohamed Univ Paris Nord CNRS UMR 7030 LIPN Paris France

ISBN: (纸本)9781450350648

We present two new parallel algorithms which compute the GCD of n integers of O(n) bits in O(n/ logn) time with O(n(2+epsilon)) processors in the worst case, for any epsilon > 0 in CRCW PRAM model. More generally, we prove that computing the GCD of m integers of O(n) bits can be achieved in O(n/ logn)parallel time with O(mn(1+epsilon)) processors, for any 2 <=( )m <= n(3/2)/ log n, i.e. the parallel time does not depend on the number m of integers considered in this range. We suggest an extended GCD version for many integers as well as an algorithm to solve linear Diophantine equations.

关键词： GCD of many integers parallel algorithms parallel Complexity of GCD Complexity analysis

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for Constrained Tensor Factorization via Alternating Direction Method of Multipliers

引用

IEEE TRANSACTIONS ON SIGNAL PROCESSING 2015年第20期63卷 5450-5463页

作者： Liavas, Athanasios P. Sidiropoulos, Nicholas D. Tech Univ Crete Dept Elect & Comp Engn Khania 73100 Greece Univ Minnesota Dept Elect & Comp Engn Minneapolis MN 55455 USA

Tensor factorization has proven useful in a wide range of applications, from sensor array processing to communications, speech and audio signal processing, and machine learning. With few recent exceptions, all tensor factorization algorithms were originally developed for centralized, in-memory computation on a single machine;and the few that break away from this mold do not easily incorporate practically important constraints, such as non-negativity. A new constrained tensor factorization framework is proposed in this paper, building upon the Alternating Direction Method of Multipliers (ADMoM). It is shown that this simplifies computations, bypassing the need to solve constrained optimization problems in each iteration;and it naturally leads to distributed algorithms suitable for parallel implementation. This opens the door for many emerging big data-enabled applications. The methodology is exemplified using non-negativity as a baseline constraint, but the proposed framework can incorporate many other types of constraints. Numerical experiments are encouraging, indicating that ADMoM-based non-negative tensor factorization (NTF) has high potential as an alternative to state-of-the-art approaches.

关键词： Tensor decomposition PARAFAC model parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for robust convex programs over networks

Parallel algorithms for robust convex programs over networks

引用

American Control Conference

作者： Keyou You Roberto Tempo Department of Automation and TNList Tsinghua University Beijing 100084 China CNR-IEIIT Politecnico di Torino 10129 Italy

ISBN: (纸本)9781467386838

This paper proposes a parallel framework to distributedly solve robust convex programs over networks when the constraints are affected by uncertainty. To this end, we adopt a probabilistic approach based on randomly sampling the uncertainty to obtain a standard convex optimization, which is called the scenario problem. However, the number of samples to attain high levels of probabilistic guarantee of robustness may be large, which results in a large number of constraints in the scenario problem. Instead of using a single processor, we resort to multiple processors that are distributed among different nodes of a network. We study recursive algorithms which parallelize the computational task across the nodes and collaboratively solve the problem very effectively. Under local communication links, we show that each node asymptotically provides a solution to the scenario optimization problem.

关键词： Uncertainty Robustness Probabilistic logic Convex functions Optimization Complexity theory parallel algorithms parallel algorithms Convex function Probabilistic logic Complexity theory Robustness Uncertainty Network Uniprocessor Recursive algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：