检索结果-内蒙古大学图书馆

27th IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC)

作者： Manning, Lawton Ballard, Grey Kannan, Ramakrishnan Park, Haesun Wake Forest Univ Winston Salem NC 27101 USA Oak Ridge Natl Lab Oak Ridge TN USA Georgia Inst Technol Atlanta GA 30332 USA

ISBN: (纸本)9781665422925

Nonnegative Matrix Factorization (NMF) is an effective tool for clustering nonnegative data, either for computing a flat partitioning of a dataset or for determining a hierarchy of similarity. In this paper, we propose a parallel algorithm for hierarchical clustering that uses a divide-and-conquer approach based on rank-two NMF to split a data set into two cohesive parts. Not only does this approach uncover more structure in the data than a flat NMF clustering, but also rank-two NMF can be computed more quickly than for general ranks, providing comparable overall time to solution. Our data distribution and parallelization strategies are designed to maintain computational load balance throughout the data-dependent hierarchy of computation while limiting interprocess communication, allowing the algorithm to scale to large dense and sparse data sets. We demonstrate the scalability of our parallel algorithm in terms of data size (up to 800 GB) and number of processors (up to 80 nodes of the Summit supercomputer), applying the hierarchical clustering approach to hyperspectral imaging and image classification data. Our algorithm for Rank-2 NMF scales perfectly on up to 1000s of cores and the entire hierarchical clustering method achieves 5.9x speedup scaling from 10 to 80 nodes on the 800 GB dataset.

关键词： low-rank approximation distributed-memory parallel algorithms scalable clustering

来源：评论

学校读者我要写书评

暂无评论

distributed-memory parallel algorithms FOR DISTANCE-2 COLORING AND RELATED PROBLEMS IN DERIVATIVE COMPUTATION

引用

SIAM JOURNAL ON SCIENTIFIC COMPUTING 2010年第4期32卷 2418-2446页

作者： Bozdag, Doruk Catalyurek, Uemit V. Gebremedhin, Assefaw H. Manne, Fredrik Boman, Erik G. Ozguner, Fuesun Ohio State Univ Dept Biomed Informat Columbus OH 43210 USA Ohio State Univ Dept Elect & Comp Engn Columbus OH 43210 USA Purdue Univ Dept Comp Sci W Lafayette IN 47907 USA Univ Bergen Dept Informat N-5008 Bergen Norway Sandia Natl Labs Scalable Algorithms Dept Albuquerque NM 87185 USA

The distance-2 graph coloring problem aims at partitioning the vertex set of a graph into the fewest sets consisting of vertices pairwise at distance greater than 2 from each other. Its applications include derivative computation in numerical optimization and channel assignment in radio networks. We present efficient, distributed-memory, parallel heuristic algorithms for this NP-hard problem as well as for two related problems used in the computation of Jacobians and Hessians. parallel speedup is achieved through graph partitioning, speculative (iterative) coloring, and a bulk synchronous parallel-like organization of parallel computation. Results from experiments conducted on a PC cluster employing up to 96 processors and using large-size real-world as well as synthetically generated test graphs show that the algorithms are scalable. In terms of quality of solution, the algorithms perform remarkably well-the numbers of colors used by the parallel algorithms are observed to be very close to the numbers used by their sequential counterparts, which in turn are quite often near optimal. Moreover, the experimental results show that the parallel distance-2 coloring algorithm compares favorably with the alternative approach of solving the distance-2 coloring problem on a graph G by first constructing the square graph G(2) and then applying a parallel distance-1 coloring algorithm on G(2). Implementations of the algorithms are made available via the Zoltan toolkit.

关键词： distance-2 graph coloring distributed-memory parallel algorithms Jacobian computation Hessian computation sparsity exploitation automatic differentiation combinatorial scientific computing

来源：评论

学校读者我要写书评

暂无评论

distributed-memory algorithms for Maximal Cardinality Matching using Matrix Algebra

Distributed-Memory Algorithms for Maximal Cardinality Matchi...

引用

IEEE International Conference on Cluster Computing (CLUSTER)

作者： Azad, Ariful Buluc, Aydin Lawrence Berkeley Natl Lab Computat Res Div Berkeley CA 94720 USA

ISBN: (纸本)9781467365987

We design and implement distributed-memory parallel algorithms for computing maximal cardinality matching in a bipartite graph. Relying on matrix algebra building blocks, our algorithms expose a higher degree of parallelism on distributed memory platforms than existing graph-based algorithms. In contrast to existing parallel algorithms, empirical approximation ratios of the new algorithms are insensitive to concurrency and stay relatively constant with increasing processor counts. On real instances, our algorithms achieve up to 300 x speedup on 1024 cores of a Cray XC30 supercomputer. Even higher speedups are obtained on larger synthetically generated graphs where our algorithms show good scaling on up to 16,384 processors.

关键词： distributed memory systems graph theory matrix algebra parallel algorithms pattern matching Cray XC30 supercomputer bipartite graph distributed-memory parallel algorithms empirical approximation ratios graph-based algorithms maximal cardinality matching processor counts Algorithm design and analysis Approximation algorithms Bipartite graph Heuristic algorithms Matrices Partitioning algorithms Sparse matrices cardinality matching matching maximal matching

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：