咨询与建议

限定检索结果

文献类型

  • 19 篇 会议
  • 12 篇 期刊文献

馆藏范围

  • 31 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 25 篇 工学
    • 25 篇 计算机科学与技术...
    • 6 篇 软件工程
    • 4 篇 电气工程
    • 1 篇 信息与通信工程
  • 10 篇 理学
    • 9 篇 数学
    • 1 篇 物理学
  • 1 篇 管理学
    • 1 篇 管理科学与工程(可...

主题

  • 31 篇 communication-av...
  • 5 篇 parallel algorit...
  • 3 篇 i/o-complexity
  • 3 篇 particle methods
  • 3 篇 fast matrix mult...
  • 3 篇 linear algebra
  • 2 篇 roundoff error a...
  • 2 篇 computational fl...
  • 2 篇 partial differen...
  • 2 篇 pipelined krylov...
  • 2 篇 s-step iterative...
  • 2 篇 qr decomposition
  • 2 篇 sparse matrix co...
  • 2 篇 asynchronous ite...
  • 2 篇 matrix multiplic...
  • 2 篇 domain decomposi...
  • 1 篇 nonnegative leas...
  • 1 篇 cholesky
  • 1 篇 op2
  • 1 篇 performance

机构

  • 6 篇 univ calif berke...
  • 4 篇 lawrence berkele...
  • 2 篇 georgia inst tec...
  • 2 篇 oak ridge natl l...
  • 2 篇 inria paris rocq...
  • 2 篇 univ calif berke...
  • 2 篇 univ calif berke...
  • 2 篇 univ electrocomm...
  • 2 篇 oregon state uni...
  • 1 篇 arup 3 piccadill...
  • 1 篇 wake forest univ...
  • 1 篇 nyu ny usa
  • 1 篇 syracuse univ de...
  • 1 篇 university of ma...
  • 1 篇 lawrence berkele...
  • 1 篇 sandia natl labs...
  • 1 篇 univ calif berke...
  • 1 篇 centralesupélec ...
  • 1 篇 pazmany peter ca...
  • 1 篇 devito codes eng...

作者

  • 8 篇 demmel james
  • 6 篇 schwartz oded
  • 5 篇 ballard grey
  • 4 篇 yelick katherine
  • 3 篇 holtz olga
  • 3 篇 buluc aydin
  • 3 篇 koanantakool pen...
  • 3 篇 kannan ramakrish...
  • 2 篇 sao piyush
  • 2 篇 magee daniel j.
  • 2 篇 nakatsukasa yuji
  • 2 篇 azad ariful
  • 2 篇 fukaya takeshi
  • 2 篇 yanagisawa yuka
  • 2 篇 solomonik edgar
  • 2 篇 yamamoto yusaku
  • 2 篇 lipshitz benjami...
  • 2 篇 vuduc richard
  • 2 篇 niemeyer kyle e.
  • 1 篇 jin peter

语言

  • 31 篇 英文
检索条件"主题词=communication-avoiding algorithms"
31 条 记 录,以下是1-10 订阅
排序:
A Class of communication-avoiding algorithms for Solving General Dense Linear Systems on CPU/GPU Parallel Machines
收藏 引用
Procedia Computer Science 2012年 9卷 17-26页
作者: Marc Baboulin Simplice Donfack Jack Dongarra Laura Grigori Adrien Rémy Stanimire Tomov Inria and University Paris-Sud France University of Tennessee USA Oak Ridge National Laboratory USA University of Manchester United Kingdom
We study several solvers for the solution of general linear systems where the main objective is to reduce the communication overhead due to pivoting. We first describe two existing algorithms for the LU factorization ... 详细信息
来源: 评论
communication-avoiding SYMMETRIC-INDEFINITE FACTORIZATION
收藏 引用
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS 2014年 第4期35卷 1364-1406页
作者: Ballard, Grey Becker, Dulceneia Demmel, James Dongarra, Jack Druinsky, Alex Peled, Inon Schwartz, Oded Toledo, Sivan Yamazaki, Ichitaro Sandia Natl Labs Livermore CA 94550 USA Univ Tennessee Knoxville TN 37996 USA Univ Calif Berkeley Berkeley CA 94720 USA Oak Ridge Natl Lab Oak Ridge TN 37831 USA Univ Manchester Manchester M13 9PL Lancs England Univ Calif Berkeley Lawrence Berkeley Natl Lab Berkeley CA 94720 USA Tel Aviv Univ IL-69978 Tel Aviv Israel
We describe and analyze a novel symmetric triangular factorization algorithm. The algorithm is essentially a block version of Aasen's triangular tridiagonalization. It factors a dense symmetric matrix A as the pro... 详细信息
来源: 评论
avoiding communication IN PRIMAL AND DUAL BLOCK COORDINATE DESCENT METHODS
收藏 引用
SIAM JOURNAL ON SCIENTIFIC COMPUTING 2019年 第1期41卷 C1-C27页
作者: Devarakonda, Aditya Fountoulakis, Kimon Demmel, James Mahoney, Michael W. Univ Calif Berkeley Dept EECS Berkeley CA 94709 USA Univ Calif Berkeley ICSI Berkeley CA 94709 USA Univ Calif Berkeley Dept Stat Berkeley CA 94709 USA Univ Calif Berkeley Math & EECS Dept Berkeley CA 94709 USA
Primal and dual block coordinate descent methods are iterative methods for solving regularized and unregularized optimization problems. Distributed-memory parallel implementations of these methods have become popular ... 详细信息
来源: 评论
communication-avoiding Optimizations for Large-Scale Unstructured-Mesh Applications with OP2  23
Communication-Avoiding Optimizations for Large-Scale Unstruc...
收藏 引用
52nd International Conference on Parallel Processing (ICPP)
作者: Ekanayake, Suneth D. Reguly, Istvan Z. Luporini, Fabio Mudalige, Gihan R. Univ Warwick Coventry W Midlands England Pazmany Peter Catholic Univ Budapest Hungary Devito Codes London England
In this paper, we investigate data movement-reducing and communication-avoiding optimizations and their practicable implementation for large-scale unstructured-mesh applications. Utilizing the high-level abstraction o... 详细信息
来源: 评论
A communication-avoiding 3D sparse triangular solver  19
A communication-avoiding 3D sparse triangular solver
收藏 引用
33rd ACM International Conference on Supercomputing (ICS)
作者: Sao, Piyush Kannan, Ramakrishnan Li, Xiaoye Sherry Vuduc, Richard Oak Ridge Natl Lab POB 2009 Oak Ridge TN 37830 USA Lawrence Berkeley Natl Lab Berkeley CA USA Georgia Inst Technol Atlanta GA 30332 USA
We present a novel distributed memory algorithm to improve the strong scalability of the solution of a sparse triangular system. This operation appears in the solve phase of direct methods for solving general sparse l... 详细信息
来源: 评论
communication-avoiding Parallel Sparse-Dense Matrix-Matrix Multiplication  30
Communication-Avoiding Parallel Sparse-Dense Matrix-Matrix M...
收藏 引用
30th IEEE International Parallel and Distributed Processing Symposium (IPDPS)
作者: Koanantakool, Penporn Azad, Ariful Buluc, Aydin Morozov, Dmitriy Oh, Sang-Yun Oliker, Leonid Yelick, Katherine Lawrence Berkeley Natl Lab Computat Res Div Berkeley CA 94720 USA Univ Calif Berkeley Dept EECS Berkeley CA 94720 USA Univ Calif Santa Barbara Dept Stat & Appl Probabil Santa Barbara CA 93106 USA
Multiplication of a sparse matrix with a dense matrix is a building block of an increasing number of applications in many areas such as machine learning and graph algorithms. However, most previous work on parallel ma... 详细信息
来源: 评论
communication-avoiding Recursive Aggregation  25
Communication-Avoiding Recursive Aggregation
收藏 引用
25th IEEE International Conference on Cluster Computing (CLUSTER)
作者: Sun, Yihao Kumar, Sidharth Gilray, Thomas Micinski, Kristopher Syracuse Univ Dept Elect Engn & Comp Sci Syracuse NY 13210 USA Univ Illinois Dept Comp Sci Chicago IL USA Univ Alabama Birmingham Dept Comp Sci Birmingham AL 35233 USA
Recursive aggregation has been of considerable interest due to its unifying a wide range of deductive-analytic workloads, including social-media mining and graph analytics. For example, Single-Source Shortest Paths (S... 详细信息
来源: 评论
Reconstructing Householder vectors from Tall-Skinny QR
收藏 引用
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING 2015年 85卷 3-31页
作者: Ballard, G. Demmel, J. Grigori, L. Jacquelin, M. Knight, N. Nguyen, H. D. Sandia Natl Labs Livermore CA USA Univ Calif Berkeley Berkeley CA 94720 USA INRIA Paris Rocquencourt Paris France Lawrence Berkeley Natl Lab Berkeley CA USA
The Tall-Skinny QR (TSQR) algorithm is more communication efficient than the standard Householder algorithm for QR decomposition of matrices with many more rows than columns. However, TSQR produces a different represe... 详细信息
来源: 评论
A massively parallel tensor contraction framework for coupled-cluster computations
收藏 引用
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING 2014年 第12期74卷 3176-3190页
作者: Solomonik, Edgar Matthews, Devin Hammond, Jeff R. Stanton, John F. Demmel, James Univ Calif Berkeley Dept Elect Engn & Comp Sci Berkeley CA 94720 USA Univ Texas Austin Dept Chem & Biochem Austin TX 78712 USA Argonne Natl Lab Leadership Comp Facil Argonne IL 60439 USA
Precise calculation of molecular electronic wavefunctions by methods such as coupled-cluster requires the computation of tensor contractions, the cost of which has polynomial computational scaling with respect to the ... 详细信息
来源: 评论
SHIFTED CHOLESKY QR FOR COMPUTING THE QR FACTORIZATION OF ILL-CONDITIONED MATRICES
收藏 引用
SIAM JOURNAL ON SCIENTIFIC COMPUTING 2020年 第1期42卷 A477-A503页
作者: Fukaya, Takeshi Kannan, Ramaseshan Nakatsukasa, Yuji Yamamoto, Yusaku Yanagisawa, Yuka Hokkaido Univ Sapporo Hokkaido Japan Arup 3 Piccadilly Pl Manchester M1 3BN Lancs England Univ Oxford Math Inst Oxford OX2 6GG England Univ Electrocommun Tokyo Japan Waseda Univ Waseda Res Inst Sci & Engn Tokyo Japan
The Cholesky QR algorithm is an efficient communication-minimizing algorithm for computing the QR factorization of a tall-skinny matrix X epsilon R-mxn, where m >> n. Unfortunately it is inherently unstable and ... 详细信息
来源: 评论