咨询与建议

限定检索结果

文献类型

  • 19 篇 会议
  • 12 篇 期刊文献

馆藏范围

  • 31 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 25 篇 工学
    • 25 篇 计算机科学与技术...
    • 6 篇 软件工程
    • 4 篇 电气工程
    • 1 篇 信息与通信工程
  • 10 篇 理学
    • 9 篇 数学
    • 1 篇 物理学
  • 1 篇 管理学
    • 1 篇 管理科学与工程(可...

主题

  • 31 篇 communication-av...
  • 5 篇 parallel algorit...
  • 3 篇 i/o-complexity
  • 3 篇 particle methods
  • 3 篇 fast matrix mult...
  • 3 篇 linear algebra
  • 2 篇 roundoff error a...
  • 2 篇 computational fl...
  • 2 篇 partial differen...
  • 2 篇 pipelined krylov...
  • 2 篇 s-step iterative...
  • 2 篇 qr decomposition
  • 2 篇 sparse matrix co...
  • 2 篇 asynchronous ite...
  • 2 篇 matrix multiplic...
  • 2 篇 domain decomposi...
  • 1 篇 nonnegative leas...
  • 1 篇 cholesky
  • 1 篇 op2
  • 1 篇 performance

机构

  • 6 篇 univ calif berke...
  • 4 篇 lawrence berkele...
  • 2 篇 georgia inst tec...
  • 2 篇 oak ridge natl l...
  • 2 篇 inria paris rocq...
  • 2 篇 univ calif berke...
  • 2 篇 univ calif berke...
  • 2 篇 univ electrocomm...
  • 2 篇 oregon state uni...
  • 1 篇 arup 3 piccadill...
  • 1 篇 wake forest univ...
  • 1 篇 nyu ny usa
  • 1 篇 syracuse univ de...
  • 1 篇 university of ma...
  • 1 篇 lawrence berkele...
  • 1 篇 sandia natl labs...
  • 1 篇 univ calif berke...
  • 1 篇 centralesupélec ...
  • 1 篇 pazmany peter ca...
  • 1 篇 devito codes eng...

作者

  • 8 篇 demmel james
  • 6 篇 schwartz oded
  • 5 篇 ballard grey
  • 4 篇 yelick katherine
  • 3 篇 holtz olga
  • 3 篇 buluc aydin
  • 3 篇 koanantakool pen...
  • 3 篇 kannan ramakrish...
  • 2 篇 sao piyush
  • 2 篇 magee daniel j.
  • 2 篇 nakatsukasa yuji
  • 2 篇 azad ariful
  • 2 篇 fukaya takeshi
  • 2 篇 yanagisawa yuka
  • 2 篇 solomonik edgar
  • 2 篇 yamamoto yusaku
  • 2 篇 lipshitz benjami...
  • 2 篇 vuduc richard
  • 2 篇 niemeyer kyle e.
  • 1 篇 jin peter

语言

  • 31 篇 英文
检索条件"主题词=Communication-Avoiding Algorithms"
31 条 记 录,以下是11-20 订阅
排序:
ROUNDOFF ERROR ANALYSIS OF THE CHOLESKYQR2 ALGORITHM
收藏 引用
ELECTRONIC TRANSACTIONS ON NUMERICAL ANALYSIS 2015年 44卷 306-326页
作者: Yamamoto, Yusaku Nakatsukasa, Yuji Yanagisawa, Yuka Fukaya, Takeshi Univ Electrocommun Tokyo Japan JST CREST Tokyo Japan Univ Tokyo Tokyo 1138656 Japan Waseda Univ Tokyo Japan RIKEN Adv Inst Computat Sci Kobe Hyogo Japan Hokkaido Univ Sapporo Hokkaido 060 Japan JST CREST Tokyo Japan
We consider the QR decomposition of an m x n matrix X with full column rank, where m >= n. Among the many algorithms available, the Cholesky QR algorithm is ideal from the viewpoint of high performance computing si... 详细信息
来源: 评论
Applying the swept rule for solving explicit partial differential equations on heterogeneous computing systems
收藏 引用
JOURNAL OF SUPERCOMPUTING 2021年 第2期77卷 1976-1997页
作者: Magee, Daniel J. Walker, Anthony S. Niemeyer, Kyle E. Oregon State Univ Sch Mech Ind & Mfg Engn Corvallis OR 97331 USA Los Alamos Natl Lab Los Alamos NM 87545 USA
Applications that exploit the architectural details of high-performance computing (HPC) systems have become increasingly invaluable in academia and industry over the past two decades. The most important hardware devel... 详细信息
来源: 评论
Accelerating solutions of one-dimensional unsteady PDEs with GPU-based swept time-space decomposition
收藏 引用
JOURNAL OF COMPUTATIONAL PHYSICS 2018年 357卷 338-352页
作者: Magee, Daniel J. Niemeyer, Kyle E. Oregon State Univ Sch Mech Ind & Mfg Engn Corvallis OR 97331 USA
The expedient design of precision components in aerospace and other high-tech industries requires simulations of physical phenomena often described by partial differential equations (PDEs) without exact solutions. Mod... 详细信息
来源: 评论
Orthogonal Layers of Parallelism in Large-Scale Eigenvalue Computations
收藏 引用
ACM TRANSACTIONS ON PARALLEL COMPUTING 2023年 第3期10卷 1-31页
作者: Alvermann, Andreas Hager, Georg Fehske, Holger Univ Greifswald Inst Phys Felix Hausdorff Str 6 D-17489 Greifswald Germany Friedrich Alexander Univ Erlangen Nurnberg Erlangen Natl High Performance Comp Ctr Martensstr 1 D-91058 Erlangen Germany
We address the communication overhead of distributed sparse matrix-(multiple)-vector multiplication in the context of large-scale eigensolvers, using filter diagonalization as an example. The basis of our study is a p... 详细信息
来源: 评论
A Computation- and communication-Optimal Parallel Direct 3-Body Algorithm  14
A Computation- and Communication-Optimal Parallel Direct 3-B...
收藏 引用
International Conference on High Performance Computing, Networking, Storage and Analysis
作者: Koanantakool, Penporn Yelick, Katherine Univ Calif Berkeley Div Comp Sci Berkeley CA 94720 USA Lawrence Berkeley Natl Lab Berkeley CA USA
Traditional particle simulation methods are used to calculate pairwise potentials, but some problems require 3-body potentials that calculate over triplets of particles. A direct calculation of 3-body interactions inv... 详细信息
来源: 评论
A Supernodal All-Pairs Shortest Path Algorithm  20
A Supernodal All-Pairs Shortest Path Algorithm
收藏 引用
25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP)
作者: Sao, Piyush Kannan, Ramakrishnan Gera, Prasun Vuduc, Richard Oak Ridge Natl Lab Oak Ridge TN 37830 USA Georgia Inst Technol Atlanta GA 30332 USA
We show how to exploit graphs parsity in the Floyd-Warshall algorithm for the all-pairs shortest path (Apsp) problem. FLOYD-WARSHALL is an attractive choice for Apsp on high-performing systems due to its structural si... 详细信息
来源: 评论
Matrix Multiplication I/O-Complexity by Path Routing  15
Matrix Multiplication I/O-Complexity by Path Routing
收藏 引用
27th ACM symposium on Parallelism in algorithms and Architectures (SPAA)
作者: Scott, Jacob Holtz, Olga Schwartz, Oded Univ Calif Berkeley Berkeley CA 94720 USA Hebrew Univ Jerusalem Jerusalem Israel
We apply a novel technique based on path routings to obtain optimal I/O-complexity lower bounds for all Strassen-like fast matrix multiplication algorithms computed in serial or in parallel, assuming no reuse of nontr... 详细信息
来源: 评论
communication-Optimal Parallel Recursive Rectangular Matrix Multiplication
Communication-Optimal Parallel Recursive Rectangular Matrix ...
收藏 引用
IEEE 27th International Parallel and Distributed Processing Symposium (IPDPS)
作者: Demmel, James Eliahu, David Fox, Armando Kamil, Shoaib Lipshitz, Benjamin Schwartz, Oded Spillinger, Omer Univ Calif Berkeley Dept Math Berkeley CA 94720 USA MIT Cambridge MA 02139 USA Univ Calif Berkeley EECS Dept Berkeley CA 94720 USA
communication-optimal algorithms are known for square matrix multiplication. Here, we obtain the first communication-optimal algorithm for all dimensions of rectangular matrices. Combining the dimension-splitting tech... 详细信息
来源: 评论
A 3D Parallel Algorithm for QR Decomposition  18
A 3D Parallel Algorithm for QR Decomposition
收藏 引用
30th ACM Symposium on Parallelism in algorithms and Architectures (SPAA)
作者: Ballard, Grey Demmel, James Grigori, Laura Jacquelin, Mathias Knight, Nicholas Wake Forest Univ Winston Salem NC 27101 USA Univ Calif Berkeley Berkeley CA 94720 USA INRIA Paris Rocquencourt Paris France Lawrence Berkeley Natl Lab Berkeley CA USA NYU New York NY USA
Interprocessor communication often dominates the runtime of large matrix computations. We present a parallel algorithm for computing QR decompositions whose bandwidth cost (communication volume) can be decreased at th... 详细信息
来源: 评论
I/O-Optimal algorithms for Symmetric Linear Algebra Kernels  22
I/O-Optimal Algorithms for Symmetric Linear Algebra Kernels
收藏 引用
34th ACM Symposium on Parallelism in algorithms and Architectures (SPAA)
作者: Beaumont, Olivier Eyraud-Dubois, Lionel Langou, Julien Verite, Mathieu Univ Bordeaux Inria Ctr Bordeaux France Univ Colorado Denver Denver Denver CO USA
In this paper, we consider two fundamental symmetric kernels in linear algebra: the Cholesky factorization and the symmetric rank-k update (SYRK), with the classical three nested loops algorithms for these kernels. In... 详细信息
来源: 评论