咨询与建议

限定检索结果

文献类型

  • 8 篇 会议
  • 4 篇 期刊文献

馆藏范围

  • 12 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 9 篇 工学
    • 8 篇 计算机科学与技术...
    • 2 篇 电气工程
    • 2 篇 软件工程
  • 2 篇 理学
    • 1 篇 数学
    • 1 篇 物理学
  • 1 篇 管理学
    • 1 篇 管理科学与工程(可...

主题

  • 12 篇 communication av...
  • 2 篇 symmetric eigenv...
  • 2 篇 band reduction
  • 2 篇 lu factorization
  • 2 篇 linear algebra
  • 1 篇 hls
  • 1 篇 sparse tiling
  • 1 篇 hardware acceler...
  • 1 篇 performance
  • 1 篇 unstructured mes...
  • 1 篇 non-volatile mem...
  • 1 篇 low-rank approxi...
  • 1 篇 vivado hls
  • 1 篇 cfd
  • 1 篇 spmm
  • 1 篇 sddmm
  • 1 篇 implicit timeste...
  • 1 篇 holo
  • 1 篇 algorithms
  • 1 篇 high-order/low-o...

机构

  • 2 篇 lawrence berkele...
  • 1 篇 univ paris 11 la...
  • 1 篇 los alamos natl ...
  • 1 篇 univ oxford oxfo...
  • 1 篇 univ paris panth...
  • 1 篇 univ calif berke...
  • 1 篇 univ paris 06 up...
  • 1 篇 swiss fed inst t...
  • 1 篇 univ manchester ...
  • 1 篇 inst def anal al...
  • 1 篇 nyu courant inst...
  • 1 篇 cnrs irit 2 rue ...
  • 1 篇 univ calif berke...
  • 1 篇 hebrew univ jeru...
  • 1 篇 illinois institu...
  • 1 篇 sorbonne univ cn...
  • 1 篇 dhirubhai ambani...
  • 1 篇 uc berkeley berk...
  • 1 篇 oak ridge natl l...
  • 1 篇 univ calif berke...

作者

  • 2 篇 grigori laura
  • 2 篇 knight nicholas
  • 2 篇 demmel james
  • 1 篇 ballard grey
  • 1 篇 jezequel fabienn...
  • 1 篇 dongarra jack
  • 1 篇 hoefler torsten
  • 1 篇 reguly i.
  • 1 篇 gates mark
  • 1 篇 chacon l.
  • 1 篇 giles m. b.
  • 1 篇 tomov stanimire
  • 1 篇 taitano w.
  • 1 篇 mudalige g. r.
  • 1 篇 demme james
  • 1 篇 park h.
  • 1 篇 kwasniewski grze...
  • 1 篇 mary theo
  • 1 篇 chaudhury bhaska...
  • 1 篇 aparna sasidhara...

语言

  • 12 篇 英文
检索条件"主题词=communication avoiding algorithms"
12 条 记 录,以下是1-10 订阅
排序:
communication avoiding BLOCK LOW-RANK PARALLEL MULTIFRONTAL TRIANGULAR SOLVE WITH MANY RIGHT-HAND SIDES
收藏 引用
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS 2024年 第1期45卷 148-166页
作者: Amestoy, Patrick Boiteau, Olivier Buttari, Alfredo Gerest, Matthieu Jezequel, Fabienne L'excellent, Jean-yves Mary, Theo ENS Lyon Mumps Technol 46 Allee Italie F-69007 Lyon France EDF R&D F-91120 Palaiseau France CNRS IRIT 2 Rue Charles Camichel F-31071 Toulouse France Sorbonne Univ CNRS LIP6 F-75005 Paris France Univ Paris Pantheon Assas F-75005 Paris France Sorbonne Univ CNRS LIP6 F-75005 Paris France
Block low-rank (BLR) compression can significantly reduce the memory and time costs of parallel sparse direct solvers. In this paper, we investigate the performance of the BLR triangular solve phase, which we observe ... 详细信息
来源: 评论
Distributed-Memory Sparse Kernels for Machine Learning  36
Distributed-Memory Sparse Kernels for Machine Learning
收藏 引用
36th IEEE International Parallel and Distributed Processing Symposium (IEEE IPDPS)
作者: Bharadwaj, Vivek Buluc, Aydin Demmel, James Univ Calif Berkeley EECS Dept Berkeley CA 94720 USA Lawrence Berkeley Natl Lab Computat Res Div Berkeley CA USA
Sampled Dense Times Dense Matrix Multiplication (SDDMM) and Sparse Times Dense Matrix Multiplication (SpMM) appear in diverse settings, such as collaborative filtering, document clustering, and graph embedding. Freque... 详细信息
来源: 评论
Flexible communication avoiding Matrix Multiplication on FPGA with High-Level Synthesis  20
Flexible Communication Avoiding Matrix Multiplication on FPG...
收藏 引用
ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA)
作者: Licht, Johannes de Fine Kwasniewski, Grzegorz Hoefler, Torsten Swiss Fed Inst Technol Zurich Switzerland
Data movement is the dominating factor affecting performance and energy in modern computing systems. Consequently, many algorithms have been developed to minimize the number of I/O operations for common computing patt... 详细信息
来源: 评论
Parallel algorithms for Intersection Computation  24
Parallel Algorithms for Intersection Computation
收藏 引用
Proceedings of the Platform for Advanced Scientific Computing Conference
作者: Aparna Sasidharan Illinois Institute of Technology Chicago United States of America
This paper discusses parallel algorithms for computing intersections between pairs of meshes. We used parallel intersection algorithms to compute interpolation weights in coupled solvers which are part of multi-physic... 详细信息
来源: 评论
Parallel Fast Multipole Method accelerated FFT on HPC clusters
收藏 引用
PARALLEL COMPUTING 2021年 104卷 102783-102783页
作者: Mehta, Chahak Karthi, Amarnath Jetly, Vishrut Chaudhury, Bhaskar Dhirubhai Ambani Inst Informat & Commun Technol Grp Computat Sci & HPC Gandhinagar 382007 India
With increasing sizes of distributed systems, there comes an increased risk of communication bottlenecks. In the past decade there has been a growing interest in communication-avoiding algorithms. The distributed memo... 详细信息
来源: 评论
Translational process: Mathematical software perspective
收藏 引用
JOURNAL OF COMPUTATIONAL SCIENCE 2021年 52卷 101216-101216页
作者: Dongarra, Jack Gates, Mark Luszczek, Piotr Tomov, Stanimire Univ Tennessee Knoxville TN 37996 USA Oak Ridge Natl Lab Oak Ridge TN USA Univ Manchester Manchester Lancs England
Each successive generation of computer architecture has brought new challenges to achieving high performance mathematical solvers, necessitating development and analysis of new algorithms, which are then embodied in s... 详细信息
来源: 评论
Multiscale high-order/low-order (HOLO) algorithms and applications
收藏 引用
JOURNAL OF COMPUTATIONAL PHYSICS 2017年 330卷 21-45页
作者: Chacon, L. Chen, G. Knoll, D. A. Newman, C. Park, H. Taitano, W. Willert, J. A. Womeldorff, G. Los Alamos Natl Lab Los Alamos NM 87545 USA Inst Def Anal Alexandria VA 22311 USA
We review the state of the art in the formulation, implementation, and performance of so-called high-order/low-order (HOLO) algorithms for challenging multiscale problems. HOLO algorithms attempt to couple one or seve... 详细信息
来源: 评论
Write-avoiding algorithms  30
Write-Avoiding Algorithms
收藏 引用
30th IEEE International Parallel and Distributed Processing Symposium (IPDPS)
作者: Carson, Erin Demme, James Grigori, Laura Knight, Nicholas Koanantakool, Penporn Schwartz, Oded Simhadri, Harsha Vardhan NYU Courant Inst Math Sci New York NY 10003 USA Univ Calif Berkeley Dept Math Berkeley CA 94720 USA Univ Calif Berkeley Comp Sci Div Berkeley CA 94720 USA Univ Paris 06 UPMC CNRS UMR 7598Lab Jacques Louis Lions France Alpines INRIA Paris Rocquencourt Paris France Univ Calif Berkeley Div Comp Sci Berkeley CA 94720 USA Hebrew Univ Jerusalem Sch Engn & Comp Sci IL-91905 Jerusalem Israel Lawrence Berkeley Natl Lab Computat Res Div Berkeley CA USA
communication, i.e., moving data between levels of a memory hierarchy or between processors over a network, is much more expensive (in time or energy) than arithmetic. There has thus been a recent focus on designing a... 详细信息
来源: 评论
avoiding communication through a Multilevel LU Factorization
收藏 引用
18th International Conference on Euro-Par Parallel Processing
作者: Donfack, Simplice Grigori, Laura Khabou, Amal Univ Paris 11 Lab Rech Informat INRIA Saclay Ile France Paris France
Due to the evolution of massively parallel computers towards deeper levels of parallelism and memory hierarchy, and due to the exponentially increasing ratio of the time required to transfer data, either through the m... 详细信息
来源: 评论
communication avoiding Symmetric Band Reduction
Communication Avoiding Symmetric Band Reduction
收藏 引用
17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
作者: Ballard, Grey Demmel, James Knight, Nicholas Univ Calif Berkeley Berkeley CA 94720 USA
The running time of an algorithm depends on both arithmetic and communication (i.e., data movement) costs, and the relative costs of communication are growing over time. In this work, we present both theoretical and p... 详细信息
来源: 评论