咨询与建议

限定检索结果

文献类型

  • 3 篇 期刊文献
  • 1 篇 会议

馆藏范围

  • 4 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 4 篇 工学
    • 4 篇 计算机科学与技术...
    • 1 篇 电气工程
    • 1 篇 信息与通信工程

主题

  • 4 篇 overlapping comm...
  • 2 篇 mpi
  • 1 篇 parallel algorit...
  • 1 篇 multicomputer
  • 1 篇 volumetric decom...
  • 1 篇 single-relaxatio...
  • 1 篇 hypercube
  • 1 篇 3d discrete four...
  • 1 篇 parallel model
  • 1 篇 perfect load bal...
  • 1 篇 asynchronous com...
  • 1 篇 block tensor mat...
  • 1 篇 lattice moltzman...
  • 1 篇 fft
  • 1 篇 shared-memory pa...
  • 1 篇 roofline model
  • 1 篇 distributed-memo...
  • 1 篇 performance anal...
  • 1 篇 cannon's algorit...
  • 1 篇 parallel computi...

机构

  • 1 篇 forschungszentru...
  • 1 篇 rensselaer polyt...
  • 1 篇 univ bonn comp s...
  • 1 篇 department of co...
  • 1 篇 forschungszentru...
  • 1 篇 shanghai univ sc...
  • 1 篇 shanghai univ ct...
  • 1 篇 forschungszentru...
  • 1 篇 univ colorado bo...
  • 1 篇 goethe univ fran...
  • 1 篇 shanghai univ sc...
  • 1 篇 rensselaer polyt...

作者

  • 1 篇 malapally nitin
  • 1 篇 dervis argun
  • 1 篇 lippert thomas
  • 1 篇 shephard mark s.
  • 1 篇 ovcharenko aleks...
  • 1 篇 jansen kenneth e...
  • 1 篇 zhou liping
  • 1 篇 sahni onkar
  • 1 篇 carothers christ...
  • 1 篇 xu lei
  • 1 篇 liu zhixiang
  • 1 篇 zhang wu
  • 1 篇 ibanez daniel
  • 1 篇 carloni paolo
  • 1 篇 aykanat cevdet
  • 1 篇 bolnykh viachesl...
  • 1 篇 suarez estela
  • 1 篇 mandelli davide
  • 1 篇 wang xiaowei
  • 1 篇 fang yong

语言

  • 4 篇 英文
检索条件"主题词=Overlapping communication and computation"
4 条 记 录,以下是1-10 订阅
排序:
Parallel overlapping Mechanism Between communication and computation of the Lattice Boltzmann Method  3rd
Parallel Overlapping Mechanism Between Communication and Com...
收藏 引用
3rd International Conference on High-Performance Computing and Applications (HPCA)
作者: Liu, Zhixiang Fang, Yong Song, Anping Xu, Lei Wang, Xiaowei Zhou, Liping Zhang, Wu Shanghai Univ Sch Commun & Informat Engn Shanghai 200444 Peoples R China Shanghai Univ Ctr High Performance Comp Shanghai 200444 Peoples R China Shanghai Univ Sch Comp Engn & Sci Shanghai 200444 Peoples R China
The lattice Boltzmann Method (LBM), different from classical numerical methods of continuum mechanics, is derived from molecular dynamics. The LBM has the following main advantages: including a simple algorithm, the d... 详细信息
来源: 评论
Neighborhood communication paradigm to increase scalability in large-scale dynamic scientific applications
收藏 引用
PARALLEL COMPUTING 2012年 第3期38卷 140-156页
作者: Ovcharenko, Aleksandr Ibanez, Daniel Delalondre, Fabien Sahni, Onkar Jansen, Kenneth E. Carothers, Christopher D. Shephard, Mark S. Rensselaer Polytech Inst Sci Computat Res Ctr SCOREC Troy NY 12180 USA Univ Colorado Boulder Dept Aerosp Engn Sci Boulder CO 80309 USA Rensselaer Polytech Inst Dept Comp Sci Troy NY 12180 USA
This paper introduces a general-purpose communication package built on top of MPI which is aimed at improving inter-processor communications independently of the supercomputer architecture being considered. The packag... 详细信息
来源: 评论
3D DFT by block tensor-matrix multiplication via a modified Cannon's algorithm: Implementation and scaling on distributed-memory clusters with fat tree networks
收藏 引用
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING 2024年 193卷
作者: Malapally, Nitin Bolnykh, Viacheslav Suarez, Estela Carloni, Paolo Lippert, Thomas Mandelli, Davide Forschungszentrum Julich Computat Biomed IAS 5 INM 9 Wilhelm Johnen Str D-52428 Julich Germany Forschungszentrum Julich Julich Supercomp Ctr JSC Wilhelm Johnen Str D-52428 Julich Germany Univ Bonn Comp Sci Dept Bonn Germany Forschungszentrum Julich Mol Neurosci & Neuroimaging IN -1 1 Wilhelm Johnen Str D-52428 Julich Germany Goethe Univ Frankfurt Inst Adv Studies Frankfurt Germany
A known scalability bottleneck of the parallel 3D FFT is its use of all -to -all communications. Here, we present S3DFT, a library that circumvents this by using point-to-point communication - albeit at a higher arith... 详细信息
来源: 评论
Efficient overlapped FFT algorithms for hypercube-connected multicomputers
收藏 引用
Parallel Algorithms and Applications 1994年 第1-2期4卷 91-110页
作者: Aykanat, Cevdet Dervis, Argun Department of Computer Engineering Bilkent University 06533 Bilkent Ankara Turkey
In this work, we propose parallel FFT algorithms, for medium-to-coarse grain hypercube-connected multicomputers, which are more elegant and efficient than the existing ones. The proposed algorithms achieve perfect loa... 详细信息
来源: 评论