咨询与建议

限定检索结果

文献类型

  • 13 篇 会议
  • 7 篇 期刊文献

馆藏范围

  • 20 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 19 篇 工学
    • 19 篇 计算机科学与技术...
    • 5 篇 软件工程
    • 2 篇 电气工程
    • 1 篇 仪器科学与技术
    • 1 篇 信息与通信工程
  • 3 篇 理学
    • 2 篇 化学
    • 1 篇 物理学

主题

  • 20 篇 communication-co...
  • 4 篇 mpi
  • 2 篇 coarse grain pip...
  • 2 篇 macro-systolic a...
  • 2 篇 performance inst...
  • 2 篇 supernode partit...
  • 2 篇 latency hiding
  • 2 篇 spmd programs
  • 2 篇 tianhe-2
  • 2 篇 asynchronous cg
  • 2 篇 parallel applica...
  • 2 篇 non-blocking col...
  • 2 篇 pipelined cg
  • 2 篇 nonlinear optimi...
  • 2 篇 mpi collectives
  • 2 篇 parallel algorit...
  • 2 篇 automatic parall...
  • 2 篇 hpcg
  • 1 篇 compilers
  • 1 篇 scalability

机构

  • 2 篇 pacific nw natl ...
  • 2 篇 chinese acad sci...
  • 2 篇 ohio state univ ...
  • 2 篇 oak ridge natl l...
  • 2 篇 natl univ def te...
  • 1 篇 oak ridge natl l...
  • 1 篇 univ strasbourg ...
  • 1 篇 peking univ peop...
  • 1 篇 huawei co toga n...
  • 1 篇 tsinghua univ de...
  • 1 篇 univ chinese aca...
  • 1 篇 univ politecn ca...
  • 1 篇 univ lille 1 lif...
  • 1 篇 state key labora...
  • 1 篇 huawei technol c...
  • 1 篇 chinese acad sci...
  • 1 篇 huawei noahs ark...
  • 1 篇 univ houston dep...
  • 1 篇 istv valencienne...
  • 1 篇 tsinghua univ ct...

作者

  • 2 篇 tipparaju vinod
  • 2 篇 yang chao
  • 2 篇 nieplocha jarek
  • 2 篇 lu yutong
  • 2 篇 bernholdt david ...
  • 2 篇 sadayappan p.
  • 2 篇 rajopadhye s
  • 2 篇 shet aniruddha g...
  • 2 篇 andonov r
  • 1 篇 carlos sancho jo...
  • 1 篇 peng shaoliang
  • 1 篇 yao jun
  • 1 篇 ren xiaoli
  • 1 篇 vadhiyar sathish
  • 1 篇 rogowski marcin
  • 1 篇 yutong lu
  • 1 篇 xu yangtong
  • 1 篇 eyraud-dubois li...
  • 1 篇 al-zawawi ahmed
  • 1 篇 pericherla suren...

语言

  • 20 篇 英文
检索条件"主题词=communication-computation overlap"
20 条 记 录,以下是1-10 订阅
排序:
Maximizing communication-computation overlap Through Automatic Parallelization and Run-time Tuning of Non-blocking Collective Operations
收藏 引用
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING 2017年 第6期45卷 1390-1416页
作者: Barigou, Youcef Gabriel, Edgar Univ Houston Dept Comp Sci Houston TX 77204 USA
Non-blocking collective communication operations extend the concept of collective operations by offering the additional benefit of being able to overlap communication and computation. They are often considered key bui... 详细信息
来源: 评论
MPI-aware Compiler Optimizations for Improving communication-computation overlap  09
MPI-aware Compiler Optimizations for Improving Communication...
收藏 引用
ACM SIGARCH International Conference on Supercomputing
作者: Danalis, Anthony Pollock, Lori Swany, Martin Cavazos, John Univ Tennessee Knoxville TN 37996 USA
Several existing compiler transformations can help improve communication-computation overlap in MPI applications. However, traditional compilers treat calls to the MPI library as a black box with unknown side effects ... 详细信息
来源: 评论
The Impact of Application's micro-Imbalance on the communication-computation overlap
The Impact of Application's micro-Imbalance on the Communica...
收藏 引用
19th International Euromicro Conference on Parallel, Distributed and Network-Based Processing (PDP)
作者: Subotic, Vladimir Carlos Sancho, Jose Labarta, Jesus Valero, Mateo Barcelona Supercomp Ctr Barcelona Spain Univ Politecn Cataluna Barcelona Supercomp Ctr Barcelona Spain
Although the community sees overlapping communication and computation as a perspective avenue for advancing parallel execution, it remains unclear what type of applications, under which conditions, and to which extent... 详细信息
来源: 评论
A framework for characterizing overlap of communication and computation in parallel applications
收藏 引用
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS 2008年 第1期11卷 75-90页
作者: Shet, Aniruddha G. Sadayappan, P. Bernholdt, David E. Nieplocha, Jarek Tipparaju, Vinod Ohio State Univ Dept Comp Sci & Engn Columbus OH 43210 USA Oak Ridge Natl Lab Comp Sci & Math Div Oak Ridge TN 37831 USA Pacific NW Natl Lab Appl Comp Sci Grp Richland WA 99352 USA
Effective overlap of computation and communication is a well understood technique for latency hiding and can yield significant performance gains for applications on high-end computers. In this paper, we propose an ins... 详细信息
来源: 评论
A Compiler Transformation to overlap communication with Dependent computation  9
A Compiler Transformation to Overlap Communication with Depe...
收藏 引用
2015 9th International Conference on Partitioned Global Address Space Programming Models (PGAS)
作者: Murthy, Karthik Mellor-Crummey, John Rice Univ Houston TX 77251 USA
Hiding communication latency is essential to achieve scalable performance on current and future parallel systems. In this extended abstract, we present a novel compiler transformation that overlaps communication with ... 详细信息
来源: 评论
A framework for characterizing overlap of communication and computation in parallel applications
A framework for characterizing overlap of communication and ...
收藏 引用
IEEE International Conference on Cluster Computing
作者: Shet, Aniruddha G. Sadayappan, P. Bernholdt, David E. Nieplocha, Jarek Tipparaju, Vinod Ohio State Univ Dept Comp Sci & Engn Columbus OH 43210 USA Oak Ridge Natl Lab Comp Sci & Math Div Oak Ridge TN 37831 USA Pacific NW Natl Lab Appl Comp Sci Grp Richland WA 99352 USA
Effective overlap of computation and communication is a well understood technique for latency hiding and can yield significant performance gains for applications on high-end computers. In this paper, we propose an ins... 详细信息
来源: 评论
IMB-ASYNC: a revised method and benchmark to estimate MPI-3 asynchronous progress efficiency
收藏 引用
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS 2022年 第4期25卷 2683-2697页
作者: Medvedev, Alexey, V Lomonosov Moscow State Univ Inst Mech Moscow Russia
The article presents design and methodology of a novel benchmark suite named IMB-ASYNC. The presented suite and method are aimed at measuring and comparing practical communication-computation overlap levels for Messag... 详细信息
来源: 评论
Optimal orthogonal tiling of 2-D iterations
收藏 引用
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING 1997年 第2期45卷 159-165页
作者: Andonov, R Rajopadhye, S LIMAV Valenciennes France IRISA Rennes France
Iteration space tiling is a common strategy used by parallelizing compilers and in performance tuning of parallel codes. We address the problem of determining the tile size that minimizes the total execution time. We ... 详细信息
来源: 评论
Static tiling for heterogeneous computing platforms
收藏 引用
PARALLEL COMPUTING 1999年 第5期25卷 547-568页
作者: Boulet, P Dongarra, J Vivien, F Ecole Normale Super Lyon LIP F-69364 Lyon 07 France Univ Lille 1 LIFL F-59655 Villeneuve Dascq France Univ Tennessee Dept Comp Sci Knoxville TN 37996 USA Oak Ridge Natl Lab Math Sci Sect Oak Ridge TN 37831 USA Univ Strasbourg ICPS F-67400 Illkirch Graffenstaden France
In the framework of fully permutable loops, tiling has been extensively studied as a source-to-source program transformation. However, little work has been devoted to the mapping and scheduling of the tiles on physica... 详细信息
来源: 评论
Reducing communication Overhead in the High Performance Conjugate Gradient Benchmark on Tianhe-2  13
Reducing Communication Overhead in the High Performance Conj...
收藏 引用
13th International Symposium on Distributed Computing and Applications to Business, Engineering and Science (DCABES)
作者: Liu, Fangfang Yang, Chao Liu, Yiqun Zhang, Xianyi Lu, Yutong Chinese Acad Sci Inst Software Beijing 100190 Peoples R China Chinese Acad Sci State Key Lab Comp Sci Beijing 100190 Peoples R China Univ Chinese Acad Sci Beijing 100049 Peoples R China Natl Univ Def Technol Dept Comp Sci & Technol Changsha 410073 Hunan Peoples R China
The High Performance Conjugate Gradient (HPCG) benchmark, proposed recently in 2013, has drawn increasingly more attention from both academia and industry. Unlike the High Performance Linpack (HPL) benchmark, which ha... 详细信息
来源: 评论