咨询与建议

限定检索结果

文献类型

  • 29 篇 会议
  • 7 篇 期刊文献
  • 1 篇 学位论文

馆藏范围

  • 37 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 34 篇 工学
    • 33 篇 计算机科学与技术...
    • 14 篇 软件工程
    • 9 篇 电气工程
    • 2 篇 机械工程
    • 2 篇 信息与通信工程
    • 1 篇 控制科学与工程
    • 1 篇 网络空间安全
  • 5 篇 理学
    • 4 篇 数学
    • 1 篇 物理学
  • 3 篇 管理学
    • 2 篇 管理科学与工程(可...
    • 1 篇 图书情报与档案管...

主题

  • 37 篇 task-based progr...
  • 4 篇 hpx
  • 4 篇 multicore
  • 4 篇 hpc
  • 4 篇 openmp
  • 3 篇 plasma
  • 3 篇 xeon phi
  • 3 篇 eigensolver
  • 3 篇 scheduling
  • 2 篇 parallelization
  • 2 篇 runtime system
  • 2 篇 dataflow
  • 2 篇 cuda
  • 2 篇 coordination lan...
  • 2 篇 performance port...
  • 2 篇 tile algorithms
  • 2 篇 mapping
  • 2 篇 exascale computi...
  • 2 篇 parsec
  • 2 篇 fpga

机构

  • 2 篇 univ politecn ca...
  • 2 篇 univ bayreuth de...
  • 2 篇 tech univ chemni...
  • 2 篇 oak ridge natl l...
  • 2 篇 univ tennessee d...
  • 1 篇 erasmus mc dept ...
  • 1 篇 univ leeds inst ...
  • 1 篇 univ durham inst...
  • 1 篇 louisiana state ...
  • 1 篇 univ neuchatel i...
  • 1 篇 univ durham larg...
  • 1 篇 inria le chesnay
  • 1 篇 university of te...
  • 1 篇 barcelona superc...
  • 1 篇 inpt toulouse
  • 1 篇 louisiana state ...
  • 1 篇 slac natl accele...
  • 1 篇 sandia natl labs...
  • 1 篇 technical univer...
  • 1 篇 univ bordeaux bo...

作者

  • 3 篇 kaiser hartmut
  • 3 篇 kurzak jakub
  • 3 篇 dongarra jack
  • 3 篇 haidar azzam
  • 3 篇 rauber thomas
  • 3 篇 ruenger gudula
  • 3 篇 thibault samuel
  • 3 篇 bosilca george
  • 2 篇 schuchart joseph
  • 2 篇 kalkhof torben
  • 2 篇 calandra henri
  • 2 篇 koch andreas
  • 2 篇 guermouche abdou
  • 2 篇 yarkhan asim
  • 2 篇 luszczek piotr
  • 2 篇 agullo emmanuel
  • 2 篇 faverge mathieu
  • 2 篇 herault thomas
  • 2 篇 diehl patrick
  • 2 篇 weinzierl tobias

语言

  • 36 篇 英文
  • 1 篇 其他
检索条件"主题词=Task-based Programming"
37 条 记 录,以下是11-20 订阅
排序:
nOS-V: Co-Executing HPC Applications Using System-Wide task Scheduling  38
nOS-V: Co-Executing HPC Applications Using System-Wide Task ...
收藏 引用
International Parallel and Distributed Processing Symposium (IPDPS)
作者: Alvarez, David Sala, Kevin Beltran, Vicenc Barcelona Supercomp Ctr Barcelona Spain
Future Exascale systems will feature massive parallelism, many-core processors and heterogeneous architectures. In this scenario, it is increasingly difficult for HPC applications to fully and efficiently utilize the ... 详细信息
来源: 评论
Speeding-Up LULESH on HPX: Useful Tricks and Lessons Learned using a Many-task-based Approach
Speeding-Up LULESH on HPX: Useful Tricks and Lessons Learned...
收藏 引用
2024 Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC Workshops 2024
作者: Kalkhof, Torben Koch, Andreas Technical University of Darmstadt Embedded Systems and Applications Group Darmstadt Germany
Current programming models face challenges in dealing with modern supercomputers' growing parallelism and heterogeneity. Emerging programming models, like the task-based programming model found in the asynchronous... 详细信息
来源: 评论
On the Arithmetic Intensity of Distributed-Memory Dense Matrix Multiplication Involving a Symmetric Input Matrix (SYMM)  37
On the Arithmetic Intensity of Distributed-Memory Dense Matr...
收藏 引用
37th IEEE International Parallel and Distributed Processing Symposium (IPDPS)
作者: Agullo, Emmanuel Buttari, Alfredo Coulaud, Olivier Eyraud-Dubois, Lionel Faverge, Mathieu Franc, Alain Guermouche, Abdou Jego, Antoine Peressoni, Romain Pruvost, Florent INRIA Le Chesnay France Univ Bordeaux Bordeaux France Bordeaux INP Bordeaux France LaBRI Saanichton BC Canada CNRS Toulouse France INPT Toulouse France IRIT Sunnyvale CA USA
Dense matrix multiplication involving a symmetric input matrix (SYMM) is implemented in reference distributed-memory codes with the same data distribution as its general analogue (GEMM). We show that, when the symmetr... 详细信息
来源: 评论
From task-based GPU Work Aggregation to Stellar Mergers: Turning Fine-Grained CPU tasks into Portable GPU Kernels  5
From Task-Based GPU Work Aggregation to Stellar Mergers: Tur...
收藏 引用
5th Annual IEEE/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC)
作者: Daiss, Gregor Diehl, Patrick Marcello, Dominic Kheirkhahan, Alireza Kaiser, Hartmut Pflueger, Dirk Louisiana State Univ LSU Ctr Computat & Technol Baton Rouge LA 70803 USA Univ Stuttgart IPVS Stuttgart Germany Louisiana State Univ Dept Phys & Astron Baton Rouge LA USA
Meeting both scalability and performance portability requirements is a challenge for any HPC application, especially for adaptively refined ones. In Octo-Tiger, an astrophysics application for the simulation of stella... 详细信息
来源: 评论
Dynamic task Fusion for a Block-Structured Finite Volume Solver over a Dynamically Adaptive Mesh with Local Time Stepping  37th
Dynamic Task Fusion for a Block-Structured Finite Volume Sol...
收藏 引用
37th International Supercomputing Conference on High Performance Computing (ISC High Performance Computing)
作者: Li, Baojiu Schulz, Holger Weinzierl, Tobias Zhang, Han Univ Durham Inst Computat Cosmol Durham DH1 3FE England Univ Durham Dept Comp Sci Durham DH1 3FE England Univ Durham Large Scale Comp Inst Data Sci Durham DH1 3FE England
Load balancing of generic wave equation solvers over dynamically adaptive meshes with local time stepping is difficult, as the load changes with every time step. task-based programming promises to mitigate the load ba... 详细信息
来源: 评论
RosneT: A Block Tensor Algebra Library for Out-of-Core Quantum Computing Simulation  2
RosneT: A Block Tensor Algebra Library for Out-of-Core Quant...
收藏 引用
2nd International Workshop on Quantum Computing Software (QCS)
作者: Sanchez-Ramirez, Sergio Conejero, Javier Lordan, Francesc Queralt, Anna Cortes, Toni Badia, Rosa M. Garcia-Saez, Artur Barcelona Supercomp Ctr QUANTIC Barcelona Spain Barcelona Supercomp Ctr Workflows & Distributed Comp Barcelona Spain Univ Politecn Cataluna Barcelona Spain
With the advent of more powerful Quantum Computers, the need for larger Quantum Simulations has boosted. As the amount of resources grows exponentially with size of the target system Tensor Networks emerge as an optim... 详细信息
来源: 评论
DEISA: Dask-Enabled In Situ Analytics  28
DEISA: Dask-Enabled In Situ Analytics
收藏 引用
28th Annual IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC)
作者: Gueroudji, Amal Bigot, Julien Raffin, Bruno Univ Paris Saclay UVSQ CNRS CEAMaison Simulat F-91191 Gif Sur Yvette France Univ Grenoble Alpes Inria CNRS Grenoble INPLIG F-38000 Grenoble France
A widening performance gap is separating CPU performance and IO bandwidth on large scale systems. In some fields such as weather forecast and nuclear fusion, numerical models generate such amounts of data that classic... 详细信息
来源: 评论
Beyond Fork-Join: Integration of Performance Portable Kokkos Kernels with HPX
Beyond Fork-Join: Integration of Performance Portable Kokkos...
收藏 引用
35th IEEE International Parallel and Distributed Processing Symposium (IPDPS)
作者: Daiss, Gregor Simberg, Mikael Reverdell, Auriane Biddiscombe, John Pollinger, Theresa Kaiser, Hartmut Pfluger, Dirk Univ Stuttgart Inst Parallel & Distributed Syst Sci Comp Stuttgart Germany Swiss Natl Supercomp Ctr Porza Switzerland Louisiana State Univ CCT Baton Rouge LA 70803 USA
Between a widening range of GPU vendors and the trend of having more GPUs per compute node in supercomputers such as Summit, Perlmutter, Frontier and Aurora, developing performant yet portable distributed HPC applicat... 详细信息
来源: 评论
Parallel and Distributed task-based Kirchhoff Seismic Pre-Stack Depth Migration Application  20
Parallel and Distributed Task-Based Kirchhoff Seismic Pre-St...
收藏 引用
20th International Symposium on Parallel and Distributed Computing (ISPDC)
作者: Gurhem, Jerome Calandra, Henri Petiton, Serge G. Univ Lille UMR 9189 CRIStAL CNRS Lille France CNRS USR 3441 Maison Simulat Saclay France Total SA Pau France
Since the middle of the 1990s, message passing libraries are the most used technology to implement parallel and distributed scientific applications. However, they may not be a solution efficient enough on exascale mac... 详细信息
来源: 评论
Optimizing Distributed Load Balancing for Workloads with Time-Varying Imbalance
Optimizing Distributed Load Balancing for Workloads with Tim...
收藏 引用
IEEE International Conference on Cluster Computing (Cluster)
作者: Lifflander, Jonathan Slattengren, Nicole Lemaster Pebay, Philippe P. Miller, Phil Rizzi, Francesco Bettencourt, Matthew T. Sandia Natl Labs Livermore CA 94550 USA NexGen Analyt Sheridan WY USA Intense Comp New York NY USA
This paper explores dynamic load balancing algorithms used by asynchronous many-task (AMT), or 'task-based', programming models to optimize task placement for scientific applications with dynamic workload imba... 详细信息
来源: 评论