咨询与建议

限定检索结果

文献类型

  • 29 篇 会议
  • 7 篇 期刊文献
  • 1 篇 学位论文

馆藏范围

  • 37 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 34 篇 工学
    • 33 篇 计算机科学与技术...
    • 14 篇 软件工程
    • 9 篇 电气工程
    • 2 篇 机械工程
    • 2 篇 信息与通信工程
    • 1 篇 控制科学与工程
    • 1 篇 网络空间安全
  • 5 篇 理学
    • 4 篇 数学
    • 1 篇 物理学
  • 3 篇 管理学
    • 2 篇 管理科学与工程(可...
    • 1 篇 图书情报与档案管...

主题

  • 37 篇 task-based progr...
  • 4 篇 hpx
  • 4 篇 multicore
  • 4 篇 hpc
  • 4 篇 openmp
  • 3 篇 plasma
  • 3 篇 xeon phi
  • 3 篇 eigensolver
  • 3 篇 scheduling
  • 2 篇 parallelization
  • 2 篇 runtime system
  • 2 篇 dataflow
  • 2 篇 cuda
  • 2 篇 coordination lan...
  • 2 篇 performance port...
  • 2 篇 tile algorithms
  • 2 篇 mapping
  • 2 篇 exascale computi...
  • 2 篇 parsec
  • 2 篇 fpga

机构

  • 2 篇 univ politecn ca...
  • 2 篇 univ bayreuth de...
  • 2 篇 tech univ chemni...
  • 2 篇 oak ridge natl l...
  • 2 篇 univ tennessee d...
  • 1 篇 erasmus mc dept ...
  • 1 篇 univ leeds inst ...
  • 1 篇 univ durham inst...
  • 1 篇 louisiana state ...
  • 1 篇 univ neuchatel i...
  • 1 篇 univ durham larg...
  • 1 篇 inria le chesnay
  • 1 篇 university of te...
  • 1 篇 barcelona superc...
  • 1 篇 inpt toulouse
  • 1 篇 louisiana state ...
  • 1 篇 slac natl accele...
  • 1 篇 sandia natl labs...
  • 1 篇 technical univer...
  • 1 篇 univ bordeaux bo...

作者

  • 3 篇 kaiser hartmut
  • 3 篇 kurzak jakub
  • 3 篇 dongarra jack
  • 3 篇 haidar azzam
  • 3 篇 rauber thomas
  • 3 篇 ruenger gudula
  • 3 篇 thibault samuel
  • 3 篇 bosilca george
  • 2 篇 schuchart joseph
  • 2 篇 kalkhof torben
  • 2 篇 calandra henri
  • 2 篇 koch andreas
  • 2 篇 guermouche abdou
  • 2 篇 yarkhan asim
  • 2 篇 luszczek piotr
  • 2 篇 agullo emmanuel
  • 2 篇 faverge mathieu
  • 2 篇 herault thomas
  • 2 篇 diehl patrick
  • 2 篇 weinzierl tobias

语言

  • 36 篇 英文
  • 1 篇 其他
检索条件"主题词=Task-based programming"
37 条 记 录,以下是11-20 订阅
排序:
DEISA: Dask-Enabled In Situ Analytics  28
DEISA: Dask-Enabled In Situ Analytics
收藏 引用
28th Annual IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC)
作者: Gueroudji, Amal Bigot, Julien Raffin, Bruno Univ Paris Saclay UVSQ CNRS CEAMaison Simulat F-91191 Gif Sur Yvette France Univ Grenoble Alpes Inria CNRS Grenoble INPLIG F-38000 Grenoble France
A widening performance gap is separating CPU performance and IO bandwidth on large scale systems. In some fields such as weather forecast and nuclear fusion, numerical models generate such amounts of data that classic... 详细信息
来源: 评论
Combining Asynchronous task Parallelism and Intel SGX for Secure Deep Learning  19
Combining Asynchronous Task Parallelism and Intel SGX for Se...
收藏 引用
19th European Dependable Computing Conference (EDCC)
作者: Rocha, Isabelly Felber, Pascal Martorel, Xavier Pasin, Marcelo Schiavoni, Valerio Unsal, Osman Univ Neuchatel Inst Comp Sci IIUN Neuchatel Switzerland Barcelona Supercomputing Ctr Barcelona Spain Univ Politecn Cataluna Barcelona Spain
A common way of improving performance of applications for multi-core processors is to exploit parallelism. In deep learning (DL), training or tuning parameters use user's sensitive data, and thus preserving privac... 详细信息
来源: 评论
From task-based GPU Work Aggregation to Stellar Mergers: Turning Fine-Grained CPU tasks into Portable GPU Kernels  5
From Task-Based GPU Work Aggregation to Stellar Mergers: Tur...
收藏 引用
5th Annual IEEE/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC)
作者: Daiss, Gregor Diehl, Patrick Marcello, Dominic Kheirkhahan, Alireza Kaiser, Hartmut Pflueger, Dirk Louisiana State Univ LSU Ctr Computat & Technol Baton Rouge LA 70803 USA Univ Stuttgart IPVS Stuttgart Germany Louisiana State Univ Dept Phys & Astron Baton Rouge LA USA
Meeting both scalability and performance portability requirements is a challenge for any HPC application, especially for adaptively refined ones. In Octo-Tiger, an astrophysics application for the simulation of stella... 详细信息
来源: 评论
Modeling the Energy Consumption for Concurrent Executions of Parallel tasks  14
Modeling the Energy Consumption for Concurrent Executions of...
收藏 引用
14th Communications and Networking Symposium (CNS 2011) / Spring Simulation Multiconference (SpringSim '11)
作者: Rauber, Thomas Ruenger, Gudula Univ Bayreuth Bayreuth Germany Tech Univ Chemnitz Chemnitz Germany
programming models using parallel tasks provide portable performance and scalability for modular applications on many high-performance systems. This is achieved by the flexibility of a two-level programming structure ... 详细信息
来源: 评论
Performance Analysis and Optimisation of Two-Sided Factorization Algorithms for Heterogeneous Platform
Performance Analysis and Optimisation of Two-Sided Factoriza...
收藏 引用
15th Annual International Conference on Computational Science (ICCS)
作者: Kabir, Khairul Haidar, Azzam Tomov, Stanimire Dongarra, Jack Univ Tennessee Knoxville TN USA Oak Ridge Natl Lab Oak Ridge TN USA Univ Manchester Manchester Lancs England
Many applications, ranging from big data analytics to nanostructure designs, require the solution of large dense singular value decomposition (SVD) or eigenvalue problems. A first step in the solution methodology for ... 详细信息
来源: 评论
Beyond Fork-Join: Integration of Performance Portable Kokkos Kernels with HPX
Beyond Fork-Join: Integration of Performance Portable Kokkos...
收藏 引用
35th IEEE International Parallel and Distributed Processing Symposium (IPDPS)
作者: Daiss, Gregor Simberg, Mikael Reverdell, Auriane Biddiscombe, John Pollinger, Theresa Kaiser, Hartmut Pfluger, Dirk Univ Stuttgart Inst Parallel & Distributed Syst Sci Comp Stuttgart Germany Swiss Natl Supercomp Ctr Porza Switzerland Louisiana State Univ CCT Baton Rouge LA 70803 USA
Between a widening range of GPU vendors and the trend of having more GPUs per compute node in supercomputers such as Summit, Perlmutter, Frontier and Aurora, developing performant yet portable distributed HPC applicat... 详细信息
来源: 评论
Divide and Conquer Symmetric Tridiagonal Eigensolver for Multicore Architectures  29
Divide and Conquer Symmetric Tridiagonal Eigensolver for Mul...
收藏 引用
29th IEEE International Parallel and Distributed Processing Symposium (IPDPS)
作者: Pichon, Gregoire Haidar, Azzam Faverge, Mathieu Kurzak, Jakub Inria Bordeaux Sud Ouest Bordeaux INP Talence France Univ Tennessee Innovat Comp Lab Knoxville TN USA
Computing eigenpairs of a symmetric matrix is a problem arising in many industrial applications, including quantum physics and finite-elements computation for automobiles. A classical approach is to reduce the matrix ... 详细信息
来源: 评论
On the Arithmetic Intensity of Distributed-Memory Dense Matrix Multiplication Involving a Symmetric Input Matrix (SYMM)  37
On the Arithmetic Intensity of Distributed-Memory Dense Matr...
收藏 引用
37th IEEE International Parallel and Distributed Processing Symposium (IPDPS)
作者: Agullo, Emmanuel Buttari, Alfredo Coulaud, Olivier Eyraud-Dubois, Lionel Faverge, Mathieu Franc, Alain Guermouche, Abdou Jego, Antoine Peressoni, Romain Pruvost, Florent INRIA Le Chesnay France Univ Bordeaux Bordeaux France Bordeaux INP Bordeaux France LaBRI Saanichton BC Canada CNRS Toulouse France INPT Toulouse France IRIT Sunnyvale CA USA
Dense matrix multiplication involving a symmetric input matrix (SYMM) is implemented in reference distributed-memory codes with the same data distribution as its general analogue (GEMM). We show that, when the symmetr... 详细信息
来源: 评论
Dynamic task Fusion for a Block-Structured Finite Volume Solver over a Dynamically Adaptive Mesh with Local Time Stepping  37th
Dynamic Task Fusion for a Block-Structured Finite Volume Sol...
收藏 引用
37th International Supercomputing Conference on High Performance Computing (ISC High Performance Computing)
作者: Li, Baojiu Schulz, Holger Weinzierl, Tobias Zhang, Han Univ Durham Inst Computat Cosmol Durham DH1 3FE England Univ Durham Dept Comp Sci Durham DH1 3FE England Univ Durham Large Scale Comp Inst Data Sci Durham DH1 3FE England
Load balancing of generic wave equation solvers over dynamically adaptive meshes with local time stepping is difficult, as the load changes with every time step. task-based programming promises to mitigate the load ba... 详细信息
来源: 评论
Speaking Pygion: Experiences Writing an Exascale Single Particle Imaging Code  1
收藏 引用
2nd International Workshop on Asynchronous Many-task Systems and Applications (WAMTA)
作者: Mirchandaney, Seema Aiken, Alex Slaughter, Elliott SLAC Natl Accelerator Lab Menlo Pk CA 94025 USA Stanford Univ Stanford CA 94305 USA
The goal of the SpiniFEL project was to write, from scratch, a single particle imaging code for exascale supercomputers. The original vision was to have two versions of the code, one in MPI and one in Pygion, a Python... 详细信息
来源: 评论