咨询与建议

限定检索结果

文献类型

  • 48 篇 期刊文献
  • 44 篇 会议
  • 1 篇 学位论文

馆藏范围

  • 93 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 89 篇 工学
    • 84 篇 计算机科学与技术...
    • 24 篇 电气工程
    • 18 篇 软件工程
    • 3 篇 信息与通信工程
    • 1 篇 控制科学与工程
  • 5 篇 理学
    • 4 篇 数学
    • 1 篇 物理学
  • 2 篇 管理学
    • 2 篇 管理科学与工程(可...

主题

  • 93 篇 parallel program...
  • 7 篇 mpi
  • 7 篇 parallel program...
  • 7 篇 high performance...
  • 5 篇 message passing
  • 5 篇 programming
  • 5 篇 runtime systems
  • 5 篇 hpc
  • 5 篇 openmp
  • 4 篇 openacc
  • 4 篇 heterogeneous co...
  • 4 篇 cloud computing
  • 4 篇 kokkos
  • 3 篇 component models
  • 3 篇 productivity
  • 3 篇 c plus plus meta...
  • 3 篇 active messages
  • 3 篇 nested paralleli...
  • 3 篇 embedded systems
  • 3 篇 cuda

机构

  • 5 篇 barcelona superc...
  • 3 篇 evidence srl pis...
  • 3 篇 univ politecn ca...
  • 2 篇 barcelona superc...
  • 2 篇 chapman univ ora...
  • 2 篇 csic iiia artifi...
  • 2 篇 thales res & tec...
  • 2 篇 queens univ belf...
  • 2 篇 swiss fed inst t...
  • 2 篇 nvidia corp sant...
  • 2 篇 univ tennessee c...
  • 2 篇 oak ridge natl l...
  • 2 篇 bsc barcelona
  • 1 篇 univ pisa dipart...
  • 1 篇 riphah int univ ...
  • 1 篇 univ bologna iis...
  • 1 篇 univ politecn ca...
  • 1 篇 barcelona superc...
  • 1 篇 seoul natl univ ...
  • 1 篇 univ calif davis...

作者

  • 9 篇 badia rosa m.
  • 6 篇 ayguade eduard
  • 5 篇 labarta jesus
  • 4 篇 gai paolo
  • 4 篇 marongiu andrea
  • 4 篇 martorell xavier
  • 3 篇 marozzo fabrizio
  • 3 篇 denny joel
  • 3 篇 gonzalez-tallada...
  • 3 篇 quinones eduardo
  • 3 篇 benini luca
  • 3 篇 talia domenico
  • 3 篇 vetter jeffrey s...
  • 3 篇 nikolopoulos dim...
  • 3 篇 jimenez-gonzalez...
  • 3 篇 scordino claudio
  • 3 篇 tejedor enric
  • 3 篇 lee seyong
  • 3 篇 filgueras antoni...
  • 3 篇 valero-lara pedr...

语言

  • 91 篇 英文
  • 1 篇 德文
  • 1 篇 中文
检索条件"主题词=parallel programming models"
93 条 记 录,以下是81-90 订阅
排序:
Scalable task parallel programming in the partitioned global address space
Scalable task parallel programming in the partitioned global...
收藏 引用
作者: Dinan, James Scott The Ohio State University
学位级别:Ph.D.
Applications that exhibit irregular, dynamic, and unbalanced parallelism are growing in number and importance in the computational science and engineering communities. These applications span many domains including co... 详细信息
来源: 评论
UCX: An Open Source Framework for HPC Network APIs and Beyond  23
UCX: An Open Source Framework for HPC Network APIs and Beyon...
收藏 引用
IEEE 23rd Annual Symposium on High-Performance Interconnects
作者: Shamis, Pavel Venkata, Manjunath Gorentla Lopez, M. Graham Baker, Matthew B. Hernandez, Oscar Itigin, Yossi Dubman, Mike Shainer, Gilad Graham, Richard L. Liss, Liran Shahar, Yiftah Potluri, Sreeram Rossetti, Davide Becker, Donald Poole, Duncan Lamb, Christopher Kumar, Sameer Stunkel, Craig Bosilca, George Bouteiller, Aurelien Oak Ridge Natl Lab Oak Ridge TN 37831 USA Mellanox Technol Yokneam Illit Israel NVIDIA Corp Santa Clara CA USA IBM Corp Armonk NY USA Univ Tennessee Knoxville TN USA
This paper presents Unified Communication X (UCX), a set of network APIs and their implementations for high throughput computing. UCX comes from the combined effort of national laboratories, industry, and academia to ... 详细信息
来源: 评论
Communication Avoiding 2D Stencil Implementations over PaRSEC Task-Based Runtime  34
Communication Avoiding 2D Stencil Implementations over PaRSE...
收藏 引用
34th IEEE International parallel and Distributed Processing Symposium (IPDPS)
作者: Pei, Yu Cao, Qinglei Bosilca, George Luszczek, Piotr Eijkhout, Victor Dongarra, Jack Univ Tennessee Innovat Comp Lab Knoxville TN 37996 USA Univ Texas Austin Texas Adv Comp Ctr Austin TX USA
Stencil computation or general sparse matrix-vector product (SpMV) are key components in many algorithms like geometric multigrid or Krylov solvers. But their low arithmetic intensity means that memory bandwidth and n... 详细信息
来源: 评论
Transparent execution of task-based parallel applications in Docker with COMP Superscalar  25
Transparent execution of task-based parallel applications in...
收藏 引用
25th Euromicro International Conference on parallel, Distributed and Network-Based Processing (PDP)
作者: Anton, Victor Ramon-Cortes, Cristian Ejarque, Jorge Badia, Rosa M. BSC Barcelona Spain CSIC IIIA Artificial Intelligence Res Inst Spanish Natl Res Council Barcelona Spain
This paper presents a framework to easily build and execute parallel applications in container-based distributed computing platforms in a user transparent way. The proposed framework is a combination of the COMP Super... 详细信息
来源: 评论
programming bare-metal accelerators with heterogeneous threading models:a case study of Matrix-3000
收藏 引用
Frontiers of Information Technology & Electronic Engineering 2023年 第4期24卷 509-520页
作者: Jianbin FANG Peng ZHANG Chun HUANG Tao TANG Kai LU Ruibo WANG Zheng WANG College of Computer Science and Technology National University of Defense TechnologyChangsha 410073China School of Computing University of LeedsLeeds LS29JTUK
As the hardware industry moves toward using specialized heterogeneous many-core processors to avoid the effects of the power wall,software developers are finding it hard to deal with the complexity of these *** this p... 详细信息
来源: 评论
A high-productivity task-based programming model for clusters
收藏 引用
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE 2012年 第18期24卷 2421-2448页
作者: Tejedor, Enric Farreras, Montse Grove, David Badia, Rosa M. Almasi, Gheorghe Labarta, Jesus Barcelona Supercomp Ctr BSC CNS Barcelona 08034 Spain Univ Politecn Cataluna UPC Barcelona Spain IBM Corp Thomas J Watson Res Ctr Yorktown Hts NY 10598 USA CSIC Artificial Intelligence Res Inst IIIA Barcelona Spain
programming for large-scale, multicore-based architectures requires adequate tools that offer ease of programming and do not hinder application performance. StarSs is a family of parallel programming models based on a... 详细信息
来源: 评论
PGAS-FMM: Implementing a distributed fast multipole method using the X10 programming language
收藏 引用
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE 2014年 第3期26卷 712-727页
作者: Milthorpe, Josh Rendell, Alistair P. Huber, Thomas Australian Natl Univ Res Sch Comp Sci Canberra ACT 0200 Australia Australian Natl Univ Res Sch Chem Canberra ACT 0200 Australia
The fast multipole method (FMM) is a complex, multi-stage algorithm over a distributed tree data structure, with multiple levels of parallelism and inherent data locality. X10 is a modern partitioned global address sp... 详细信息
来源: 评论
Enhancing Kokkos with OpenACC
收藏 引用
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS 2024年 第5期38卷 409-426页
作者: Valero-Lara, Pedro Lee, Seyong Gonzalez-Tallada, Marc Denny, Joel Teranishi, Keita Vetter, Jeffrey S. Oak Ridge Natl Lab 1 Bethel Valley Rd Oak Ridge TN 37830 USA Univ Politecn Cataluna Barcelona Spain
C++ template metaprogramming has emerged as a prominent approach for achieving performance portability in heterogeneous computing. Kokkos represents a notable paradigm in this domain, offering programmers a suite of h... 详细信息
来源: 评论
EVALUATING COMPUTATIONAL COSTS WHILE HANDLING DATA AND CONTROL parallelISM
收藏 引用
parallel PROCESSING LETTERS 2008年 第1期18卷 165-174页
作者: Campa, Sonia Univ Pisa Dept Comp Sci I-56123 Pisa Italy
The aim of this work is to introduce a computational costs system associated to a semantic framework for orthogonal data and control parallelism handling. In such a framework a parallel application is described by a s... 详细信息
来源: 评论
parallel signal processing with S-Net
收藏 引用
Procedia Computer Science 2010年 第1期1卷 2085-2094页
作者: Frank Penczek Stephan Herhut Clemens Grelck Sven-Bodo Scholz Alex Shafarenko Rémi Barrère Eric Lenormand University of Hertfordshire School of Computer Science Hatfield UK University of Amsterdam Institute of Informatics Amsterdam The Netherlands Thales Research & Technologies Palaiseau France
We argue that programming high-end stream-processing applications requires a form of coordination language that enables the designer to represent interactions between stream-processing functions asynchronously. We fur... 详细信息
来源: 评论