咨询与建议

限定检索结果

文献类型

  • 3 篇 会议
  • 1 篇 期刊文献

馆藏范围

  • 4 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 4 篇 工学
    • 2 篇 计算机科学与技术...
    • 2 篇 软件工程

主题

  • 4 篇 nested loop para...
  • 3 篇 gpu
  • 2 篇 code generation
  • 1 篇 index distributi...
  • 1 篇 cost model
  • 1 篇 optimization
  • 1 篇 python
  • 1 篇 parallel algorit...

机构

  • 2 篇 univ glasgow sch...
  • 1 篇 doctoral school ...
  • 1 篇 faculty of econo...
  • 1 篇 john von neumann...
  • 1 篇 univ glasgow gla...

作者

  • 3 篇 singer jeremy
  • 3 篇 jacob dejice
  • 2 篇 trinder phil
  • 1 篇 szénási sándor
  • 1 篇 pintér ádám

语言

  • 4 篇 英文
检索条件"主题词=Nested loop parallelization"
4 条 记 录,以下是1-10 订阅
排序:
ALPyNA: Acceleration of loops in Python for Novel Architectures  6
ALPyNA: Acceleration of Loops in Python for Novel Architectu...
收藏 引用
6th ACM SIGPLAN International Workshop on Libraries, Languages and Compilers for Array Programming (ARRAY)
作者: Jacob, Dejice Singer, Jeremy Univ Glasgow Sch Comp Sci Glasgow Lanark Scotland
We present ALPyNA, an automatic loop parallelization framework for Python, which analyzes data dependences within nested loops and dynamically generates CUDA kernels for GPU execution. The ALPyNA system applies classi... 详细信息
来源: 评论
Python Programmers Have GPUs too Automatic Python loop parallelization with Staged Dependence Analysis  15
Python Programmers Have GPUs too Automatic Python Loop Paral...
收藏 引用
15th ACM SIGPLAN International Symposium on Dynamic Languages (DLS)
作者: Jacob, Dejice Trinder, Phil Singer, Jeremy Univ Glasgow Sch Comp Sci Glasgow Lanark Scotland
Python is a popular language for end-user software development in many application domains. End-users want to harness parallel compute resources effectively, by exploiting commodity manycore technology including GPUs.... 详细信息
来源: 评论
Pricing Python Parallelism: A Dynamic Language Cost Model for Heterogeneous Platforms  16
Pricing Python Parallelism: A Dynamic Language Cost Model fo...
收藏 引用
16th ACM SIGPLAN International Symposium on Dynamic Languages (DLS)
作者: Jacob, Dejice Trinder, Phil Singer, Jeremy Univ Glasgow Glasgow Lanark Scotland
Execution times may be reduced by offloading parallel loop nests to a GPU. Auto-parallelizing compilers are common for static languages, often using a cost model to determine when the GPU execution speed will outweigh... 详细信息
来源: 评论
Index Dependent nested loops parallelization with an Even Distributed Number of Steps
Informatica (Slovenia)
收藏 引用
Informatica (Slovenia) 2021年 第4期45卷 493-506页
作者: Pintér, Ádám Szénási, Sándor John von Neumann Faculty of Informatics Óbuda University Hungary Doctoral School of Applied Informatics and Applied Mathematics Óbuda University Hungary Faculty of Economics J. Selye University Slovakia
Parallel processing of algorithms is an effective way to achieve higher performance on multiprocessor systems rather. During parallelization, it is critical to minimize the difference between the processing time for t... 详细信息
来源: 评论