咨询与建议

限定检索结果

文献类型

  • 17 篇 会议
  • 4 篇 期刊文献

馆藏范围

  • 21 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 20 篇 工学
    • 20 篇 软件工程
    • 9 篇 计算机科学与技术...

主题

  • 4 篇 parallel program...
  • 2 篇 data race detect...
  • 2 篇 gpu programming
  • 2 篇 hybrid programmi...
  • 2 篇 raja
  • 2 篇 0
  • 2 篇 wait-
  • 2 篇 entropy decoding
  • 2 篇 performance port...
  • 2 篇 heterogeneous co...
  • 2 篇 openmp 4
  • 2 篇 prefix codes
  • 2 篇 semantics
  • 2 篇 parallel
  • 2 篇 programming mode...
  • 2 篇 lock-free data s...
  • 2 篇 jpeg decoding
  • 2 篇 parallel linear ...
  • 2 篇 kokkos
  • 2 篇 autotuning

机构

  • 2 篇 uk atom weap est...
  • 2 篇 univ murcia dept...
  • 2 篇 univ carlos iii ...
  • 2 篇 univ pisa dept c...
  • 2 篇 univ murcia dept...
  • 2 篇 univ bristol hpc...
  • 2 篇 yonsei univ dept...
  • 2 篇 tech univ cartag...
  • 1 篇 chinese acad sci...
  • 1 篇 sandia natl labs...
  • 1 篇 north carolina s...
  • 1 篇 univ manchester ...
  • 1 篇 indiana univ blo...
  • 1 篇 upmc paris 06 so...
  • 1 篇 inria ecole norm...
  • 1 篇 oracle labs burl...
  • 1 篇 nvidia corp sant...
  • 1 篇 univ twente form...
  • 1 篇 vienna univ tech...
  • 1 篇 univ porto fac s...

作者

  • 2 篇 martineau matthe...
  • 2 篇 garcia-carballei...
  • 2 篇 garcia l. p.
  • 2 篇 fernandez javier
  • 2 篇 gaudin wayne
  • 2 篇 torquati massimo
  • 2 篇 burgstaller bern...
  • 2 篇 astorga david de...
  • 2 篇 garcia jose dani...
  • 2 篇 herrera f. j.
  • 2 篇 dolz manuel f.
  • 2 篇 mcintosh-smith s...
  • 2 篇 park jinwoo
  • 2 篇 cuenca j.
  • 2 篇 sodsong wasuwee
  • 2 篇 gimenez d.
  • 2 篇 jung minyoung
  • 2 篇 danelutto marco
  • 1 篇 laarman alfons
  • 1 篇 ballard grey

语言

  • 21 篇 英文
检索条件"任意字段=21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2016"
21 条 记 录,以下是11-20 订阅
排序:
Declarative Coordination of Graph-based parallel Programs  16
Declarative Coordination of Graph-based Parallel Programs
收藏 引用
21st acm sigplan symposium on principles and practice of parallel programming (ppopp)
作者: Cruz, Flavio Rocha, Ricardo Goldstein, Seth Copen Univ Porto CRACS Rua Campo Alegre 1021 P-4169007 Oporto Portugal Univ Porto INESC TEC Rua Campo Alegre 1021 P-4169007 Oporto Portugal Univ Porto Fac Sci Rua Campo Alegre 1021 P-4169007 Oporto Portugal Carnegie Mellon Univ Pittsburgh PA 15213 USA
Declarative programming has been hailed as a promising approach to parallel programming since it makes it easier to reason about programs while hiding the implementation details of parallelism from the programmer. How... 详细信息
来源: 评论
High Performance Model Based Image Reconstruction  16
High Performance Model Based Image Reconstruction
收藏 引用
21st acm sigplan symposium on principles and practice of parallel programming (ppopp)
作者: Wang, Xiao Sabne, Amit Kisner, Sherman Raghunathan, Anand Bouman, Charles Midkiff, Samuel Purdue Univ Sch Elect & Comp Engn W Lafayette IN 47907 USA High Performance Imaging LLC W Lafayette IN USA
Computed Tomography (CT) Image Reconstruction is an important technique used in a wide range of applications, ranging from explosive detection, medical imaging to scientific imaging. Among available reconstruction met... 详细信息
来源: 评论
NUMA-aware Scheduling and Memory Allocation for data-flow task-parallel Applications  16
NUMA-aware Scheduling and Memory Allocation for data-flow ta...
收藏 引用
21st acm sigplan symposium on principles and practice of parallel programming (ppopp)
作者: Drebes, Andi Pop, Antoniu Heydemann, Karine Drach, Nathalie Cohen, Albert Univ Manchester Sch Comp Sci Manchester M13 9PL Lancs England UPMC Paris 06 Sorbonne Univ CNRS LIP6UMR 7606 Paris France Inria Ecole Normale Super Rocquencourt France
Dynamic task parallelism is a popular programming model on shared-memory systems. Compared to data parallel loop-based concurrency, it promises enhanced scalability, load balancing and locality. These promises, howeve... 详细信息
来源: 评论
AUTOGEN: Automatic Discovery of Cache-Oblivious parallel Recursive Algorithms for Solving Dynamic Programs  16
AUTOGEN: Automatic Discovery of Cache-Oblivious Parallel Rec...
收藏 引用
21st acm sigplan symposium on principles and practice of parallel programming (ppopp)
作者: Chowdhury, Rezaul Ganapathi, Pramod Tithi, Jesmin Jahan Bachmeier, Charles Kuszmaul, Bradley C. Leiserson, Charles E. Solar-Lezama, Armando Tang, Yuan SUNY Stony Brook Dept Comp Sci Stony Brook NY 11794 USA MIT Comp Sci & Artificial Intelligence Lab Cambridge MA 02139 USA Fudan Univ Shanghai Key Lab Intelligent Informat Proc Sch Software Shanghai Peoples R China
We present AUTOGEN-an algorithm that for a wide class of dynamic programming (DP) problems automatically discovers highly efficient cache-oblivious parallel recursive divide-and-conquer algorithms from inefficient ite... 详细信息
来源: 评论
Multi-Core On-The-Fly SCC Decomposition  16
Multi-Core On-The-Fly SCC Decomposition
收藏 引用
21st acm sigplan symposium on principles and practice of parallel programming (ppopp)
作者: Bloemen, Vincent Laarman, Alfons van de Pol, Jaco Univ Twente Formal Methods & Tools POB 217 NL-7500 AE Enschede Netherlands Vienna Univ Technol FORSYTE Vienna Austria
The main advantages of Tarjan's strongly connected component (SCC) algorithm are its linear time complexity and ability to return SCCs on-the-fly, while traversing or even generating the graph. Until now, most par... 详细信息
来源: 评论
A High-Performance parallel Algorithm for Nonnegative Matrix Factorization  16
A High-Performance Parallel Algorithm for Nonnegative Matrix...
收藏 引用
21st acm sigplan symposium on principles and practice of parallel programming (ppopp)
作者: Kannan, Ramakrishnan Ballard, Grey Park, Haesun Georgia Tech Atlanta GA 30332 USA Sandia Natl Labs Livermore CA 94550 USA
Non-negative matrix factorization (NMF) is the problem of determining two non-negative low rank factors W and H, for the given input matrix A, such that A approximate to WH. NMF is a useful tool for many applications ... 详细信息
来源: 评论
Scalable adaptive NUMA-aware Lock combining local locking and remote locking for efficient concurrency  16
Scalable adaptive NUMA-aware Lock combining local locking an...
收藏 引用
21st acm sigplan symposium on principles and practice of parallel programming, ppopp 2016
作者: Zhang, Mingzhe Lau, Francis C.M. Wang, Cho-Li Cheng, Luwei Chen, Haibo Dept. Computer Science University of Hong Kong Hong Kong Facebook United States Institute of Parallel and Distributed Systems Shanghai Jiao Tong University China
Scalable locking is a key building block for scalable multi-threaded software. Its performance is especially critical in multi-socket, multi-core machines with non-uniform memory access (NUMA). Previous schemes such a... 详细信息
来源: 评论
Assessing the performance portability of modern parallel programming models using TeaLeaf
收藏 引用
CONCURRENCY AND COMPUTATION-practice & EXPERIENCE 2017年 第15期29卷 1-15页
作者: Martineau, Matthew McIntosh-Smith, Simon Gaudin, Wayne Univ Bristol HPC Grp Bristol Avon England UK Atom Weap Estab AWE Aldermaston England
In this work, we evaluate several emerging parallel programming models: Kokkos, RAJA, OpenACC, and OpenMP 4.0, against the mature CUDA and OpenCL APIs. Each model has been used to port Tealeaf, a miniature proxy appli... 详细信息
来源: 评论
Enabling semantics to improve detection of data races and misuses of lock-free data structures
收藏 引用
CONCURRENCY AND COMPUTATION-practice & EXPERIENCE 2017年 第15期29卷
作者: Dolz, Manuel F. Astorga, David Del Rio Fernandez, Javier Torquati, Massimo Garcia, Jose Daniel Garcia-Carballeira, Felix Danelutto, Marco Univ Carlos III Madrid Dept Comp Sci Madrid 28911 Spain Univ Pisa Dept Comp Sci I-56127 Pisa Italy
The rapid progress of multi/many-core architectures has caused data-intensive parallel applications not yet fully optimized to deliver the best performance. In the advent of concurrent programming, frameworks offering... 详细信息
来源: 评论
Guided installation of basic linear algebra routines in a cluster with manycore components
收藏 引用
CONCURRENCY AND COMPUTATION-practice & EXPERIENCE 2017年 第15期29卷 1-14页
作者: Cuenca, J. Garcia, L. P. Gimenez, D. Herrera, F. J. Univ Murcia Dept Engn & Technol Comp Murcia Spain Tech Univ Cartagena Serv Support Technol Res Murcia Spain Univ Murcia Dept Comp & Syst Murcia Spain
Computational systems are nowadays composed of basic computational components that share multiprocessors and coprocessors of different types, typically several graphics processing units (GPUs) or many integrated cores... 详细信息
来源: 评论