咨询与建议

限定检索结果

文献类型

  • 322 篇 会议
  • 18 篇 期刊文献

馆藏范围

  • 340 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 288 篇 工学
    • 248 篇 软件工程
    • 232 篇 计算机科学与技术...
    • 13 篇 电子科学与技术(可...
    • 7 篇 信息与通信工程
    • 5 篇 控制科学与工程
    • 4 篇 机械工程
    • 4 篇 生物工程
    • 3 篇 生物医学工程(可授...
    • 1 篇 力学(可授工学、理...
    • 1 篇 动力工程及工程热...
    • 1 篇 电气工程
    • 1 篇 核科学与技术
    • 1 篇 农业工程
    • 1 篇 环境科学与工程(可...
  • 53 篇 理学
    • 49 篇 数学
    • 4 篇 生物学
    • 4 篇 系统科学
    • 4 篇 统计学(可授理学、...
    • 2 篇 化学
  • 14 篇 管理学
    • 10 篇 管理科学与工程(可...
    • 8 篇 工商管理
    • 4 篇 图书情报与档案管...
  • 3 篇 经济学
    • 3 篇 应用经济学
  • 2 篇 法学
    • 2 篇 社会学
  • 1 篇 教育学
    • 1 篇 教育学
  • 1 篇 农学
    • 1 篇 作物学

主题

  • 54 篇 performance
  • 48 篇 parallel process...
  • 33 篇 algorithms
  • 33 篇 parallel program...
  • 27 篇 languages
  • 25 篇 design
  • 20 篇 parallel algorit...
  • 20 篇 gpu
  • 9 篇 experimentation
  • 9 篇 measurement
  • 7 篇 graphics process...
  • 7 篇 theory
  • 7 篇 parallel
  • 6 篇 scalability
  • 6 篇 mpi
  • 6 篇 parallel computi...
  • 6 篇 concurrency
  • 5 篇 parallelism
  • 5 篇 graph algorithms
  • 5 篇 multicore

机构

  • 7 篇 carnegie mellon ...
  • 4 篇 indiana univ blo...
  • 4 篇 shanghai jiao to...
  • 3 篇 univ of tokyo
  • 3 篇 tsinghua univ de...
  • 3 篇 univ chinese aca...
  • 3 篇 massachusetts in...
  • 3 篇 univ illinois ur...
  • 3 篇 swiss fed inst t...
  • 3 篇 mit csail united...
  • 3 篇 tsinghua univ pe...
  • 3 篇 univ calif berke...
  • 2 篇 ist austria klos...
  • 2 篇 fudan univ sch c...
  • 2 篇 georgetown univ ...
  • 2 篇 univ wisconsin d...
  • 2 篇 shanghai key lab...
  • 2 篇 univ of wisconsi...
  • 2 篇 tsinghua univers...
  • 2 篇 shanghai jiao to...

作者

  • 8 篇 blelloch guy e.
  • 7 篇 chen haibo
  • 6 篇 hoefler torsten
  • 6 篇 garland michael
  • 6 篇 zhai jidong
  • 6 篇 shun julian
  • 5 篇 sun yihan
  • 4 篇 dhulipala laxman
  • 4 篇 chen wenguang
  • 4 篇 tsigas philippas
  • 4 篇 tan guangming
  • 4 篇 wang haojie
  • 4 篇 nikolopoulos dim...
  • 4 篇 mellor-crummey j...
  • 4 篇 gu yan
  • 4 篇 kennedy ken
  • 3 篇 taura kenjiro
  • 3 篇 li jiajia
  • 3 篇 yonezawa akinori
  • 3 篇 pingali keshav

语言

  • 338 篇 英文
  • 2 篇 其他
检索条件"任意字段=Proceedings of the 5th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming"
340 条 记 录,以下是251-260 订阅
排序:
Ownership Passing: Efficient Distributed Memory programming on Multi-core Systems  13
Ownership Passing: Efficient Distributed Memory Programming ...
收藏 引用
18th acm sigplan symposium on principles and practice of parallel programming
作者: Friedley, Andrew Hoefler, Torsten Bronevetsky, Greg Lumsdaine, Andrew Ma, Ching-Chen Indiana Univ Bloomington IN 47405 USA ETH Zurich Switzerland Lawrence Livermore Natl Lab Livermore CA USA Rose Hulman Inst Technol Terre Haute IN 47803 USA
the number of cores in multi- and many-core high-performance processors is steadily increasing. MPI, the de-facto standard for programming high-performance computing systems offers a distributed memory programming mod... 详细信息
来源: 评论
TileSpGEMM: A Tiled Algorithm for parallel Sparse General Matrix-Matrix Multiplication on GPUs  22
TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Ma...
收藏 引用
27th acm sigplan symposium on principles and practice of parallel programming (PPoPP)
作者: Niu, Yuyao Lu, Zhengyang Ji, Haonan Song, Shuhui Jin, Zhou Liu, Weifeng China Univ Petr Super Sci Software Lab Beijing Peoples R China
Sparse general matrix-matrix multiplication (SpGEMM) is one of the most fundamental building blocks in sparse linear solvers, graph processing frameworks and machine learning applications. the existing parallel approa... 详细信息
来源: 评论
DAPPLE: A pipelined data parallel approach for training large models  21
DAPPLE: A pipelined data parallel approach for training larg...
收藏 引用
26th acm sigplan symposium on principles and practice of parallel programming, PPoPP 2021
作者: Fan, Shiqing Rong, Yi Meng, Chen Cao, Zongyan Wang, Siyu Zheng, Zhen Wu, Chuan Long, Guoping Yang, Jun Xia, Lixue Diao, Lansong Liu, Xiaoyong Lin, Wei Alibaba Group China
It is a challenging task to train large DNN models on sophisticated GPU platforms with diversified interconnect capabilities. Recently, pipelined training has been proposed as an effective approach for improving devic... 详细信息
来源: 评论
Provably Fast and Space-Efficient parallel Biconnectivity  23
Provably Fast and Space-Efficient Parallel Biconnectivity
收藏 引用
28th acm sigplan Annual symposium on principles and practice of parallel programming, PPoPP 2023
作者: Dong, Xiaojun Wang, Letong Gu, Yan Sun, Yihan UC Riverside
Computing biconnected components (BCC) of a graph is a fundamental graph problem. the canonical parallel BCC algorithm is the Tarjan-Vishkin algorithm, which has O(n + m) optimal work and polylogarithmic span on a gra... 详细信息
来源: 评论
Scheduling parallel Programs by Work Stealing with Private Deques  13
Scheduling Parallel Programs by Work Stealing with Private D...
收藏 引用
18th acm sigplan symposium on principles and practice of parallel programming
作者: Acar, Umut A. Chargueraud, Arthur Rainey, Mike Carnegie Mellon Univ Dept Comp Sci Pittsburgh PA 15213 USA Univ Paris 11 CNRS LRI Orsay France
Work stealing has proven to be an effective method for scheduling parallel programs on multicore computers. To achieve high performance, work stealing distributes tasks between concurrent queues, called deques, which ... 详细信息
来源: 评论
An ownership policy and deadlock detector for promises  21
An ownership policy and deadlock detector for promises
收藏 引用
26th acm sigplan symposium on principles and practice of parallel programming, PPoPP 2021
作者: Voss, Caleb Sarkar, Vivek Georgia Institute of Technology United States
Task-parallel programs often enjoy deadlock freedom under certain restrictions, such as the use of structured join operations, as in Cilk and X10, or the use of asynchronous task futures together with deadlock-avoidin... 详细信息
来源: 评论
Data structures for task-based priority scheduling  14
Data structures for task-based priority scheduling
收藏 引用
2014 19th acm sigplan symposium on principles and practice of parallel programming, PPoPP 2014
作者: Wimmer, Martin Versaci, Francesco Träff, Jesper Larsson Cederman, Daniel Tsigas, Philippas Faculty of Informatics Parallel Computing Vienna University of Technology 1040 Vienna/Wien Austria Computer Science and Engineering Chalmers University of Technology 412 96 Göteborg Sweden
We present three lock-free data structures for priority task scheduling: a priority work-stealing one, a centralized one with ρ-relaxed semantics, and a hybrid one combining both concepts. With the single-source shor... 详细信息
来源: 评论
Merchandiser: Data Placement on Heterogeneous Memory for Task-parallel HPC Applications with Load-Balance Awareness  23
Merchandiser: Data Placement on Heterogeneous Memory for Tas...
收藏 引用
28th acm sigplan Annual symposium on principles and practice of parallel programming, PPoPP 2023
作者: Xie, Zhen Liu, Jie Li, Jiajia Li, Dong University of California Argonne National Laboratory Merced United States University of California Merced United States North Carolina State University United States
the emergence of heterogeneous memory (HM) provides a cost-effective and high-performance solution to memory-consuming HPC applications. Deciding the placement of data objects on HM is critical for high performance. W... 详细信息
来源: 评论
Data locality and load balancing in COOL  93
Data locality and load balancing in COOL
收藏 引用
4th acm sigplan symposium on principles and practice of parallel programming, PPOPP 1993
作者: Chandra, Rohit Gupta, Anoop Hennessy, John L. Center for Integrated Systems Stanford University StanfordCA94305 United States
Large-scale shared memory multiprocessors typically support a multilevel memory hierarchy consisting of per-processor caches, a local portion of shared memory, and remote shared memory. On such machines, the performan... 详细信息
来源: 评论
thread to Strand Binding of parallel Network Applications in Massive Multi-threaded Systems
Thread to Strand Binding of Parallel Network Applications in...
收藏 引用
15th acm sigplan symposium on principles and practice of parallel programming
作者: Radojkovic, Petar Cakarevic, Vladimir Verdu, Javier Pajuelo, Alex Cazorla, Francisco J. Nemirovsky, Mario Valero, Mateo Univ Politecn Cataluna E-08028 Barcelona Spain CSIC Madrid Spain
In processors with several levels of hardware resource sharing, like CMPs in which each core is an SMT, the scheduling process becomes more complex than in processors with a single level of resource sharing, such as p... 详细信息
来源: 评论