咨询与建议

限定检索结果

文献类型

  • 311 篇 会议
  • 19 篇 期刊文献

馆藏范围

  • 330 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 280 篇 工学
    • 240 篇 软件工程
    • 227 篇 计算机科学与技术...
    • 13 篇 电子科学与技术(可...
    • 8 篇 信息与通信工程
    • 5 篇 控制科学与工程
    • 4 篇 机械工程
    • 4 篇 生物工程
    • 3 篇 生物医学工程(可授...
    • 1 篇 力学(可授工学、理...
    • 1 篇 动力工程及工程热...
    • 1 篇 电气工程
    • 1 篇 核科学与技术
    • 1 篇 农业工程
    • 1 篇 环境科学与工程(可...
    • 1 篇 网络空间安全
  • 53 篇 理学
    • 49 篇 数学
    • 4 篇 生物学
    • 4 篇 系统科学
    • 4 篇 统计学(可授理学、...
    • 2 篇 化学
  • 15 篇 管理学
    • 11 篇 管理科学与工程(可...
    • 8 篇 工商管理
    • 4 篇 图书情报与档案管...
  • 3 篇 经济学
    • 3 篇 应用经济学
  • 2 篇 法学
    • 2 篇 社会学
  • 1 篇 教育学
    • 1 篇 教育学
  • 1 篇 农学
    • 1 篇 作物学

主题

  • 54 篇 performance
  • 48 篇 parallel process...
  • 33 篇 algorithms
  • 32 篇 parallel program...
  • 27 篇 languages
  • 25 篇 design
  • 20 篇 parallel algorit...
  • 20 篇 gpu
  • 9 篇 experimentation
  • 9 篇 measurement
  • 7 篇 graphics process...
  • 7 篇 theory
  • 7 篇 parallel
  • 6 篇 mpi
  • 6 篇 parallel computi...
  • 6 篇 concurrency
  • 5 篇 scalability
  • 5 篇 parallelism
  • 5 篇 graph algorithms
  • 4 篇 gpgpu

机构

  • 7 篇 carnegie mellon ...
  • 5 篇 univ of tokyo
  • 4 篇 indiana univ blo...
  • 3 篇 univ chinese aca...
  • 3 篇 massachusetts in...
  • 3 篇 univ illinois ur...
  • 3 篇 swiss fed inst t...
  • 3 篇 mit csail united...
  • 3 篇 shanghai jiao to...
  • 3 篇 tsinghua univ pe...
  • 3 篇 univ calif berke...
  • 2 篇 ist austria klos...
  • 2 篇 georgetown univ ...
  • 2 篇 univ wisconsin d...
  • 2 篇 shanghai key lab...
  • 2 篇 univ of wisconsi...
  • 2 篇 tsinghua univers...
  • 2 篇 tsinghua univ de...
  • 2 篇 shanghai jiao to...
  • 2 篇 nvidia corporati...

作者

  • 8 篇 blelloch guy e.
  • 6 篇 hoefler torsten
  • 6 篇 garland michael
  • 6 篇 chen haibo
  • 6 篇 shun julian
  • 5 篇 sun yihan
  • 5 篇 zhai jidong
  • 4 篇 dhulipala laxman
  • 4 篇 tsigas philippas
  • 4 篇 tan guangming
  • 4 篇 wang haojie
  • 4 篇 mellor-crummey j...
  • 4 篇 gu yan
  • 4 篇 kennedy ken
  • 3 篇 taura kenjiro
  • 3 篇 li jiajia
  • 3 篇 yonezawa akinori
  • 3 篇 pingali keshav
  • 3 篇 kim jungwon
  • 3 篇 chen wenguang

语言

  • 327 篇 英文
  • 3 篇 其他
检索条件"任意字段=Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming"
330 条 记 录,以下是131-140 订阅
排序:
Extracting logical structure and identifying stragglers in parallel execution traces  14
Extracting logical structure and identifying stragglers in p...
收藏 引用
2014 19th acm sigplan symposium on principles and practice of parallel programming, PPoPP 2014
作者: Isaacs, Katherine E. Gamblin, Todd Bhatele, Abhinav Bremer, Peer-Timo Schulz, Martin Hamann, Bernd Department of Computer Science University of California Davis United States Center for Applied Scientific Computing Lawrence Livermore National Laboratory United States
We introduce a new approach to automatically extract an idealized logical structure from a parallel execution trace. We use this structure to define intuitive metrics such as the lateness of a process involved in a pa... 详细信息
来源: 评论
Fine-grain parallel megabase sequence comparison with multiple heterogeneous GPUs  14
Fine-grain parallel megabase sequence comparison with multip...
收藏 引用
2014 19th acm sigplan symposium on principles and practice of parallel programming, PPoPP 2014
作者: De Sandes, Edans F.O. Miranda, Guillermo Melo, Alba C.M.A. Martorell, Xavier Ayguadé, Eduard University of Brasilia Brazil Universitat Politècnica de Catalunya Barcelona Supercomputing Center Spain
this paper proposes and evaluates a parallel strategy to execute the exact Smith-Waterman (SW) algorithm for megabase DNA sequences in heterogeneous multi-GPU platforms. In our strategy, the computation of a single hu... 详细信息
来源: 评论
Triolet: A programming system that unifies algorithmic skeleton interfaces for high-performance cluster computing  14
Triolet: A programming system that unifies algorithmic skele...
收藏 引用
2014 19th acm sigplan symposium on principles and practice of parallel programming, PPoPP 2014
作者: Rodrigues, Christopher Jablin, thomas Dakkak, Abdul Hwu, Wen-Mei University of Illinois at Urbana-Champaign United States
Functional algorithmic skeletons promise a high-level programming interface for distributed-memory clusters that free developers from concerns of task decomposition, scheduling, and communication. Unfortunately, prior... 详细信息
来源: 评论
Designing and auto-tuning parallel 3-D FFT for computation-communication overlap  14
Designing and auto-tuning parallel 3-D FFT for computation-c...
收藏 引用
2014 19th acm sigplan symposium on principles and practice of parallel programming, PPoPP 2014
作者: Song, Sukhyun Hollingsworth, Jeffrey K. Department of Computer Science University of Maryland College Park United States
this paper presents a method to design and auto-tune a new parallel 3-D FFT code using the non-blocking MPI all-to-all operation. We achieve high performance by optimizing computation-communication overlap. Our code p... 详细信息
来源: 评论
Efficient deterministic multithreading without global barriers  14
Efficient deterministic multithreading without global barrie...
收藏 引用
proceedings of the 19th acm sigplan symposium on principles and practice of parallel programming
作者: Lu, Kai Zhou, Xu Bergan, Tom Wang, Xiaoping Science and Technology on Parallel and Distributed Processing Laboratory National University of Defense Technology Changsha China College of Computer National University of Defense Technology Changsha China University of Washington Computer Science and Engineering United States
Multithreaded programs execute nondeterministically on conventional architectures and operating systems. this complicates many tasks, including debugging and testing. Deterministic multithreading (DMT) makes the outpu... 详细信息
来源: 评论
Well-structured futures and cache locality  14
Well-structured futures and cache locality
收藏 引用
2014 19th acm sigplan symposium on principles and practice of parallel programming, PPoPP 2014
作者: Herlihy, Maurice Liu, Zhiyu Computer Science Department Brown University United States
In fork-join parallelism, a sequential program is split into a directed acyclic graph of tasks linked by directed dependency edges, and the tasks are executed, possibly in parallel, in an order consistent with their d... 详细信息
来源: 评论
Data structures for task-based priority scheduling  14
Data structures for task-based priority scheduling
收藏 引用
2014 19th acm sigplan symposium on principles and practice of parallel programming, PPoPP 2014
作者: Wimmer, Martin Versaci, Francesco Träff, Jesper Larsson Cederman, Daniel Tsigas, Philippas Faculty of Informatics Parallel Computing Vienna University of Technology 1040 Vienna/Wien Austria Computer Science and Engineering Chalmers University of Technology 412 96 Göteborg Sweden
We present three lock-free data structures for priority task scheduling: a priority work-stealing one, a centralized one with ρ-relaxed semantics, and a hybrid one combining both concepts. With the single-source shor... 详细信息
来源: 评论
SCCMulti: An improved parallel strongly connected components algorithm  14
SCCMulti: An improved parallel strongly connected components...
收藏 引用
2014 19th acm sigplan symposium on principles and practice of parallel programming, PPoPP 2014
作者: Tomkins, Daniel Smith, Timmie Amato, Nancy M. Rauchwerger, Lawrence Parasol Laboratory Department of Computer Science and Engineering Texas A and M University United States
Tarjan's famous linear time, sequential algorithm for finding the strongly connected components (SCCs) of a graph relies on depth first search, which is inherently sequential. Deterministic parallel algorithms sol... 详细信息
来源: 评论
CUDA-NP: Realizing nested thread-level parallelism in GPGPU applications  14
CUDA-NP: Realizing nested thread-level parallelism in GPGPU ...
收藏 引用
2014 19th acm sigplan symposium on principles and practice of parallel programming, PPoPP 2014
作者: Yang, Yi Zhou, Huiyang Department of Computing Systems Architecture NEC Laboratories America Inc. United States Department of Electrical and Computer Engineering North Carolina State University United States
parallel programs consist of series of code sections with different thread-level parallelism (TLP). As a result, it is rather common that a thread in a parallel program, such as a GPU kernel in CUDA programs, still co... 详细信息
来源: 评论
Resilient X10: Efficient failure-aware programming  14
Resilient X10: Efficient failure-aware programming
收藏 引用
2014 19th acm sigplan symposium on principles and practice of parallel programming, PPoPP 2014
作者: Cunningham, David Grove, David Herta, Benjamin Iyengar, Arun Kawachiya, Kiyokuni Murata, Hiroki Saraswat, Vijay Takeuchi, Mikio Tardieu, Olivier IBM T. J. Watson Research Center Japan Google Inc. Japan IBM Research Tokyo Japan
Scale-out programs run on multiple processes in a cluster. In scale-out systems, processes can fail. Computations using traditional libraries such as MPI fail when any component process fails. the advent of Map Reduce... 详细信息
来源: 评论