咨询与建议

限定检索结果

文献类型

  • 15 篇 期刊文献
  • 14 篇 会议
  • 3 篇 学位论文

馆藏范围

  • 32 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 23 篇 工学
    • 20 篇 计算机科学与技术...
    • 9 篇 软件工程
    • 4 篇 电气工程
  • 14 篇 理学
    • 14 篇 数学
  • 1 篇 管理学
    • 1 篇 管理科学与工程(可...

主题

  • 32 篇 cache-oblivious ...
  • 5 篇 parallel algorit...
  • 4 篇 graph algorithms
  • 3 篇 sorting
  • 3 篇 mpi_alltoall
  • 2 篇 gaussian elimina...
  • 2 篇 minimum path dat...
  • 2 篇 segment intersec...
  • 2 篇 hashing
  • 2 篇 minimum cut
  • 2 篇 schedulers
  • 2 篇 partitioning
  • 2 篇 in-place algorit...
  • 2 篇 voronoi diagrams
  • 2 篇 performance eval...
  • 2 篇 pipelines
  • 2 篇 multicore
  • 2 篇 dynamic programm...
  • 2 篇 multiprocessors
  • 2 篇 scheduling

机构

  • 3 篇 carnegie mellon ...
  • 3 篇 it univ copenhag...
  • 3 篇 charles universi...
  • 2 篇 georgetown univ ...
  • 2 篇 hong kong univ s...
  • 2 篇 univ waterloo sc...
  • 2 篇 swiss fed inst t...
  • 1 篇 caltech ctr math...
  • 1 篇 washington unive...
  • 1 篇 intel labs santa...
  • 1 篇 univ so calif de...
  • 1 篇 cmu pittsburgh p...
  • 1 篇 univ padua dept ...
  • 1 篇 sapienza univ ro...
  • 1 篇 nicta atp nsw 20...
  • 1 篇 georgetown unive...
  • 1 篇 university of aa...
  • 1 篇 mit csail 77 mas...
  • 1 篇 univ padua dept ...
  • 1 篇 swiss fed inst t...

作者

  • 3 篇 blelloch guy e.
  • 3 篇 gibbons phillip ...
  • 2 篇 chan timothy m.
  • 2 篇 li shigang
  • 2 篇 hoefler torsten
  • 2 篇 wei zhewei
  • 2 篇 chen eric y.
  • 2 篇 fusco emanuele g...
  • 2 篇 abu salem fatima...
  • 2 篇 fineman jeremy t...
  • 2 篇 zhang yunquan
  • 2 篇 yi ke
  • 2 篇 bender michael a...
  • 2 篇 gianinazzi lukas
  • 2 篇 simhadri harsha ...
  • 2 篇 silvestri france...
  • 2 篇 pagh rasmus
  • 2 篇 finocchi irene
  • 2 篇 geissmann barbar...
  • 2 篇 caminiti saverio

语言

  • 31 篇 英文
  • 1 篇 其他
检索条件"主题词=cache-oblivious algorithms"
32 条 记 录,以下是11-20 订阅
排序:
Optimization and Performance Modeling of Stencil Computations on Modern Microprocessors
收藏 引用
SIAM REVIEW 2009年 第1期51卷 129-159页
作者: Datta, Kaushik Kamil, Shoaib Williams, Samuel Oliker, Leonid Shalf, John Yelick, Katherine Univ Calif Berkeley Dept Comp Sci Berkeley CA 94720 USA Univ Calif Berkeley Lawrence Berkeley Lab NERSC CRD Berkeley CA 94720 USA
Stencil-based kernels constitute the core of many important scientific applications on block-structured grids. Unfortunately, these codes achieve a low fraction of peak performance, clue primarily to the disparity bet... 详细信息
来源: 评论
cache-oblivious selection in sorted X+Y matrices
收藏 引用
INFORMATION PROCESSING LETTERS 2008年 第2期109卷 87-92页
作者: de Berg, Mark Thite, Shripad Tech Univ Eindhoven Dept Comp Sci Eindhoven Netherlands CALTECH Ctr Math Informat IST Pasadena CA 91125 USA
Let X[0 . . n - 1] and Y[0 . . m - 1] be two sorted arrays, and define the m x n matrix A by A[j][i] = X[i] + Y[j]. Frederickson and Johnson [G.N. Frederickson, D.B. Johnson. Generalized selection and ranking: Sorted ... 详细信息
来源: 评论
Optimizing graph algorithms for improved cache performance
收藏 引用
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 2004年 第9期15卷 769-782页
作者: Park, JS Penner, M Prasanna, VK Univ Calif Los Angeles Dept Comp Sci Los Angeles CA 90095 USA Univ So Calif Dept Elect Engn Los Angeles CA 90089 USA
In this paper, we develop algorithmic optimizations to improve the cache performance of four fundamental graph algorithms. We present a cache-oblivious implementation of the Floyd-Warshall Algorithm for the fundamenta... 详细信息
来源: 评论
Parallel Minimum Cuts in Near-linear Work and Low Depth
收藏 引用
ACM TRANSACTIONS ON PARALLEL COMPUTING 2021年 第2期8卷 1–20页
作者: Geissmann, Barbara Gianinazzi, Lukas Swiss Fed Inst Technol Dept Comp Sci Univ Str 6 CAB Zurich Switzerland
We present the first near-linear work and poly-logarithmic depth algorithm for computing a minimum cut in an undirected graph. Previous parallel algorithms with poly-logarithmic depth required at least quadratic work ... 详细信息
来源: 评论
Pruning spanners and constructing well-separated pair decompositions in the presence of memory hierarchies
收藏 引用
JOURNAL OF DISCRETE algorithms 2010年 第3期8卷 259-272页
作者: Gieseke, Fabian Gudmundsson, Joachim Vahrenhold, Jan Tech Univ Dortmund Fac Comp Sci LS 11 D-44227 Dortmund Germany NICTA ATP Sydney NSW 2015 Australia
Given a geometric graph G = (S, E) in R-d with constant dilation t, and a positive constant epsilon, we show how to construct a (1 + epsilon)-spanner of G with O(| S|) edges using O(sort(| E|)) memory transfers in the... 详细信息
来源: 评论
cache-oblivious MPI All-to-All Communications on Many-Core Architectures  22
Cache-Oblivious MPI All-to-All Communications on Many-Core A...
收藏 引用
22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP)
作者: Li, Shigang Zhang, Yunquan Hoefler, Torsten Chinese Acad Sci Inst Comp Technol SKL Comp Architecture Beijing Peoples R China Swiss Fed Inst Technol Dept Comp Sci Zurich Switzerland
In the many-core era, the performance of MPI collectives is more dependent on the intra-node communication component. However, the communication algorithms generally inherit from the inter-node version and ignore the ... 详细信息
来源: 评论
Sorting with Asymmetric Read and Write Costs  15
Sorting with Asymmetric Read and Write Costs
收藏 引用
27th ACM symposium on Parallelism in algorithms and Architectures (SPAA)
作者: Blelloch, Guy E. Fineman, Jeremy T. Gibbons, Phillip B. Gu, Yan Shun, Julian Carnegie Mellon Univ Pittsburgh PA 15213 USA Georgetown Univ Washington DC 20057 USA Intel Labs Santa Clara CA USA CMU Pittsburgh PA USA
Emerging memory technologies have a significant gap between the cost, both in time and in energy, of writing to memory versus reading from memory. In this paper we present models and algorithms that account for this d... 详细信息
来源: 评论
Towards Many-Core Implementation of LU Decomposition using Peano Curves
Towards Many-Core Implementation of LU Decomposition using P...
收藏 引用
6th ACM International Conference on Computing Frontiers and Workshops
作者: Heinecke, Alexander Bader, Michael Tech Univ Munich Dept Informat D-80290 Munich Germany
We present Our recent research oil cache-oblivious algorithms and implementations of parallel LU decomposition oil shared-memory multi- and manycore platforms. Our approach uses a block-recursive matrix storage scheme... 详细信息
来源: 评论
Closing the Gap Between cache-oblivious and cache-adaptive Analysis  20
Closing the Gap Between Cache-oblivious and Cache-adaptive A...
收藏 引用
32nd ACM Symposium on Parallelism in algorithms and Architectures (SPAA)
作者: Bender, Michael A. Chowdhury, Rezaul A. Das, Rathish Johnson, Rob Kuszmaul, William Lincoln, Andrea Liu, Quanquan C. Lynch, Jayson Xu, Helen SUNY Stony Brook Stony Brook NY 11794 USA VMware Res Palo Alto CA USA MIT CSAIL 77 Massachusetts Ave Cambridge MA 02139 USA
cache-adaptive analysis was introduced to analyze the performance of an algorithm when the cache (or internal memory) available to the algorithm dynamically changes size. These memory-size fluctuations are, in fact, t... 详细信息
来源: 评论
Optimal In-Place algorithms for 3-d Convex Hulls and 2-d Segment Intersection
Optimal In-Place Algorithms for 3-d Convex Hulls and 2-d Seg...
收藏 引用
25th Annual Symposium on Computational Geometry
作者: Chan, Timothy M. Chen, Eric Y. Univ Waterloo Sch Comp Sci Waterloo ON N2L 3G1 Canada
We describe the first optimal randomized in-place algorithm for the basic 3-d convex hull problem (and, in particular, for 2-d Voronoi diagrams). The algorithm runs in O(n log n) expected time using only O(1) extra sp... 详细信息
来源: 评论