咨询与建议

限定检索结果

文献类型

  • 5 篇 会议
  • 4 篇 期刊文献
  • 1 篇 学位论文

馆藏范围

  • 10 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 9 篇 工学
    • 8 篇 计算机科学与技术...
    • 3 篇 电气工程
    • 1 篇 控制科学与工程
    • 1 篇 软件工程
  • 1 篇 管理学
    • 1 篇 管理科学与工程(可...

主题

  • 10 篇 data layout opti...
  • 2 篇 computational sc...
  • 2 篇 ising model
  • 2 篇 performance pred...
  • 1 篇 object-based sem...
  • 1 篇 performance
  • 1 篇 flash memories
  • 1 篇 parallel process...
  • 1 篇 solid state driv...
  • 1 篇 in-memory databa...
  • 1 篇 spin selection s...
  • 1 篇 p-oftl
  • 1 篇 compiler optimiz...
  • 1 篇 storage systems
  • 1 篇 gpu memory optim...
  • 1 篇 disk technology ...
  • 1 篇 garbage collecti...
  • 1 篇 logp parallel mo...
  • 1 篇 semantic-unaware...
  • 1 篇 hot-cold cluster...

机构

  • 1 篇 virginia polytec...
  • 1 篇 ibm corp china r...
  • 1 篇 louisiana state ...
  • 1 篇 tsinghua univ de...
  • 1 篇 louisiana state ...
  • 1 篇 iit dept comp sc...
  • 1 篇 ohio state univ ...
  • 1 篇 renmin univ chin...
  • 1 篇 pacific northwes...
  • 1 篇 microsoft res re...
  • 1 篇 intel corp santa...
  • 1 篇 mit cambridge ma...
  • 1 篇 ibm corp almaden...
  • 1 篇 argonne natl lab...
  • 1 篇 tsinghua natl la...
  • 1 篇 louisiana state ...
  • 1 篇 ens inria lyon
  • 1 篇 virginia tech | ...
  • 1 篇 ohio state univ ...
  • 1 篇 dawning informat...

作者

  • 2 篇 ramanujam j.
  • 2 篇 lu qingda
  • 2 篇 feng shuangtong
  • 2 篇 krishnamoorthy s...
  • 2 篇 sadayappan p.
  • 1 篇 he jun
  • 1 篇 song huaiming
  • 1 篇 chen yongjian
  • 1 篇 hsu ww
  • 1 篇 shu jiwu
  • 1 篇 rountev atanas
  • 1 篇 bondhugula uday
  • 1 篇 lu youyou
  • 1 篇 ngai tin-fook
  • 1 篇 lin haibo
  • 1 篇 wang wei
  • 1 篇 chen liang jeff
  • 1 篇 moscibroda thoma...
  • 1 篇 gao xiaoyang
  • 1 篇 chen yueguo

语言

  • 10 篇 英文
检索条件"主题词=Data layout optimization"
10 条 记 录,以下是1-10 订阅
排序:
Empirical performance model-driven data layout optimization selection for tensor contraction expressions
收藏 引用
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING 2012年 第3期72卷 338-352页
作者: Lu, Qingda Gao, Xiaoyang Krishnamoorthy, Sriram Baumgartner, Gerald Ramanujam, J. Sadayappan, P. Louisiana State Univ Dept Comp Sci Baton Rouge LA 70803 USA Ohio State Univ Dept Comp Sci & Engn Columbus OH 43210 USA Louisiana State Univ Dept Elect & Comp Engn Baton Rouge LA 70803 USA
Empirical optimizers like ATLAS have been very effective in optimizing computational kernels in libraries. The best choice of parameters such as tile size and degree of loop unrolling is determined in ATLAS by executi... 详细信息
来源: 评论
The automatic improvement of locality in storage systems
收藏 引用
ACM TRANSACTIONS ON COMPUTER SYSTEMS 2005年 第4期23卷 424-473页
作者: Hsu, WW Smith, AJ Young, HC IBM Corp Almaden Res Ctr Comp Sci Storage Syst Dept San Jose CA 95120 USA Univ Calif Berkeley Dept EECS Div Comp Sci Berkeley CA 94720 USA
Disk I/O is increasingly the performance bottleneck in computer systems despite rapidly increasing disk data transfer rates. In this article, we propose Automatic Locality-Improving Storage (ALIS), an introspective st... 详细信息
来源: 评论
Efficient algorithms for parallelizing Monte Carlo simulations for 2D Ising spin models
收藏 引用
JOURNAL OF SUPERCOMPUTING 2008年 第3期44卷 274-290页
作者: Santos, Eunice E. Rickman, Jeffrey M. Muthukrishnan, Gayathri Feng, Shuangtong Virginia Polytech Inst & State Univ Dept Comp Sci Blacksburg VA 24061 USA Lehigh Univ Dept Mat Sci & Engn Bethlehem PA 18015 USA
In this paper, we design and implement a variety of parallel algorithms for both sweep spin selection and random spin selection. We analyze our parallel algorithms on LogP, a portable and general parallel machine mode... 详细信息
来源: 评论
Wide Table layout optimization based on Column Ordering and Duplication  17
Wide Table Layout Optimization based on Column Ordering and ...
收藏 引用
ACM International Conference on Management of data
作者: Bian, Haoqiong Yan, Ying Tao, Wenbo Chen, Liang Jeff Chen, Yueguo Du, Xiaoyong Moscibroda, Thomas Renmin Univ China DEKE Key Lab MOE Beijing Peoples R China Microsoft Res Redmond WA 98052 USA MIT Cambridge MA 02139 USA
Modern data analytical tasks often witness very wide tables, from a few hundred columns to a few thousand. While it is commonly agreed that column stores are an appropriate data format for wide tables and analytical w... 详细信息
来源: 评论
A Server-Level Adaptive data layout Strategy for Parallel File Systems
A Server-Level Adaptive Data Layout Strategy for Parallel Fi...
收藏 引用
26th IEEE International Parallel and Distributed Processing Symposium (IPDPS) / Workshop on High Performance data Intensive Computing
作者: Song, Huaiming Jin, Hui He, Jun Sun, Xian-He Thakur, Rajeev Dawning Informat Ind Ctr Res & Dev Beijing 100193 Peoples R China IIT Dept Comp Sci Chicago IL 60616 USA Argonne Natl Lab Math & Comp Sci Div 9700 S Cass Ave Argonne IL 60439 USA
Parallel file systems are widely used for providing a high degree of I/O parallelism to mask the gap between I/O and memory speed. However, peak I/O performance is rarely attained due to complex data access patterns o... 详细信息
来源: 评论
p-OFTL: An Object-based Semantic-aware Parallel Flash Translation Layer  14
p-OFTL: An Object-based Semantic-aware Parallel Flash Transl...
收藏 引用
Design, Automation and Test in Europe Conference and Exhibition (DATE)
作者: Wang, Wei Lu, Youyou Shu, Jiwu Tsinghua Univ Dept Comp Sci & Technol Beijing 100084 Peoples R China Tsinghua Natl Lab Informat Sci & Technol Beijing Peoples R China
With increased density and decreased price, flash memory has been widely used in storage systems for its low latency and low power features. However, traditional storage systems are designed and excessively optimized ... 详细信息
来源: 评论
Memory-Efficient Storing of Timestamps for Spatio-Temporal data Management in Columnar In-Memory databases  26th
Memory-Efficient Storing of Timestamps for Spatio-Temporal D...
收藏 引用
26th International Conference on database Systems for Advanced Applications (DASFAA)
作者: Richly, Keven Univ Potsdam Hasso Plattner Inst Potsdam Germany
Vast amounts of spatio-temporal data are continuously accumulated through the wide distribution of location-acquisition technologies. Concerning the increased performance requirements of spatio-temporal data mining ap... 详细信息
来源: 评论
data layout Transformation for Enhancing data Locality on NUCA Chip Multiprocessors  09
Data Layout Transformation for Enhancing Data Locality on NU...
收藏 引用
18th International Conference on Parallel Architectures and Compilation Techniques
作者: Lu, Qingda Alias, Christophe Bondhugula, Uday Henretty, Thomas Krishnamoorthy, Sriram Ramanujam, J. Rountev, Atanas Sadayappan, P. Chen, Yongjian Lin, Haibo Ngai, Tin-Fook Ohio State Univ Columbus OH 43210 USA ENS INRIA Lyon France Pacific NorthWest Natl Lab Richland WA 99352 USA Louisiana State Univ Baton Rouge LA 70803 USA Intel Corp Santa Clara CA 95051 USA IBM Corp China Res Lab Beijing Peoples R China
With increasing numbers of cores, future CMPs (Chip Multi-Processors) are likely to have a tiled architecture with a portion of shared L2 cache on each the and a bank-interleaved distribution of the address space. Alt... 详细信息
来源: 评论
Efficient Parallelization of 2D Ising Spin Systems
Efficient Parallelization of 2D Ising Spin Systems
收藏 引用
作者: Feng, Shuangtong Virginia Tech | University
The problem of efficient parallelization of 2D Ising spin systems requires realistic algorithmic design and implementation based on an understanding of issues from computer science and statistical physics. In this wor... 详细信息
来源: 评论
Optimizing Memory Access Efficiency in CUDA Kernel via data layout Technique
收藏 引用
Journal of Computer and Communications 2024年 第5期12卷 124-139页
作者: Neda Seifi Abdullah Al-Mamun Department of Computer & Cyber Sciences&#8212 SCCS Augusta University Augusta Georgia USA
Over the past decade, Graphics Processing Units (GPUs) have revolutionized high-performance computing, playing pivotal roles in advancing fields like IoT, autonomous vehicles, and exascale computing. Despite these adv... 详细信息
来源: 评论