咨询与建议

限定检索结果

文献类型

  • 6 篇 期刊文献
  • 3 篇 会议

馆藏范围

  • 9 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 8 篇 工学
    • 8 篇 计算机科学与技术...
    • 2 篇 电气工程
    • 2 篇 软件工程
  • 3 篇 理学
    • 3 篇 数学
  • 1 篇 管理学
    • 1 篇 管理科学与工程(可...

主题

  • 9 篇 tile algorithms
  • 4 篇 dynamic scheduli...
  • 2 篇 plasma
  • 2 篇 bulge chasing
  • 2 篇 two-stage approa...
  • 2 篇 task-based progr...
  • 2 篇 bidiagonal reduc...
  • 2 篇 openmp
  • 2 篇 dense linear alg...
  • 1 篇 symmetric eigenv...
  • 1 篇 performance
  • 1 篇 space-time adapt...
  • 1 篇 multicores
  • 1 篇 powerpack
  • 1 篇 power profiling
  • 1 篇 rapl
  • 1 篇 multi-core archi...
  • 1 篇 algorithms
  • 1 篇 tree reduction
  • 1 篇 bidiagional redu...

机构

  • 3 篇 kaust supercomp ...
  • 2 篇 univ tennessee d...
  • 2 篇 univ tennessee i...
  • 2 篇 univ tennessee d...
  • 1 篇 univ leeds inst ...
  • 1 篇 oak ridge natl l...
  • 1 篇 univ manchester ...
  • 1 篇 natl sci fdn & m...
  • 1 篇 innovat comp lab...
  • 1 篇 oak ridge natl l...
  • 1 篇 univ manchester ...
  • 1 篇 univ pisa dept c...
  • 1 篇 univ manchester ...
  • 1 篇 czech acad sci i...
  • 1 篇 numer algorithms...
  • 1 篇 kaust supercompu...
  • 1 篇 oak ridge natl l...
  • 1 篇 univ manchester ...
  • 1 篇 kaust supercompu...
  • 1 篇 univ tennessee d...

作者

  • 8 篇 dongarra jack
  • 6 篇 luszczek piotr
  • 6 篇 ltaief hatem
  • 3 篇 kurzak jakub
  • 3 篇 haidar azzam
  • 2 篇 yarkhan asim
  • 1 篇 zounon mawussi
  • 1 篇 wu panruo
  • 1 篇 vanneschi marco
  • 1 篇 gates mark
  • 1 篇 sistek jakub
  • 1 篇 stevens david
  • 1 篇 dorris joseph
  • 1 篇 mencagli gabriel...
  • 1 篇 buono daniele
  • 1 篇 hammarling sven
  • 1 篇 bagherpour negin
  • 1 篇 pascucci alessio
  • 1 篇 relton samuel d.
  • 1 篇 weaver vincent m...

语言

  • 8 篇 英文
  • 1 篇 其他
检索条件"主题词=Tile Algorithms"
9 条 记 录,以下是1-10 订阅
排序:
High-Performance Bidiagonal Reduction using tile algorithms on Homogeneous Multicore Architectures
收藏 引用
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE 2013年 第3期39卷 1-22页
作者: Ltaief, Hatem Luszczek, Piotr Dongarra, Jack Kaust Supercomp Lab Thuwal Saudi Arabia Univ Tennessee Dept Elect Engn & Comp Sci Innovat Comp Lab Knoxville TN 37996 USA Oak Ridge Natl Lab Oak Ridge TN USA Univ Manchester Manchester M13 9PL Lancs England
This article presents a new high-performance bidiagonal reduction (BRD) for homogeneous multicore architectures. This article is an extension of the high-performance tridiagonal reduction implemented by the same autho... 详细信息
来源: 评论
Energy Footprint of Advanced Dense Numerical Linear Algebra using tile algorithms on Multicore Architectures
Energy Footprint of Advanced Dense Numerical Linear Algebra ...
收藏 引用
2nd International Conference on Cloud and Green Computing / 2nd International Conference on Social Computing and its Applications (CGC/SCA)
作者: Dongarra, Jack Ltaief, Hatem Luszczek, Piotr Weaver, Vincent M. Univ Tennessee Innovat Comp Lab Knoxville TN 37996 USA Oak Ridge Natl Lab Comp Sci & Math Div Oak Ridge TN USA Univ Manchester Sch Math Sch Comp Sci Manchester NH USA Natl Sci Fdn & Microsoft Research Manchester NH USA KAUST Supercomputing Lab Thuwal Saudi Arabia
We propose to study the impact on the energy footprint of two advanced algorithmic strategies in the context of high performance dense linear algebra libraries: (1) mixed precision algorithms with iterative refinement... 详细信息
来源: 评论
TOWARD A HIGH PERFORMANCE tile DIVIDE AND CONQUER ALGORITHM FOR THE DENSE SYMMETRIC EIGENVALUE PROBLEM
收藏 引用
SIAM JOURNAL ON SCIENTIFIC COMPUTING 2012年 第6期34卷 C249-C274页
作者: Haidar, Azzam Ltaief, Hatem Dongarra, Jack Univ Tennessee Dept Elect Engn & Comp Sci Knoxville TN 37996 USA KAUST Supercomp Lab Thuwal Saudi Arabia Oak Ridge Natl Lab Div Math & Comp Sci Oak Ridge TN USA Univ Manchester Sch Math Manchester Lancs England Univ Manchester Sch Comp Sci Manchester Lancs England
Classical solvers for the dense symmetric eigenvalue problem suffer from the first step, which involves a reduction to tridiagonal form that is dominated by the cost of accessing memory during the panel factorization.... 详细信息
来源: 评论
Parallel Two-Sided Matrix Reduction to Band Bidiagonal Form on Multicore Architectures
收藏 引用
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 2010年 第4期21卷 417-423页
作者: Ltaief, Hatem Kurzak, Jakub Dongarra, Jack Univ Tennessee Dept Elect Engn & Comp Sci Innovat Comp Lab Knoxville TN 37996 USA
The objective of this paper is to extend, in the context of multicore architectures, the concepts of tile algorithms [Buttari et al., 2007] for Cholesky, LU, and QR factorizations to the family of two-sided factorizat... 详细信息
来源: 评论
Profiling high performance dense linear algebra algorithms on multicore architectures for power and energy efficiency
收藏 引用
COMPUTER SCIENCE-RESEARCH AND DEVELOPMENT 2012年 第4期27卷 277-287页
作者: Ltaief, Hatem Luszczek, Piotr Dongarra, Jack KAUST Supercomp Lab Thuwal Saudi Arabia Univ Tennessee Dept Elect Engn & Comp Sci Knoxville TN 37996 USA
This paper presents the power profile of two high performance dense linear algebra libraries i.e., LAPACK and PLASMA. The former is based on block algorithms that use the fork-join paradigm to achieve parallel perform... 详细信息
来源: 评论
Performance analysis and structured parallelisation of the space-time adaptive processing computational kernel on multi-core architectures
收藏 引用
INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS 2014年 第5期29卷 460-498页
作者: Buono, Daniele Mencagli, Gabriele Pascucci, Alessio Vanneschi, Marco Univ Pisa Dept Comp Sci Largo B Pontecorvo 3 I-56127 Pisa Italy
The development of radar systems on general-purpose off-the-shelf parallel hardware represents an effective means of providing efficient implementations with reasonable realisation costs. However, the fulfilment of th... 详细信息
来源: 评论
A Comprehensive Study of Task Coalescing for Selecting Parallelism Granularity in a Two-Stage Bidiagonal Reduction
A Comprehensive Study of Task Coalescing for Selecting Paral...
收藏 引用
26th IEEE International Parallel and Distributed Processing Symposium (IPDPS) / Workshop on High Performance Data Intensive Computing
作者: Haidar, Azzam Ltaief, Hatem Luszczek, Piotr Dongarra, Jack Univ Tennessee Innovat Comp Lab Knoxville TN 37996 USA KAUST Supercomput Lab Thuwal Saudi Arabia
We present new high performance numerical kernels combined with advanced optimization techniques that significantly increase the performance of parallel bidiagonal reduction. Our approach is based on developing effici... 详细信息
来源: 评论
Task-Based Cholesky Decomposition on Knights Corner Using OpenMP
Task-Based Cholesky Decomposition on Knights Corner Using Op...
收藏 引用
International Supercomputing Conference (ISC High Performance)
作者: Dorris, Joseph Kurzak, Jakub Luszczek, Piotr YarKhan, Asim Dongarra, Jack Innovat Comp Lab Knoxville TN 37996 USA
The growing popularity of the Intel Xeon Phi coprocessors and the continued development of this new many-core architecture have created the need for an open-source, scalable, and cross-platform taskbased dense linear ... 详细信息
来源: 评论
PLASMA: Parallel Linear Algebra Software for Multicore Using OpenMP
收藏 引用
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE 2019年 第2期45卷 16-16页
作者: Dongarra, Jack Gates, Mark Haidar, Azzam Kurzak, Jakub Luszczek, Piotr Wu, Panruo Yamazaki, Ichitaro Yarkhan, Asim Abalenkovs, Maksims Bagherpour, Negin Hammarling, Sven Sistek, Jakub Stevens, David Zounon, Mawussi Relton, Samuel D. Univ Tennessee Dept Elect Engn & Comp Sci 1122 Volunteer BlvdSuite 203 Knoxville TN 37996 USA Univ Houston Dept Comp Sci 3551 Cullen Blvd Houston TX 77204 USA Univ Manchester Sch Math Manchester M13 9PL Lancs England Czech Acad Sci Inst Math Zitna 25 Prague 11567 Czech Republic Numer Algorithms Grp Manchester One53 Portland St Manchester M1 3LD Lancs England Univ Leeds Inst Hlth Sci Leeds LS2 9LJ W Yorkshire England
The recent version of the Parallel Linear Algebra Software for Multicore Architectures (PLASMA) library is based on tasks with dependencies from the OpenMP standard. The main functionality of the library is presented.... 详细信息
来源: 评论