咨询与建议

限定检索结果

文献类型

  • 51 篇 期刊文献
  • 28 篇 会议

馆藏范围

  • 79 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 78 篇 工学
    • 71 篇 计算机科学与技术...
    • 57 篇 电气工程
    • 6 篇 软件工程
    • 3 篇 电子科学与技术(可...
    • 3 篇 信息与通信工程
    • 2 篇 网络空间安全
    • 1 篇 控制科学与工程
  • 6 篇 理学
    • 5 篇 数学
    • 1 篇 物理学
  • 2 篇 管理学
    • 2 篇 管理科学与工程(可...

主题

  • 79 篇 algorithm-based ...
  • 14 篇 concurrent error...
  • 8 篇 fault tolerance
  • 8 篇 matrix multiplic...
  • 7 篇 error detection
  • 5 篇 fault tolerant s...
  • 4 篇 error correction
  • 4 篇 sparse grid comb...
  • 4 篇 checkpointing
  • 4 篇 checksum encodin...
  • 3 篇 fault diagnosis
  • 3 篇 weighted sum par...
  • 3 篇 simd
  • 3 篇 silent errors
  • 3 篇 silent data corr...
  • 3 篇 avx-512
  • 3 篇 high-performance...
  • 3 篇 parallel computi...
  • 3 篇 high performance...
  • 3 篇 pde solvers

机构

  • 6 篇 univ calif river...
  • 6 篇 princeton univ d...
  • 6 篇 univ calif davis...
  • 2 篇 princeton univ d...
  • 2 篇 univ calif river...
  • 2 篇 chinese acad sci...
  • 2 篇 australian natl ...
  • 2 篇 oak ridge natl l...
  • 1 篇 italian natl agc...
  • 1 篇 penn state univ ...
  • 1 篇 univ calif davis...
  • 1 篇 univ quebec dept...
  • 1 篇 national microel...
  • 1 篇 sungkyunkwan uni...
  • 1 篇 georgia inst tec...
  • 1 篇 oak ridge natl l...
  • 1 篇 univ lyon inria ...
  • 1 篇 politecn milan d...
  • 1 篇 carnegie mellon ...
  • 1 篇 sandia natl labs...

作者

  • 9 篇 chen zizhong
  • 8 篇 jha nk
  • 8 篇 redinbo gr
  • 4 篇 wu panruo
  • 4 篇 zhai yujia
  • 4 篇 chen jieyang
  • 4 篇 banerjee p
  • 4 篇 zhao kai
  • 3 篇 nguyen c
  • 3 篇 ouyang kaiming
  • 3 篇 liang xin
  • 3 篇 strazdins peter ...
  • 3 篇 harding brendan
  • 3 篇 li sihuan
  • 3 篇 vinnakota b
  • 3 篇 abraham ja
  • 2 篇 grover pulkit
  • 2 篇 liu jinyang
  • 2 篇 mayo jackson r.
  • 2 篇 tao dingwen

语言

  • 78 篇 英文
  • 1 篇 其他
检索条件"主题词=algorithm-Based fault tolerance"
79 条 记 录,以下是21-30 订阅
排序:
Toward fault-tolerant parallel-in-time integration with PFASST
收藏 引用
PARALLEL COMPUTING 2017年 62卷 20-37页
作者: Speck, Robert Ruprecht, Daniel Forschungszentrum Julich Julich Supercomp Ctr D-52425 Julich Germany Univ Leeds Sch Mech Engn Woodhouse Lane Leeds LS2 9JT W Yorkshire England
We introduce and analyze different strategies for the parallel-in-time integration method PFASST to recover from hard faults and subsequent data loss. Since PFASST stores solutions at multiple time steps on different ... 详细信息
来源: 评论
Correcting Soft Errors Online in Fast Fourier Transform  17
Correcting Soft Errors Online in Fast Fourier Transform
收藏 引用
International Conference for High Performance Computing, Networking, Storage and Analysis (SC)
作者: Liang, Xin Chen, Jieyang Tao, Dingwen Li, Sihuan Wu, Panruo Li, Hongbo Ouyang, Kaiming Liu, Yuanlai Song, Fengguang Chen, Zizhong Univ Calif Riverside Riverside CA 92521 USA Indiana Univ Purdue Univ Indianapolis IN 46202 USA
While many algorithm-based fault tolerance (ABFT) schemes have been proposed to detect soft errors offline in the fast Fourier transform (FFT) after computation finishes, none of the existing ABFT schemes detect soft ... 详细信息
来源: 评论
Exploiting data representation for fault tolerance
收藏 引用
JOURNAL OF COMPUTATIONAL SCIENCE 2016年 14卷 51-60页
作者: Elliott, J. Hoemmen, M. Mueller, F. North Carolina State Univ Dept Comp Sci Raleigh NC 27695 USA Sandia Natl Labs Ctr Res Comp POB 5800 Albuquerque NM 87185 USA
Incorrect computer hardware behavior may corrupt intermediate computations in numerical algorithms, possibly resulting in incorrect answers. Prior work models misbehaving hardware by randomly flipping bits in memory. ... 详细信息
来源: 评论
A backward/forward recovery approach for the preconditioned conjugate gradient method
收藏 引用
JOURNAL OF COMPUTATIONAL SCIENCE 2016年 第Part3期17卷 522-534页
作者: Fasi, Massimiliano Langou, Julien Robert, Yves Ucar, Bora Univ Manchester Manchester M13 9PL Lancs England Univ Colorado Denver Denver CO USA ENS Lyon Lyon France Univ Tennessee Knoxville TN USA Univ Lyon INRIA CNRS LIPUMR5668ENS LyonUCBL Lyon France
Several recent papers have introduced a periodic verification mechanism to detect silent errors in iterative solvers. Chen (2013, pp. 167-176) has shown how to combine such a verification mechanism (a stability test c... 详细信息
来源: 评论
Design and analysis of two highly scalable sparse grid combination algorithms
收藏 引用
JOURNAL OF COMPUTATIONAL SCIENCE 2016年 第Part3期17卷 547-561页
作者: Strazdins, Peter E. Ali, Md. Mohsin Harding, Brendan Australian Natl Univ Res Sch Comp Sci Canberra ACT 0200 Australia Australian Natl Univ Inst Math Sci Canberra ACT 0200 Australia
Many large scale scientific simulations involve the time evolution of systems modelled as Partial Differential Equations (PDEs). The sparse grid combination technique (SGCT) is a cost-effective method for solve time-e... 详细信息
来源: 评论
Reliable Linear, Sesquilinear, and Bijective Operations on Integer Data Streams Via Numerical Entanglement
收藏 引用
IEEE TRANSACTIONS ON SIGNAL PROCESSING 2016年 第17期64卷 4606-4617页
作者: Anam, Mohammad Ashraful Andreopoulos, Yiannis UCL Elect & Elect Engn Dept London WC1E 7JE England
A new technique is proposed for fault-tolerant linear, sesquilinear and bijective (LSB) operations onM integer data streams (M >= 3), such as: scaling, additions/subtractions, inner or outer vector products, permut... 详细信息
来源: 评论
In-Situ Mitigation of Silent Data Corruption in PDE Solvers
In-Situ Mitigation of Silent Data Corruption in PDE Solvers
收藏 引用
ACM Workshop on fault-tolerance for HPC at Extreme Scale (FTXS)
作者: Salloum, Maher Mayo, Jackson R. Armstrong, Robert C. Sandia Natl Labs Livermore CA 94551 USA
We present algorithmic techniques for parallel PDE solvers that leverage numerical smoothness properties of physics simulation to detect and correct silent data corruption within local computations. We initially model... 详细信息
来源: 评论
Quality Aware Error Detection in 2-D Separable Linear Transformation  25
Quality Aware Error Detection in 2-D Separable Linear Transf...
收藏 引用
25th IEEE Asian Test Symposium (ATS)
作者: Hu, Shih-Hsin Abraham, Jacob A. Univ Texas Austin Comp Engn Res Ctr Austin TX 78712 USA Qualcomm Technol Inc San Diego CA 92121 USA
In this paper, we propose a generic weighted check-sum code based quality aware error detection scheme for 2D separable linear transformation. These key components are widely used in multimedia compression systems, e.... 详细信息
来源: 评论
Application fault tolerance for Shrinking Resources via the Sparse Grid Combination Technique  30
Application Fault Tolerance for Shrinking Resources via the ...
收藏 引用
30th IEEE International Parallel and Distributed Processing Symposium (IPDPS)
作者: Strazdins, Peter E. Ali, Md Mohsin Debusschere, Bert Australian Natl Univ Res Sch Comp Sci Canberra ACT 0200 Australia Sandia Natl Labs Combust Res Facil Livermore CA USA
The need to make large-scale scientific simulations resilient to the shrinking and growing of compute resources arises from exascale computing and adverse operating conditions (fault tolerance). It can also arise from... 详细信息
来源: 评论
fault TOLERANT COMPUTATION WITH THE SPARSE GRID COMBINATION TECHNIQUE
收藏 引用
SIAM JOURNAL ON SCIENTIFIC COMPUTING 2015年 第3期37卷 C331-C353页
作者: Harding, Brendan Hegland, Markus Larson, Jay Southern, James Australian Natl Univ Inst Math Sci Acton ACT 2601 Australia Fujitsu Labs Europe Hayes UB4 8FE Middx England
This paper continues to develop a fault tolerant extension of the sparse grid combination technique recently proposed in [B. Harding and M. Hegland, ANZIAM J. Electron. Suppl., 54 (2013), pp. C394-C411]. This approach... 详细信息
来源: 评论