咨询与建议

限定检索结果

文献类型

  • 51 篇 期刊文献
  • 28 篇 会议

馆藏范围

  • 79 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 78 篇 工学
    • 71 篇 计算机科学与技术...
    • 57 篇 电气工程
    • 6 篇 软件工程
    • 3 篇 电子科学与技术(可...
    • 3 篇 信息与通信工程
    • 2 篇 网络空间安全
    • 1 篇 控制科学与工程
  • 6 篇 理学
    • 5 篇 数学
    • 1 篇 物理学
  • 2 篇 管理学
    • 2 篇 管理科学与工程(可...

主题

  • 79 篇 algorithm-based ...
  • 14 篇 concurrent error...
  • 8 篇 fault tolerance
  • 8 篇 matrix multiplic...
  • 7 篇 error detection
  • 5 篇 fault tolerant s...
  • 4 篇 error correction
  • 4 篇 sparse grid comb...
  • 4 篇 checkpointing
  • 4 篇 checksum encodin...
  • 3 篇 fault diagnosis
  • 3 篇 weighted sum par...
  • 3 篇 simd
  • 3 篇 silent errors
  • 3 篇 silent data corr...
  • 3 篇 avx-512
  • 3 篇 high-performance...
  • 3 篇 parallel computi...
  • 3 篇 high performance...
  • 3 篇 pde solvers

机构

  • 6 篇 univ calif river...
  • 6 篇 princeton univ d...
  • 6 篇 univ calif davis...
  • 2 篇 princeton univ d...
  • 2 篇 univ calif river...
  • 2 篇 chinese acad sci...
  • 2 篇 australian natl ...
  • 2 篇 oak ridge natl l...
  • 1 篇 italian natl agc...
  • 1 篇 penn state univ ...
  • 1 篇 univ calif davis...
  • 1 篇 univ quebec dept...
  • 1 篇 national microel...
  • 1 篇 sungkyunkwan uni...
  • 1 篇 georgia inst tec...
  • 1 篇 oak ridge natl l...
  • 1 篇 univ lyon inria ...
  • 1 篇 politecn milan d...
  • 1 篇 carnegie mellon ...
  • 1 篇 sandia natl labs...

作者

  • 9 篇 chen zizhong
  • 8 篇 jha nk
  • 8 篇 redinbo gr
  • 4 篇 wu panruo
  • 4 篇 zhai yujia
  • 4 篇 chen jieyang
  • 4 篇 banerjee p
  • 4 篇 zhao kai
  • 3 篇 nguyen c
  • 3 篇 ouyang kaiming
  • 3 篇 liang xin
  • 3 篇 strazdins peter ...
  • 3 篇 harding brendan
  • 3 篇 li sihuan
  • 3 篇 vinnakota b
  • 3 篇 abraham ja
  • 2 篇 grover pulkit
  • 2 篇 liu jinyang
  • 2 篇 mayo jackson r.
  • 2 篇 tao dingwen

语言

  • 78 篇 英文
  • 1 篇 其他
检索条件"主题词=algorithm-based fault tolerance"
79 条 记 录,以下是51-60 订阅
排序:
FT-BLAS: A fault Tolerant High Performance BLAS Implementation on x86 CPUs
收藏 引用
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 2023年 第12期34卷 3207-3223页
作者: Zhai, Yujia Giem, Elisabeth Zhao, Kai Liu, Jinyang Huang, Jiajun Wong, Bryan M. Shelton, Christian R. Chen, Zizhong Univ Calif Riverside Riverside CA 92521 USA Univ Alabama Birmingham Birmingham AL 35294 USA
Basic Linear Algebra Subprograms (BLAS) serve as a foundational library for scientific computing and machine learning. In this article, we present a new BLAS implementation, FT-BLAS, that provides performance comparab... 详细信息
来源: 评论
SYNTHESIS OF algorithm-based fault-TOLERANT SYSTEMS FOR DEPENDENCE GRAPHS
收藏 引用
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 1993年 第8期4卷 864-874页
作者: VINNAKOTA, B JHA, NK PRINCETON UNIV DEPT ELECT ENGNPRINCETONNJ 08544
algorithm-based fault tolerance (ABFT) is a scheme to improve the reliability of parallel architectures used for computation-intensive tasks. The exact implementation of an ABFT scheme is algorithm-dependent. ABFT sys... 详细信息
来源: 评论
fault TOLERANT COMPUTATION WITH THE SPARSE GRID COMBINATION TECHNIQUE
收藏 引用
SIAM JOURNAL ON SCIENTIFIC COMPUTING 2015年 第3期37卷 C331-C353页
作者: Harding, Brendan Hegland, Markus Larson, Jay Southern, James Australian Natl Univ Inst Math Sci Acton ACT 2601 Australia Fujitsu Labs Europe Hayes UB4 8FE Middx England
This paper continues to develop a fault tolerant extension of the sparse grid combination technique recently proposed in [B. Harding and M. Hegland, ANZIAM J. Electron. Suppl., 54 (2013), pp. C394-C411]. This approach... 详细信息
来源: 评论
Toward fault-tolerant parallel-in-time integration with PFASST
收藏 引用
PARALLEL COMPUTING 2017年 62卷 20-37页
作者: Speck, Robert Ruprecht, Daniel Forschungszentrum Julich Julich Supercomp Ctr D-52425 Julich Germany Univ Leeds Sch Mech Engn Woodhouse Lane Leeds LS2 9JT W Yorkshire England
We introduce and analyze different strategies for the parallel-in-time integration method PFASST to recover from hard faults and subsequent data loss. Since PFASST stores solutions at multiple time steps on different ... 详细信息
来源: 评论
Reliable Linear, Sesquilinear, and Bijective Operations on Integer Data Streams Via Numerical Entanglement
收藏 引用
IEEE TRANSACTIONS ON SIGNAL PROCESSING 2016年 第17期64卷 4606-4617页
作者: Anam, Mohammad Ashraful Andreopoulos, Yiannis UCL Elect & Elect Engn Dept London WC1E 7JE England
A new technique is proposed for fault-tolerant linear, sesquilinear and bijective (LSB) operations onM integer data streams (M >= 3), such as: scaling, additions/subtractions, inner or outer vector products, permut... 详细信息
来源: 评论
Design and analysis of two highly scalable sparse grid combination algorithms
收藏 引用
JOURNAL OF COMPUTATIONAL SCIENCE 2016年 第Part3期17卷 547-561页
作者: Strazdins, Peter E. Ali, Md. Mohsin Harding, Brendan Australian Natl Univ Res Sch Comp Sci Canberra ACT 0200 Australia Australian Natl Univ Inst Math Sci Canberra ACT 0200 Australia
Many large scale scientific simulations involve the time evolution of systems modelled as Partial Differential Equations (PDEs). The sparse grid combination technique (SGCT) is a cost-effective method for solve time-e... 详细信息
来源: 评论
A mesh check-sum ABFT scheme for stream ciphers
收藏 引用
INTERNATIONAL JOURNAL OF COMMUNICATION NETWORKS AND DISTRIBUTED SYSTEMS 2009年 第4期3卷 285-300页
作者: Zhang, Chang N. Liu, Xiao Wei Univ Regina TRLabs Dept Comp Sci Regina SK S4S 0A2 Canada
To enhance the security and reliability of the widely-used stream ciphers, a novel mesh check-sum ABFT scheme for stream ciphers is developed. By utilising the ready-made arithmetic unit in stream ciphers, single and ... 详细信息
来源: 评论
"Short-Dot": Computing Large Linear Transforms Distributedly Using Coded Short Dot Products
收藏 引用
IEEE TRANSACTIONS ON INFORMATION THEORY 2019年 第10期65卷 6171-6193页
作者: Dutta, Sanghamitra Cadambe, Viveck Grover, Pulkit Carnegie Mellon Univ Dept Elect & Comp Engn Pittsburgh PA 15213 USA Penn State Univ Dept Elect Engn University Pk PA 16802 USA
We consider the problem of computing a matrix-vector product Ax using a set of P parallel or distributed processing nodes prone to "straggling," i.e., unpredictable delays. Every processing node can access o... 详细信息
来源: 评论
Correcting DFT Codes with Modified Berlekamp-Massey algorithm and Syndrome Extension
Correcting DFT Codes with Modified Berlekamp-Massey Algorith...
收藏 引用
17th IEEE Pacific Rim International Symposium on Dependable Computing (PRDC)
作者: Redinbo, Robert Univ Calif Davis ECE Dept Davis CA 95616 USA
Real number block codes derived from the discrete Fourier transform (DFT) are corrected by coupling a very modified Berlekamp-Massey algorithm with a syndrome extension process. Enhanced extension recursions based on ... 详细信息
来源: 评论
Physics-based Checksums for Silent-Error Detection in PDE Solvers  25th
Physics-Based Checksums for Silent-Error Detection in PDE So...
收藏 引用
25th International Conference on Parallel and Distributed Computing (Euro-Par)
作者: Salloum, Maher Mayo, Jackson R. Armstrong, Robert C. Sandia Natl Labs POB 969 Livermore CA 94551 USA
We discuss techniques for efficient local detection of silent data corruption in parallel scientific computations, leveraging physical quantities such as momentum and energy that may be conserved by discretized PDEs. ... 详细信息
来源: 评论