咨询与建议

限定检索结果

文献类型

  • 51 篇 期刊文献
  • 28 篇 会议

馆藏范围

  • 79 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 78 篇 工学
    • 71 篇 计算机科学与技术...
    • 57 篇 电气工程
    • 6 篇 软件工程
    • 3 篇 电子科学与技术(可...
    • 3 篇 信息与通信工程
    • 2 篇 网络空间安全
    • 1 篇 控制科学与工程
  • 6 篇 理学
    • 5 篇 数学
    • 1 篇 物理学
  • 2 篇 管理学
    • 2 篇 管理科学与工程(可...

主题

  • 79 篇 algorithm-based ...
  • 14 篇 concurrent error...
  • 8 篇 fault tolerance
  • 8 篇 matrix multiplic...
  • 7 篇 error detection
  • 5 篇 fault tolerant s...
  • 4 篇 error correction
  • 4 篇 sparse grid comb...
  • 4 篇 checkpointing
  • 4 篇 checksum encodin...
  • 3 篇 fault diagnosis
  • 3 篇 weighted sum par...
  • 3 篇 simd
  • 3 篇 silent errors
  • 3 篇 silent data corr...
  • 3 篇 avx-512
  • 3 篇 high-performance...
  • 3 篇 parallel computi...
  • 3 篇 high performance...
  • 3 篇 pde solvers

机构

  • 6 篇 univ calif river...
  • 6 篇 princeton univ d...
  • 6 篇 univ calif davis...
  • 2 篇 princeton univ d...
  • 2 篇 univ calif river...
  • 2 篇 chinese acad sci...
  • 2 篇 australian natl ...
  • 2 篇 oak ridge natl l...
  • 1 篇 italian natl agc...
  • 1 篇 penn state univ ...
  • 1 篇 univ calif davis...
  • 1 篇 univ quebec dept...
  • 1 篇 national microel...
  • 1 篇 sungkyunkwan uni...
  • 1 篇 georgia inst tec...
  • 1 篇 oak ridge natl l...
  • 1 篇 univ lyon inria ...
  • 1 篇 politecn milan d...
  • 1 篇 carnegie mellon ...
  • 1 篇 sandia natl labs...

作者

  • 9 篇 chen zizhong
  • 8 篇 jha nk
  • 8 篇 redinbo gr
  • 4 篇 wu panruo
  • 4 篇 zhai yujia
  • 4 篇 chen jieyang
  • 4 篇 banerjee p
  • 4 篇 zhao kai
  • 3 篇 nguyen c
  • 3 篇 ouyang kaiming
  • 3 篇 liang xin
  • 3 篇 strazdins peter ...
  • 3 篇 harding brendan
  • 3 篇 li sihuan
  • 3 篇 vinnakota b
  • 3 篇 abraham ja
  • 2 篇 grover pulkit
  • 2 篇 liu jinyang
  • 2 篇 mayo jackson r.
  • 2 篇 tao dingwen

语言

  • 78 篇 英文
  • 1 篇 其他
检索条件"主题词=algorithm-based fault tolerance"
79 条 记 录,以下是11-20 订阅
排序:
ERROR-CORRECTING CODES OVER Z(2M) FOR algorithm-based fault-tolerance
收藏 引用
IEEE TRANSACTIONS ON COMPUTERS 1994年 第3期43卷 370-374页
作者: FENG, GL RAO, TRN KOLLURU, MS Center for Adv. Comput. Studies Southwestern Louisiana Univ. Lafayette LA USA
algorithm-based fault tolerance is a scheme of low-cost error protection in real-time digital signal processing environments and other computation-intensive tasks. In this paper, a new method for encoding data is prop... 详细信息
来源: 评论
FT-CNN: algorithm-based fault tolerance for Convolutional Neural Networks
收藏 引用
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 2021年 第7期32卷 1677-1689页
作者: Zhao, Kai Di, Sheng Li, Sihuan Liang, Xin Zhai, Yujia Chen, Jieyang Ouyang, Kaiming Cappello, Franck Chen, Zizhong Univ Calif Riverside Dept Comp Sci & Engn Riverside CA 92521 USA Argonne Natl Lab Math & Comp Sci Div Lemont IL 60439 USA Oak Ridge Natl Lab Comp Sci & Math Div Oak Ridge TN 37831 USA
Convolutional neural networks (CNNs) are becoming more and more important for solving challenging and critical problems in many fields. CNN inference applications have been deployed in safety-critical systems, which m... 详细信息
来源: 评论
Combinatorial analysis of check set construction for algorithm-based fault tolerance systems
收藏 引用
JOURNAL OF ELECTRONIC TESTING-THEORY AND APPLICATIONS 1998年 第3期12卷 255-260页
作者: Wang, DQ Zhao, LC Dalian Maritime Univ Dept Basic Sci Dalian 116026 Peoples R China
algorithm-based fault tolerance (ABFT) is a low-cost system-level concurrent error detection and fault location scheme. The design problem for an ABFT system is concerned with the construction of a check set for detec... 详细信息
来源: 评论
Generalized algorithm-based fault tolerance: Error correction via Kalman estimation
收藏 引用
IEEE TRANSACTIONS ON COMPUTERS 1998年 第6期47卷 639-655页
作者: Redinbo, GR Univ Calif Davis Dept Elect & Comp Engn Davis CA 95616 USA
An extension to algorithm-based fault tolerance (ABFT) methodologies shows how parity values dictated by a real convolutional code can be employed by Kalman estimation techniques to perform real number correction for ... 详细信息
来源: 评论
PARTITIONED ENCODING-SCHEMES FOR algorithm-based fault-tolerance IN MASSIVELY-PARALLEL SYSTEMS
收藏 引用
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 1994年 第6期5卷 649-653页
作者: REXFORD, J JHA, NK PRINCETON UNIV DEPT ELECT ENGNPRINCETONNJ 08544
This short note considers the applicability of algorithm-based fault tolerance (ABFT) to massively parallel scientific computation. Existing ABFT schemes can provide effective fault tolerance at a low cost for computa... 详细信息
来源: 评论
Mantissa-preserving operations and robust algorithm-based fault tolerance for matrix computations
收藏 引用
IEEE TRANSACTIONS ON COMPUTERS 1996年 第4期45卷 408-424页
作者: Dutt, S Assaad, FT Dept. of Electr. Eng. Minnesota Univ. Minneapolis MN USA
A system-level method for achieving fault tolerance called algorithm-based fault tolerance (ABFT) has been proposed by a number of researchers. Many ABFT schemes use a floating-point checksum test to detect computatio... 详细信息
来源: 评论
Efficient techniques for the analysis of algorithm-based fault tolerance (ABFT) schemes
收藏 引用
IEEE TRANSACTIONS ON COMPUTERS 1996年 第4期45卷 499-503页
作者: Nair, VSS Abraham, JA Banerjee, P UNIV TEXAS COMP ENGN RES CTRAUSTINTX 78712 UNIV ILLINOIS CTR RELIABLE & HIGH PERFORMANCE COMPURBANAIL 61801
This paper presents a model which can be used to characterize the diagnosability of algorithm-based fault Tolerant (ABFT) systems. In the model, the relationship between processors computing useful data, the output da... 详细信息
来源: 评论
Parallel Reduction to Hessenberg Form with algorithm-based fault tolerance  13
Parallel Reduction to Hessenberg Form with Algorithm-Based F...
收藏 引用
International Conference for High Performance Computing, Networking, Storage and Analysis (SC)
作者: Jia, Yulu Bosilca, George Dongarra, Jack J. Univ Tennessee Knoxville TN 37996 USA
This paper studies the resilience of a two-sided factorization and presents a generic algorithm-based approach capable of making two-sided factorizations resilient. We establish the theoretical proof of the correctnes... 详细信息
来源: 评论
Rethinking algorithm-based fault tolerance with a Cooperative Software-Hardware Approach  13
Rethinking Algorithm-Based Fault Tolerance with a Cooperativ...
收藏 引用
International Conference for High Performance Computing, Networking, Storage and Analysis (SC)
作者: Li, Dong Chen, Zizhong Wu, Panruo Vetter, Jeffrey S. Oak Ridge Natl Lab Oak Ridge TN 37831 USA Univ Calif Riverside Riverside CA 92521 USA Georgia Inst Technol Atlanta GA 30332 USA
algorithm-based fault tolerance (ABFT) is a highly efficient resilience solution for many widely-used scientific computing kernels. However, in the context of the resilience ecosystem, ABFT is completely opaque to any... 详细信息
来源: 评论
Numerical Defect Correction as an algorithm-based fault tolerance Technique for Iterative Solvers
Numerical Defect Correction as an Algorithm-Based Fault Tole...
收藏 引用
17th IEEE Pacific Rim International Symposium on Dependable Computing (PRDC)
作者: Oboril, Fabian Tahoori, Mehdi B. Heuveline, Vincent Lukarski, Dimitar Weiss, Jan-Philipp Karlsruhe Inst Technol KIT Chair Dependable Nano Comp CDNC Karlsruhe Germany Karlsruhe Inst Technol KIT Engn Math & Comp Lab EMCL Karlsruhe Germany Karlsruhe Inst Technol KIT Shared Res Grp New Frontiers High Performance Comp Exploit Multicore & Coprocessor Technol Karlsruhe Germany
As hardware devices like processor cores and memory sub-systems based on nano-scale technology nodes become more unreliable, the need for fault tolerant numerical computing engines, as used in many critical applicatio... 详细信息
来源: 评论