咨询与建议

限定检索结果

文献类型

  • 51 篇 期刊文献
  • 28 篇 会议

馆藏范围

  • 79 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 78 篇 工学
    • 71 篇 计算机科学与技术...
    • 57 篇 电气工程
    • 6 篇 软件工程
    • 3 篇 电子科学与技术(可...
    • 3 篇 信息与通信工程
    • 2 篇 网络空间安全
    • 1 篇 控制科学与工程
  • 6 篇 理学
    • 5 篇 数学
    • 1 篇 物理学
  • 2 篇 管理学
    • 2 篇 管理科学与工程(可...

主题

  • 79 篇 algorithm-based ...
  • 14 篇 concurrent error...
  • 8 篇 fault tolerance
  • 8 篇 matrix multiplic...
  • 7 篇 error detection
  • 5 篇 fault tolerant s...
  • 4 篇 error correction
  • 4 篇 sparse grid comb...
  • 4 篇 checkpointing
  • 4 篇 checksum encodin...
  • 3 篇 fault diagnosis
  • 3 篇 weighted sum par...
  • 3 篇 simd
  • 3 篇 silent errors
  • 3 篇 silent data corr...
  • 3 篇 avx-512
  • 3 篇 high-performance...
  • 3 篇 parallel computi...
  • 3 篇 high performance...
  • 3 篇 pde solvers

机构

  • 6 篇 univ calif river...
  • 6 篇 princeton univ d...
  • 6 篇 univ calif davis...
  • 2 篇 princeton univ d...
  • 2 篇 univ calif river...
  • 2 篇 chinese acad sci...
  • 2 篇 australian natl ...
  • 2 篇 oak ridge natl l...
  • 1 篇 italian natl agc...
  • 1 篇 penn state univ ...
  • 1 篇 univ calif davis...
  • 1 篇 univ quebec dept...
  • 1 篇 national microel...
  • 1 篇 sungkyunkwan uni...
  • 1 篇 georgia inst tec...
  • 1 篇 oak ridge natl l...
  • 1 篇 univ lyon inria ...
  • 1 篇 politecn milan d...
  • 1 篇 carnegie mellon ...
  • 1 篇 sandia natl labs...

作者

  • 9 篇 chen zizhong
  • 8 篇 jha nk
  • 8 篇 redinbo gr
  • 4 篇 wu panruo
  • 4 篇 zhai yujia
  • 4 篇 chen jieyang
  • 4 篇 banerjee p
  • 4 篇 zhao kai
  • 3 篇 nguyen c
  • 3 篇 ouyang kaiming
  • 3 篇 liang xin
  • 3 篇 strazdins peter ...
  • 3 篇 harding brendan
  • 3 篇 li sihuan
  • 3 篇 vinnakota b
  • 3 篇 abraham ja
  • 2 篇 grover pulkit
  • 2 篇 liu jinyang
  • 2 篇 mayo jackson r.
  • 2 篇 tao dingwen

语言

  • 78 篇 英文
  • 1 篇 其他
检索条件"主题词=Algorithm-Based Fault Tolerance"
79 条 记 录,以下是31-40 订阅
排序:
Correcting DFT Codes with a Modified Berlekamp-Massey algorithm and Kalman Recursive Syndrome Extension
收藏 引用
IEEE TRANSACTIONS ON COMPUTERS 2014年 第1期63卷 196-203页
作者: Redinbo, G. Robert Univ Calif Davis Dept Elect & Comp Engn Davis CA 95616 USA
Real number block codes derived from the discrete Fourier transform (DFT) are corrected by coupling a very modified Berlekamp-Massey (BM) algorithm with a syndrome extension process. The modified BM algorithm determin... 详细信息
来源: 评论
Rollback-Free Recovery for a High Performance Dense Linear Solver With Reduced Memory Footprint
收藏 引用
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 2024年 第7期35卷 1307-1319页
作者: Loreti, Daniela Artioli, Marcello Ciampolini, Anna Univ Bologna Dept Comp Sci & Engn I-40126 Bologna Italy Italian Natl Agcy New Technol Energy & Sustainable I-40129 Bologna Italy
The scale of nowadays High Performance Computing (HPC) systems is the key element that determines the achievement of impressive performance, as well as the reason for their relatively limited reliability. Over the las... 详细信息
来源: 评论
FT-PBLAS: PBLAS-based fault-Tolerant Linear Algebra Computation on High-performance Computing Systems
收藏 引用
IEEE ACCESS 2020年 8卷 42674-42688页
作者: Zhu, Yanchao Liu, Yi Zhang, Guozhen Beihang Univ Sch Comp Sci & Engn Sino German Joint Software Inst Beijing 100191 Peoples R China Beihang Univ Beijing Key Lab Network Technol Beijing 100191 Peoples R China
As high-performance computing (HPC) systems have scaled up, resilience has become a great challenge. To guarantee resilience, various kinds of hardware and software techniques have been proposed. However, among popula... 详细信息
来源: 评论
EVALUATION AND COMPARISON OF fault-TOLERANT SOFTWARE TECHNIQUES
收藏 引用
IEEE TRANSACTIONS ON RELIABILITY 1993年 第2期42卷 190-204页
作者: HUDAK, J SUH, BH SIEWIOREK, D SEGALL, Z CARNEGIE MELLON UNIV DEPT ELECT & COMP ENGNPITTSBURGHPA 15213
Various fault-tolerant software techniques have been proposed in order to meet the reliability requirements of critical systems. This paper evaluates 4 implementations of fault-tolerant software techniques with respec... 详细信息
来源: 评论
Tests and tolerances for high-performance software-implemented fault detection
收藏 引用
IEEE TRANSACTIONS ON COMPUTERS 2003年 第5期52卷 579-591页
作者: Turmon, M Granat, R Katz, DS Lou, JZ Jet Prop Lab Data Understanding Syst Grp Pasadena CA 91109 USA Jet Prop Lab Parallel Applicat Technol Grp Pasadena CA 91109 USA
We describe and test a software approach to fault detection in common numerical algorithms. Such result checking or algorithm-based fault tolerance (ABFT) methods may be used, for example, to overcome single-event ups... 详细信息
来源: 评论
fault-tolerant computation in groups and semigroups: applications to automata, dynamic systems and Petri nets
收藏 引用
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS 2002年 第4-5期339卷 387-430页
作者: Hadjicostis, CN Verghese, GC Univ Illinois Dept Elect & Comp Engn Urbana IL 61801 USA
The traditional approach to fault-tolerant computation has been via modular hardware redundancy. Although universal and simple, modular redundancy is inherently expensive and inefficient. By exploiting particular stru... 详细信息
来源: 评论
Decoding real-number convolutional codes: Change detection, Kalman estimation
收藏 引用
IEEE TRANSACTIONS ON INFORMATION THEORY 1997年 第6期43卷 1864-1876页
作者: Redinbo, GR Department of Electrical and Computer Engineering University of California Davis CA USA
Convolutional codes which employ real-number symbols are difficult to decode because of the size of the alphabet and the numerical and roundoff noise inherent in arithmetic operations. Such codes find applications in ... 详细信息
来源: 评论
On-line soft error correction in matrix-matrix multiplication
收藏 引用
JOURNAL OF COMPUTATIONAL SCIENCE 2013年 第6期4卷 465-472页
作者: Wu, Panruo Ding, Chong Chen, Longxiang Davies, Teresa Karlsson, Christer Chen, Zizhong Colorado Sch Mines Dept Elect Engn & Comp Sci Golden CO 80401 USA Univ Calif Riverside Dept Comp Sci & Engn Riverside CA 92521 USA
Soft errors are one-time events that corrupt the state of a computing system but not its overall functionality. Soft errors normally do not interrupt the execution of the affected program, but the affected computation... 详细信息
来源: 评论
Extending backward error assertions to tolerance of large errors in floating point computations
收藏 引用
IEEE TRANSACTIONS ON COMPUTERS 1997年 第4期46卷 505-510页
作者: Fitzpatrick, P NATL UNIV IRELAND UNIV COLL CORK NATL MICROELECT RES CTR CORK IRELAND
The use of backward error assertions combined with iterative refinement has been suggested for the correction of small fault induced errors in the floating point solution of linear systems. We extend this to the corre... 详细信息
来源: 评论
Graceful degradation in algorithm-based fault tolerant multiprocessor systems
收藏 引用
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 1997年 第2期8卷 137-153页
作者: Yajnik, S Jha, NK PRINCETON UNIV DEPT ELECT ENGN PRINCETON NJ 08544 USA
algorithm-based fault tolerance (ABFT) is a technique which improves the reliability of a multiprocessor system by providing concurrent error detection and fault location capability to it. It encodes data at the syste... 详细信息
来源: 评论