咨询与建议

限定检索结果

文献类型

  • 51 篇 期刊文献
  • 28 篇 会议

馆藏范围

  • 79 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 78 篇 工学
    • 71 篇 计算机科学与技术...
    • 57 篇 电气工程
    • 6 篇 软件工程
    • 3 篇 电子科学与技术(可...
    • 3 篇 信息与通信工程
    • 2 篇 网络空间安全
    • 1 篇 控制科学与工程
  • 6 篇 理学
    • 5 篇 数学
    • 1 篇 物理学
  • 2 篇 管理学
    • 2 篇 管理科学与工程(可...

主题

  • 79 篇 algorithm-based ...
  • 14 篇 concurrent error...
  • 8 篇 fault tolerance
  • 8 篇 matrix multiplic...
  • 7 篇 error detection
  • 5 篇 fault tolerant s...
  • 4 篇 error correction
  • 4 篇 sparse grid comb...
  • 4 篇 checkpointing
  • 4 篇 checksum encodin...
  • 3 篇 fault diagnosis
  • 3 篇 weighted sum par...
  • 3 篇 simd
  • 3 篇 silent errors
  • 3 篇 silent data corr...
  • 3 篇 avx-512
  • 3 篇 high-performance...
  • 3 篇 parallel computi...
  • 3 篇 high performance...
  • 3 篇 pde solvers

机构

  • 6 篇 univ calif river...
  • 6 篇 princeton univ d...
  • 6 篇 univ calif davis...
  • 2 篇 princeton univ d...
  • 2 篇 univ calif river...
  • 2 篇 chinese acad sci...
  • 2 篇 australian natl ...
  • 2 篇 oak ridge natl l...
  • 1 篇 italian natl agc...
  • 1 篇 penn state univ ...
  • 1 篇 univ calif davis...
  • 1 篇 univ quebec dept...
  • 1 篇 national microel...
  • 1 篇 sungkyunkwan uni...
  • 1 篇 georgia inst tec...
  • 1 篇 oak ridge natl l...
  • 1 篇 univ lyon inria ...
  • 1 篇 politecn milan d...
  • 1 篇 carnegie mellon ...
  • 1 篇 sandia natl labs...

作者

  • 9 篇 chen zizhong
  • 8 篇 jha nk
  • 8 篇 redinbo gr
  • 4 篇 wu panruo
  • 4 篇 zhai yujia
  • 4 篇 chen jieyang
  • 4 篇 banerjee p
  • 4 篇 zhao kai
  • 3 篇 nguyen c
  • 3 篇 ouyang kaiming
  • 3 篇 liang xin
  • 3 篇 strazdins peter ...
  • 3 篇 harding brendan
  • 3 篇 li sihuan
  • 3 篇 vinnakota b
  • 3 篇 abraham ja
  • 2 篇 grover pulkit
  • 2 篇 liu jinyang
  • 2 篇 mayo jackson r.
  • 2 篇 tao dingwen

语言

  • 78 篇 英文
  • 1 篇 其他
检索条件"主题词=algorithm-based fault tolerance"
79 条 记 录,以下是61-70 订阅
排序:
Protecting wavelet lifting transforms
Protecting wavelet lifting transforms
收藏 引用
10th IEEE Pacific Rim International Symposium on Dependable Computing (PRDC 2004)
作者: Redinbo, GR Nguyen, C Univ Calif Davis Dept Elect & Comp Engn Davis CA 95616 USA
Wavelet transforms are the central to many applications in image processing and data compression. They have banks of multi-rate filters that are difficult to protect from. computer-induced numerical errors. An efficie... 详细信息
来源: 评论
A Highly-Efficient Error Detection Technique for General Matrix Multiplication using Tiled Processing on SIMD Architecture  40
A Highly-Efficient Error Detection Technique for General Mat...
收藏 引用
IEEE 40th International Conference on Computer Design (ICCD)
作者: Mummidi, Chandra Sekhar Bal, Sandeep Goldstein, Brunno F. Srinivasan, Sudarshan Kundu, Sandip Univ Massachusetts Amherst MA 01003 USA Univ Fed Rio de Janeiro UFRJ Rio De Janeiro Brazil Intel Labs Mumbai Maharashtra India
General Matrix Multiplication (GEMM) is instrumental in myriads of scientific, high-performance computing, and machine learning applications such as computer vision, recommendation models, and weather forecasts. It is... 详细信息
来源: 评论
Highly Scalable algorithms for the Sparse Grid Combination Technique  29
Highly Scalable Algorithms for the Sparse Grid Combination T...
收藏 引用
29th IEEE International Parallel and Distributed Processing Symposium (IPDPS)
作者: Strazdins, Peter E. Ali, Md Mohsin Harding, Brendan Australian Natl Univ Res Sch Comp Sci Canberra ACT Australia Australian Natl Univ Inst Math Sci Canberra ACT Australia
Many petascale and exascale scientific simulations involve the time evolution of systems modelled as Partial Differential Equations (PDEs). The sparse grid combination technique (SGCT) is a cost-effective method for s... 详细信息
来源: 评论
ERROR DETECTION IN DIGITAL NEURAL NETWORKS - AN algorithm-based APPROACH FOR INNER PRODUCT PROTECTION
ERROR DETECTION IN DIGITAL NEURAL NETWORKS - AN ALGORITHM-BA...
收藏 引用
5th SPIE Conference on Advanced Signal Processing - algorithms, Architectures, and Implementations
作者: BREVEGLIERI, L PIURI, V POLITECN MILAN DEPT ELECTR & INFORMATI-20133 MILANITALY
Artificial Neural Networks are an interesting solution for several real-time applications in the area of signal and image processing, in particular since recent advances in VLSI integration technologies allow for effi... 详细信息
来源: 评论
3D Coded SUMMA: Communication-Efficient and Robust Parallel Matrix Multiplication  26th
3D Coded SUMMA: Communication-Efficient and Robust Parallel ...
收藏 引用
26th International Conference on Parallel and Distributed Computing (Euro-Par)
作者: Jeong, Haewon Yang, Yaoqing Gupta, Vipul Engelmann, Christian Low, Tze Meng Cadambe, Viveck Ramchandran, Kannan Grover, Pulkit Carnegie Mellon Univ Pittsburgh PA 15213 USA Univ Calif Berkeley Berkeley CA USA Oak Ridge Natl Lab Oak Ridge TN USA Penn State Univ State Coll PA USA
In this paper, we propose a novel fault-tolerant parallel matrix multiplication algorithm called 3D Coded SUMMA that achieves higher failure-tolerance than replication-based schemes for the same amount of redundancy. ... 详细信息
来源: 评论
Combining backward and forward recovery to cope with silent errors in iterative solvers  29
Combining backward and forward recovery to cope with silent ...
收藏 引用
29th IEEE International Parallel and Distributed Processing Symposium (IPDPS)
作者: Fasi, Massimiliano Robert, Yves Ucar, Bora Ecole Normale Super Lyon Lyon France CNRS F-75700 Paris France INRIA Rocquencourt France Univ Bologna I-40126 Bologna Italy Univ Knoxville Knoxville TN USA
Several recent papers have introduced a periodic verification mechanism to detect silent errors in iterative solvers. Chen [PPoPP' 13, pp. 167-176] has shown how to combine such a verification mechanism (a stabili... 详细信息
来源: 评论
Supporting the Development of Resilient Message Passing Applications using Simulation
Supporting the Development of Resilient Message Passing Appl...
收藏 引用
22nd Euromicro International Conference on Parallel, Distributed, and Network-based Processing (PDP)
作者: Naughton, Thomas Engelmann, Christian Vallee, Geoffroy Boehm, Swen Oak Ridge Natl Lab Comp Sci & Math Div Oak Ridge TN 37831 USA
An emerging aspect of high-performance computing (HPC) hardware/software co-design is investigating performance under failure. The work in this paper extends the Extreme-scale Simulator (xSim), which was designed for ... 详细信息
来源: 评论
In-Situ Mitigation of Silent Data Corruption in PDE Solvers
In-Situ Mitigation of Silent Data Corruption in PDE Solvers
收藏 引用
ACM Workshop on fault-tolerance for HPC at Extreme Scale (FTXS)
作者: Salloum, Maher Mayo, Jackson R. Armstrong, Robert C. Sandia Natl Labs Livermore CA 94551 USA
We present algorithmic techniques for parallel PDE solvers that leverage numerical smoothness properties of physics simulation to detect and correct silent data corruption within local computations. We initially model... 详细信息
来源: 评论
A Case Study of Designing Efficient algorithm-based fault Tolerant Application for Exascale Parallelism
A Case Study of Designing Efficient Algorithm-based Fault To...
收藏 引用
26th IEEE International Parallel and Distributed Processing Symposium (IPDPS) / Workshop on High Performance Data Intensive Computing
作者: Yao, Erlin Wang, Rui Chen, Mingyu Tan, Guangming Sun, Ninghui Chinese Acad Sci Inst Comp Technol State Key Lab Comp Architecture Beijing Peoples R China
fault tolerance overhead of high performance computing (HPC) applications is becoming critical to the efficient utilization of HPC systems at large scale. Today's HPC applications typically tolerate fail-stop fail... 详细信息
来源: 评论
Concurrent error detection in fast unitary transform algorithms
Concurrent error detection in fast unitary transform algorit...
收藏 引用
International Conference on Dependable Systems and Networks (DSN 2001)
作者: Redinbo, GR Univ Calif Davis Dept Elect & Comp Engn Davis CA 95616 USA
Discrete fast unitary transform algorithms, of which the fast Fourier transform (FFT) and fast discrete Cosine transform (DCT) are practical examples, are highly susceptible to temporary calculation failures because o... 详细信息
来源: 评论