检索结果-内蒙古大学图书馆

Hierarchical search algorithm for error detection in floating-point arithmetic expressions

JOURNAL OF SUPERCOMPUTING 2024年第1期80卷 1183-1205页

作者： Zhang, Zuoyan Xu, Jinchen Hao, Jiangwei Qu, Yang He, Haotian Zhou, Bei Informat Engn Univ 62 Sci AnenueHigh Tech Zone Zhengzhou 450001 Henan Peoples R China

Scientific and engineering applications rely on floating-point arithmetic to approximate real numbers. Due to the inherent rounding errors in floating-point numbers, error propagation during calculations can accumulate and lead to serious errors that may compromise the safety and reliability of the program. In theory, the most accurate method of error detection is to exhaustively search all possible floating-point inputs, but this is not feasible in practice due to the huge search space involved. Effectively and efficiently detecting maximum floating-point errors has been a challenge. To address this challenge, we design and implement an error detection tool for floating-point arithmetic expressions called HSED. It leverages modified mantissas under double precision floating-point types to simulate hierarchical searches from either half or single precision to double precision. Experimental results show that for 32 single-parameter arithmetic expressions in the FPBench benchmark test set, the error detection effects and performance of HSED are significantly better than the state-of-the-art error detection tools Herbie, S3FP and ATOMU. HSED outperforms Herbie, Herbie+, S3FP and ATOMU in 24, 19, 27 and 25 cases, respectively. The average time taken by Herbie, Herbie+, and S3FP is 1.82, 11.20, and 129.15 times longer than HSED, respectively.

关键词： floating-point arithmetic Error detection Dynamic analysis Hierarchical search

来源：评论

学校读者我要写书评

暂无评论

Error in Ulps of the Multiplication or Division by a Correctly-Rounded Function or Constant in Binary floating-point arithmetic

引用

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING 2024年第2期12卷 656-666页

作者： Brisebarre, Nicolas Muller, Jean-Michel Picot, Joris CNRS Lab LIP ENS Lyon F-69342 Lyon France ENS Lyon Lab LIP F-69342 Lyon France

Assume we use a binary floating-point arith-metic and that RN is the round-to-nearest function. Also assume that c is a constant or a real function of one or more variables, and that we have at our disposal a correctly rounded implementation of c, sayc=RN(c). For evaluat-ing xc (resp.x/c or c/x), the natural way is to replace it by RN (xc) (***(x/c) or RN(c/x)), that is, to call functioncand to perform a floating-point multiplica-tion or division. This can be generalized to the approxima-tion of n/dbyRN(n/d) and the approximation of ndbyRN(nd), wheren=RN(n) andd=RN(d),and n and d are functions for which we have at our disposal acorrectly rounded implementation. We discuss tight error bounds in ulps of such approximations. From our results, one immediately obtains tight error bounds for calculations such as x & lowast;pi,ln(2)/x,x/(y+z),(x+y)& lowast;z,x/sqrt(y),sqrt(x)/y,(x+y)(z+t),(x+y)/(z+t),(x+y)/(zt),etc. in floating-point arithmetic

关键词： floating-point arithmetic ulp numerical error correct rounding multiplication by a constant

来源：评论

学校读者我要写书评

暂无评论

Enabling floating-point arithmetic in the Coq Proof Assistant

引用

JOURNAL OF AUTOMATED REASONING 2023年第4期67卷 1-30页

作者： Martin-Dorel, Erik Melquiond, Guillaume Roux, Pierre Univ Toulouse Toulouse INP IRIT CNRSUT3 Toulouse France Univ Paris Saclay LMF CNRS ENS Paris SaclayInria Gif Sur Yvette France Univ Toulouse ONERA DTIS Toulouse France

floating-point arithmetic is a well-known and extremely efficient way of performing approximate computations over the real numbers. Although it requires some careful considerations, floating-point numbers are nowadays routinely used to prove mathematical theorems. Numerical computations have been applied in the context of formal proofs too, as illustrated by the CoqInterval library. But these computations do not benefit from the powerful floating-point units available in modern processors, since they are emulated inside the logic of the formal system. This paper experiments with the use of hardware floating-point numbers for numerically intensive proofs verified by the Coq proof assistant. This gives rise to various questions regarding the formalization, the implementation, the usability, and the level of trust. This approach has been applied to the CoqInterval and ValidSDP libraries, which demonstrates a speedup of at least one order of magnitude.

关键词： Formal proof floating-point arithmetic Proof by computation

来源：评论

学校读者我要写书评

暂无评论

Formal Verification of Emulated floating-point arithmetic in Falcon 1

引用

19th International Workshop on Security on Advances in Information and Computer Security (IWSEC)

作者： Hwang, Vincent Max Planck Inst Secur & Privacy Bochum Germany

ISBN: (数字)9789819777372

ISBN: (纸本)9789819777365;9789819777372

We show that there is a discrepancy between the emulated floating-point multiplication in the submission package of the digital signature Falcon and the claimed behavior. In particular, we show that some floating-point products with absolute values the smallest normal positive floating-point number are incorrectly zeroized. However, we show that the discrepancy doesn't affect the complex fast Fourier transform in the signature generation of Falcon by modeling the floating-point addition, subtraction, and multiplication in CryptoLine. We later implement our own floating-point multiplications in Armv7-M assembly and Jasmin and prove their equivalence with our model, demonstrating the possibility of transferring the challenging verification task (verifying highly-optimized assembly) to the presumably more readable code base (Jasmin).

关键词： Falcon floating-point arithmetic Formal verification CryptoLine

来源：评论

学校读者我要写书评

暂无评论

A Provably Robust Algorithm for Triangle-triangle Intersections in floating-point arithmetic

引用

ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE 2022年第2期48卷 17-17页

作者： Mccoid, Conor Gander, Martin J. Univ Geneva Rue Conseil Gen 7-9 CH-1205 Geneva Switzerland

Motivated by the unexpected failure of the triangle intersection component of the Projection Algorithm for Nonmatching Grids (PANG), this article provides a robust version with proof of backward stability. The new triangle intersection algorithm ensures consistency and parsimony across three types of calculations. The set of intersections produced by the algorithm, called representations, is shown to match the set of geometric intersections, called models. The article concludes with a comparison between the old and new intersection algorithms for PANG using an example found to reliably generate failures in the former.

关键词： Mesh intersection advancing front algorithms non-matching grids polygon clipping floating-point arithmetic robustness

来源：评论

学校读者我要写书评

暂无评论

Generation of test matrices with specified eigenvalues using floating-point arithmetic

引用

NUMERICAL ALGORITHMS 2022年第1期90卷 241-262页

作者： Ozaki, Katsuhisa Ogita, Takeshi 307 Fukasaku Minuma Ku Saitama Saitama 3378570 Japan 2-6-1 Zempukuji Suginami Ku Tokyo 1678585 Japan

This paper concerns test matrices for numerical linear algebra using an error-free transformation of floating-point arithmetic. For specified eigenvalues given by a user, we propose methods of generating a matrix whose eigenvalues are exactly known based on, for example, Schur or Jordan normal form and a block diagonal form. It is also possible to produce a real matrix with specified complex eigenvalues. Such test matrices with exactly known eigenvalues are useful for numerical algorithms in checking the accuracy of computed results. In particular, exact errors of eigenvalues can be monitored. To generate test matrices, we first propose an error-free transformation for the product of three matrices YSX. We approximate S by S' to compute YS'X without a rounding error. Next, the error-free transformation is applied to the generation of test matrices with exactly known eigenvalues. Note that the exactly known eigenvalues of the constructed matrix may differ from the anticipated given eigenvalues. Finally, numerical examples are introduced in checking the accuracy of numerical computations for symmetric and unsymmetric eigenvalue problems.

关键词： Test matrices floating-point arithmetic Numerical linear algebra Eigenvalue problems

来源：评论

学校读者我要写书评

暂无评论

Formally verified 32-and 64-bit integer division using double-precision floating-point arithmetic 29

Formally verified 32-and 64-bit integer division using doubl...

引用

29th IEEE Symposium on Computer arithmetic (ARITH)

作者： Monniaux, David Pain, Alice Univ Grenoble Alpes VERIMAG Grenoble INP CNRS F-38000 Grenoble France

ISBN: (数字)9781665478274

ISBN: (纸本)9781665478274

Some recent processors are not equipped with an integer division unit. Compilers then implement division by a call to a special function supplied by the processor designers, which implements division by a loop producing one bit of quotient per iteration. This hinders compiler optimizations and results in non-constant time computation, which is a problem in some applications. We advocate instead using the processor's floating-point unit, and propose code that the compiler can easily interleave with other computations. We fully proved the correctness of our algorithm, which mixes floating-point and fixed-bitwidth integer computations, using the Coq proof assistant and successfully integrated it into the CompCert formally verified compiler.

关键词： Program processors Codes Digital arithmetic Optimization floating-point arithmetic

来源：评论

学校读者我要写书评

暂无评论

Algorithms for Stochastically Rounded Elementary arithmetic Operations in IEEE 754 floating-point arithmetic

引用

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING 2021年第3期9卷 1451-1466页

作者： Fasi, Massimiliano Mikaitis, Mantas Orebro Univ Sch Sci & Technol S-70182 Orebro Sweden Univ Manchester Dept Math Manchester M13 9PL Lancs England

We present algorithms for performing the five elementary arithmetic operations (+, -, x, divided by, and root) in floating point arithmetic with stochastic rounding, and demonstrate the value of these algorithms by discussing various applications where stochastic rounding is beneficial. The algorithms require that the hardware be compliant with the IEEE 754 floating-point standard and that a floating-point pseudorandom number generator be available. The goal of these techniques is to emulate stochastic rounding when the underlying hardware does not support this rounding mode, as is the case for most existing CPUs and GPUs. By simulating stochastic rounding in software, one has the possibility to explore the behavior of this rounding mode and develop new algorithms even without having access to hardware implementing stochastic rounding- once such hardware becomes available, it suffices to replace the proposed algorithms by calls to the corresponding hardware routines. When stochastically rounding double precision operations, the algorithms we propose are between 7.3 and 19 times faster than the implementations that use the GNU MPFR library to simulate extended precision. We test our algorithms on various tasks, including summation algorithms and solvers for ordinary differential equations, where stochastic rounding is expected to bring advantages.

关键词： Stochastic processes Hardware Standards Tools Software Libraries Monte Carlo methods floating-point arithmetic error-free transformation stochastic rounding numerical analysis numerical algorithm IEEE 754

来源：评论

学校读者我要写书评

暂无评论

A Multiple Precision floating-point arithmetic Unit Based on the RISC-V Instruction Set

A Multiple Precision Floating-Point Arithmetic Unit Based on...

引用

Electronic Information Engineering and Computer Technology (EIECT), International Conference on

作者： Jianxin Chen Hong Hao Shuai Wang Lele Li Xinxin Zhao Fan Yu Jing Wang Guilong Xu Zongqi Sun Kai Jiang Inspur Academy of Science and Technology Inspur Group Co. Ltd Jinan China

ISBN: (数字)9798331528850

ISBN: (纸本)9798331528867

With the advancements in image processing and machine learning, greater challenges have been posed to parallel computing, especially in the realm of floating-point arithmetic. In the face of increasingly complex application scenarios, single-precision floating-point units (FPUs) are becoming increasingly inadequate in terms of flexibility and versatility. Therefore, this paper proposes a design for a multi-precision floating-point unit. The FPU of this design supports multiple precision formats including fp32, fp16, fp8, and variable precision fp16, achieving high multi-precision flexibility. This design effectively reduces the calculation cycle while ensuring efficient execution performance, meeting the diverse needs for calculation accuracy in different application scenarios. In addition to basic arithmetic functions, this design also implements single instruction multiple data (SIMD) functionality, further enhancing the processing power and efficiency of the arithmetic unit. Meanwhile this design introduces a series of custom simd_fmt instructions, expanding the RISC-V instruction set to enable it to support a wider range of computational operations. These instructions exhibit significant performance advantages when dealing with multithreaded tasks and vector operations.

关键词： Accuracy Single instruction multiple data Instruction sets Image processing Machine learning Vectors floating-point arithmetic Faces

来源：评论

学校读者我要写书评

暂无评论

Acceleration of Complex Matrix Multiplication Using Arbitrary Precision floating-point arithmetic

Acceleration of Complex Matrix Multiplication Using Arbitrar...

引用

International Conference on Engineering and Emerging Technologies (ICEET)

作者： Tomonori Kouya Shizuoka Institute of Science and Technology Fukuroi Japan

ISBN: (数字)9798350316926

ISBN: (纸本)9798350316933

Efficient multiple precision linear numerical computation libraries such as MPLAPACK are critical in dealing with ill-conditioned problems. Specifically, there are optimization methods for matrix multiplication, such as the Strassen algorithm and the Ozaki scheme, which can be used to speed up computation. For complex matrix multiplication, the 3M method can also be used, which requires only three multiplications of real matrices, instead of the 4M method, which requires four multiplications of real matrices. In this study, we extend these optimization methods to arbitrary precision complex matrix multiplication and verify the possible increase in computation speed through benchmark tests. The optimization methods are also applied to complex LU decomposition using matrix multiplication to demonstrate that the Ozaki scheme can be used to achieve higher computation speeds.

关键词： Optimization methods Benchmark testing Libraries Computational efficiency Matrix decomposition floating-point arithmetic

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：