检索结果-内蒙古大学图书馆

Radix-10 Restoring Square Root for 6-input LUTs Programmable Devices

CIRCUITS SYSTEMS AND SIGNAL PROCESSING 2021年第5期40卷 2335-2360页

作者： Vazquez, Martin Tosini, Marcelo Leiva, Lucas UNICEN Comp & Syst Dept Tandil Argentina

This paper proposes efficient fixed-point and floating-point implementations for radix-10 square root in Xilinx FPGAs devices. The method implements digit recurrence with restoring algorithm, which supports the three decimal floating-point (DFP) types specified in the IEEE 754-2008 standard. The technique used for restoring is optimal and novel. The designs use new techniques based on the efficient utilization of dedicated resources in the programmable devices. Implementations were made in Xilinx 7-series devices. For fixed-point square root, they are capable of operating up to 212 MHz for p=7, 197 MHz for p=16, and 190 MHz for p=34. As for DFP square root, the operation frequency obtained is 194 MHz for p=7, 183 MHz for p=16, and 174 MHz for p=34. The proposed architecture achieves better computation times than related works.

关键词： Square root digit-recurrence algorithm Decimal arithmetic Floating-point representation FPGA

来源：评论

学校读者我要写书评

暂无评论

An efficient FPGA architecture for integer nth root computation

引用

INTERNATIONAL JOURNAL OF ELECTRONICS 2015年第10期102卷 1675-1694页

作者： Rangel-Valdez, Nelson Hugo Barron-Zambrano, Jose Torres-Huitzil, Cesar Torres-Jimenez, Jose Univ Politecn Victoria Dept Tecnol Informac Victoria Tamaulipas Mexico CINVESTAV Tamaulipas Informat Technol Lab Victoria Tamaulipas Mexico

In embedded computing, it is common to find applications such as signal processing, image processing, computer graphics or data compression that might benefit from hardware implementation for the computation of integer roots of order [GRAPHICS] . However, the scientific literature lacks architectural designs that implement such operations for different values of N, using a low amount of resources. This article presents a parameterisable field programmable gate array (FPGA) architecture for an efficient Nth root calculator that uses only adders/subtractors and [GRAPHICS] location memory elements. The architecture was tested for different values of [GRAPHICS] , using 64-bit number representation. The results show a consumption up to 10% of the logical resources of a Xilinx XC6SLX45-CSG324C device, depending on the value of N. The hardware implementation improved the performance of its corresponding software implementations in one order of magnitude. The architecture performance varies from several thousands to seven millions of root operations per second.

关键词： eta th root algorithm arithmetic core FPGA digit-recurrence algorithm

来源：评论

学校读者我要写书评

暂无评论

Improved Decimal Floating-Point Logarithmic Converter Based on Selection by Rounding

引用

IEEE TRANSACTIONS ON COMPUTERS 2012年第5期61卷 607-621页

作者： Chen, Dongdong Han, Liu Choi, Younhee Ko, Seok-Bum Univ Saskatchewan Dept Elect & Comp Engn Sasaktoon SK S7N 5A9 Canada

This paper presents the algorithm and architecture of the decimal floating-point (DFP) logarithmic converter, based on the digit-recurrence algorithm with selection by rounding. The proposed approach can compute faithful DFP logarithm results for any one of the three DFP formats specified in the IEEE 754-2008 standard. In order to optimize the latency for the proposed design, we mainly integrate the following novel features: 1) using the redundant carry-save representation of the data path;2) reducing the number of iterations by determining the number of initial iteration;and 3) retiming and balancing the delay of the proposed architecture. The proposed architecture is synthesized with STM 90-nm standard cell library and the results show that the critical path delay and the number of clock cycles of the proposed Decimal64 logarithmic converter are 1.55 ns (34.4 FO4) and 19, respectively, and the total hardware complexity is 43,572 NAND2 gates. The delay estimation results of the proposed architecture show that its latency is close to that of the binary radix-16 logarithmic converter, and that it has a significant decrease on latency compared with a recently published high performance CORDIC implementation.

关键词： Decimal floating-point decimal logarithmic converter digit-recurrence algorithm selection by rounding

来源：评论

学校读者我要写书评

暂无评论

A 32-bit Decimal Floating-Point Logarithmic Converter

A 32-bit Decimal Floating-Point Logarithmic Converter

引用

19th IEEE Symposium on Computer Arithmetic (ARITH 2009)

作者： Chen, Dongdong Zhang, Yu Choi, Younhee Lee, Moon Ho Ko, Seok-Bum Univ Saskatchewan Dept Elect & Comp Engn Campus Dr 57 Saskatoon SK Canada Chonbuk Natl Univ Inst Informat & Commun Jeonju 561756 South Korea

ISBN: (纸本)9780769536705

This paper presents a new design and implementation of a 32-bit decimal floating-point (DFP) logarithmic converter based on the digit-recurrence algorithm. The converter can calculate accurate logarithms of 32-bit DFP numbers which are defined in the IEEE 754-2008 standard. Redundant digit e(1) is obtained by look-up table in the first iteration and the rest redundant digits e(j) are selected by rounding the scaled remainder during the succeeding iterations. The sequential architecture of the proposed 32-bit DFP logarithmic converter is implemented on Xilinx Virtex-II Pro P30 FPGA device and then synthesized with TMSC 0.18-um standard cell library. The implementation results indicate that the maximum frequency of the proposed architecture is 47.7 MHz in FPGA and 107.9 MHz in TMSC 0.18-um technology. The faithful 32-bit DFP logarithm results can be obtained in 18 cycles.

关键词： Decimal Logarithmic Converter Decimal Floating-Point digit-recurrence algorithm Selection by Rounding

来源：评论

学校读者我要写书评

暂无评论

Complex square root with operand prescaling

Complex square root with operand prescaling

引用

15th IEEE International Conference on Application-Specific Systems, Architectures and Processors

作者： Ercegovac, Milos D. Muller, Jean-Michel Univ Calif Los Angeles Dept Comp Sci Los Angeles CA 90095 USA Ecole Normale Super Lyon CNRS Lab CNRS ENSL INRIA UCBL LIP F-69364 Lyon France

ISBN: (纸本)0769522262

We propose a radix-r digit-recurrence algorithm for complex square-root. The operand is prescaled to allow the selection of square-root digits by rounding of the residual. This leads to a simple hardware implementation of digit selection. Moreover, the use of digit recurrence approach allows correct rounding of the result if needed. The algorithm, compatible with the complex division presented in Ercegovac and Muller ("Complex Division with Prescaling of the Operands," in Proc. Application-Specific Systems, Architectures, and Processors (ASAP'03), The Hague, The Netherlands, June 24-26, 2003), and its design are described. We also give rough estimates of its latency and cost with respect to implementation based on standard floating-point instructions as used in software routines for complex square root.

关键词： computer arithmetic complex square-root digit-recurrence algorithm operand prescaling

来源：评论

学校读者我要写书评

暂无评论

A hardware algorithm for fast logarithmic computation with exponential convergence rate

引用

JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS 2005年第4期28卷 749-752页

作者： Chen, CY Chen, RL Sheu, MH Feng Chia Univ Dept Informat Engn & Comp Sci Taichung 407 Taiwan Natl Yunlin Univ Sci & Technol Dept Elect Engn Yunlin 640 Taiwan

A hardware algorithm is proposed for improving the speed of the linear digit-recurrence logarithmic algorithm. The convergence rate of this logarithmic algorithm is exponential. Furthermore, the size of the lookup tables used in the algorithm is smaller than the size of the lookup tables used in the digit-recurrence algorithms. When the word length of the operand is less than or equal to 64 bits, the operations involved in each stage of the logarithmic computation only include small table lookup operation, digit-multiplication, and simple square operations. We conclude that the hardware implementation of our proposed algorithm is very efficient.

关键词： logarithmic computation digit-recurrence algorithm reciprocal approximation logarithmic number system

来源：评论

学校读者我要写书评

暂无评论

A digit-recurrence algorithm for cube rooting

引用

IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES 2001年第5期E84A卷 1309-1314页

作者： Takagi, N Nagoya Univ Dept Informat Engn Nagoya Aichi 4648603 Japan

A digit-recurrence algorithm for cube rooting is proposed. In cube rooting, the digit-recurrence equation of the residual includes the square of the partial result of the cube root. In the proposed algorithm, the square of the partial result is kept, and the square, as well as the residual, is updated by addition/subtraction, shift, and multiplication by one or two digits. Different specific versions of the algorithm are possible, depending on the radix, the digit set of the cube root, and etc. Any version of the algorithm can be implemented as a sequential (folded) circuit or a combinational (unfolded) circuit, which is suitable for VLSI realization.

关键词： computer arithmetic cube rooting hardware algorithm digit-recurrence algorithm VLSI

来源：评论

学校读者我要写书评

暂无评论

Radix-4 reciprocal square-root and its combination with division and square root

引用

IEEE TRANSACTIONS ON COMPUTERS 2003年第9期52卷 1100-1114页

作者： Lang, T Antelo, E Univ Calif Irvine Dept Elect & Comp Engn Irvine CA 92697 USA Univ Santiago de Compostela Dept Elect & Comp Santiago De Compostela 15706 Spain

In this work, we present a reciprocal square root algorithm by digit recurrence and selection by a staircase function and the radix-4 implementation. As in similar algorithms for division and square root, the results are obtained correctly rounded in a straightforward manner (in constrast to existing methods to compute the reciprocal square root). Although, apparently, a single selection function can only be used for j greater than or equal to 2 (the selection constants are different for j = 0, j = 1, and j greater than or equal to 2), we show that it is possible to use a single selection function for all iterations. We perform a rough comparison with existing methods and we conclude that our implementation is a low hardware complexity solution with moderate latency, especially for exactly rounded results. We also extend the unit to support division and square root with the same selection function and with slight modifications in the initialization of the reciprocal square root unit.

关键词： combined division square root reciprocal square root digit-recurrence algorithm exact rounding staircase selection function

来源：评论

学校读者我要写书评

暂无评论

A VLSI algorithm for computing the Euclidean norm of a 3D vector

引用

IEEE TRANSACTIONS ON COMPUTERS 2000年第10期49卷 1074-1082页

作者： Takagi, N Kuwahara, S Nagoya Univ Dept Informat Engn Chikusa Ku Nagoya Aichi 4648603 Japan Toyota Motor Corp Tech Dept 1AV 14G Aichi 4718572 Japan

A digit-recurrence algorithm for computing the Euclidean norm of a three-dimensional (3D) vector which often appears in 3D computer graphics is proposed. One of the three squarings required for the usual computation is removed and the other two squarings, as well as the two additions, are overlapped with the square rooting. The Euclidean norm is computed by iteration of carry-propagation-free additions, shifts, and multiplications by one digit. Different specific versions of the algorithm are possible, depending on the radix, the redundancy factor of the digit set, and etc. Each version of the algorithm can be implemented as a sequential (folded) circuit or a combinational (unfolded) circuit, which has a regular array structure suitable for VLSI.

关键词： computer arithmetic Euclidean norm VLSI algorithm digit-recurrence algorithm computer graphics

来源：评论

学校读者我要写书评

暂无评论

Computation of √x/d in a very high radix combined division/square-root unit with scaling and selection by rounding

引用

IEEE TRANSACTIONS ON COMPUTERS 1998年第2期47卷 152-161页

作者： Antelo, E Lang, T Bruguera, JD Univ Santiago de Compostela Dept Electron & Comp Santiago De Compostela Spain Univ Calif Irvine Dept Elect & Comp Engn Irvine CA 92697 USA

A very-high radix digit-recurrence algorithm for the operation root s/d is developed, with residual scaling and digit selection by rounding. This is an extension of the division and square-root algorithms presented previously, and for which a combined unit was shown to provide a fast execution of these operations. The architecture of a combined unit to execute division, square-root, and root x/d is described, with inverse square-root as a special case. A comparison with the corresponding combined division and square-root unit shows a similar cycle time and an increase of one cycle for the extended operation with respect to square-root. To obtain an exactly rounded result for the extended operation a datapath of about 2n bits is needed. An alternative is proposed which requires approximately the same width as for square-root, but produces a result with an error of less than one ulp. The area increase with respect to the division and square root unit should be no greater than 15 percent. Consequently, whenever a Very high radix unit for division and square-root seems suitable, it might be profitable to implement the extended unit instead.

关键词： digit-recurrence algorithm division high-radix methods inverse square-root square-root

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：