检索结果-内蒙古大学图书馆

A High-Accuracy Hardware-Efficient Multiply-Accumulate (MAC) Unit Based on dual-Mode Truncation Error Compensation for CNNs

引用

IEEE ACCESS 2020年 8卷 214716-214731页

作者： Tang, Song-Nien Han, Yu-Shin Chung Yuan Christian Univ Dept Informat & Comp Engn Taoyuan 320314 Taiwan

This paper presents a multiply-accumulate (MAC) unit that enables a dual-mode truncation error compensation (TEC) scheme based on a fixed-width Booth multiplier (FWBM) for convolutional neural network (CNN) inference operations. The proposed tailored TEC schemes of Modes 1 and 2 can achieve high MAC accuracy for a general or rectified linear unit-based CNN model with general (Mode 1) or positive/zero (Mode 2) input patterns. By pre-calculating the pre-known CNN model coefficients, the proposed dual-mode TEC scheme can be realized using minimal partial product operations with high hardware efficiency using a softwarefihardware codesign approach. Further, a reconfigurable architecture of the resultant MAC unit is presented to realize the proposed dual-mode TEC scheme. By evaluating the accuracy for 9-N and 25-N MAC operations (N denotes the number of times MAC is performed), a MAC operation using the proposed TEC scheme can achieve the highest accuracy for Modes 1 and 2, relative to contrast samples that directly employ the FWBM with a conventional TEC function. The hardware performances of 9-N and 25-N MAC units are also evaluated using the TSMC 40-nm standard cell library. Compared with the contrast TEC-enabled designs, the proposed MAC unit exhibits higher hardware efficiency in terms of area, delay, and power consumption and achieves a minimum reduction of more than 40% in both area-delay-error and power-delay-error products. Moreover, the resultant 9-N and 25-N MAC units are verified using a system-on-chip field-programmable gate array platform to test a CNN model for handwritten digit classification.

关键词： Multiply-accumulate MAC 2d convolution convolutional neural network CNN accelerator truncation error compensation booth multiplier

来源：评论

学校读者我要写书评

暂无评论

FPGA Implementation of Spatial Image Filters using Xilinx System Generator

引用

Procedia Engineering 2012年 38卷 2244-2249页

作者： V. Elamaran Angam Praveen Medapati Srinivasa Reddy Lanka Venkata Aditya Kunta Suman Department of Electronics & Communication Engineering School of Electrical & Electronics Engineering SASTRA University Thanjavur Tamilnadu India

The objective of this paper is to design, model, simulate and synthesis of Spatial Filtering Techniques, in which the operation is performed within neighborhood of a pixel. Recent increases in Filed Programmable Gate Array (FPGA) performance and size offer a new hardware acceleration opportunity. The convolution filtering operations are implemented using Xilinx System Generator (XSG) which is the industry's leading high-level tool for designing high-performance dSP systems using FPGAs. The designs are modeled using XSG Block set and synthesized onto Virtex 6 xc6vs315t-3ff156 FPGA device. The algorithms are validated using hardware co-simulation method.

关键词： Spatial Filtering FPGA System Generator Co-simulation 2d convolution

来源：评论

学校读者我要写书评

暂无评论

Free deterministic equivalent Z-scores of compound Wishart models: A goodness of fit test of 2d ARMA models

引用

RANdOM MATRICES-THEORY ANd APPLICATIONS 2019年第2期8卷

作者： Hayase, Tomohiro Univ Tokyo Grad Sch Math Sci Meguro Ku 3-8-1 Komaba Tokyo 1558914 Japan

We introduce a new method to qualify the goodness of fit parameter estimation of compound Wishart models. Our method is based on the free deterministic equivalent Z-score, which we introduce in this paper. Furthermore, an application to two-dimensional autoregressive moving-average model is provided. Our proposed method is a generalization of statistical hypothesis testing to one-dimensional moving average model based on fluctuations of real compound Wishart matrices by Hasegawa et al. (A. Hasegawa, N. Sakuma and H. Yoshida, Fluctuations of Marchenko-Pastur limit of random matrices with dependent entries, Statist. Probab. Lett. 127 (2017) 85-96].

关键词： Free probability compound Wishart matrices second-order freeness fluctuation of matrices free deterministic equivalents 2d ARMA model 2d convolution

来源：评论

学校读者我要写书评

暂无评论

Fixed-Point convolutional Neural Network for Real-Time Video Processing in FPGA

Fixed-Point Convolutional Neural Network for Real-Time Video...

引用

IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (EIConRus)

作者： Solovyev, Roman Kustov, Alexander Telpukhov, dmitry Rukhlov, Vladimir Kalinin, Alexandr Russian Acad Sci IPPM Inst Design Problems Microelect Moscow Russia Univ Michigan Dept Computat Med & Bioinformat Ann Arbor MI 48109 USA

ISBN: (纸本)9781728103396

Modern mobile neural networks with a reduced number of weights and parameters do a good job with image classification tasks, but even they may be too complex to be implemented in an FPGA for video processing tasks. The article proposes neural network architecture for the practical task of recognizing images from a camera, which has several advantages in terms of speed. This is achieved by reducing the number of weights, moving from a floating-point to a fixed-point arithmetic, and due to a number of hardware-level optimizations associated with storing weights in blocks, a shift register, and an adjustable number of convolutional blocks that work in parallel. The article also proposed methods for adapting the existing data set for solving a different task. As the experiments showed, the proposed neural network copes well with real-time video processing even on the cheap FPGAs.

关键词： Neural network hardware Field programmable gate arrays Fixed-point arithmetic 2d convolution

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：