咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >USEFUSE: UTILE STRIDE FOR ENHA... 收藏
arXiv

USEFUSE: UTILE STRIDE FOR ENHANCED PERFORMANCE IN FUSED LAYER ARCHITECTURE OF DEEP NEURAL NETWORKS

作     者:Ibrahim, Muhammad Sohail Usman, Muhammad Lee, Jeong-A 

作者机构:Department of Mechanical Systems Engineering Kumoh National Institute of Technology Gumi-Si Korea Republic of Faculty of Informatics and Data Science University of Regensburg Regensburg Germany Department of Computer Engineering Chosun University Gwangju Korea Republic of 

出 版 物:《arXiv》 (arXiv)

年 卷 期:2024年

核心收录:

主  题:Convolutional neural networks 

摘      要:Convolutional Neural Networks (CNNs) are crucial in various applications, but their deployment on resource-constrained edge devices poses challenges. This study presents the Sum-of-Products (SOP) units for convolution, which utilize low-latency left-to-right bit-serial arithmetic to minimize response time and enhance overall performance. The study proposes a methodology for fusing multiple convolution layers to reduce off-chip memory communication and increase overall performance. An effective mechanism detects and skips inefficient convolutions after ReLU layers, minimizing power consumption without compromising accuracy. Furthermore, efficient tile movement guarantees uniform access to the fusion pyramid. An analysis demonstrates the utile stride strategy improves operational intensity. Two designs cater to varied demands: one focuses on minimal response time for mission-critical applications, and another focuses on resource-constrained devices with comparable latency. This approach notably reduced redundant computations, improving the efficiency of CNN deployment on edge devices. © 2024, CC BY.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分