版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:Shandong Univ Sch Informat Sci & Engn ISE Qingdao 266237 Peoples R China Chinese Acad Sci Inst Microelect Key Lab Microelect Devices & Integrated Technol Beijing 100045 Peoples R China NeuMem Co Ltd Hefei 266237 Peoples R China Peking Univ Sch Integrated Circuits Beijing 100871 Peoples R China
出 版 物:《IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS》 (IEEE Trans. Circuits Syst. Regul. Pap.)
年 卷 期:2024年
核心收录:
基 金:National Natural Science Foundation of China [62034006, 92264201, 62104134, 91964105] China Key Research and Development Program [2022YFB3603900] Natural Science Foundation of Shandong Province [ZR2020JQ28] Program of Qilu Young Scholars of Shandong University MIND Project [MINDXZ202407]
主 题:NOR flash memory computing in memory general computing non-linear partial differential solver flash memory circuit
摘 要:Solving partial differential equations (PDEs) requires precise numerical iterations that impose significant demands on computational resources and memory capacities, which can be addressed by adopting computing-in-memory (CIM) architecture to reduce the latency and power consumption during data transmission. Among PDEs, nonlinear PDEs present heightened complexities in both analytical investigations and numerical simulations as the presence of nonlinear terms introduces intricate dynamics and mathematical intricacies. The high-precision requirements of PDE solvers, particularly for nonlinear PDE solvers, pose challenges in constructing CIM PDE solvers. In this work, a flash-based high-precision PDE solver has been demonstrated to solve the intractable nonlinear partial differential equation. It s based on 55nm NOR flash technology with well-optimized Program/Erase (PE) schemes. Utilizing the proposed optimization scheme, the PE endurance can be largely enhanced up to 10(10) cycles, which is a record high with suppressed cell degradation and robust reliabilities. Then, applying the Fourier neural operator (FNO) to the optimized flash-based high-precision CIM (32-bit) in the hardware system, a series of nonlinear PDEs can be solved with similar to 2TOPS/W high energy efficiency, which is similar to 110x higher than CPU. Our optimization strategies make it feasible to use flash-based CIM for high-precision computing with frequent weight updating and the demonstrated PDE solver provides an energy-saving solution to implement general-purpose computation tasks.