咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >ON THE PERFORMANCE ANALYSIS OF... 收藏
arXiv

ON THE PERFORMANCE ANALYSIS OF MOMENTUM METHOD: A FREQUENCY DOMAIN PERSPECTIVE

作     者:Li, Xianliang Luo, Jun Zheng, Zhiwei Wang, Hanxiao Luo, Li Wen, Lingkun Wu, Linlong Xu, Sheng 

作者机构:Shenzhen Institutes of Advanced Technology Chinese Academy of Sciences China University of Chinese Academy of Sciences China University of California Berkeley United States Institute of Automation Chinese Academy of Sciences China Sun Yat-sen University China Shanghai Astronomical Observatory Chinese Academy of Sciences China University of Luxembourg Luxembourg 

出 版 物:《arXiv》 (arXiv)

年 卷 期:2024年

核心收录:

主  题:Wiener filtering 

摘      要:Momentum-based optimizers are widely adopted for training neural networks. However, the optimal selection of momentum coefficients remains elusive. This uncertainty impedes a clear understanding of the role of momentum in stochastic gradient methods. In this paper, we present a frequency domain analysis framework that interprets the momentum method as a time-variant filter for gradients, where adjustments to momentum coefficients modify the filter characteristics. Our experiments support this perspective and provide a deeper understanding of the mechanism involved. Moreover, our analysis reveals the following significant findings: high-frequency gradient components are undesired in the late stages of training;preserving the original gradient in the early stages, and gradually amplifying low-frequency gradient components during training both enhance performance. Based on these insights, we propose Frequency Stochastic Gradient Descent with Momentum (FSGDM), a heuristic optimizer that dynamically adjusts the momentum filtering characteristic with an empirically effective dynamic magnitude response. Experimental results demonstrate the superiority of FSGDM over conventional momentum optimizers. 1 © 2024, CC BY.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分