检索结果-内蒙古大学图书馆

Appropriate learning rates of adaptive learning rate optimization algorithms for Training Deep Neural Networks

IEEE TRANSACTIONS ON CYBERNETICS 2022年第12期52卷 13250-13261页

作者： Iiduka, Hideaki Meiji Univ Dept Comp Sci Tokyo Kanagawa 2148571 Japan

This article deals with nonconvex stochastic optimization problems in deep learning. Appropriate learning rates, based on theory, for adaptive-learning-rate optimization algorithms (e.g., Adam and AMSGrad) to approximate the stationary points of such problems are provided. These rates are shown to allow faster convergence than previously reported for these algorithms. Specifically, the algorithms are examined in numerical experiments on text and image classification and are shown in experiments to perform better with constant learning rates than algorithms using diminishing learning rates.

关键词： optimization Convergence Stochastic processes Deep learning Approximation algorithms Training Heuristic algorithms adaptive mean square gradient (AMSGrad) adaptive moment estimation (Adam) adaptive-learning-rate optimization algorithm deep neural network learning rate nonconvex stochastic optimization

来源：评论

学校读者我要写书评

暂无评论

Unified algorithm Framework for Nonconvex Stochastic optimization in Deep Neural Networks

引用

IEEE ACCESS 2021年 9卷 143807-143823页

作者： Zhu, Yini Iiduka, Hideaki Meiji Univ Grad Sch Sci & Technol Comp Sci Course Kawasaki Kanagawa 2148571 Japan Meiji Univ Dept Comp Sci Kawasaki Kanagawa 2148571 Japan

This paper presents a unified algorithmic framework for nonconvex stochastic optimization, which is needed to train deep neural networks. The unified algorithm includes the existing adaptive-learning-rate optimization algorithms, such as adaptive Moment Estimation (Adam), adaptive Mean Square Gradient (AMSGrad), Adam with weighted gradient and dynamic bound of learning rate (GWDC), AMSGrad with weighted gradient and dynamic bound of learning rate (AMSGWDC), and Adapting stepsizes by the belief in observed gradients (AdaBelief). The paper also gives convergence analyses of the unified algorithm for constant and diminishing learning rates. When using a constant learning rate, the algorithm can approximate a stationary point of a nonconvex stochastic optimization problem. When using a diminishing rate, it converges to a stationary point of the problem. Hence, the analyses lead to the finding that the existing adaptive-learning-rate optimization algorithms can be applied to nonconvex stochastic optimization in deep neural networks in theory. Additionally, this paper provides numerical results showing that the unified algorithm can train deep neural networks in practice. Moreover, it provides numerical comparisons for unconstrained minimization using benchmark functions of the unified algorithm with certain heuristic intelligent optimization algorithms. The numerical comparisons show that a teaching-learning-based optimization algorithm and the unified algorithm perform well.

关键词： optimization Heuristic algorithms Convergence Deep learning Approximation algorithms Convex functions Symmetric matrices Adam adaptive-learning-rate optimization algorithm AMSGrad AMSGWDC deep neural network GWDC heuristic intelligent optimization methods learning rate nonconvex stochastic optimization stationary point problem

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：