加速梯度下降的统一设计

Yuquan Chen, Yiheng Wei, Yong Wang, Yangquan Chen
{"title":"加速梯度下降的统一设计","authors":"Yuquan Chen, Yiheng Wei, Yong Wang, Yangquan Chen","doi":"10.1115/detc2019-97624","DOIUrl":null,"url":null,"abstract":"\n Nowadays, different kinds of problems such as modeling, optimal control, and machine learning can be formulated as an optimization problem. Gradient descent is the most popular method to solve such problem and many accelerated gradient descents have been designed to improve the performance. In this paper, we will analyze the basic gradient descent, momentum gradient descent, and Nesterov accelerated gradient descent from the system perspective and it is found that all of them can be formulated as a feedback control problem for tracking an extreme point. On this basis, a unified gradient descent design procedure is given, where a high order transfer function is considered. Furthermore, as an extension, both a fractional integrator and a general fractional transfer function are considered, which resulting in the fractional gradient descent. Due to the infinite-dimensional property of fractional order systems, numerical inverse Laplace transform and Matlab command stmcb() are used to realize a finite-order implementation for the fractional gradient descent. Besides the simplified design procedure, it is found that the convergence rate of fractional gradient descent is more robust to the step size by simulating results.","PeriodicalId":166402,"journal":{"name":"Volume 9: 15th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications","volume":"13 8","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"On the Unified Design of Accelerated Gradient Descent\",\"authors\":\"Yuquan Chen, Yiheng Wei, Yong Wang, Yangquan Chen\",\"doi\":\"10.1115/detc2019-97624\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n Nowadays, different kinds of problems such as modeling, optimal control, and machine learning can be formulated as an optimization problem. Gradient descent is the most popular method to solve such problem and many accelerated gradient descents have been designed to improve the performance. In this paper, we will analyze the basic gradient descent, momentum gradient descent, and Nesterov accelerated gradient descent from the system perspective and it is found that all of them can be formulated as a feedback control problem for tracking an extreme point. On this basis, a unified gradient descent design procedure is given, where a high order transfer function is considered. Furthermore, as an extension, both a fractional integrator and a general fractional transfer function are considered, which resulting in the fractional gradient descent. Due to the infinite-dimensional property of fractional order systems, numerical inverse Laplace transform and Matlab command stmcb() are used to realize a finite-order implementation for the fractional gradient descent. Besides the simplified design procedure, it is found that the convergence rate of fractional gradient descent is more robust to the step size by simulating results.\",\"PeriodicalId\":166402,\"journal\":{\"name\":\"Volume 9: 15th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications\",\"volume\":\"13 8\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Volume 9: 15th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1115/detc2019-97624\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Volume 9: 15th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1115/detc2019-97624","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

如今,不同类型的问题,如建模、最优控制和机器学习,都可以表述为优化问题。梯度下降法是解决这类问题最常用的方法,为了提高性能,人们设计了许多加速梯度下降法。本文将从系统的角度对基本梯度下降、动量梯度下降和Nesterov加速梯度下降进行分析,发现它们都可以表示为跟踪极值点的反馈控制问题。在此基础上,给出了考虑高阶传递函数的统一梯度下降设计方法。进一步,作为推广,考虑了分数阶积分器和一般分数阶传递函数,从而得到分数阶梯度下降。由于分数阶系统的无限维特性,利用数值拉普拉斯逆变换和Matlab命令stmcb()实现分数阶梯度下降的有限阶实现。除了简化了设计程序外,仿真结果还表明分数阶梯度下降法的收敛速度对步长具有更强的鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
On the Unified Design of Accelerated Gradient Descent
Nowadays, different kinds of problems such as modeling, optimal control, and machine learning can be formulated as an optimization problem. Gradient descent is the most popular method to solve such problem and many accelerated gradient descents have been designed to improve the performance. In this paper, we will analyze the basic gradient descent, momentum gradient descent, and Nesterov accelerated gradient descent from the system perspective and it is found that all of them can be formulated as a feedback control problem for tracking an extreme point. On this basis, a unified gradient descent design procedure is given, where a high order transfer function is considered. Furthermore, as an extension, both a fractional integrator and a general fractional transfer function are considered, which resulting in the fractional gradient descent. Due to the infinite-dimensional property of fractional order systems, numerical inverse Laplace transform and Matlab command stmcb() are used to realize a finite-order implementation for the fractional gradient descent. Besides the simplified design procedure, it is found that the convergence rate of fractional gradient descent is more robust to the step size by simulating results.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Fractional-Order Extreme Learning Machine With Mittag-Leffler Distribution Heuristic Approach for Warehouse Resources and Production Planning Optimization: An Industry Case Study Electrode-Skin Impedance Component Estimation in the Time-Domain Chattering-Free Finite-Time Stability of a Class of Fractional-Order Nonlinear Systems Modeling, Simulation and Assessment of a Hybrid Electric Ferry: Case Study for Mid-Size Ferry
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1