加速梯度下降的统一设计

Volume 9: 15th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications Pub Date : 2019-11-25 DOI:10.1115/detc2019-97624

Yuquan Chen, Yiheng Wei, Yong Wang, Yangquan Chen

{"title":"加速梯度下降的统一设计","authors":"Yuquan Chen, Yiheng Wei, Yong Wang, Yangquan Chen","doi":"10.1115/detc2019-97624","DOIUrl":null,"url":null,"abstract":"\n Nowadays, different kinds of problems such as modeling, optimal control, and machine learning can be formulated as an optimization problem. Gradient descent is the most popular method to solve such problem and many accelerated gradient descents have been designed to improve the performance. In this paper, we will analyze the basic gradient descent, momentum gradient descent, and Nesterov accelerated gradient descent from the system perspective and it is found that all of them can be formulated as a feedback control problem for tracking an extreme point. On this basis, a unified gradient descent design procedure is given, where a high order transfer function is considered. Furthermore, as an extension, both a fractional integrator and a general fractional transfer function are considered, which resulting in the fractional gradient descent. Due to the infinite-dimensional property of fractional order systems, numerical inverse Laplace transform and Matlab command stmcb() are used to realize a finite-order implementation for the fractional gradient descent. Besides the simplified design procedure, it is found that the convergence rate of fractional gradient descent is more robust to the step size by simulating results.","PeriodicalId":166402,"journal":{"name":"Volume 9: 15th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications","volume":"13 8","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"On the Unified Design of Accelerated Gradient Descent\",\"authors\":\"Yuquan Chen, Yiheng Wei, Yong Wang, Yangquan Chen\",\"doi\":\"10.1115/detc2019-97624\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n Nowadays, different kinds of problems such as modeling, optimal control, and machine learning can be formulated as an optimization problem. Gradient descent is the most popular method to solve such problem and many accelerated gradient descents have been designed to improve the performance. In this paper, we will analyze the basic gradient descent, momentum gradient descent, and Nesterov accelerated gradient descent from the system perspective and it is found that all of them can be formulated as a feedback control problem for tracking an extreme point. On this basis, a unified gradient descent design procedure is given, where a high order transfer function is considered. Furthermore, as an extension, both a fractional integrator and a general fractional transfer function are considered, which resulting in the fractional gradient descent. Due to the infinite-dimensional property of fractional order systems, numerical inverse Laplace transform and Matlab command stmcb() are used to realize a finite-order implementation for the fractional gradient descent. Besides the simplified design procedure, it is found that the convergence rate of fractional gradient descent is more robust to the step size by simulating results.\",\"PeriodicalId\":166402,\"journal\":{\"name\":\"Volume 9: 15th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications\",\"volume\":\"13 8\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Volume 9: 15th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1115/detc2019-97624\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Volume 9: 15th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1115/detc2019-97624","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

如今，不同类型的问题，如建模、最优控制和机器学习，都可以表述为优化问题。梯度下降法是解决这类问题最常用的方法，为了提高性能，人们设计了许多加速梯度下降法。本文将从系统的角度对基本梯度下降、动量梯度下降和Nesterov加速梯度下降进行分析，发现它们都可以表示为跟踪极值点的反馈控制问题。在此基础上，给出了考虑高阶传递函数的统一梯度下降设计方法。进一步，作为推广，考虑了分数阶积分器和一般分数阶传递函数，从而得到分数阶梯度下降。由于分数阶系统的无限维特性，利用数值拉普拉斯逆变换和Matlab命令stmcb()实现分数阶梯度下降的有限阶实现。除了简化了设计程序外，仿真结果还表明分数阶梯度下降法的收敛速度对步长具有更强的鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

On the Unified Design of Accelerated Gradient Descent

Nowadays, different kinds of problems such as modeling, optimal control, and machine learning can be formulated as an optimization problem. Gradient descent is the most popular method to solve such problem and many accelerated gradient descents have been designed to improve the performance. In this paper, we will analyze the basic gradient descent, momentum gradient descent, and Nesterov accelerated gradient descent from the system perspective and it is found that all of them can be formulated as a feedback control problem for tracking an extreme point. On this basis, a unified gradient descent design procedure is given, where a high order transfer function is considered. Furthermore, as an extension, both a fractional integrator and a general fractional transfer function are considered, which resulting in the fractional gradient descent. Due to the infinite-dimensional property of fractional order systems, numerical inverse Laplace transform and Matlab command stmcb() are used to realize a finite-order implementation for the fractional gradient descent. Besides the simplified design procedure, it is found that the convergence rate of fractional gradient descent is more robust to the step size by simulating results.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Volume 9: 15th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications

自引率

0.00%

发文量