Rocket Powered Landing Guidance Using Proximal Policy Optimization

Yifan Chen, Lin Ma
{"title":"Rocket Powered Landing Guidance Using Proximal Policy Optimization","authors":"Yifan Chen, Lin Ma","doi":"10.1145/3351917.3351935","DOIUrl":null,"url":null,"abstract":"Rocket recovery requires advanced guidance algorithms to achieve pinpoint landing while satisfying multiple stringent constraints. In this paper, we design a guidance law based on reinforcement learning for the powered landing phase of vertical take-off and vertical landing reusable rocket. To this end, we apply the proximal policy optimization algorithm to develop a control policy that drives the rocket to land at a specified location. The policy parameterized using a neural network is updated by performing gradient ascent algorithm. After abundant amount of training, the learned policy is evaluated in a simulation of the rocket powered landing scenario considering aerodynamic drag, and the result demonstrates the ability of the proposed guidance method to successfully land the rocket from a random initial state.","PeriodicalId":367885,"journal":{"name":"Proceedings of the 2019 4th International Conference on Automation, Control and Robotics Engineering","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 4th International Conference on Automation, Control and Robotics Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3351917.3351935","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Rocket recovery requires advanced guidance algorithms to achieve pinpoint landing while satisfying multiple stringent constraints. In this paper, we design a guidance law based on reinforcement learning for the powered landing phase of vertical take-off and vertical landing reusable rocket. To this end, we apply the proximal policy optimization algorithm to develop a control policy that drives the rocket to land at a specified location. The policy parameterized using a neural network is updated by performing gradient ascent algorithm. After abundant amount of training, the learned policy is evaluated in a simulation of the rocket powered landing scenario considering aerodynamic drag, and the result demonstrates the ability of the proposed guidance method to successfully land the rocket from a random initial state.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于近端策略优化的火箭动力着陆制导
火箭回收需要先进的制导算法来实现精确着陆,同时满足多个严格的约束条件。针对可重复使用火箭垂直起降动力着陆阶段,设计了一种基于强化学习的制导律。为此,我们应用近端策略优化算法制定控制策略,驱动火箭在指定位置着陆。采用神经网络参数化策略,通过梯度上升算法对策略进行更新。经过大量的训练,在考虑气动阻力的火箭动力着陆场景的仿真中对学习策略进行了评估,结果证明了所提出的制导方法能够从随机初始状态成功着陆火箭。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Research on Arm Shape of Manipulator Based on Trajectory and End Posture Type Synthesis of Multi-mode Mobile Parallel Mechanism Based on Planar 4R Single-loop Kinematic Chains Stabilization Control of Underactuated Unmanned Surface Vessels Based on Backstepping and Lyapunov Direct Method Design and Test of Multi-Information Storage Testing System in Complex Physical Environment Vibration Analysis of the Bogie of Medium-speed Maglev Trains with Mid-mounted Linear Motor
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1