基于强化学习的高超声速飞行器最优跟踪控制:一种无模型方法

Xiaoxiang Hu, Kejun Dong, Teng-Chieh Yang, Bing Xiao
{"title":"基于强化学习的高超声速飞行器最优跟踪控制:一种无模型方法","authors":"Xiaoxiang Hu, Kejun Dong, Teng-Chieh Yang, Bing Xiao","doi":"10.1109/INDIN51773.2022.9976071","DOIUrl":null,"url":null,"abstract":"The tracking control of hypersonic flight vehicle (HFV) is discussed in this paper, and the nonlinear model of HFV is assumed to be completely unknown. This problem is surely challenging because of the missing prior knowledge, but is more closer to reality since the exact mode of HFV is difficult to be obtained. A reinforcement learning (RL) based optimal controller is proposed for the tracking control of HFV. A model based RL algorithm is firstly proposed and then, based on this algorithm, a model free algorithm is constructed. For relaxing the environmental conditions, neural network (NN) is adopted for the approximation of Critic and Actor, and then a Greedy Policy based updated learning law for NN is derived. The presented RL based control strategy is carried on the nonlinear model of HFV to show its effectiveness.","PeriodicalId":359190,"journal":{"name":"2022 IEEE 20th International Conference on Industrial Informatics (INDIN)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Reinforcement Learning based Optimal Tracking Control for Hypersonic Flight Vehicle: A Model Free Approach\",\"authors\":\"Xiaoxiang Hu, Kejun Dong, Teng-Chieh Yang, Bing Xiao\",\"doi\":\"10.1109/INDIN51773.2022.9976071\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The tracking control of hypersonic flight vehicle (HFV) is discussed in this paper, and the nonlinear model of HFV is assumed to be completely unknown. This problem is surely challenging because of the missing prior knowledge, but is more closer to reality since the exact mode of HFV is difficult to be obtained. A reinforcement learning (RL) based optimal controller is proposed for the tracking control of HFV. A model based RL algorithm is firstly proposed and then, based on this algorithm, a model free algorithm is constructed. For relaxing the environmental conditions, neural network (NN) is adopted for the approximation of Critic and Actor, and then a Greedy Policy based updated learning law for NN is derived. The presented RL based control strategy is carried on the nonlinear model of HFV to show its effectiveness.\",\"PeriodicalId\":359190,\"journal\":{\"name\":\"2022 IEEE 20th International Conference on Industrial Informatics (INDIN)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 20th International Conference on Industrial Informatics (INDIN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INDIN51773.2022.9976071\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 20th International Conference on Industrial Informatics (INDIN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDIN51773.2022.9976071","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

本文讨论了高超声速飞行器的跟踪控制问题,并假设高超声速飞行器的非线性模型完全未知。由于缺乏先验知识,这一问题无疑具有挑战性,但由于难以获得HFV的确切模式,这一问题更接近现实。提出了一种基于强化学习(RL)的最优控制器用于HFV的跟踪控制。首先提出了一种基于模型的强化学习算法,然后在此基础上构造了无模型强化学习算法。为了放松环境条件,采用神经网络(NN)对批评家和行动者进行逼近,并推导出基于贪心策略的神经网络更新学习律。通过对HFV非线性模型的分析,验证了该控制策略的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Reinforcement Learning based Optimal Tracking Control for Hypersonic Flight Vehicle: A Model Free Approach
The tracking control of hypersonic flight vehicle (HFV) is discussed in this paper, and the nonlinear model of HFV is assumed to be completely unknown. This problem is surely challenging because of the missing prior knowledge, but is more closer to reality since the exact mode of HFV is difficult to be obtained. A reinforcement learning (RL) based optimal controller is proposed for the tracking control of HFV. A model based RL algorithm is firstly proposed and then, based on this algorithm, a model free algorithm is constructed. For relaxing the environmental conditions, neural network (NN) is adopted for the approximation of Critic and Actor, and then a Greedy Policy based updated learning law for NN is derived. The presented RL based control strategy is carried on the nonlinear model of HFV to show its effectiveness.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Sentiment Analysis of Board Secretaries’ Q&R Data Offset Estimation Based on ARIMA-LSTM for Time Synchronization in Single Twisted Pair Ethernet Dynamic Task Offloading Approach for Task Delay Reduction in the IoT-enabled Fog Computing Systems Fuzzy PID Control for Multi-joint Robotic Arm Graph Attention Network for Financial Aspect-based Sentiment Classification with Contrastive Learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1