{"title":"基于强化学习的高超声速飞行器最优跟踪控制:一种无模型方法","authors":"Xiaoxiang Hu, Kejun Dong, Teng-Chieh Yang, Bing Xiao","doi":"10.1109/INDIN51773.2022.9976071","DOIUrl":null,"url":null,"abstract":"The tracking control of hypersonic flight vehicle (HFV) is discussed in this paper, and the nonlinear model of HFV is assumed to be completely unknown. This problem is surely challenging because of the missing prior knowledge, but is more closer to reality since the exact mode of HFV is difficult to be obtained. A reinforcement learning (RL) based optimal controller is proposed for the tracking control of HFV. A model based RL algorithm is firstly proposed and then, based on this algorithm, a model free algorithm is constructed. For relaxing the environmental conditions, neural network (NN) is adopted for the approximation of Critic and Actor, and then a Greedy Policy based updated learning law for NN is derived. The presented RL based control strategy is carried on the nonlinear model of HFV to show its effectiveness.","PeriodicalId":359190,"journal":{"name":"2022 IEEE 20th International Conference on Industrial Informatics (INDIN)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Reinforcement Learning based Optimal Tracking Control for Hypersonic Flight Vehicle: A Model Free Approach\",\"authors\":\"Xiaoxiang Hu, Kejun Dong, Teng-Chieh Yang, Bing Xiao\",\"doi\":\"10.1109/INDIN51773.2022.9976071\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The tracking control of hypersonic flight vehicle (HFV) is discussed in this paper, and the nonlinear model of HFV is assumed to be completely unknown. This problem is surely challenging because of the missing prior knowledge, but is more closer to reality since the exact mode of HFV is difficult to be obtained. A reinforcement learning (RL) based optimal controller is proposed for the tracking control of HFV. A model based RL algorithm is firstly proposed and then, based on this algorithm, a model free algorithm is constructed. For relaxing the environmental conditions, neural network (NN) is adopted for the approximation of Critic and Actor, and then a Greedy Policy based updated learning law for NN is derived. The presented RL based control strategy is carried on the nonlinear model of HFV to show its effectiveness.\",\"PeriodicalId\":359190,\"journal\":{\"name\":\"2022 IEEE 20th International Conference on Industrial Informatics (INDIN)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 20th International Conference on Industrial Informatics (INDIN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INDIN51773.2022.9976071\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 20th International Conference on Industrial Informatics (INDIN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDIN51773.2022.9976071","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Reinforcement Learning based Optimal Tracking Control for Hypersonic Flight Vehicle: A Model Free Approach
The tracking control of hypersonic flight vehicle (HFV) is discussed in this paper, and the nonlinear model of HFV is assumed to be completely unknown. This problem is surely challenging because of the missing prior knowledge, but is more closer to reality since the exact mode of HFV is difficult to be obtained. A reinforcement learning (RL) based optimal controller is proposed for the tracking control of HFV. A model based RL algorithm is firstly proposed and then, based on this algorithm, a model free algorithm is constructed. For relaxing the environmental conditions, neural network (NN) is adopted for the approximation of Critic and Actor, and then a Greedy Policy based updated learning law for NN is derived. The presented RL based control strategy is carried on the nonlinear model of HFV to show its effectiveness.