A. Tsourdos, Ir. Adhi Dharma Permana, Dew Budiarti, Hyo-Sang Shin, Chang-hun Lee
{"title":"Developing Flight Control Policy Using Deep Deterministic Policy Gradient","authors":"A. Tsourdos, Ir. Adhi Dharma Permana, Dew Budiarti, Hyo-Sang Shin, Chang-hun Lee","doi":"10.1109/ICARES.2019.8914343","DOIUrl":null,"url":null,"abstract":"Developing a flight control system for a 6 degree-of-freedom aircraft remains a considerable task that requires time and effort to gather all the necessary data. In this paper, a policy using reinforcement learning based on Deep Deterministic Policy Gradient (DDPG) is proposed and its application to UAS (Unmanned Aerial System) control is presented. Previous research has shown a slight difficulty in training the DDPG learning agent for a system with multiple agent. A learning strategy is introduced to implicitly guide the learning agent to utilize all three control surfaces and still produce a converging policy. The DDPG learning agent is trained through several training sets to generate the best policy suited to control the aircraft. The final policy as the result of the training procedure is then extracted and tested. This research shows that DDPG can be used to develop the policy for flight control.","PeriodicalId":376964,"journal":{"name":"2019 IEEE International Conference on Aerospace Electronics and Remote Sensing Technology (ICARES)","volume":"408 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Conference on Aerospace Electronics and Remote Sensing Technology (ICARES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICARES.2019.8914343","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Developing a flight control system for a 6 degree-of-freedom aircraft remains a considerable task that requires time and effort to gather all the necessary data. In this paper, a policy using reinforcement learning based on Deep Deterministic Policy Gradient (DDPG) is proposed and its application to UAS (Unmanned Aerial System) control is presented. Previous research has shown a slight difficulty in training the DDPG learning agent for a system with multiple agent. A learning strategy is introduced to implicitly guide the learning agent to utilize all three control surfaces and still produce a converging policy. The DDPG learning agent is trained through several training sets to generate the best policy suited to control the aircraft. The final policy as the result of the training procedure is then extracted and tested. This research shows that DDPG can be used to develop the policy for flight control.