{"title":"基于积分强化学习的线性二次跟踪器","authors":"On Park, Hyo-Sang Shin, A. Tsourdos","doi":"10.1109/REDUAS47371.2019.8999679","DOIUrl":null,"url":null,"abstract":"This paper describes a Reinforcement Learning (RL) application using Linear Quadratic Regulator (LQR) based tracking controller, which is augmented with a tracking error term. In order to deal with the steady-state errors, Linear Quadratic Tracker with Integrator (LQTI) is designed by adding an integration term of the tracking error in the state variable. Based on the LQTI, an online learning using the Integral Reinforcement Learning (IRL) is applied for the tracking problem to find the optimal control on the partially unknown continuous-time systems by regulating the augmented state variable. The optimal control solution and the performance of the method are verified through numerical simulation on two applications.","PeriodicalId":351115,"journal":{"name":"2019 Workshop on Research, Education and Development of Unmanned Aerial Systems (RED UAS)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Linear Quadratic Tracker with Integrator using Integral Reinforcement Learning\",\"authors\":\"On Park, Hyo-Sang Shin, A. Tsourdos\",\"doi\":\"10.1109/REDUAS47371.2019.8999679\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes a Reinforcement Learning (RL) application using Linear Quadratic Regulator (LQR) based tracking controller, which is augmented with a tracking error term. In order to deal with the steady-state errors, Linear Quadratic Tracker with Integrator (LQTI) is designed by adding an integration term of the tracking error in the state variable. Based on the LQTI, an online learning using the Integral Reinforcement Learning (IRL) is applied for the tracking problem to find the optimal control on the partially unknown continuous-time systems by regulating the augmented state variable. The optimal control solution and the performance of the method are verified through numerical simulation on two applications.\",\"PeriodicalId\":351115,\"journal\":{\"name\":\"2019 Workshop on Research, Education and Development of Unmanned Aerial Systems (RED UAS)\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 Workshop on Research, Education and Development of Unmanned Aerial Systems (RED UAS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/REDUAS47371.2019.8999679\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Workshop on Research, Education and Development of Unmanned Aerial Systems (RED UAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/REDUAS47371.2019.8999679","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Linear Quadratic Tracker with Integrator using Integral Reinforcement Learning
This paper describes a Reinforcement Learning (RL) application using Linear Quadratic Regulator (LQR) based tracking controller, which is augmented with a tracking error term. In order to deal with the steady-state errors, Linear Quadratic Tracker with Integrator (LQTI) is designed by adding an integration term of the tracking error in the state variable. Based on the LQTI, an online learning using the Integral Reinforcement Learning (IRL) is applied for the tracking problem to find the optimal control on the partially unknown continuous-time systems by regulating the augmented state variable. The optimal control solution and the performance of the method are verified through numerical simulation on two applications.