Guillermo Puriel-Gil, Wen Yu, Juan Humberto Sossa Azuela
{"title":"Reinforcement Learning Compensation based PD Control for Inverted Pendulum","authors":"Guillermo Puriel-Gil, Wen Yu, Juan Humberto Sossa Azuela","doi":"10.1109/ICEEE.2018.8533946","DOIUrl":null,"url":null,"abstract":"In this paper, we present a Control Algorithm based on Reinforcement Learning for an inverted pendulum. By implementing the Q-Learning techniques in the PD control scheme, the pendulum is enabled to improve its online performance and adapt to changes in the parameters of the pendulum model. In a first step, Q-Learning is used so that the control can balance the pendulum towards its inverted vertical position; In a second step, we combine hybrid techniques of Q-Learning and PD control. With this combination, we can bring the pendulum to its inverted vertical position, regardless of the applied disturbance. Finally, the results of the simulation show the effectiveness of the proposed controller.","PeriodicalId":6924,"journal":{"name":"2018 15th International Conference on Electrical Engineering, Computing Science and Automatic Control (CCE)","volume":"1 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 15th International Conference on Electrical Engineering, Computing Science and Automatic Control (CCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEEE.2018.8533946","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
In this paper, we present a Control Algorithm based on Reinforcement Learning for an inverted pendulum. By implementing the Q-Learning techniques in the PD control scheme, the pendulum is enabled to improve its online performance and adapt to changes in the parameters of the pendulum model. In a first step, Q-Learning is used so that the control can balance the pendulum towards its inverted vertical position; In a second step, we combine hybrid techniques of Q-Learning and PD control. With this combination, we can bring the pendulum to its inverted vertical position, regardless of the applied disturbance. Finally, the results of the simulation show the effectiveness of the proposed controller.