{"title":"Deep Deterministic Policy Gradient for Magnetic Levitation Control","authors":"S. Wongsa, Nitis Kowkasai","doi":"10.1109/ECTI-CON49241.2020.9158096","DOIUrl":null,"url":null,"abstract":"MagLev ball motion control is a challenging problem because of its nonlinear and unstable behaviour. It also requires to have a large operating range. In this work, we have extended the current success of a deep reinforcement learning algorithm in continuous control using the Deep Deterministic Policy Gradient (DDPG), to magnetic levitation (MagLev) control. In reinforcement learning, the reward function is very critical for learning the optimal policy. We propose a novel and simple reward function for DDPG and evaluate our method on MagLev control. Our DDPG-based controller shows the capability to learn from the system with comparable performance to the previously proposed reward function. It can also find the solution to control the ball at all positions, including the ones that were not shown during the learning stage.","PeriodicalId":371552,"journal":{"name":"2020 17th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 17th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ECTI-CON49241.2020.9158096","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
MagLev ball motion control is a challenging problem because of its nonlinear and unstable behaviour. It also requires to have a large operating range. In this work, we have extended the current success of a deep reinforcement learning algorithm in continuous control using the Deep Deterministic Policy Gradient (DDPG), to magnetic levitation (MagLev) control. In reinforcement learning, the reward function is very critical for learning the optimal policy. We propose a novel and simple reward function for DDPG and evaluate our method on MagLev control. Our DDPG-based controller shows the capability to learn from the system with comparable performance to the previously proposed reward function. It can also find the solution to control the ball at all positions, including the ones that were not shown during the learning stage.