Florian Jaensch, A. Csiszar, Annika Kienzlen, A. Verl
{"title":"Reinforcement Learning of Material Flow Control Logic Using Hardware-in-the-Loop Simulation","authors":"Florian Jaensch, A. Csiszar, Annika Kienzlen, A. Verl","doi":"10.1109/AI4I.2018.8665712","DOIUrl":null,"url":null,"abstract":"In this paper the concept of reinforcement learning agent is presented, which can deduce the correct control policy of a plant by acting in its digital twin (the HiL simulation). This way the agent substitutes a real control system. By using reinforcement learning methods, a proof of concept application is presented for a simplistic material flow system, with the same type of access to the digital twin which a PLC controller-hardware would have. With the presented approach the agent is able to find the correct control policy.","PeriodicalId":133657,"journal":{"name":"2018 First International Conference on Artificial Intelligence for Industries (AI4I)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 First International Conference on Artificial Intelligence for Industries (AI4I)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AI4I.2018.8665712","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15
Abstract
In this paper the concept of reinforcement learning agent is presented, which can deduce the correct control policy of a plant by acting in its digital twin (the HiL simulation). This way the agent substitutes a real control system. By using reinforcement learning methods, a proof of concept application is presented for a simplistic material flow system, with the same type of access to the digital twin which a PLC controller-hardware would have. With the presented approach the agent is able to find the correct control policy.