{"title":"A Model-Free Optimal Control Method With Fixed Terminal States and Delay","authors":"Mi Zhou, Erik Verriest, Chaouki Abdallah","doi":"arxiv-2409.10722","DOIUrl":null,"url":null,"abstract":"Model-free algorithms are brought into the control system's research with the\nemergence of reinforcement learning algorithms. However, there are two\npractical challenges of reinforcement learning-based methods. First, learning\nby interacting with the environment is highly complex. Second, constraints on\nthe states (boundary conditions) require additional care since the state\ntrajectory is implicitly defined from the inputs and system dynamics. To\naddress these problems, this paper proposes a new model-free algorithm based on\nbasis functions, gradient estimation, and the Lagrange method. The favorable\nperformance of the proposed algorithm is shown using several examples under\nstate-dependent switches and time delays.","PeriodicalId":501175,"journal":{"name":"arXiv - EE - Systems and Control","volume":"12 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - EE - Systems and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.10722","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Model-free algorithms are brought into the control system's research with the
emergence of reinforcement learning algorithms. However, there are two
practical challenges of reinforcement learning-based methods. First, learning
by interacting with the environment is highly complex. Second, constraints on
the states (boundary conditions) require additional care since the state
trajectory is implicitly defined from the inputs and system dynamics. To
address these problems, this paper proposes a new model-free algorithm based on
basis functions, gradient estimation, and the Lagrange method. The favorable
performance of the proposed algorithm is shown using several examples under
state-dependent switches and time delays.