{"title":"通过强化学习实现波浪能转换器的无模型线性非因果最优控制","authors":"Siyuan Zhan;John V. Ringwood","doi":"10.1109/TCST.2024.3401863","DOIUrl":null,"url":null,"abstract":"This article introduces a novel reinforcement learning (RL) method for wave energy converters (WECs), which directly generates linear noncausal optimal control (LNOC) policies on continuous action space. Unlike other existing WEC RL algorithms looking at the problem mainly from a learning perspective, the proposed RL approach adopts a control-theoretic approach by delving into the underlying WEC energy maximization (EM) optimal control problem (OCP). This leads to control-informed decisions on choosing the RL state, as well as developing the RL structure. The proposed model-free LNOC (MF-LNOC) offers substantial advantages, including significantly improved performance due to the use of noncausal information, a simplified RL with linear actor and quadratic critic structures, and remarkable fast convergence speeds, achieved using less than 150 s of data points, for a benchmarked point absorber, which can be further shortened using the replay technique. This reduction in training time allows for controller reconfiguration in pace with sea changes. Demonstrative numerical simulations are presented to verify the efficacy of the proposed methods. The proposed MF-LNOC also shows robustness against wave prediction inaccuracies and changing sea conditions. The MF-LNOC methodology can be highly attractive for WEC developers who want to design an efficient and reliable controller for WECs but also hope to avoid the challenge of establishing a control-oriented model that can preserve high fidelity over a wide range of sea conditions.","PeriodicalId":13103,"journal":{"name":"IEEE Transactions on Control Systems Technology","volume":"32 6","pages":"2164-2177"},"PeriodicalIF":4.9000,"publicationDate":"2024-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10541046","citationCount":"0","resultStr":"{\"title\":\"Model-Free Linear Noncausal Optimal Control of Wave Energy Converters via Reinforcement Learning\",\"authors\":\"Siyuan Zhan;John V. Ringwood\",\"doi\":\"10.1109/TCST.2024.3401863\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article introduces a novel reinforcement learning (RL) method for wave energy converters (WECs), which directly generates linear noncausal optimal control (LNOC) policies on continuous action space. Unlike other existing WEC RL algorithms looking at the problem mainly from a learning perspective, the proposed RL approach adopts a control-theoretic approach by delving into the underlying WEC energy maximization (EM) optimal control problem (OCP). This leads to control-informed decisions on choosing the RL state, as well as developing the RL structure. The proposed model-free LNOC (MF-LNOC) offers substantial advantages, including significantly improved performance due to the use of noncausal information, a simplified RL with linear actor and quadratic critic structures, and remarkable fast convergence speeds, achieved using less than 150 s of data points, for a benchmarked point absorber, which can be further shortened using the replay technique. This reduction in training time allows for controller reconfiguration in pace with sea changes. Demonstrative numerical simulations are presented to verify the efficacy of the proposed methods. The proposed MF-LNOC also shows robustness against wave prediction inaccuracies and changing sea conditions. The MF-LNOC methodology can be highly attractive for WEC developers who want to design an efficient and reliable controller for WECs but also hope to avoid the challenge of establishing a control-oriented model that can preserve high fidelity over a wide range of sea conditions.\",\"PeriodicalId\":13103,\"journal\":{\"name\":\"IEEE Transactions on Control Systems Technology\",\"volume\":\"32 6\",\"pages\":\"2164-2177\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2024-03-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10541046\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Control Systems Technology\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10541046/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Control Systems Technology","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10541046/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Model-Free Linear Noncausal Optimal Control of Wave Energy Converters via Reinforcement Learning
This article introduces a novel reinforcement learning (RL) method for wave energy converters (WECs), which directly generates linear noncausal optimal control (LNOC) policies on continuous action space. Unlike other existing WEC RL algorithms looking at the problem mainly from a learning perspective, the proposed RL approach adopts a control-theoretic approach by delving into the underlying WEC energy maximization (EM) optimal control problem (OCP). This leads to control-informed decisions on choosing the RL state, as well as developing the RL structure. The proposed model-free LNOC (MF-LNOC) offers substantial advantages, including significantly improved performance due to the use of noncausal information, a simplified RL with linear actor and quadratic critic structures, and remarkable fast convergence speeds, achieved using less than 150 s of data points, for a benchmarked point absorber, which can be further shortened using the replay technique. This reduction in training time allows for controller reconfiguration in pace with sea changes. Demonstrative numerical simulations are presented to verify the efficacy of the proposed methods. The proposed MF-LNOC also shows robustness against wave prediction inaccuracies and changing sea conditions. The MF-LNOC methodology can be highly attractive for WEC developers who want to design an efficient and reliable controller for WECs but also hope to avoid the challenge of establishing a control-oriented model that can preserve high fidelity over a wide range of sea conditions.
期刊介绍:
The IEEE Transactions on Control Systems Technology publishes high quality technical papers on technological advances in control engineering. The word technology is from the Greek technologia. The modern meaning is a scientific method to achieve a practical purpose. Control Systems Technology includes all aspects of control engineering needed to implement practical control systems, from analysis and design, through simulation and hardware. A primary purpose of the IEEE Transactions on Control Systems Technology is to have an archival publication which will bridge the gap between theory and practice. Papers are published in the IEEE Transactions on Control System Technology which disclose significant new knowledge, exploratory developments, or practical applications in all aspects of technology needed to implement control systems, from analysis and design through simulation, and hardware.