Tianhao Qie, Xinan Zhang, Chaoqun Xiang, Herbert Ho Ching Iu, Tyrone Fernando
{"title":"基于在线强化学习的新型三电平中性点箝位直流/交流逆变器线性二次调节器","authors":"Tianhao Qie, Xinan Zhang, Chaoqun Xiang, Herbert Ho Ching Iu, Tyrone Fernando","doi":"10.1049/enc2.12132","DOIUrl":null,"url":null,"abstract":"<p>This article proposes a novel online reinforcement learning-based linear quadratic regulator for the three-level neutral-point clamped DC/AC voltage source inverter. The proposed controller employs online updated fixed-weight recurrent neural network (NN) and policy iteration to dynamically adjust the optimal control gains based on real-time measurements without any knowledge of the system model or offline pre-training. Moreover, it produces a constant switching frequency with low current harmonics. Compared to the existing control methods, it provides superior control performance, guaranteed control stability, and simplified NN design. Experimental results are presented to verify the effectiveness of the proposed control method.</p>","PeriodicalId":100467,"journal":{"name":"Energy Conversion and Economics","volume":"5 5","pages":"281-292"},"PeriodicalIF":0.0000,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/enc2.12132","citationCount":"0","resultStr":"{\"title\":\"A novel online reinforcement learning-based linear quadratic regulator for three-level neutral-point clamped DC/AC inverter\",\"authors\":\"Tianhao Qie, Xinan Zhang, Chaoqun Xiang, Herbert Ho Ching Iu, Tyrone Fernando\",\"doi\":\"10.1049/enc2.12132\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This article proposes a novel online reinforcement learning-based linear quadratic regulator for the three-level neutral-point clamped DC/AC voltage source inverter. The proposed controller employs online updated fixed-weight recurrent neural network (NN) and policy iteration to dynamically adjust the optimal control gains based on real-time measurements without any knowledge of the system model or offline pre-training. Moreover, it produces a constant switching frequency with low current harmonics. Compared to the existing control methods, it provides superior control performance, guaranteed control stability, and simplified NN design. Experimental results are presented to verify the effectiveness of the proposed control method.</p>\",\"PeriodicalId\":100467,\"journal\":{\"name\":\"Energy Conversion and Economics\",\"volume\":\"5 5\",\"pages\":\"281-292\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-10-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1049/enc2.12132\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Energy Conversion and Economics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1049/enc2.12132\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy Conversion and Economics","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/enc2.12132","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
本文为三电平中性点箝位直流/交流电压源逆变器提出了一种新颖的基于在线强化学习的线性二次调节器。该控制器采用在线更新的固定权重递归神经网络(NN)和策略迭代,根据实时测量结果动态调整最佳控制增益,而无需了解系统模型或进行离线预训练。此外,它还能产生恒定的开关频率和较低的电流谐波。与现有的控制方法相比,它的控制性能更优越,控制稳定性更有保障,而且简化了 NN 的设计。实验结果验证了所提控制方法的有效性。
A novel online reinforcement learning-based linear quadratic regulator for three-level neutral-point clamped DC/AC inverter
This article proposes a novel online reinforcement learning-based linear quadratic regulator for the three-level neutral-point clamped DC/AC voltage source inverter. The proposed controller employs online updated fixed-weight recurrent neural network (NN) and policy iteration to dynamically adjust the optimal control gains based on real-time measurements without any knowledge of the system model or offline pre-training. Moreover, it produces a constant switching frequency with low current harmonics. Compared to the existing control methods, it provides superior control performance, guaranteed control stability, and simplified NN design. Experimental results are presented to verify the effectiveness of the proposed control method.