AdapSafe2: Prior-Free Safe-Certified Reinforcement Learning for Multi-Area Frequency Control

IF 7.2 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Power Systems Pub Date : 2024-10-21 DOI:10.1109/TPWRS.2024.3483994
Xu Wan;Mingyang Sun
{"title":"AdapSafe2: Prior-Free Safe-Certified Reinforcement Learning for Multi-Area Frequency Control","authors":"Xu Wan;Mingyang Sun","doi":"10.1109/TPWRS.2024.3483994","DOIUrl":null,"url":null,"abstract":"Safe Reinforcement Learning (RL) has been widely investigated to conduct power systems frequency control under high renewable energy resources penetration. Nevertheless, existing safe RL-based frequency control methods still face two fundamental challenges to achieving safety guarantees: (1) operating in non-stationary environments without the prior knowledge of the system parameters and (2) simultaneously satisfying high-dimensional and time-varying safety constraints in multi-area cases. To this end, this paper proposes a prior-free reinforcement learning-based frequency control method with guaranteed safety for multi-area power systems named AdapSafe2. To tackle Challenge (1), a meta-based environmental learning algorithm is developed to automatically capture and rapidly adapts to non-stationary system parameters without relying on a predefined nominal model. Furthermore, a meta-RL framework is established to achieve a self-adaptive frequency control strategy without prior knowledge. Moreover, for Challenge (2), a novel safety-critic network and a safe-certified compensator based on the control barrier function are designed to identify time-varying safety constraints. Leveraging risk assessments from the safety-critic network, the compensator performs dynamic safety compensations only for areas with risk, thereby enhancing the efficiency of solving under high-dimensional safety constraints. Numerical simulations conducted under 2-Area and 3-Area wind-aggregated low-inertia power systems demonstrate that the proposed AdapSafe2 can outperform the state-of-the-art approaches while effectively satisfying the dynamic safety constraints.","PeriodicalId":13373,"journal":{"name":"IEEE Transactions on Power Systems","volume":"40 3","pages":"2244-2257"},"PeriodicalIF":7.2000,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Power Systems","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10726595/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Safe Reinforcement Learning (RL) has been widely investigated to conduct power systems frequency control under high renewable energy resources penetration. Nevertheless, existing safe RL-based frequency control methods still face two fundamental challenges to achieving safety guarantees: (1) operating in non-stationary environments without the prior knowledge of the system parameters and (2) simultaneously satisfying high-dimensional and time-varying safety constraints in multi-area cases. To this end, this paper proposes a prior-free reinforcement learning-based frequency control method with guaranteed safety for multi-area power systems named AdapSafe2. To tackle Challenge (1), a meta-based environmental learning algorithm is developed to automatically capture and rapidly adapts to non-stationary system parameters without relying on a predefined nominal model. Furthermore, a meta-RL framework is established to achieve a self-adaptive frequency control strategy without prior knowledge. Moreover, for Challenge (2), a novel safety-critic network and a safe-certified compensator based on the control barrier function are designed to identify time-varying safety constraints. Leveraging risk assessments from the safety-critic network, the compensator performs dynamic safety compensations only for areas with risk, thereby enhancing the efficiency of solving under high-dimensional safety constraints. Numerical simulations conducted under 2-Area and 3-Area wind-aggregated low-inertia power systems demonstrate that the proposed AdapSafe2 can outperform the state-of-the-art approaches while effectively satisfying the dynamic safety constraints.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
AdapSafe2:用于多区域频率控制的无先验安全认证强化学习
安全强化学习(RL)在高可再生能源渗透率下进行电力系统频率控制已得到广泛研究。然而,现有的基于rl的安全频率控制方法在实现安全保证方面仍然面临两个基本挑战:(1)在没有系统参数先验知识的非平稳环境中运行;(2)同时满足多区域情况下的高维时变安全约束。为此,本文提出了一种基于无先验强化学习的多区域电力系统安全变频控制方法AdapSafe2。为了解决挑战(1),开发了一种基于元的环境学习算法来自动捕获和快速适应非平稳系统参数,而不依赖于预定义的标称模型。在此基础上,建立了元强化学习框架,实现了无先验知识的自适应频率控制策略。此外,对于挑战(2),设计了一种新的安全批评网络和基于控制障碍函数的安全认证补偿器来识别时变安全约束。补偿器利用安全批判网络的风险评估,仅对存在风险的区域进行动态安全补偿,从而提高了高维安全约束下的求解效率。在2区和3区风聚集低惯性电力系统下进行的数值模拟表明,所提出的AdapSafe2方法在满足动态安全约束的同时,优于现有方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Power Systems
IEEE Transactions on Power Systems 工程技术-工程:电子与电气
CiteScore
15.80
自引率
7.60%
发文量
696
审稿时长
3 months
期刊介绍: The scope of IEEE Transactions on Power Systems covers the education, analysis, operation, planning, and economics of electric generation, transmission, and distribution systems for general industrial, commercial, public, and domestic consumption, including the interaction with multi-energy carriers. The focus of this transactions is the power system from a systems viewpoint instead of components of the system. It has five (5) key areas within its scope with several technical topics within each area. These areas are: (1) Power Engineering Education, (2) Power System Analysis, Computing, and Economics, (3) Power System Dynamic Performance, (4) Power System Operations, and (5) Power System Planning and Implementation.
期刊最新文献
A Robust and Fast Operational Risk Assessment Method for Composite Power Systems with High Wind Power Penetration Asynchronous Distributed Scheduling of Active Distribution Network and Thermostatically Controlled Loads With MPC-Based Aggregation Sequential Convex Approximation Approach for Chance-Constrained AC Optimal Power Flow under Arbitrary Random Distribution with Mild Conditions Customising Electricity Contracts at Scale with Large Language Models Small-Signal Stability and Inertia Constrained Optimal Configuration Method of Grid-Forming Energy Storage Systems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1