A Memory-Based Graph Reinforcement Learning Method for Critical Load Restoration With Uncertainties of Distributed Energy Resource

IF 9.8 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Smart Grid Pub Date : 2024-10-17 DOI:10.1109/TSG.2024.3482696
Bangji Fan;Xinghua Liu;Gaoxi Xiao;Yan Xu;Xiang Yang;Peng Wang
{"title":"A Memory-Based Graph Reinforcement Learning Method for Critical Load Restoration With Uncertainties of Distributed Energy Resource","authors":"Bangji Fan;Xinghua Liu;Gaoxi Xiao;Yan Xu;Xiang Yang;Peng Wang","doi":"10.1109/TSG.2024.3482696","DOIUrl":null,"url":null,"abstract":"The integration of distributed energy resources into distribution networks, marked by its inherent uncertainties, presents a substantial challenge for devising load restoration strategies. To tackle this challenge, we develop a memory-based graph reinforcement learning approach, designed to train the agent to acquire a critical load restoration strategy in a distribution network under uncertainties. Specifically, the restoration problem under uncertainties is formulated as a novel partially observable Markov decision process, where a multimodal feature-based observation space is proposed. This space includes graph-structured data of the environment and memory information of the agent. The graph-structured data contain potential features of the current observation, thus enhancing the observable domain, while the memory information incorporates temporal correlations between sample sequences to address the partial observability of the environment. Based on the proposed Markov process, we put forth a maximum entropy-based recurrent graph soft actor-critic algorithm to train the agent in partially observable environments through a recursive structure, where entropy regularization is utilized to facilitate a more extensive exploration of possibilities in a state space with high uncertainties. The performance of the proposed approach is validated by a comparative study versus existing results on the IEEE 123-bus system containing wind power and photovoltaic sources.","PeriodicalId":13331,"journal":{"name":"IEEE Transactions on Smart Grid","volume":"16 2","pages":"1706-1718"},"PeriodicalIF":9.8000,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Smart Grid","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10720914/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

The integration of distributed energy resources into distribution networks, marked by its inherent uncertainties, presents a substantial challenge for devising load restoration strategies. To tackle this challenge, we develop a memory-based graph reinforcement learning approach, designed to train the agent to acquire a critical load restoration strategy in a distribution network under uncertainties. Specifically, the restoration problem under uncertainties is formulated as a novel partially observable Markov decision process, where a multimodal feature-based observation space is proposed. This space includes graph-structured data of the environment and memory information of the agent. The graph-structured data contain potential features of the current observation, thus enhancing the observable domain, while the memory information incorporates temporal correlations between sample sequences to address the partial observability of the environment. Based on the proposed Markov process, we put forth a maximum entropy-based recurrent graph soft actor-critic algorithm to train the agent in partially observable environments through a recursive structure, where entropy regularization is utilized to facilitate a more extensive exploration of possibilities in a state space with high uncertainties. The performance of the proposed approach is validated by a comparative study versus existing results on the IEEE 123-bus system containing wind power and photovoltaic sources.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种基于记忆的图强化学习方法,用于在分布式能源资源不确定的情况下恢复关键负荷
分布式能源集成到配电网中,由于其固有的不确定性,对负荷恢复策略的设计提出了重大挑战。为了解决这一挑战,我们开发了一种基于记忆的图强化学习方法,旨在训练智能体在不确定的配电网中获得临界负荷恢复策略。具体而言,将不确定条件下的恢复问题表述为一种新的部分可观测马尔可夫决策过程,提出了基于多模态特征的观测空间。该空间包括环境的图结构数据和代理的内存信息。图结构数据包含当前观测的潜在特征,从而增强了可观察域,而记忆信息包含样本序列之间的时间相关性,以解决环境的部分可观察性。基于所提出的马尔可夫过程,我们提出了一种基于最大熵的递归图软actor-critic算法,通过递归结构在部分可观察环境中训练agent,其中利用熵正则化来促进在具有高不确定性的状态空间中更广泛地探索可能性。通过与现有的IEEE 123总线系统中包含风电和光伏源的对比研究,验证了所提出方法的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Smart Grid
IEEE Transactions on Smart Grid ENGINEERING, ELECTRICAL & ELECTRONIC-
CiteScore
22.10
自引率
9.40%
发文量
526
审稿时长
6 months
期刊介绍: The IEEE Transactions on Smart Grid is a multidisciplinary journal that focuses on research and development in the field of smart grid technology. It covers various aspects of the smart grid, including energy networks, prosumers (consumers who also produce energy), electric transportation, distributed energy resources, and communications. The journal also addresses the integration of microgrids and active distribution networks with transmission systems. It publishes original research on smart grid theories and principles, including technologies and systems for demand response, Advance Metering Infrastructure, cyber-physical systems, multi-energy systems, transactive energy, data analytics, and electric vehicle integration. Additionally, the journal considers surveys of existing work on the smart grid that propose new perspectives on the history and future of intelligent and active grids.
期刊最新文献
Region-Driven Two-Layer Refined Scheduling for Multi-stack-integrated Alkaline Electrolyzer in Wind-Hydrogen System Bayesian-Optimized Deep Learning for Adaptive Real-Time FDIA Detection in Smart Grid Demand Response Systems Mixed Competitive—Cooperative Pricing for EV Charging Stations: A Multi-Agent Reinforcement Learning Approach with Heterogeneous Hierarchical Attention Distributed Adaptive Unified Fault-Tolerant Control Scheme for Multiple VSGs in Islanded Microgrids Hierarchical Distributed Control with Projected Gradient Descent for Real-Time Congestion Mitigation in MVDC Distribution Systems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1