A Memory-Based Graph Reinforcement Learning Method for Critical Load Restoration With Uncertainties of Distributed Energy Resource

IF 9.8 1区工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Smart Grid Pub Date : 2024-10-17 DOI:10.1109/TSG.2024.3482696

Bangji Fan;Xinghua Liu;Gaoxi Xiao;Yan Xu;Xiang Yang;Peng Wang

{"title":"A Memory-Based Graph Reinforcement Learning Method for Critical Load Restoration With Uncertainties of Distributed Energy Resource","authors":"Bangji Fan;Xinghua Liu;Gaoxi Xiao;Yan Xu;Xiang Yang;Peng Wang","doi":"10.1109/TSG.2024.3482696","DOIUrl":null,"url":null,"abstract":"The integration of distributed energy resources into distribution networks, marked by its inherent uncertainties, presents a substantial challenge for devising load restoration strategies. To tackle this challenge, we develop a memory-based graph reinforcement learning approach, designed to train the agent to acquire a critical load restoration strategy in a distribution network under uncertainties. Specifically, the restoration problem under uncertainties is formulated as a novel partially observable Markov decision process, where a multimodal feature-based observation space is proposed. This space includes graph-structured data of the environment and memory information of the agent. The graph-structured data contain potential features of the current observation, thus enhancing the observable domain, while the memory information incorporates temporal correlations between sample sequences to address the partial observability of the environment. Based on the proposed Markov process, we put forth a maximum entropy-based recurrent graph soft actor-critic algorithm to train the agent in partially observable environments through a recursive structure, where entropy regularization is utilized to facilitate a more extensive exploration of possibilities in a state space with high uncertainties. The performance of the proposed approach is validated by a comparative study versus existing results on the IEEE 123-bus system containing wind power and photovoltaic sources.","PeriodicalId":13331,"journal":{"name":"IEEE Transactions on Smart Grid","volume":"16 2","pages":"1706-1718"},"PeriodicalIF":9.8000,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Smart Grid","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10720914/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

The integration of distributed energy resources into distribution networks, marked by its inherent uncertainties, presents a substantial challenge for devising load restoration strategies. To tackle this challenge, we develop a memory-based graph reinforcement learning approach, designed to train the agent to acquire a critical load restoration strategy in a distribution network under uncertainties. Specifically, the restoration problem under uncertainties is formulated as a novel partially observable Markov decision process, where a multimodal feature-based observation space is proposed. This space includes graph-structured data of the environment and memory information of the agent. The graph-structured data contain potential features of the current observation, thus enhancing the observable domain, while the memory information incorporates temporal correlations between sample sequences to address the partial observability of the environment. Based on the proposed Markov process, we put forth a maximum entropy-based recurrent graph soft actor-critic algorithm to train the agent in partially observable environments through a recursive structure, where entropy regularization is utilized to facilitate a more extensive exploration of possibilities in a state space with high uncertainties. The performance of the proposed approach is validated by a comparative study versus existing results on the IEEE 123-bus system containing wind power and photovoltaic sources.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一种基于记忆的图强化学习方法，用于在分布式能源资源不确定的情况下恢复关键负荷

分布式能源集成到配电网中，由于其固有的不确定性，对负荷恢复策略的设计提出了重大挑战。为了解决这一挑战，我们开发了一种基于记忆的图强化学习方法，旨在训练智能体在不确定的配电网中获得临界负荷恢复策略。具体而言，将不确定条件下的恢复问题表述为一种新的部分可观测马尔可夫决策过程，提出了基于多模态特征的观测空间。该空间包括环境的图结构数据和代理的内存信息。图结构数据包含当前观测的潜在特征，从而增强了可观察域，而记忆信息包含样本序列之间的时间相关性，以解决环境的部分可观察性。基于所提出的马尔可夫过程，我们提出了一种基于最大熵的递归图软actor-critic算法，通过递归结构在部分可观察环境中训练agent，其中利用熵正则化来促进在具有高不确定性的状态空间中更广泛地探索可能性。通过与现有的IEEE 123总线系统中包含风电和光伏源的对比研究，验证了所提出方法的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Smart Grid ENGINEERING, ELECTRICAL & ELECTRONIC-

CiteScore

22.10

自引率

9.40%

发文量

526

审稿时长

6 months

期刊介绍： The IEEE Transactions on Smart Grid is a multidisciplinary journal that focuses on research and development in the field of smart grid technology. It covers various aspects of the smart grid, including energy networks, prosumers (consumers who also produce energy), electric transportation, distributed energy resources, and communications. The journal also addresses the integration of microgrids and active distribution networks with transmission systems. It publishes original research on smart grid theories and principles, including technologies and systems for demand response, Advance Metering Infrastructure, cyber-physical systems, multi-energy systems, transactive energy, data analytics, and electric vehicle integration. Additionally, the journal considers surveys of existing work on the smart grid that propose new perspectives on the history and future of intelligent and active grids.