Coordinated Reactive Power Optimization for Transmission and Distribution System With Gross Prediction Errors: A Modified Belief Markov Decision Process-Based Reinforcement Learning Methodology
{"title":"Coordinated Reactive Power Optimization for Transmission and Distribution System With Gross Prediction Errors: A Modified Belief Markov Decision Process-Based Reinforcement Learning Methodology","authors":"Yaru Gu;Xueliang Huang","doi":"10.1109/TPWRS.2025.3528428","DOIUrl":null,"url":null,"abstract":"Given the significant uncertainty of distributed generations (DGs) in the transmission and distribution (T&D) system, we propose a novel Modified Belief Markov Decision Process-based (MBMDP-based) Reinforcement Learning scheme (namely, MBMRL) for Day-ahead Coordinated Reactive Power Optimization Problem (DCRPOP) with gross prediction errors. Firstly, we characterize DCRPOP as a Partially Observable Markov Decision Process (POMDP) model embedded with a belief state, which utilizes the probability distribution of the observed state with errors to portray the precise state. Secondly, the POMDP model is transformed into the MBMDP model by introducing the misestimated belief state probability vector. A misestimated belief state probability vector is incorporated into the belief state update process to enhance the confidence level in circumstances of a significant data discrepancy. Then, the MBMDP block with a high confidence level for the precise state is inputted into the underlying network architecture of the multi-agent actor-attention-critic algorithm, assisting agents in independently capturing features and outputting optimal decision-making actions even with significant data errors. Case studies are conducted in two T&D systems with different scales. The training dataset is constructed based on a real historical database from Suzhou, China. Simulation results validate the superior performance and scalability of the proposed methodology.","PeriodicalId":13373,"journal":{"name":"IEEE Transactions on Power Systems","volume":"40 5","pages":"3619-3631"},"PeriodicalIF":7.2000,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Power Systems","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10839083/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Given the significant uncertainty of distributed generations (DGs) in the transmission and distribution (T&D) system, we propose a novel Modified Belief Markov Decision Process-based (MBMDP-based) Reinforcement Learning scheme (namely, MBMRL) for Day-ahead Coordinated Reactive Power Optimization Problem (DCRPOP) with gross prediction errors. Firstly, we characterize DCRPOP as a Partially Observable Markov Decision Process (POMDP) model embedded with a belief state, which utilizes the probability distribution of the observed state with errors to portray the precise state. Secondly, the POMDP model is transformed into the MBMDP model by introducing the misestimated belief state probability vector. A misestimated belief state probability vector is incorporated into the belief state update process to enhance the confidence level in circumstances of a significant data discrepancy. Then, the MBMDP block with a high confidence level for the precise state is inputted into the underlying network architecture of the multi-agent actor-attention-critic algorithm, assisting agents in independently capturing features and outputting optimal decision-making actions even with significant data errors. Case studies are conducted in two T&D systems with different scales. The training dataset is constructed based on a real historical database from Suzhou, China. Simulation results validate the superior performance and scalability of the proposed methodology.
期刊介绍:
The scope of IEEE Transactions on Power Systems covers the education, analysis, operation, planning, and economics of electric generation, transmission, and distribution systems for general industrial, commercial, public, and domestic consumption, including the interaction with multi-energy carriers. The focus of this transactions is the power system from a systems viewpoint instead of components of the system. It has five (5) key areas within its scope with several technical topics within each area. These areas are: (1) Power Engineering Education, (2) Power System Analysis, Computing, and Economics, (3) Power System Dynamic Performance, (4) Power System Operations, and (5) Power System Planning and Implementation.