学习最佳功率流：环境设计很重要

IF 9.6 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Energy and AI Pub Date : 2024-09-01 Epub Date: 2024-08-13 DOI:10.1016/j.egyai.2024.100410

Thomas Wolgast, Astrid Nieße

{"title":"学习最佳功率流：环境设计很重要","authors":"Thomas Wolgast, Astrid Nieße","doi":"10.1016/j.egyai.2024.100410","DOIUrl":null,"url":null,"abstract":"<div><p>To solve the optimal power flow (OPF) problem, reinforcement learning (RL) emerges as a promising new approach. However, the RL-OPF literature is strongly divided regarding the exact formulation of the OPF problem as an RL environment. In this work, we collect and implement diverse environment design decisions from the literature regarding training data, observation space, episode definition, and reward function choice. In an experimental analysis, we show the significant impact of these environment design options on RL-OPF training performance. Further, we derive some first recommendations regarding the choice of these design decisions. The created environment framework is fully open-source and can serve as a benchmark for future research in the RL-OPF field.</p></div>","PeriodicalId":34138,"journal":{"name":"Energy and AI","volume":"17 ","pages":"Article 100410"},"PeriodicalIF":9.6000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666546824000764/pdfft?md5=9a476707ca477944ae06662f8d552385&pid=1-s2.0-S2666546824000764-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Learning the optimal power flow: Environment design matters\",\"authors\":\"Thomas Wolgast, Astrid Nieße\",\"doi\":\"10.1016/j.egyai.2024.100410\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>To solve the optimal power flow (OPF) problem, reinforcement learning (RL) emerges as a promising new approach. However, the RL-OPF literature is strongly divided regarding the exact formulation of the OPF problem as an RL environment. In this work, we collect and implement diverse environment design decisions from the literature regarding training data, observation space, episode definition, and reward function choice. In an experimental analysis, we show the significant impact of these environment design options on RL-OPF training performance. Further, we derive some first recommendations regarding the choice of these design decisions. The created environment framework is fully open-source and can serve as a benchmark for future research in the RL-OPF field.</p></div>\",\"PeriodicalId\":34138,\"journal\":{\"name\":\"Energy and AI\",\"volume\":\"17 \",\"pages\":\"Article 100410\"},\"PeriodicalIF\":9.6000,\"publicationDate\":\"2024-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2666546824000764/pdfft?md5=9a476707ca477944ae06662f8d552385&pid=1-s2.0-S2666546824000764-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Energy and AI\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666546824000764\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/8/13 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy and AI","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666546824000764","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/13 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

为了解决最优功率流（OPF）问题，强化学习（RL）成为一种很有前途的新方法。然而，RL-OPF 文献在将 OPF 问题作为 RL 环境的确切表述方面存在严重分歧。在这项工作中，我们收集并实施了文献中关于训练数据、观察空间、情节定义和奖励函数选择的各种环境设计决策。在实验分析中，我们展示了这些环境设计选项对 RL-OPF 训练性能的重大影响。此外，我们还就这些设计决策的选择提出了一些初步建议。创建的环境框架是完全开源的，可以作为 RL-OPF 领域未来研究的基准。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Learning the optimal power flow: Environment design matters

To solve the optimal power flow (OPF) problem, reinforcement learning (RL) emerges as a promising new approach. However, the RL-OPF literature is strongly divided regarding the exact formulation of the OPF problem as an RL environment. In this work, we collect and implement diverse environment design decisions from the literature regarding training data, observation space, episode definition, and reward function choice. In an experimental analysis, we show the significant impact of these environment design options on RL-OPF training performance. Further, we derive some first recommendations regarding the choice of these design decisions. The created environment framework is fully open-source and can serve as a benchmark for future research in the RL-OPF field.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊