Guojun Nan, Zixiang Shen, Haibo Du, Lanlin Yu, Wenwu Zhu
{"title":"基于 D3QN-PER 算法的智能输电线路规划方法","authors":"Guojun Nan, Zixiang Shen, Haibo Du, Lanlin Yu, Wenwu Zhu","doi":"10.1049/cth2.12689","DOIUrl":null,"url":null,"abstract":"The planning of power transmission line projects encompasses vast and complex geographical terrains. To address the complexity of transmission line planning and achieve lower line costs, this study proposes a novel intelligent line planning method. For the first time, it combines the Dueling Double Deep Q Network (D3QN) with the prioritized experience replay (PER) mechanism. First, correlate the reward function with metrics such as line length, number of corner points, and geographical environmental data, which are pertinent to the construction costs of power transmission line. Second, the D3QN algorithm is formulated by integrating Double DQN and Dueling DQN. The network's input information is divided into two components during training, aligning with the characteristics of power transmission line planning projects. Finally, the convergence efficiency of the algorithm is improved by using the PER mechanism for the problem of cost difference due to the different number of corner points in the planning path. In order to test the feasibility of the algorithm, we conducted experiments using real maps. Compared with the traditional ant colony optimization (ACO) algorithm, the D3QN‐PER deep reinforcement learning algorithm reduces the line length by more than 4% and the number of corner points by more than 60%.","PeriodicalId":502998,"journal":{"name":"IET Control Theory & Applications","volume":" 4","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Smart line planning method for power transmission based on D3QN‐PER algorithm\",\"authors\":\"Guojun Nan, Zixiang Shen, Haibo Du, Lanlin Yu, Wenwu Zhu\",\"doi\":\"10.1049/cth2.12689\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The planning of power transmission line projects encompasses vast and complex geographical terrains. To address the complexity of transmission line planning and achieve lower line costs, this study proposes a novel intelligent line planning method. For the first time, it combines the Dueling Double Deep Q Network (D3QN) with the prioritized experience replay (PER) mechanism. First, correlate the reward function with metrics such as line length, number of corner points, and geographical environmental data, which are pertinent to the construction costs of power transmission line. Second, the D3QN algorithm is formulated by integrating Double DQN and Dueling DQN. The network's input information is divided into two components during training, aligning with the characteristics of power transmission line planning projects. Finally, the convergence efficiency of the algorithm is improved by using the PER mechanism for the problem of cost difference due to the different number of corner points in the planning path. In order to test the feasibility of the algorithm, we conducted experiments using real maps. Compared with the traditional ant colony optimization (ACO) algorithm, the D3QN‐PER deep reinforcement learning algorithm reduces the line length by more than 4% and the number of corner points by more than 60%.\",\"PeriodicalId\":502998,\"journal\":{\"name\":\"IET Control Theory & Applications\",\"volume\":\" 4\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IET Control Theory & Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1049/cth2.12689\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Control Theory & Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1049/cth2.12689","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
输电线路项目规划涉及广阔而复杂的地理地形。为解决输电线路规划的复杂性并降低线路成本,本研究提出了一种新颖的智能线路规划方法。它首次将决斗双深 Q 网络(D3QN)与优先经验重放(PER)机制相结合。首先,将奖励函数与线路长度、转角点数量和地理环境数据等指标相关联,这些指标与输电线路的建设成本息息相关。其次,通过整合双DQN和决斗DQN,制定了D3QN算法。在训练过程中,根据输电线路规划项目的特点,将网络的输入信息分为两部分。最后,针对规划路径中角点数量不同导致的成本差异问题,利用 PER 机制提高了算法的收敛效率。为了检验算法的可行性,我们使用真实地图进行了实验。与传统的蚁群优化(ACO)算法相比,D3QN-PER 深度强化学习算法的线路长度减少了 4% 以上,角点数量减少了 60% 以上。
Smart line planning method for power transmission based on D3QN‐PER algorithm
The planning of power transmission line projects encompasses vast and complex geographical terrains. To address the complexity of transmission line planning and achieve lower line costs, this study proposes a novel intelligent line planning method. For the first time, it combines the Dueling Double Deep Q Network (D3QN) with the prioritized experience replay (PER) mechanism. First, correlate the reward function with metrics such as line length, number of corner points, and geographical environmental data, which are pertinent to the construction costs of power transmission line. Second, the D3QN algorithm is formulated by integrating Double DQN and Dueling DQN. The network's input information is divided into two components during training, aligning with the characteristics of power transmission line planning projects. Finally, the convergence efficiency of the algorithm is improved by using the PER mechanism for the problem of cost difference due to the different number of corner points in the planning path. In order to test the feasibility of the algorithm, we conducted experiments using real maps. Compared with the traditional ant colony optimization (ACO) algorithm, the D3QN‐PER deep reinforcement learning algorithm reduces the line length by more than 4% and the number of corner points by more than 60%.