Cost-Effective Power Delivery via Deep Reinforcement Learning-Based Dynamic Electric Vehicle Transportation

IF 8.9 1区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Internet of Things Journal Pub Date : 2025-03-19 DOI:10.1109/JIOT.2025.3552823

Zheng Bao;Changbing Tang;Xinghuo Yu;Feilong Lin;Guanghui Wen;Zhonglong Zheng

{"title":"Cost-Effective Power Delivery via Deep Reinforcement Learning-Based Dynamic Electric Vehicle Transportation","authors":"Zheng Bao;Changbing Tang;Xinghuo Yu;Feilong Lin;Guanghui Wen;Zhonglong Zheng","doi":"10.1109/JIOT.2025.3552823","DOIUrl":null,"url":null,"abstract":"Power delivery issues are increasingly evident in cyber-physical smart grid systems as energy transactions frequently overlook the physical constraints of distribution, leading to transmission congestion and compromising network security and reliability. This article presents a novel and cost-effective solution to power delivery challenges by utilizing electric vehicles (EVs) with dynamic transportation capabilities as free carriers. Unlike traditional approaches, a deep reinforcement learning (DRL)-based optimization framework is designed to effectively manage incomplete information in real-time. Our method first introduces an investment-free model that leverages existing EV routes to transport energy during congestion, operating in a “free-riding” transmission mode. This not only enhances network reliability but also curtails costs. Then, we develop a Markov decision process (MDP) for sequential decision-making of 24-h optimal control, aimed at minimizing operational losses including load shedding and battery degradation. To deal with the stochastic nature of energy requests and EV routes in the control problem, we employ a model-free DRL algorithm to tackle the challenge of incomplete information. An Actor-Critic network, combining value-based and policy-based approaches, helps discover approximately optimal strategies in a continuous action space. Finally, the simulation results numerically demonstrate the performance of the proposed method.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 13","pages":"23245-23256"},"PeriodicalIF":8.9000,"publicationDate":"2025-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10934055/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Power delivery issues are increasingly evident in cyber-physical smart grid systems as energy transactions frequently overlook the physical constraints of distribution, leading to transmission congestion and compromising network security and reliability. This article presents a novel and cost-effective solution to power delivery challenges by utilizing electric vehicles (EVs) with dynamic transportation capabilities as free carriers. Unlike traditional approaches, a deep reinforcement learning (DRL)-based optimization framework is designed to effectively manage incomplete information in real-time. Our method first introduces an investment-free model that leverages existing EV routes to transport energy during congestion, operating in a “free-riding” transmission mode. This not only enhances network reliability but also curtails costs. Then, we develop a Markov decision process (MDP) for sequential decision-making of 24-h optimal control, aimed at minimizing operational losses including load shedding and battery degradation. To deal with the stochastic nature of energy requests and EV routes in the control problem, we employ a model-free DRL algorithm to tackle the challenge of incomplete information. An Actor-Critic network, combining value-based and policy-based approaches, helps discover approximately optimal strategies in a continuous action space. Finally, the simulation results numerically demonstrate the performance of the proposed method.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于深度强化学习的动态电动汽车运输的经济高效电力输送

由于能源交易经常忽略分配的物理约束，导致传输拥塞，损害网络的安全性和可靠性，因此，在网络物理智能电网系统中，电力传输问题日益明显。本文提出了一种新颖的、具有成本效益的解决方案，即利用具有动态运输能力的电动汽车作为免费载体来解决电力输送挑战。与传统方法不同，基于深度强化学习（DRL）的优化框架旨在有效地实时管理不完整信息。我们的方法首先引入了一个无投资模型，该模型利用现有的电动汽车路线在拥堵期间运输能源，以“免费搭乘”的传输模式运行。这不仅提高了网络的可靠性，而且降低了成本。然后，我们开发了一个马尔可夫决策过程（MDP）用于24小时最优控制的顺序决策，旨在最大限度地减少包括负载减少和电池退化在内的运行损失。为了处理控制问题中能量请求和EV路径的随机性，我们采用无模型DRL算法来解决信息不完全的挑战。行动者-评论家网络结合了基于价值和基于策略的方法，有助于在连续的行动空间中发现近似最优的策略。最后，通过数值仿真验证了所提方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Internet of Things Journal Computer Science-Information Systems

CiteScore

17.60

自引率

13.20%

发文量

1982

期刊介绍： The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.