A multi-objective reinforcement learning-based velocity optimization approach for electric trucks considering battery degradation mitigation

Ruo Jia, Kun Gao, Shaohua Cui, Jing Chen, Jelena Andric
{"title":"A multi-objective reinforcement learning-based velocity optimization approach for electric trucks considering battery degradation mitigation","authors":"Ruo Jia, Kun Gao, Shaohua Cui, Jing Chen, Jelena Andric","doi":"10.1016/j.tre.2024.103885","DOIUrl":null,"url":null,"abstract":"Electrification of commercial vehicles for more sustainable logistic systems has been promoted in the past decades. This study proposes a deep reinforcement learning method for velocity optimization and battery degradation minimization during operation for battery-powered electric trucks (BETs), aiming to achieve a safe, efficient, and comfortable driving control policy for BETs. To obtain an optimal solution considering both calendar and cyclic battery degradation, Deep Deterministic Policy Gradient and Twin Delayed Deep Deterministic Policy Gradient (TD3) approaches are integrated within a simulation environment. To optimize overall BET velocity performance, a trade-off among safety, efficiency, comfort, and battery degradation is incorporated into the reward function of reinforcement learning using Mixture of Experts (MoE) model. The results indicate that the proposed TD3-MoE model achieves safe, efficient, and comfortable car-following control while optimizing total battery degradation. Specifically, the model achieves reductions in total battery capacity loss ranging from 2.4% to 8.3% at different states of charge (SoC) of battery compared to human-driven scenarios. Moreover, despite calendar battery degradation being inevitable, the cyclic battery degradation is effectively mitigated by 27.7% to 29.6% compared to the same SoCs in human-driving data. Furthermore, the TD3-MoE model achieves significant energy consumption reductions, ranging from 35.3% to 39.8% compared to real car-following trajectories.","PeriodicalId":49418,"journal":{"name":"Transportation Research Part E-Logistics and Transportation Review","volume":"28 10 1","pages":""},"PeriodicalIF":8.3000,"publicationDate":"2024-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part E-Logistics and Transportation Review","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.tre.2024.103885","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0

Abstract

Electrification of commercial vehicles for more sustainable logistic systems has been promoted in the past decades. This study proposes a deep reinforcement learning method for velocity optimization and battery degradation minimization during operation for battery-powered electric trucks (BETs), aiming to achieve a safe, efficient, and comfortable driving control policy for BETs. To obtain an optimal solution considering both calendar and cyclic battery degradation, Deep Deterministic Policy Gradient and Twin Delayed Deep Deterministic Policy Gradient (TD3) approaches are integrated within a simulation environment. To optimize overall BET velocity performance, a trade-off among safety, efficiency, comfort, and battery degradation is incorporated into the reward function of reinforcement learning using Mixture of Experts (MoE) model. The results indicate that the proposed TD3-MoE model achieves safe, efficient, and comfortable car-following control while optimizing total battery degradation. Specifically, the model achieves reductions in total battery capacity loss ranging from 2.4% to 8.3% at different states of charge (SoC) of battery compared to human-driven scenarios. Moreover, despite calendar battery degradation being inevitable, the cyclic battery degradation is effectively mitigated by 27.7% to 29.6% compared to the same SoCs in human-driving data. Furthermore, the TD3-MoE model achieves significant energy consumption reductions, ranging from 35.3% to 39.8% compared to real car-following trajectories.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于强化学习的电动卡车多目标速度优化方法,考虑到电池退化缓解问题
在过去的几十年里,商用车的电气化一直在推动更可持续的物流系统。本研究提出了一种基于深度强化学习的电动卡车运行速度优化和电池退化最小化方法,旨在实现电动卡车安全、高效、舒适的驾驶控制策略。为了获得考虑日历和循环电池退化的最优解,在仿真环境中集成了深度确定性策略梯度和双延迟深度确定性策略梯度(TD3)方法。为了优化整体BET速度性能,使用混合专家(MoE)模型将安全性、效率、舒适性和电池退化之间的权衡纳入强化学习的奖励函数中。结果表明,TD3-MoE模型在优化电池总退化的同时,实现了安全、高效、舒适的跟车控制。具体而言,与人为驱动的场景相比,该模型在电池不同充电状态(SoC)下实现了电池总容量损失减少2.4%至8.3%。此外,尽管日历电池退化是不可避免的,但与人类驾驶数据中相同的soc相比,循环电池退化有效缓解了27.7%至29.6%。此外,与实际车辆跟随轨迹相比,TD3-MoE模型实现了显著的能耗降低,降幅为35.3%至39.8%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
16.20
自引率
16.00%
发文量
285
审稿时长
62 days
期刊介绍: Transportation Research Part E: Logistics and Transportation Review is a reputable journal that publishes high-quality articles covering a wide range of topics in the field of logistics and transportation research. The journal welcomes submissions on various subjects, including transport economics, transport infrastructure and investment appraisal, evaluation of public policies related to transportation, empirical and analytical studies of logistics management practices and performance, logistics and operations models, and logistics and supply chain management. Part E aims to provide informative and well-researched articles that contribute to the understanding and advancement of the field. The content of the journal is complementary to other prestigious journals in transportation research, such as Transportation Research Part A: Policy and Practice, Part B: Methodological, Part C: Emerging Technologies, Part D: Transport and Environment, and Part F: Traffic Psychology and Behaviour. Together, these journals form a comprehensive and cohesive reference for current research in transportation science.
期刊最新文献
Multi-agent deep reinforcement learning-based truck-drone collaborative routing with dynamic emergency response Enhancing resilience in supply chains through resource orchestration and AI assimilation: An empirical exploration An interpretable machine learning framework for enhancing road transportation safety Shared use of dedicated lanes by connected and automated buses and private vehicles: A multi-green-wave signal control scheme Is operational flexibility a viable strategy during major supply chain disruptions? Evidence from the COVID-19 pandemic
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1