Performance comparison of explainable DQN and DDPG models for cooperative lane change decision-making in multi-intelligent industrial IoT vehicles

IF 6 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Internet of Things Pub Date : 2025-03-01 DOI:10.1016/j.iot.2025.101552
Hao-bai ZHAN
{"title":"Performance comparison of explainable DQN and DDPG models for cooperative lane change decision-making in multi-intelligent industrial IoT vehicles","authors":"Hao-bai ZHAN","doi":"10.1016/j.iot.2025.101552","DOIUrl":null,"url":null,"abstract":"<div><div>With the rapid advancement of intelligent connected vehicles (ICVs) technology, efficient and safe vehicular lane-changing decisions have become a focal point of interest for intelligent transportation systems (ITS). This paper investigates the application of explainable artificial intelligence (XAI) techniques to deep reinforcement learning algorithms, specifically deep Q-networks (DQN) and deep deterministic policy gradient (DDPG), for lane-changing decisions in industrial internet of things (IIoT) vehicles. By integrating innovative reward functions, the study assesses the performance differences between these models under various traffic densities and ICV counts in a three-lane highway scenario. The use of XAI feature representations enhances the transparency and interpretability of the models, providing insights into the decision-making process. XAI helps to elucidate how the models arrive at their decisions, improving trust and reliability in automated systems. The research reveals that although the DQN model demonstrates initial superior performance in the early phases of experimentation, the DDPG model outperforms in crucial performance metrics such as average fleet speed, headway, and stability during later stages of training. The DDPG model maintains better control over fleet speed and vehicle spacing in both low-density and high-density traffic environments, showcasing its superior adaptability and efficiency. These findings highlight the DDPG model's enhanced capability to manage dynamic and complex driving environments, attributed to its refined policy learning approach which adeptly balances exploration and exploitation. The novel reward function significantly promotes cooperative lane-changing behaviors among ICVs, optimizing lane change decisions and improving overall traffic flow efficiency. This study not only provides valuable technical support for lane-changing decisions in smart vehicular networks but also lays a theoretical and empirical foundation for the advancement of future ITS. The insights gained from comparing DQN and DDPG models contribute to the ongoing discussion on effective deep learning strategies for real-world ITS applications, potentially guiding future developments in autonomous driving technologies.</div></div>","PeriodicalId":29968,"journal":{"name":"Internet of Things","volume":"31 ","pages":"Article 101552"},"PeriodicalIF":6.0000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Internet of Things","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2542660525000654","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

With the rapid advancement of intelligent connected vehicles (ICVs) technology, efficient and safe vehicular lane-changing decisions have become a focal point of interest for intelligent transportation systems (ITS). This paper investigates the application of explainable artificial intelligence (XAI) techniques to deep reinforcement learning algorithms, specifically deep Q-networks (DQN) and deep deterministic policy gradient (DDPG), for lane-changing decisions in industrial internet of things (IIoT) vehicles. By integrating innovative reward functions, the study assesses the performance differences between these models under various traffic densities and ICV counts in a three-lane highway scenario. The use of XAI feature representations enhances the transparency and interpretability of the models, providing insights into the decision-making process. XAI helps to elucidate how the models arrive at their decisions, improving trust and reliability in automated systems. The research reveals that although the DQN model demonstrates initial superior performance in the early phases of experimentation, the DDPG model outperforms in crucial performance metrics such as average fleet speed, headway, and stability during later stages of training. The DDPG model maintains better control over fleet speed and vehicle spacing in both low-density and high-density traffic environments, showcasing its superior adaptability and efficiency. These findings highlight the DDPG model's enhanced capability to manage dynamic and complex driving environments, attributed to its refined policy learning approach which adeptly balances exploration and exploitation. The novel reward function significantly promotes cooperative lane-changing behaviors among ICVs, optimizing lane change decisions and improving overall traffic flow efficiency. This study not only provides valuable technical support for lane-changing decisions in smart vehicular networks but also lays a theoretical and empirical foundation for the advancement of future ITS. The insights gained from comparing DQN and DDPG models contribute to the ongoing discussion on effective deep learning strategies for real-world ITS applications, potentially guiding future developments in autonomous driving technologies.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
求助全文
约1分钟内获得全文 去求助
来源期刊
Internet of Things
Internet of Things Multiple-
CiteScore
3.60
自引率
5.10%
发文量
115
审稿时长
37 days
期刊介绍: Internet of Things; Engineering Cyber Physical Human Systems is a comprehensive journal encouraging cross collaboration between researchers, engineers and practitioners in the field of IoT & Cyber Physical Human Systems. The journal offers a unique platform to exchange scientific information on the entire breadth of technology, science, and societal applications of the IoT. The journal will place a high priority on timely publication, and provide a home for high quality. Furthermore, IOT is interested in publishing topical Special Issues on any aspect of IOT.
期刊最新文献
Novel RSSI-Based localization in LoRaWAN using probability density estimation similarity-based techniques Quantum-resistant hardware-accelerated IoT traffic encryptor CONCERN: A model-based monitoring infrastructure Towards privacy-preserving split learning: Destabilizing adversarial inference and reconstruction attacks in the cloud A secure image encryption mechanism using biased Fourier quantum walk and addition-crossover structure in the Internet of Things
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1