{"title":"Human-Like Interactive Lane-Change Modeling Based on Reward-Guided Diffusive Predictor and Planner","authors":"Kehua Chen;Yuhao Luo;Meixin Zhu;Hai Yang","doi":"10.1109/TITS.2024.3520613","DOIUrl":null,"url":null,"abstract":"Lane changing presents a dynamic scenario characterized by intricate interactions among vehicles. Within mixed-autonomy traffic environment, modeling a human-like lane-change trajectory enables human drivers to better understand and predict autonomous vehicles’ behaviors, thereby enhancing road safety and travel efficiency. In this study, we achieve human-like interactive lane-change modeling based on a novel framework named Diff-LC. The human-like modeling of LCV behaviors relies on an advanced diffusive planner, and the implemented trajectory is selected based on the recovered LCV reward function learned through Multi-Agent Adversarial Inverse Reinforcement Learning (MA-AIRL). To account for interactions between FVs and LCVs, we further employ a diffusive predictor to forecast future behaviors of FVs conditioned on both historical and planned trajectories. Additionally, we leverage the recovered reward function of FVs to enable controllable prediction of trajectories. In the experimental part, we begin by analyzing the significance of features in the recovered reward functions and then proceed to compare the distinctions between the LCV and the FV. To validate the effectiveness of the proposed framework, we compare the diffusive predictor and planner with several state-of-the-art methods. The results demonstrate that motions planned by Diff-LC closely reach the intended positions with small displacement errors and exhibit highly similar speed and jerk distributions to those of human drivers. We also conduct a dynamic simulation to evaluate Diff-LC’s performance across different traffic conditions. Finally, we explore customized generation using the Diffusion Posterior Sampling method. The codes can be found at <uri>https://github.com/zeonchen/Diff-LC/</uri>.","PeriodicalId":13416,"journal":{"name":"IEEE Transactions on Intelligent Transportation Systems","volume":"26 3","pages":"3903-3916"},"PeriodicalIF":7.9000,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Intelligent Transportation Systems","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10819263/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0
Abstract
Lane changing presents a dynamic scenario characterized by intricate interactions among vehicles. Within mixed-autonomy traffic environment, modeling a human-like lane-change trajectory enables human drivers to better understand and predict autonomous vehicles’ behaviors, thereby enhancing road safety and travel efficiency. In this study, we achieve human-like interactive lane-change modeling based on a novel framework named Diff-LC. The human-like modeling of LCV behaviors relies on an advanced diffusive planner, and the implemented trajectory is selected based on the recovered LCV reward function learned through Multi-Agent Adversarial Inverse Reinforcement Learning (MA-AIRL). To account for interactions between FVs and LCVs, we further employ a diffusive predictor to forecast future behaviors of FVs conditioned on both historical and planned trajectories. Additionally, we leverage the recovered reward function of FVs to enable controllable prediction of trajectories. In the experimental part, we begin by analyzing the significance of features in the recovered reward functions and then proceed to compare the distinctions between the LCV and the FV. To validate the effectiveness of the proposed framework, we compare the diffusive predictor and planner with several state-of-the-art methods. The results demonstrate that motions planned by Diff-LC closely reach the intended positions with small displacement errors and exhibit highly similar speed and jerk distributions to those of human drivers. We also conduct a dynamic simulation to evaluate Diff-LC’s performance across different traffic conditions. Finally, we explore customized generation using the Diffusion Posterior Sampling method. The codes can be found at https://github.com/zeonchen/Diff-LC/.
期刊介绍:
The theoretical, experimental and operational aspects of electrical and electronics engineering and information technologies as applied to Intelligent Transportation Systems (ITS). Intelligent Transportation Systems are defined as those systems utilizing synergistic technologies and systems engineering concepts to develop and improve transportation systems of all kinds. The scope of this interdisciplinary activity includes the promotion, consolidation and coordination of ITS technical activities among IEEE entities, and providing a focus for cooperative activities, both internally and externally.