Automated lane changing control in mixed traffic: An adaptive dynamic programming approach

IF 5.8 1区 工程技术 Q1 ECONOMICS Transportation Research Part B-Methodological Pub Date : 2024-08-01 DOI:10.1016/j.trb.2024.103026
Sayan Chakraborty , Leilei Cui , Kaan Ozbay , Zhong-Ping Jiang
{"title":"Automated lane changing control in mixed traffic: An adaptive dynamic programming approach","authors":"Sayan Chakraborty ,&nbsp;Leilei Cui ,&nbsp;Kaan Ozbay ,&nbsp;Zhong-Ping Jiang","doi":"10.1016/j.trb.2024.103026","DOIUrl":null,"url":null,"abstract":"<div><p>The majority of the past research dealing with lane-changing controller design of autonomous vehicles (<span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span>s) is based on the assumption of full knowledge of the model dynamics of the <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span> and the surrounding vehicles. However, in the real world, this is not a very realistic assumption as accurate dynamic models are difficult to obtain. Also, the dynamic model parameters might change over time due to various factors. Thus, there is a need for a learning-based lane change controller design methodology that can learn the optimal control policy in real time using sensor data. In this paper, we have addressed this need by introducing an optimal learning-based control methodology that can solve the real-time lane-changing problem of <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span>s, where the input-state data of the <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span> is utilized to generate a near-optimal lane-changing controller by approximate/adaptive dynamic programming (ADP) technique. In the case of this type of complex lane-changing maneuver, the lateral dynamics depend on the longitudinal velocity of the vehicle. If the longitudinal velocity is assumed constant, a linear parameter invariant model can be used. However, assuming constant velocity while performing a lane-changing maneuver is not a realistic assumption. This assumption might increase the risk of accidents, especially in the case of lane abortion when the surrounding vehicles are not cooperative. Thus, in this paper, the dynamics of the <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span> are assumed to be a linear parameter-varying system. Thus we have two challenges for the lane-changing controller design: parameter-varying, and unknown dynamics. With the help of both gain scheduling and ADP techniques combined, a learning-based control algorithm that can generate a near-optimal lane-changing controller without having to know the accurate dynamic model of the <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span> is proposed. The inclusion of a gain scheduling approach with ADP makes the controller applicable to non-linear and/or parameter-varying <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span> dynamics. The stability of the learning-based gain scheduling controller has also been rigorously proved. Moreover, a data-driven lane-changing decision-making algorithm is introduced that can make the <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span> perform a lane abortion if safety conditions are violated during a lane change. Finally, the proposed learning-based gain scheduling controller design algorithm and the lane-changing decision-making methodology are numerically validated using MATLAB, SUMO simulations, and the NGSIM dataset.</p></div>","PeriodicalId":54418,"journal":{"name":"Transportation Research Part B-Methodological","volume":"187 ","pages":"Article 103026"},"PeriodicalIF":5.8000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part B-Methodological","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0191261524001504","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0

Abstract

The majority of the past research dealing with lane-changing controller design of autonomous vehicles (AVs) is based on the assumption of full knowledge of the model dynamics of the AV and the surrounding vehicles. However, in the real world, this is not a very realistic assumption as accurate dynamic models are difficult to obtain. Also, the dynamic model parameters might change over time due to various factors. Thus, there is a need for a learning-based lane change controller design methodology that can learn the optimal control policy in real time using sensor data. In this paper, we have addressed this need by introducing an optimal learning-based control methodology that can solve the real-time lane-changing problem of AVs, where the input-state data of the AV is utilized to generate a near-optimal lane-changing controller by approximate/adaptive dynamic programming (ADP) technique. In the case of this type of complex lane-changing maneuver, the lateral dynamics depend on the longitudinal velocity of the vehicle. If the longitudinal velocity is assumed constant, a linear parameter invariant model can be used. However, assuming constant velocity while performing a lane-changing maneuver is not a realistic assumption. This assumption might increase the risk of accidents, especially in the case of lane abortion when the surrounding vehicles are not cooperative. Thus, in this paper, the dynamics of the AV are assumed to be a linear parameter-varying system. Thus we have two challenges for the lane-changing controller design: parameter-varying, and unknown dynamics. With the help of both gain scheduling and ADP techniques combined, a learning-based control algorithm that can generate a near-optimal lane-changing controller without having to know the accurate dynamic model of the AV is proposed. The inclusion of a gain scheduling approach with ADP makes the controller applicable to non-linear and/or parameter-varying AV dynamics. The stability of the learning-based gain scheduling controller has also been rigorously proved. Moreover, a data-driven lane-changing decision-making algorithm is introduced that can make the AV perform a lane abortion if safety conditions are violated during a lane change. Finally, the proposed learning-based gain scheduling controller design algorithm and the lane-changing decision-making methodology are numerically validated using MATLAB, SUMO simulations, and the NGSIM dataset.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
混合交通中的自动变道控制:自适应动态编程方法
过去有关自动驾驶车辆变道控制器设计的大部分研究都是基于对自动驾驶车辆和周围车辆的模型动态完全了解的假设。然而,在现实世界中,这种假设并不现实,因为很难获得精确的动态模型。而且,动态模型参数可能会因各种因素而随时间发生变化。因此,需要一种基于学习的变道控制器设计方法,这种方法可以利用传感器数据实时学习最佳控制策略。针对这一需求,我们在本文中介绍了一种基于学习的最优控制方法,该方法可以解决 s 的实时变道问题,即利用 s 的输入状态数据,通过近似/自适应动态编程(ADP)技术生成一个接近最优的变道控制器。在这种复杂的变道机动中,横向动态取决于车辆的纵向速度。如果假设纵向速度不变,则可以使用线性参数不变模型。但是,在进行变道机动时假设速度恒定并不现实。这种假设可能会增加事故风险,特别是在周围车辆不配合的情况下,尤其如此。因此,本文假定变道器的动态是一个线性参数变化系统。因此,变道控制器的设计面临两个挑战:参数变化和未知动态。在增益调度和 ADP 技术相结合的帮助下,本文提出了一种基于学习的控制算法,该算法可以生成接近最优的变道控制器,而无需知道变道系统的精确动态模型。增益调度方法与 ADP 的结合使控制器适用于非线性和/或参数变化动态。基于学习的增益调度控制器的稳定性也得到了严格证明。此外,还引入了一种数据驱动的变道决策算法,如果在变道过程中违反了安全条件,该算法可以使变道执行流产。最后,利用 MATLAB、SUMO 仿真和 NGSIM 数据集对所提出的基于学习的增益调度控制器设计算法和变道决策方法进行了数值验证。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Transportation Research Part B-Methodological
Transportation Research Part B-Methodological 工程技术-工程:土木
CiteScore
12.40
自引率
8.80%
发文量
143
审稿时长
14.1 weeks
期刊介绍: Transportation Research: Part B publishes papers on all methodological aspects of the subject, particularly those that require mathematical analysis. The general theme of the journal is the development and solution of problems that are adequately motivated to deal with important aspects of the design and/or analysis of transportation systems. Areas covered include: traffic flow; design and analysis of transportation networks; control and scheduling; optimization; queuing theory; logistics; supply chains; development and application of statistical, econometric and mathematical models to address transportation problems; cost models; pricing and/or investment; traveler or shipper behavior; cost-benefit methodologies.
期刊最新文献
Modelling the impacts of en-route ride-pooling service in a mixed pooling and non-pooling market Amachine learning technique embedded reference-dependent choice model for explanatory power improvement: Shifting of reference point as a key factor in vehicle purchase decision-making Making the most of your private parking slot: Strategy-proof double auctions-enabled staggered sharing schemes Editorial Board Safety, liability, and insurance markets in the age of automated driving
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1