{"title":"混合交通中的自动变道控制:自适应动态编程方法","authors":"Sayan Chakraborty , Leilei Cui , Kaan Ozbay , Zhong-Ping Jiang","doi":"10.1016/j.trb.2024.103026","DOIUrl":null,"url":null,"abstract":"<div><p>The majority of the past research dealing with lane-changing controller design of autonomous vehicles (<span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span>s) is based on the assumption of full knowledge of the model dynamics of the <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span> and the surrounding vehicles. However, in the real world, this is not a very realistic assumption as accurate dynamic models are difficult to obtain. Also, the dynamic model parameters might change over time due to various factors. Thus, there is a need for a learning-based lane change controller design methodology that can learn the optimal control policy in real time using sensor data. In this paper, we have addressed this need by introducing an optimal learning-based control methodology that can solve the real-time lane-changing problem of <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span>s, where the input-state data of the <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span> is utilized to generate a near-optimal lane-changing controller by approximate/adaptive dynamic programming (ADP) technique. In the case of this type of complex lane-changing maneuver, the lateral dynamics depend on the longitudinal velocity of the vehicle. If the longitudinal velocity is assumed constant, a linear parameter invariant model can be used. However, assuming constant velocity while performing a lane-changing maneuver is not a realistic assumption. This assumption might increase the risk of accidents, especially in the case of lane abortion when the surrounding vehicles are not cooperative. Thus, in this paper, the dynamics of the <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span> are assumed to be a linear parameter-varying system. Thus we have two challenges for the lane-changing controller design: parameter-varying, and unknown dynamics. With the help of both gain scheduling and ADP techniques combined, a learning-based control algorithm that can generate a near-optimal lane-changing controller without having to know the accurate dynamic model of the <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span> is proposed. The inclusion of a gain scheduling approach with ADP makes the controller applicable to non-linear and/or parameter-varying <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span> dynamics. The stability of the learning-based gain scheduling controller has also been rigorously proved. Moreover, a data-driven lane-changing decision-making algorithm is introduced that can make the <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span> perform a lane abortion if safety conditions are violated during a lane change. Finally, the proposed learning-based gain scheduling controller design algorithm and the lane-changing decision-making methodology are numerically validated using MATLAB, SUMO simulations, and the NGSIM dataset.</p></div>","PeriodicalId":54418,"journal":{"name":"Transportation Research Part B-Methodological","volume":"187 ","pages":"Article 103026"},"PeriodicalIF":5.8000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automated lane changing control in mixed traffic: An adaptive dynamic programming approach\",\"authors\":\"Sayan Chakraborty , Leilei Cui , Kaan Ozbay , Zhong-Ping Jiang\",\"doi\":\"10.1016/j.trb.2024.103026\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The majority of the past research dealing with lane-changing controller design of autonomous vehicles (<span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span>s) is based on the assumption of full knowledge of the model dynamics of the <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span> and the surrounding vehicles. However, in the real world, this is not a very realistic assumption as accurate dynamic models are difficult to obtain. Also, the dynamic model parameters might change over time due to various factors. Thus, there is a need for a learning-based lane change controller design methodology that can learn the optimal control policy in real time using sensor data. In this paper, we have addressed this need by introducing an optimal learning-based control methodology that can solve the real-time lane-changing problem of <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span>s, where the input-state data of the <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span> is utilized to generate a near-optimal lane-changing controller by approximate/adaptive dynamic programming (ADP) technique. In the case of this type of complex lane-changing maneuver, the lateral dynamics depend on the longitudinal velocity of the vehicle. If the longitudinal velocity is assumed constant, a linear parameter invariant model can be used. However, assuming constant velocity while performing a lane-changing maneuver is not a realistic assumption. This assumption might increase the risk of accidents, especially in the case of lane abortion when the surrounding vehicles are not cooperative. Thus, in this paper, the dynamics of the <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span> are assumed to be a linear parameter-varying system. Thus we have two challenges for the lane-changing controller design: parameter-varying, and unknown dynamics. With the help of both gain scheduling and ADP techniques combined, a learning-based control algorithm that can generate a near-optimal lane-changing controller without having to know the accurate dynamic model of the <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span> is proposed. The inclusion of a gain scheduling approach with ADP makes the controller applicable to non-linear and/or parameter-varying <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span> dynamics. The stability of the learning-based gain scheduling controller has also been rigorously proved. Moreover, a data-driven lane-changing decision-making algorithm is introduced that can make the <span><math><mrow><mi>A</mi><mi>V</mi></mrow></math></span> perform a lane abortion if safety conditions are violated during a lane change. Finally, the proposed learning-based gain scheduling controller design algorithm and the lane-changing decision-making methodology are numerically validated using MATLAB, SUMO simulations, and the NGSIM dataset.</p></div>\",\"PeriodicalId\":54418,\"journal\":{\"name\":\"Transportation Research Part B-Methodological\",\"volume\":\"187 \",\"pages\":\"Article 103026\"},\"PeriodicalIF\":5.8000,\"publicationDate\":\"2024-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Research Part B-Methodological\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0191261524001504\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part B-Methodological","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0191261524001504","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0
摘要
过去有关自动驾驶车辆变道控制器设计的大部分研究都是基于对自动驾驶车辆和周围车辆的模型动态完全了解的假设。然而,在现实世界中,这种假设并不现实,因为很难获得精确的动态模型。而且,动态模型参数可能会因各种因素而随时间发生变化。因此,需要一种基于学习的变道控制器设计方法,这种方法可以利用传感器数据实时学习最佳控制策略。针对这一需求,我们在本文中介绍了一种基于学习的最优控制方法,该方法可以解决 s 的实时变道问题,即利用 s 的输入状态数据,通过近似/自适应动态编程(ADP)技术生成一个接近最优的变道控制器。在这种复杂的变道机动中,横向动态取决于车辆的纵向速度。如果假设纵向速度不变,则可以使用线性参数不变模型。但是,在进行变道机动时假设速度恒定并不现实。这种假设可能会增加事故风险,特别是在周围车辆不配合的情况下,尤其如此。因此,本文假定变道器的动态是一个线性参数变化系统。因此,变道控制器的设计面临两个挑战:参数变化和未知动态。在增益调度和 ADP 技术相结合的帮助下,本文提出了一种基于学习的控制算法,该算法可以生成接近最优的变道控制器,而无需知道变道系统的精确动态模型。增益调度方法与 ADP 的结合使控制器适用于非线性和/或参数变化动态。基于学习的增益调度控制器的稳定性也得到了严格证明。此外,还引入了一种数据驱动的变道决策算法,如果在变道过程中违反了安全条件,该算法可以使变道执行流产。最后,利用 MATLAB、SUMO 仿真和 NGSIM 数据集对所提出的基于学习的增益调度控制器设计算法和变道决策方法进行了数值验证。
Automated lane changing control in mixed traffic: An adaptive dynamic programming approach
The majority of the past research dealing with lane-changing controller design of autonomous vehicles (s) is based on the assumption of full knowledge of the model dynamics of the and the surrounding vehicles. However, in the real world, this is not a very realistic assumption as accurate dynamic models are difficult to obtain. Also, the dynamic model parameters might change over time due to various factors. Thus, there is a need for a learning-based lane change controller design methodology that can learn the optimal control policy in real time using sensor data. In this paper, we have addressed this need by introducing an optimal learning-based control methodology that can solve the real-time lane-changing problem of s, where the input-state data of the is utilized to generate a near-optimal lane-changing controller by approximate/adaptive dynamic programming (ADP) technique. In the case of this type of complex lane-changing maneuver, the lateral dynamics depend on the longitudinal velocity of the vehicle. If the longitudinal velocity is assumed constant, a linear parameter invariant model can be used. However, assuming constant velocity while performing a lane-changing maneuver is not a realistic assumption. This assumption might increase the risk of accidents, especially in the case of lane abortion when the surrounding vehicles are not cooperative. Thus, in this paper, the dynamics of the are assumed to be a linear parameter-varying system. Thus we have two challenges for the lane-changing controller design: parameter-varying, and unknown dynamics. With the help of both gain scheduling and ADP techniques combined, a learning-based control algorithm that can generate a near-optimal lane-changing controller without having to know the accurate dynamic model of the is proposed. The inclusion of a gain scheduling approach with ADP makes the controller applicable to non-linear and/or parameter-varying dynamics. The stability of the learning-based gain scheduling controller has also been rigorously proved. Moreover, a data-driven lane-changing decision-making algorithm is introduced that can make the perform a lane abortion if safety conditions are violated during a lane change. Finally, the proposed learning-based gain scheduling controller design algorithm and the lane-changing decision-making methodology are numerically validated using MATLAB, SUMO simulations, and the NGSIM dataset.
期刊介绍:
Transportation Research: Part B publishes papers on all methodological aspects of the subject, particularly those that require mathematical analysis. The general theme of the journal is the development and solution of problems that are adequately motivated to deal with important aspects of the design and/or analysis of transportation systems. Areas covered include: traffic flow; design and analysis of transportation networks; control and scheduling; optimization; queuing theory; logistics; supply chains; development and application of statistical, econometric and mathematical models to address transportation problems; cost models; pricing and/or investment; traveler or shipper behavior; cost-benefit methodologies.