Qianwen Li;Peng Zhang;Handong Yao;Zhiwei Chen;Xiaopeng Li
{"title":"用于互联和自动驾驶车辆的基于在线学习的模型预测轨迹控制:建模与物理测试","authors":"Qianwen Li;Peng Zhang;Handong Yao;Zhiwei Chen;Xiaopeng Li","doi":"10.26599/JICV.2023.9210026","DOIUrl":null,"url":null,"abstract":"Motivated by the promising benefits of connected and autonomous vehicles (CAVs) in improving fuel efficiency, mitigating congestion, and enhancing safety, numerous theoretical models have been proposed to plan CAV multiple-step trajectories (time-specific speed/location trajectories) to accomplish various operations. However, limited efforts have been made to develop proper trajectory control techniques to regulate vehicle movements to follow multiple-step trajectories and test the performance of theoretical trajectory planning models with field experiments. Without an effective control method, the benefits of theoretical models for CAV trajectory planning can be difficult to harvest. This study proposes an online learning-based model predictive vehicle trajectory control structure to follow time-specific speed and location profiles. Unlike single-step controllers that are dominantly used in the literature, a multiple-step model predictive controller is adopted to control the vehicle's longitudinal movements for higher accuracy. The model predictive controller output (speed) cannot be interpreted by vehicles. A reinforcement learning agent is used to convert the speed value to the vehicle's direct control variable (i.e., throttle/brake). The reinforcement learning agent captures real-time changes in the operating environment. This is valuable in saving parameter calibration resources and improving trajectory control accuracy. A line tracking controller keeps vehicles on track. The proposed control structure is tested using reduced-scale robot cars. The adaptivity of the proposed control structure is demonstrated by changing the vehicle load. Then, experiments on two fundamental CAV platoon operations (i.e., platooning and split) show the effectiveness of the proposed trajectory control structure in regulating robot movements to follow time-specific reference trajectories.","PeriodicalId":100793,"journal":{"name":"Journal of Intelligent and Connected Vehicles","volume":"7 2","pages":"86-96"},"PeriodicalIF":0.0000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10586903","citationCount":"0","resultStr":"{\"title\":\"Online Learning-Based Model Predictive Trajectory Control for Connected and Autonomous Vehicles: Modeling and Physical Tests\",\"authors\":\"Qianwen Li;Peng Zhang;Handong Yao;Zhiwei Chen;Xiaopeng Li\",\"doi\":\"10.26599/JICV.2023.9210026\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Motivated by the promising benefits of connected and autonomous vehicles (CAVs) in improving fuel efficiency, mitigating congestion, and enhancing safety, numerous theoretical models have been proposed to plan CAV multiple-step trajectories (time-specific speed/location trajectories) to accomplish various operations. However, limited efforts have been made to develop proper trajectory control techniques to regulate vehicle movements to follow multiple-step trajectories and test the performance of theoretical trajectory planning models with field experiments. Without an effective control method, the benefits of theoretical models for CAV trajectory planning can be difficult to harvest. This study proposes an online learning-based model predictive vehicle trajectory control structure to follow time-specific speed and location profiles. Unlike single-step controllers that are dominantly used in the literature, a multiple-step model predictive controller is adopted to control the vehicle's longitudinal movements for higher accuracy. The model predictive controller output (speed) cannot be interpreted by vehicles. A reinforcement learning agent is used to convert the speed value to the vehicle's direct control variable (i.e., throttle/brake). The reinforcement learning agent captures real-time changes in the operating environment. This is valuable in saving parameter calibration resources and improving trajectory control accuracy. A line tracking controller keeps vehicles on track. The proposed control structure is tested using reduced-scale robot cars. The adaptivity of the proposed control structure is demonstrated by changing the vehicle load. Then, experiments on two fundamental CAV platoon operations (i.e., platooning and split) show the effectiveness of the proposed trajectory control structure in regulating robot movements to follow time-specific reference trajectories.\",\"PeriodicalId\":100793,\"journal\":{\"name\":\"Journal of Intelligent and Connected Vehicles\",\"volume\":\"7 2\",\"pages\":\"86-96\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10586903\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Intelligent and Connected Vehicles\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10586903/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Intelligent and Connected Vehicles","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10586903/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Online Learning-Based Model Predictive Trajectory Control for Connected and Autonomous Vehicles: Modeling and Physical Tests
Motivated by the promising benefits of connected and autonomous vehicles (CAVs) in improving fuel efficiency, mitigating congestion, and enhancing safety, numerous theoretical models have been proposed to plan CAV multiple-step trajectories (time-specific speed/location trajectories) to accomplish various operations. However, limited efforts have been made to develop proper trajectory control techniques to regulate vehicle movements to follow multiple-step trajectories and test the performance of theoretical trajectory planning models with field experiments. Without an effective control method, the benefits of theoretical models for CAV trajectory planning can be difficult to harvest. This study proposes an online learning-based model predictive vehicle trajectory control structure to follow time-specific speed and location profiles. Unlike single-step controllers that are dominantly used in the literature, a multiple-step model predictive controller is adopted to control the vehicle's longitudinal movements for higher accuracy. The model predictive controller output (speed) cannot be interpreted by vehicles. A reinforcement learning agent is used to convert the speed value to the vehicle's direct control variable (i.e., throttle/brake). The reinforcement learning agent captures real-time changes in the operating environment. This is valuable in saving parameter calibration resources and improving trajectory control accuracy. A line tracking controller keeps vehicles on track. The proposed control structure is tested using reduced-scale robot cars. The adaptivity of the proposed control structure is demonstrated by changing the vehicle load. Then, experiments on two fundamental CAV platoon operations (i.e., platooning and split) show the effectiveness of the proposed trajectory control structure in regulating robot movements to follow time-specific reference trajectories.