A model predictive control trajectory tracking lateral controller for autonomous vehicles combined with deep deterministic policy gradient

IF 1.7 4区 计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS Transactions of the Institute of Measurement and Control Pub Date : 2023-10-13 DOI:10.1177/01423312231197854
Zhaokang Xie, Xiaoci Huang, Suyun Luo, Ruoping Zhang, Fang Ma
{"title":"A model predictive control trajectory tracking lateral controller for autonomous vehicles combined with deep deterministic policy gradient","authors":"Zhaokang Xie, Xiaoci Huang, Suyun Luo, Ruoping Zhang, Fang Ma","doi":"10.1177/01423312231197854","DOIUrl":null,"url":null,"abstract":"To solve the problem of trajectory tracking lateral control in autonomous driving technology, a model predictive control (MPC) controller trajectory tracking lateral control method combined with a deep deterministic policy gradient algorithm (DDPG) is proposed in this paper. This method inputs the real-time state of the vehicle into DDPG to achieve real-time automatic optimization of the prediction time domain and control time domain parameters of the MPC controller, and then affects the specific performance of the MPC controller in trajectory tracking lateral control. Specifically, the state space, action space, and reward function of DDPG are defined, and the automatic driving trajectory tracking lateral controller is designed in combination with the vehicle dynamics model. To reduce the exploration space of DDPG and improve the training efficiency of the entire model, the technique of advantage-disadvantage experience separation and extraction is introduced. Finally, the proposed method was trained and verified in various scenarios, and compared with two other lateral control methods for autonomous driving. The results showed that the learning and training time of the trajectory tracking lateral control method based on DDPG-MPC was shorter than that of the DDPG-based method, and the evaluation indicators in the trajectory tracking control process were better than those of the DDPG-based method and original MPC-based method.","PeriodicalId":49426,"journal":{"name":"Transactions of the Institute of Measurement and Control","volume":"53 1","pages":"0"},"PeriodicalIF":1.7000,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transactions of the Institute of Measurement and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/01423312231197854","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

To solve the problem of trajectory tracking lateral control in autonomous driving technology, a model predictive control (MPC) controller trajectory tracking lateral control method combined with a deep deterministic policy gradient algorithm (DDPG) is proposed in this paper. This method inputs the real-time state of the vehicle into DDPG to achieve real-time automatic optimization of the prediction time domain and control time domain parameters of the MPC controller, and then affects the specific performance of the MPC controller in trajectory tracking lateral control. Specifically, the state space, action space, and reward function of DDPG are defined, and the automatic driving trajectory tracking lateral controller is designed in combination with the vehicle dynamics model. To reduce the exploration space of DDPG and improve the training efficiency of the entire model, the technique of advantage-disadvantage experience separation and extraction is introduced. Finally, the proposed method was trained and verified in various scenarios, and compared with two other lateral control methods for autonomous driving. The results showed that the learning and training time of the trajectory tracking lateral control method based on DDPG-MPC was shorter than that of the DDPG-based method, and the evaluation indicators in the trajectory tracking control process were better than those of the DDPG-based method and original MPC-based method.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
结合深度确定性策略梯度的自动驾驶汽车模型预测控制轨迹跟踪横向控制器
为解决自动驾驶技术中的轨迹跟踪横向控制问题,提出了一种结合深度确定性策略梯度算法的模型预测控制(MPC)控制器轨迹跟踪横向控制方法。该方法将车辆的实时状态输入到DDPG中,实现MPC控制器预测时域和控制时域参数的实时自动优化,进而影响MPC控制器在轨迹跟踪横向控制中的具体性能。具体而言,定义了DDPG的状态空间、动作空间和奖励函数,结合车辆动力学模型设计了自动驾驶轨迹跟踪横向控制器。为了减小DDPG的探索空间,提高整个模型的训练效率,引入了优劣势经验分离提取技术。最后,对该方法进行了各种场景下的训练和验证,并与另外两种自动驾驶横向控制方法进行了比较。结果表明,基于DDPG-MPC的轨迹跟踪横向控制方法的学习和训练时间比基于ddpg的方法短,轨迹跟踪控制过程中的评价指标优于基于ddpg的方法和原始基于mpc的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
4.10
自引率
16.70%
发文量
203
审稿时长
3.4 months
期刊介绍: Transactions of the Institute of Measurement and Control is a fully peer-reviewed international journal. The journal covers all areas of applications in instrumentation and control. Its scope encompasses cutting-edge research and development, education and industrial applications.
期刊最新文献
Quantized guaranteed cost dynamic output feedback control for uncertain nonlinear networked systems with external disturbance Event-triggered control of switched 2D continuous-discrete systems Prescribed-time leader-following consensus and containment control for second-order multiagent systems with only position measurements Distributed nonsingular terminal sliding mode control–based RBFNN for heterogeneous vehicular platoons with input saturation Event-triggered adaptive command-filtered trajectory tracking control for underactuated surface vessels based on multivariate finite-time disturbance observer under actuator faults and input saturation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1