Learning-Based NMPC Adaptation for Autonomous Driving Using Parallelized Digital Twin

IF 3.9 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Control Systems Technology Pub Date : 2024-08-14 DOI:10.1109/TCST.2024.3437163
Jean Pierre Allamaa;Panagiotis Patrinos;Herman Van der Auweraer;Tong Duy Son
{"title":"Learning-Based NMPC Adaptation for Autonomous Driving Using Parallelized Digital Twin","authors":"Jean Pierre Allamaa;Panagiotis Patrinos;Herman Van der Auweraer;Tong Duy Son","doi":"10.1109/TCST.2024.3437163","DOIUrl":null,"url":null,"abstract":"In this work, we focus on the challenge of transferring an autonomous driving (AD) controller from simulation to reality (Sim2Real). We propose a data-efficient method for online and on-the-fly adaptation of parametrizable control architectures such that the target closed-loop performance is optimized while accounting for uncertainties such as model mismatches, changes in the environment, and task variations. The novelty of the approach resides in leveraging black-box optimization enabled by executable digital twins (xDTs) for data-driven parameter calibration through derivative-free methods to directly adapt the controller in real time (RT). The xDTs are augmented with domain randomization (DR) for robustness and allow for safe parameter exploration. The proposed method requires a minimal amount of interaction with the real world as it pushes the exploration toward the xDTs. We validate our approach through real-world experiments, demonstrating its effectiveness in transferring and fine-tuning a nonlinear model predictive control (NMPC) with nine parameters, in under 10 min. This eliminates the need for hours-long manual tuning and lengthy machine learning training and data collection phases. Our results show that the online adapted NMPC directly compensates for the Sim2Real gap and avoids overtuning in simulation. Importantly, a 75% improvement in tracking performance is achieved, and the Sim2Real gap over the target performance is reduced from a factor of 876 to 1.033.","PeriodicalId":13103,"journal":{"name":"IEEE Transactions on Control Systems Technology","volume":"33 4","pages":"1317-1332"},"PeriodicalIF":3.9000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Control Systems Technology","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10636183/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

In this work, we focus on the challenge of transferring an autonomous driving (AD) controller from simulation to reality (Sim2Real). We propose a data-efficient method for online and on-the-fly adaptation of parametrizable control architectures such that the target closed-loop performance is optimized while accounting for uncertainties such as model mismatches, changes in the environment, and task variations. The novelty of the approach resides in leveraging black-box optimization enabled by executable digital twins (xDTs) for data-driven parameter calibration through derivative-free methods to directly adapt the controller in real time (RT). The xDTs are augmented with domain randomization (DR) for robustness and allow for safe parameter exploration. The proposed method requires a minimal amount of interaction with the real world as it pushes the exploration toward the xDTs. We validate our approach through real-world experiments, demonstrating its effectiveness in transferring and fine-tuning a nonlinear model predictive control (NMPC) with nine parameters, in under 10 min. This eliminates the need for hours-long manual tuning and lengthy machine learning training and data collection phases. Our results show that the online adapted NMPC directly compensates for the Sim2Real gap and avoids overtuning in simulation. Importantly, a 75% improvement in tracking performance is achieved, and the Sim2Real gap over the target performance is reduced from a factor of 876 to 1.033.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用并行化数字孪生系统为自动驾驶提供基于学习的 NMPC 适应性
在这项工作中,我们专注于将自动驾驶(AD)控制器从模拟转移到现实(Sim2Real)的挑战。我们提出了一种数据高效的方法,用于在线和动态适应可参数化控制体系结构,使目标闭环性能得到优化,同时考虑模型不匹配、环境变化和任务变化等不确定性。该方法的新颖之处在于利用可执行数字双胞胎(xdt)支持的黑盒优化,通过无导数方法进行数据驱动的参数校准,直接实时调整控制器。xdt通过域随机化(DR)增强了鲁棒性,并允许安全的参数探索。所提出的方法需要与现实世界进行最少的交互,因为它将探索推向了xdt。我们通过现实世界的实验验证了我们的方法,证明了它在10分钟内传递和微调具有9个参数的非线性模型预测控制(NMPC)的有效性。这消除了长达数小时的手动调整和冗长的机器学习训练和数据收集阶段的需要。结果表明,在线自适应NMPC直接补偿了Sim2Real的间隙,避免了仿真中的反调谐。重要的是,跟踪性能提高了75%,Sim2Real与目标性能的差距从876降低到1.033。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Control Systems Technology
IEEE Transactions on Control Systems Technology 工程技术-工程:电子与电气
CiteScore
10.70
自引率
2.10%
发文量
218
审稿时长
6.7 months
期刊介绍: The IEEE Transactions on Control Systems Technology publishes high quality technical papers on technological advances in control engineering. The word technology is from the Greek technologia. The modern meaning is a scientific method to achieve a practical purpose. Control Systems Technology includes all aspects of control engineering needed to implement practical control systems, from analysis and design, through simulation and hardware. A primary purpose of the IEEE Transactions on Control Systems Technology is to have an archival publication which will bridge the gap between theory and practice. Papers are published in the IEEE Transactions on Control System Technology which disclose significant new knowledge, exploratory developments, or practical applications in all aspects of technology needed to implement control systems, from analysis and design through simulation, and hardware.
期刊最新文献
Structured Fault-Relevant Statistical Analysis for Incipient Fault Detection Hybrid Feedback Control for Global Navigation With Locally Optimal Obstacle Avoidance in n-Dimensional Spaces Adaptive Multirate Moving Horizon Estimator for Real-Time Battery State and Parameter Estimation Data-Driven Orientation Tracking Control of Magnetically Actuated Capsule Endoscopy With Unknown Input Constraint Navigating Robot Swarm Through a Virtual Tube With Flow-Adaptive Distribution Control
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1