Learning-Based NMPC Adaptation for Autonomous Driving Using Parallelized Digital Twin

IF 3.9 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Control Systems Technology Pub Date : 2024-08-14 DOI:10.1109/TCST.2024.3437163

Jean Pierre Allamaa;Panagiotis Patrinos;Herman Van der Auweraer;Tong Duy Son

{"title":"Learning-Based NMPC Adaptation for Autonomous Driving Using Parallelized Digital Twin","authors":"Jean Pierre Allamaa;Panagiotis Patrinos;Herman Van der Auweraer;Tong Duy Son","doi":"10.1109/TCST.2024.3437163","DOIUrl":null,"url":null,"abstract":"In this work, we focus on the challenge of transferring an autonomous driving (AD) controller from simulation to reality (Sim2Real). We propose a data-efficient method for online and on-the-fly adaptation of parametrizable control architectures such that the target closed-loop performance is optimized while accounting for uncertainties such as model mismatches, changes in the environment, and task variations. The novelty of the approach resides in leveraging black-box optimization enabled by executable digital twins (xDTs) for data-driven parameter calibration through derivative-free methods to directly adapt the controller in real time (RT). The xDTs are augmented with domain randomization (DR) for robustness and allow for safe parameter exploration. The proposed method requires a minimal amount of interaction with the real world as it pushes the exploration toward the xDTs. We validate our approach through real-world experiments, demonstrating its effectiveness in transferring and fine-tuning a nonlinear model predictive control (NMPC) with nine parameters, in under 10 min. This eliminates the need for hours-long manual tuning and lengthy machine learning training and data collection phases. Our results show that the online adapted NMPC directly compensates for the Sim2Real gap and avoids overtuning in simulation. Importantly, a 75% improvement in tracking performance is achieved, and the Sim2Real gap over the target performance is reduced from a factor of 876 to 1.033.","PeriodicalId":13103,"journal":{"name":"IEEE Transactions on Control Systems Technology","volume":"33 4","pages":"1317-1332"},"PeriodicalIF":3.9000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Control Systems Technology","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10636183/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

In this work, we focus on the challenge of transferring an autonomous driving (AD) controller from simulation to reality (Sim2Real). We propose a data-efficient method for online and on-the-fly adaptation of parametrizable control architectures such that the target closed-loop performance is optimized while accounting for uncertainties such as model mismatches, changes in the environment, and task variations. The novelty of the approach resides in leveraging black-box optimization enabled by executable digital twins (xDTs) for data-driven parameter calibration through derivative-free methods to directly adapt the controller in real time (RT). The xDTs are augmented with domain randomization (DR) for robustness and allow for safe parameter exploration. The proposed method requires a minimal amount of interaction with the real world as it pushes the exploration toward the xDTs. We validate our approach through real-world experiments, demonstrating its effectiveness in transferring and fine-tuning a nonlinear model predictive control (NMPC) with nine parameters, in under 10 min. This eliminates the need for hours-long manual tuning and lengthy machine learning training and data collection phases. Our results show that the online adapted NMPC directly compensates for the Sim2Real gap and avoids overtuning in simulation. Importantly, a 75% improvement in tracking performance is achieved, and the Sim2Real gap over the target performance is reduced from a factor of 876 to 1.033.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用并行化数字孪生系统为自动驾驶提供基于学习的 NMPC 适应性

在这项工作中，我们专注于将自动驾驶（AD）控制器从模拟转移到现实（Sim2Real）的挑战。我们提出了一种数据高效的方法，用于在线和动态适应可参数化控制体系结构，使目标闭环性能得到优化，同时考虑模型不匹配、环境变化和任务变化等不确定性。该方法的新颖之处在于利用可执行数字双胞胎（xdt）支持的黑盒优化，通过无导数方法进行数据驱动的参数校准，直接实时调整控制器。xdt通过域随机化（DR）增强了鲁棒性，并允许安全的参数探索。所提出的方法需要与现实世界进行最少的交互，因为它将探索推向了xdt。我们通过现实世界的实验验证了我们的方法，证明了它在10分钟内传递和微调具有9个参数的非线性模型预测控制（NMPC）的有效性。这消除了长达数小时的手动调整和冗长的机器学习训练和数据收集阶段的需要。结果表明，在线自适应NMPC直接补偿了Sim2Real的间隙，避免了仿真中的反调谐。重要的是，跟踪性能提高了75%，Sim2Real与目标性能的差距从876降低到1.033。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Control Systems Technology 工程技术-工程：电子与电气

CiteScore

10.70

自引率

2.10%

发文量

218

审稿时长

6.7 months

期刊介绍： The IEEE Transactions on Control Systems Technology publishes high quality technical papers on technological advances in control engineering. The word technology is from the Greek technologia. The modern meaning is a scientific method to achieve a practical purpose. Control Systems Technology includes all aspects of control engineering needed to implement practical control systems, from analysis and design, through simulation and hardware. A primary purpose of the IEEE Transactions on Control Systems Technology is to have an archival publication which will bridge the gap between theory and practice. Papers are published in the IEEE Transactions on Control System Technology which disclose significant new knowledge, exploratory developments, or practical applications in all aspects of technology needed to implement control systems, from analysis and design through simulation, and hardware.