Hierarchical Control for Cooperative Teams in Competitive Autonomous Racing

IF 14 1区工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE IEEE Transactions on Intelligent Vehicles Pub Date : 2024-02-07 DOI:10.1109/TIV.2024.3363177

Rishabh Saumil Thakkar;Aryaman Singh Samyal;David Fridovich-Keil;Zhe Xu;Ufuk Topcu

{"title":"Hierarchical Control for Cooperative Teams in Competitive Autonomous Racing","authors":"Rishabh Saumil Thakkar;Aryaman Singh Samyal;David Fridovich-Keil;Zhe Xu;Ufuk Topcu","doi":"10.1109/TIV.2024.3363177","DOIUrl":null,"url":null,"abstract":"We investigate the problem of autonomous racing among teams of cooperative agents that are subject to realistic racing rules. Our work extends previous research on hierarchical control in head-to-head autonomous racing by considering a generalized version of the problem while maintaining the two-level hierarchical control structure. A high-level tactical planner constructs a discrete game that encodes the complex rules using simplified dynamics to produce a sequence of target waypoints. The low-level path planner uses these waypoints as a reference trajectory and computes high-resolution control inputs by solving a simplified formulation of a racing game with a simplified representation of the realistic racing rules. We explore two approaches for the low-level path planner: training a multi-agent reinforcement learning (MARL) policy and solving a linear-quadratic Nash game (LQNG) approximation. We evaluate our controllers on simple and complex tracks against three baselines: an end-to-end MARL controller, a MARL controller tracking a fixed racing line, and an LQNG controller tracking a fixed racing line. Quantitative results show our hierarchical methods outperform the baselines in terms of race wins, overall team performance, and compliance with the rules. Qualitatively, we observe the hierarchical controllers mimic actions performed by expert human drivers such as coordinated overtaking, defending against multiple opponents, and long-term planning for delayed advantages.","PeriodicalId":36532,"journal":{"name":"IEEE Transactions on Intelligent Vehicles","volume":"9 5","pages":"4845-4860"},"PeriodicalIF":14.0000,"publicationDate":"2024-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Intelligent Vehicles","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10423812/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

We investigate the problem of autonomous racing among teams of cooperative agents that are subject to realistic racing rules. Our work extends previous research on hierarchical control in head-to-head autonomous racing by considering a generalized version of the problem while maintaining the two-level hierarchical control structure. A high-level tactical planner constructs a discrete game that encodes the complex rules using simplified dynamics to produce a sequence of target waypoints. The low-level path planner uses these waypoints as a reference trajectory and computes high-resolution control inputs by solving a simplified formulation of a racing game with a simplified representation of the realistic racing rules. We explore two approaches for the low-level path planner: training a multi-agent reinforcement learning (MARL) policy and solving a linear-quadratic Nash game (LQNG) approximation. We evaluate our controllers on simple and complex tracks against three baselines: an end-to-end MARL controller, a MARL controller tracking a fixed racing line, and an LQNG controller tracking a fixed racing line. Quantitative results show our hierarchical methods outperform the baselines in terms of race wins, overall team performance, and compliance with the rules. Qualitatively, we observe the hierarchical controllers mimic actions performed by expert human drivers such as coordinated overtaking, defending against multiple opponents, and long-term planning for delayed advantages.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

竞技自主赛车中合作团队的分层控制

我们研究了合作代理团队之间的自主赛车问题，这些代理团队必须遵守现实的赛车规则。我们的研究在保持两级分层控制结构的基础上，考虑了问题的一般化版本，从而扩展了之前关于正面交锋自主赛车中分层控制的研究。高级战术规划器构建了一个离散博弈，利用简化动力学对复杂规则进行编码，从而生成目标航点序列。低级路径规划器将这些航点作为参考轨迹，通过求解赛车游戏的简化表述和现实赛车规则的简化表示来计算高分辨率控制输入。我们为低级路径规划器探索了两种方法：训练多代理强化学习（MARL）策略和求解线性-二次纳什博弈（LQNG）近似值。我们在简单和复杂的赛道上对我们的控制器进行了评估，并与三种基线进行了比较：端到端 MARL 控制器、跟踪固定赛道的 MARL 控制器和跟踪固定赛道的 LQNG 控制器。定量结果显示，我们的分层方法在比赛胜利、团队整体表现和遵守规则方面都优于基准方法。从质量上讲，我们观察到分层控制器模仿了人类专业驾驶员的操作，例如协调超车、抵御多个对手以及为延迟优势进行长期规划。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Intelligent Vehicles Mathematics-Control and Optimization

CiteScore

12.10

自引率

13.40%

发文量

177

期刊介绍： The IEEE Transactions on Intelligent Vehicles (T-IV) is a premier platform for publishing peer-reviewed articles that present innovative research concepts, application results, significant theoretical findings, and application case studies in the field of intelligent vehicles. With a particular emphasis on automated vehicles within roadway environments, T-IV aims to raise awareness of pressing research and application challenges. Our focus is on providing critical information to the intelligent vehicle community, serving as a dissemination vehicle for IEEE ITS Society members and others interested in learning about the state-of-the-art developments and progress in research and applications related to intelligent vehicles. Join us in advancing knowledge and innovation in this dynamic field.

期刊最新文献

Table of Contents Introducing IEEE Collabratec The Autonomous Right of Way: Smart Governance for Smart Mobility With Intelligent Vehicles TechRxiv: Share Your Preprint Research with the World! Blank