Zhongyu Rao;Yingfeng Cai;Hai Wang;Long Chen;Yicheng Li;Qingchao Liu
{"title":"A Camera-Based End-to-End Autonomous Driving Framework Combined With Meta-Based Multi-Task Optimization","authors":"Zhongyu Rao;Yingfeng Cai;Hai Wang;Long Chen;Yicheng Li;Qingchao Liu","doi":"10.1109/TTE.2024.3462449","DOIUrl":null,"url":null,"abstract":"Most existing autonomous driving pipelines can be divided into two broad categories: those based on a modular framework, which can result in error transmission, and those based on an end-to-end framework, which lack interpretability. To overcome these challenges, we propose a novel vision-based multi-task framework that incorporates motion planning, bird’s eye view (BEV) map generation, BEV object prediction, depth estimation, semantic segmentation, and velocity prediction. In particular, we present an improved view transformation module that transforms feature maps into BEV space and predicts future waypoints in BEV space. The multi-task framework can improve performance by sharing information across tasks, and the results of the multi-tasks also improve interpretability. In addition, to address the negative transfer, we introduce intertask affinity, which provides a rough estimate of the relationship between tasks. Moreover, because these relationships may change during training, we use a meta-based multi-task optimization method to dynamically adjust multi-task weighting. We evaluate the performance of our proposed model using the Longest6 and Town05 Long benchmarks of the CARLA simulator. Our model outperforms the current state-of-the-art camera-based models and achieves competitive results with other multimodal methods on both the benchmarks. These results demonstrate the considerable potential of our proposed model for autonomous driving systems. We have also prepared an autonomous driving demonstration using the CARLA simulator which is presented at <uri>https://www.youtube.com/watch?v=ctngFH4GSBc</uri>.","PeriodicalId":56269,"journal":{"name":"IEEE Transactions on Transportation Electrification","volume":"11 1","pages":"4443-4455"},"PeriodicalIF":8.3000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Transportation Electrification","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10682509/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Most existing autonomous driving pipelines can be divided into two broad categories: those based on a modular framework, which can result in error transmission, and those based on an end-to-end framework, which lack interpretability. To overcome these challenges, we propose a novel vision-based multi-task framework that incorporates motion planning, bird’s eye view (BEV) map generation, BEV object prediction, depth estimation, semantic segmentation, and velocity prediction. In particular, we present an improved view transformation module that transforms feature maps into BEV space and predicts future waypoints in BEV space. The multi-task framework can improve performance by sharing information across tasks, and the results of the multi-tasks also improve interpretability. In addition, to address the negative transfer, we introduce intertask affinity, which provides a rough estimate of the relationship between tasks. Moreover, because these relationships may change during training, we use a meta-based multi-task optimization method to dynamically adjust multi-task weighting. We evaluate the performance of our proposed model using the Longest6 and Town05 Long benchmarks of the CARLA simulator. Our model outperforms the current state-of-the-art camera-based models and achieves competitive results with other multimodal methods on both the benchmarks. These results demonstrate the considerable potential of our proposed model for autonomous driving systems. We have also prepared an autonomous driving demonstration using the CARLA simulator which is presented at https://www.youtube.com/watch?v=ctngFH4GSBc.
期刊介绍:
IEEE Transactions on Transportation Electrification is focused on components, sub-systems, systems, standards, and grid interface technologies related to power and energy conversion, propulsion, and actuation for all types of electrified vehicles including on-road, off-road, off-highway, and rail vehicles, airplanes, and ships.