The Design of a Multi-Task Learning System for Autonomous Vehicles

Khanh-Phong Bui, Hoang-Lam Ngoc Le, Quang-Thang Le, Dinh-Hiep Huynh, Vu-Hoang Tran
{"title":"The Design of a Multi-Task Learning System for Autonomous Vehicles","authors":"Khanh-Phong Bui, Hoang-Lam Ngoc Le, Quang-Thang Le, Dinh-Hiep Huynh, Vu-Hoang Tran","doi":"10.1109/GTSD54989.2022.9989259","DOIUrl":null,"url":null,"abstract":"Recently, a lot of research and applications regarding autonomous vehicles have been invested in and developed. These applications have many complex scenarios to handle such as lane segmentation, object detection, traffic sign recognition, and steering control prediction. Many methods handle these tasks separately. Despite the excellent performance these methods achieve, processing these tasks one after another takes a longer time than tackling them all at once. So, in this paper, to reduce the inference time of the autonomous driving system, we proposed a multi-task framework to conduct three tasks: lane segmentation, object detection, and traffic sign recognition simultaneously. Our framework is composed of one encoder for feature extraction and two decoders to handle specific tasks. We only use one encoder for multiple tasks because these tasks complement each other, we hope that the information can be shared among these tasks through the single encoder to improve the performance of each task and also to reduce the amount of data required for training. The decoders include a detection decoder and a segmentation decoder. The detection decoder is designed to detect objects and recognize traffic signs. On the other hand, the segmentation decoder is designed to focus solely on the task of separating the drivable area. By testing on the challenging Carla dataset, our model shows that it can achieve better results compared to state-of-the-art methods. Besides, experimental results also show that, compared with solving tasks independently, our framework can achieve similar performance but greatly reduce processing time.","PeriodicalId":125445,"journal":{"name":"2022 6th International Conference on Green Technology and Sustainable Development (GTSD)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 6th International Conference on Green Technology and Sustainable Development (GTSD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GTSD54989.2022.9989259","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Recently, a lot of research and applications regarding autonomous vehicles have been invested in and developed. These applications have many complex scenarios to handle such as lane segmentation, object detection, traffic sign recognition, and steering control prediction. Many methods handle these tasks separately. Despite the excellent performance these methods achieve, processing these tasks one after another takes a longer time than tackling them all at once. So, in this paper, to reduce the inference time of the autonomous driving system, we proposed a multi-task framework to conduct three tasks: lane segmentation, object detection, and traffic sign recognition simultaneously. Our framework is composed of one encoder for feature extraction and two decoders to handle specific tasks. We only use one encoder for multiple tasks because these tasks complement each other, we hope that the information can be shared among these tasks through the single encoder to improve the performance of each task and also to reduce the amount of data required for training. The decoders include a detection decoder and a segmentation decoder. The detection decoder is designed to detect objects and recognize traffic signs. On the other hand, the segmentation decoder is designed to focus solely on the task of separating the drivable area. By testing on the challenging Carla dataset, our model shows that it can achieve better results compared to state-of-the-art methods. Besides, experimental results also show that, compared with solving tasks independently, our framework can achieve similar performance but greatly reduce processing time.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
自动驾驶汽车多任务学习系统的设计
近年来,人们对自动驾驶汽车进行了大量的研究和应用。这些应用程序有许多复杂的场景需要处理,如车道分割、目标检测、交通标志识别和转向控制预测。许多方法分别处理这些任务。尽管这些方法实现了出色的性能,但一个接一个地处理这些任务比一次处理所有任务需要更长的时间。因此,为了减少自动驾驶系统的推理时间,本文提出了一个多任务框架,同时进行车道分割、目标检测和交通标志识别三个任务。我们的框架由一个用于特征提取的编码器和两个用于处理特定任务的解码器组成。对于多个任务,我们只使用一个编码器,因为这些任务是相互补充的,我们希望通过单个编码器可以在这些任务之间共享信息,以提高每个任务的性能,也减少训练所需的数据量。解码器包括检测解码器和分割解码器。检测解码器设计用于检测物体和识别交通标志。另一方面,分割解码器被设计成只专注于分离可驱动区域的任务。通过对具有挑战性的Carla数据集的测试,我们的模型表明,与最先进的方法相比,它可以获得更好的结果。此外,实验结果也表明,与独立解决任务相比,我们的框架可以达到相似的性能,但大大减少了处理时间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Design a Fuel Battery Operation Model for a Car Application for Training Key Information Extraction from Mobile-Captured Vietnamese Receipt Images using Graph Neural Networks Approach Indoor Mobile Robot Positioning using Sensor Fusion A Steering Strategy for Self-Driving Automobile Systems Based on Lane-Line Detection The Improved Sliding Mode Observer for Sensorless Speed Control of Permanent Magnet Synchronous Motor
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1