多模态注意力引导实时车道检测

X. Zhang, Yansheng Gong, Zhiwei Li, Xuan Liu, Shuyue Pan, Jun Li
{"title":"多模态注意力引导实时车道检测","authors":"X. Zhang, Yansheng Gong, Zhiwei Li, Xuan Liu, Shuyue Pan, Jun Li","doi":"10.1109/ICARM52023.2021.9536157","DOIUrl":null,"url":null,"abstract":"Multimodal data fusion is becoming a trend for the field of autonomous driving, especially for lane detection. In the process of driving, sensors often encounter problems such as modality imbalance, changing illumination and so on. Therefore, it is worthwhile to study the problems of applying multimodal fusion for lane detection and modality imbalance in the fusion process. In this paper, we propose a novel multimodal model for lane detection, in which attention mechanism is embedded into network to balance multimodal feature fusion and to improve detection capability. In addition, we use multi-frame input and long short-term memory (LSTM) network to solve the shadow interference, vehicles occlusion and mark degradation. At the same time, the network can be applied to the task of lane detection. In order to verify the effect of multimodal application and attention mechanism on fusion, we have designed adequate experiments on processed continuous scene KITTI dataset. The results show that precision increases by about 15% when LiDAR is added compared with RGB only. Besides, attention mechanism obviously improves the performance of multi-modal detection by balancing multi-modal features.","PeriodicalId":367307,"journal":{"name":"2021 6th IEEE International Conference on Advanced Robotics and Mechatronics (ICARM)","volume":"56 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Multi-Modal Attention Guided Real-Time Lane Detection\",\"authors\":\"X. Zhang, Yansheng Gong, Zhiwei Li, Xuan Liu, Shuyue Pan, Jun Li\",\"doi\":\"10.1109/ICARM52023.2021.9536157\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multimodal data fusion is becoming a trend for the field of autonomous driving, especially for lane detection. In the process of driving, sensors often encounter problems such as modality imbalance, changing illumination and so on. Therefore, it is worthwhile to study the problems of applying multimodal fusion for lane detection and modality imbalance in the fusion process. In this paper, we propose a novel multimodal model for lane detection, in which attention mechanism is embedded into network to balance multimodal feature fusion and to improve detection capability. In addition, we use multi-frame input and long short-term memory (LSTM) network to solve the shadow interference, vehicles occlusion and mark degradation. At the same time, the network can be applied to the task of lane detection. In order to verify the effect of multimodal application and attention mechanism on fusion, we have designed adequate experiments on processed continuous scene KITTI dataset. The results show that precision increases by about 15% when LiDAR is added compared with RGB only. Besides, attention mechanism obviously improves the performance of multi-modal detection by balancing multi-modal features.\",\"PeriodicalId\":367307,\"journal\":{\"name\":\"2021 6th IEEE International Conference on Advanced Robotics and Mechatronics (ICARM)\",\"volume\":\"56 2\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 6th IEEE International Conference on Advanced Robotics and Mechatronics (ICARM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICARM52023.2021.9536157\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 6th IEEE International Conference on Advanced Robotics and Mechatronics (ICARM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICARM52023.2021.9536157","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

多模态数据融合已成为自动驾驶领域的发展趋势,尤其是车道检测。在驾驶过程中,传感器经常会遇到模态不平衡、光照变化等问题。因此,应用多模态融合进行车道检测以及融合过程中的模态不平衡问题是值得研究的问题。本文提出了一种新的多模态车道检测模型,该模型将注意力机制嵌入到网络中,以平衡多模态特征融合,提高检测能力。此外,我们采用多帧输入和LSTM网络来解决阴影干扰、车辆遮挡和标记退化问题。同时,该网络可以应用于车道检测任务。为了验证多模态应用和注意机制对融合的影响,我们在处理过的连续场景KITTI数据集上设计了充分的实验。结果表明,加入激光雷达后,精度比仅加入RGB时提高了15%左右。此外,注意机制通过平衡多模态特征,明显提高了多模态检测的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Multi-Modal Attention Guided Real-Time Lane Detection
Multimodal data fusion is becoming a trend for the field of autonomous driving, especially for lane detection. In the process of driving, sensors often encounter problems such as modality imbalance, changing illumination and so on. Therefore, it is worthwhile to study the problems of applying multimodal fusion for lane detection and modality imbalance in the fusion process. In this paper, we propose a novel multimodal model for lane detection, in which attention mechanism is embedded into network to balance multimodal feature fusion and to improve detection capability. In addition, we use multi-frame input and long short-term memory (LSTM) network to solve the shadow interference, vehicles occlusion and mark degradation. At the same time, the network can be applied to the task of lane detection. In order to verify the effect of multimodal application and attention mechanism on fusion, we have designed adequate experiments on processed continuous scene KITTI dataset. The results show that precision increases by about 15% when LiDAR is added compared with RGB only. Besides, attention mechanism obviously improves the performance of multi-modal detection by balancing multi-modal features.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Non-model Friction Disturbance Compensation of a Pan-tilt Based on MUAV for Aerial Remote Sensing Application Multi-Modal Attention Guided Real-Time Lane Detection Amphibious Robot with a Novel Composite Propulsion Mechanism Iterative Learning Control of Impedance Parameters for a Soft Exosuit Triple-step Nonlinear Controller with MLFNN for a Lower Limb Rehabilitation Robot
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1