Sensor modality fusion with CNNs for UGV autonomous driving in indoor environments

2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) Pub Date : 2017-12-13 DOI:10.1109/IROS.2017.8205958

Naman Patel, A. Choromańska, P. Krishnamurthy, F. Khorrami

{"title":"Sensor modality fusion with CNNs for UGV autonomous driving in indoor environments","authors":"Naman Patel, A. Choromańska, P. Krishnamurthy, F. Khorrami","doi":"10.1109/IROS.2017.8205958","DOIUrl":null,"url":null,"abstract":"We present a novel end-to-end learning framework to enable ground vehicles to autonomously navigate unknown environments by fusing raw pixels from cameras and depth measurements from a LiDAR. A deep neural network architecture is introduced to effectively perform modality fusion and reliably predict steering commands even in the presence of sensor failures. The proposed network is trained on our own dataset, from LiDAR and a camera mounted on a UGV taken in an indoor corridor environment. Comprehensive experimental evaluation to demonstrate the robustness of our network architecture is performed to show that the proposed deep learning neural network is able to autonomously navigate in the corridor environment. Furthermore, we demonstrate that the fusion of the camera and LiDAR modalities provides further benefits beyond robustness to sensor failures. Specifically, the multimodal fused system shows a potential to navigate around static and dynamic obstacles and to handle changes in environment geometry without being trained for these tasks.","PeriodicalId":6658,"journal":{"name":"2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)","volume":"57 1","pages":"1531-1536"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"50","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IROS.2017.8205958","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 50

Abstract

We present a novel end-to-end learning framework to enable ground vehicles to autonomously navigate unknown environments by fusing raw pixels from cameras and depth measurements from a LiDAR. A deep neural network architecture is introduced to effectively perform modality fusion and reliably predict steering commands even in the presence of sensor failures. The proposed network is trained on our own dataset, from LiDAR and a camera mounted on a UGV taken in an indoor corridor environment. Comprehensive experimental evaluation to demonstrate the robustness of our network architecture is performed to show that the proposed deep learning neural network is able to autonomously navigate in the corridor environment. Furthermore, we demonstrate that the fusion of the camera and LiDAR modalities provides further benefits beyond robustness to sensor failures. Specifically, the multimodal fused system shows a potential to navigate around static and dynamic obstacles and to handle changes in environment geometry without being trained for these tasks.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

传感器模态与cnn融合用于UGV室内自动驾驶

我们提出了一种新颖的端到端学习框架，通过融合来自相机的原始像素和来自激光雷达的深度测量，使地面车辆能够自主导航未知环境。引入深度神经网络架构，有效地进行模态融合，并在传感器故障的情况下可靠地预测转向命令。所提出的网络是在我们自己的数据集上进行训练的，这些数据集来自激光雷达和安装在室内走廊环境中的UGV上的摄像头。综合实验评估证明了我们的网络架构的鲁棒性，表明所提出的深度学习神经网络能够在走廊环境中自主导航。此外，我们证明了相机和LiDAR模式的融合除了对传感器故障的鲁棒性之外，还提供了进一步的好处。具体来说，多模态融合系统显示出绕过静态和动态障碍物以及处理环境几何变化的潜力，而无需针对这些任务进行训练。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

自引率

0.00%

发文量

期刊最新文献

Direct visual SLAM fusing proprioception for a humanoid robot Upper limb motion intent recognition using tactile sensing Soft fluidic rotary actuator with improved actuation properties Underwater 3D structures as semantic landmarks in SONAR mapping Adaptive perception: Learning from sensory predictions to extract object shape with a biomimetic fingertip