{"title":"IGFNet: Illumination-Guided Fusion Network for Semantic Scene Understanding using RGB-Thermal Images","authors":"Haotian Li, Yuxiang Sun","doi":"10.1109/ROBIO58561.2023.10354613","DOIUrl":null,"url":null,"abstract":"Semantic scene understanding is a fundamental task for autonomous driving. It serves as a build block for many downstream tasks. Under challenging illumination conditions, thermal images can provide complementary information for RGB images. Many multi-modal fusion networks have been proposed using RGB-Thermal data for semantic scene understanding. However, current state-of-the-art methods simply use networks to fuse features on multi-modality inexplicably, rather than designing a fusion method based on the intrinsic characteristics of RGB images and thermal images. To address this issue, we propose IGFNet, an illumination-guided fusion network for RGB-Thermal semantic scene understanding, which utilizes a weight mask generated by the illumination estimation module to weight the RGB and thermal feature maps at different stages. Experimental results show that our network outperforms the state-of-the-art methods on the MFNet dataset. Our code is available at: https://github.com/lab-sun/IGFNet.","PeriodicalId":505134,"journal":{"name":"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"81 6","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROBIO58561.2023.10354613","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Semantic scene understanding is a fundamental task for autonomous driving. It serves as a build block for many downstream tasks. Under challenging illumination conditions, thermal images can provide complementary information for RGB images. Many multi-modal fusion networks have been proposed using RGB-Thermal data for semantic scene understanding. However, current state-of-the-art methods simply use networks to fuse features on multi-modality inexplicably, rather than designing a fusion method based on the intrinsic characteristics of RGB images and thermal images. To address this issue, we propose IGFNet, an illumination-guided fusion network for RGB-Thermal semantic scene understanding, which utilizes a weight mask generated by the illumination estimation module to weight the RGB and thermal feature maps at different stages. Experimental results show that our network outperforms the state-of-the-art methods on the MFNet dataset. Our code is available at: https://github.com/lab-sun/IGFNet.