{"title":"一种高效且稳健的协同单目视觉Slam框架","authors":"Dipanjan Das, Soumyadip Maity, B. Dhara","doi":"10.1109/AVSS52988.2021.9663736","DOIUrl":null,"url":null,"abstract":"Visual SLAM (VSLAM) has shown remarkable performance in robot navigation and its practical applicability can be enriched by building a multi-robot collaboration framework called Visual collaborative SLAM (CoSLAM). CoSLAM extends the usage of SLAM for navigating in larger areas for certain applications like inspection etc. using multiple vehicles which not only saves time but also power. Visual CoSLAM framework suffers from problems like i) Robot can start from anywhere in the scene using their own VSLAM which save both time and power ii) making the framework independent of the choice of SLAM for greater applicability of different SLAMs, iii) avoiding collision with other robots by a robust merging of two noisy maps, when the visual overlap is detected. Very few works are available in the literature which addresses the above problems in a single framework in a practical sense. In this paper, we present a framework for CoSLAM using monocular cameras addressing all the above problems. Unlike existing systems which work only on ORB SLAM, our framework is truly independent of SLAMs. We propose a deep learning based algorithm to find out the visually overlapped scene required for merging two or more 3D maps. Our Map Merging is robust in presence of outliers as we compute similarity transforms using both structural information as well as camera-camera relationships and choose one based on a statistical inference. Experimental results show that our framework is robust and works well for any individual SLAM where we demonstrate our result on ORB and EdgeSLAM which are prototypical extremes methods for map merging in a CoSLAM framework.","PeriodicalId":246327,"journal":{"name":"2021 17th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Efficient And Robust Framework For Collaborative Monocular Visual Slam\",\"authors\":\"Dipanjan Das, Soumyadip Maity, B. Dhara\",\"doi\":\"10.1109/AVSS52988.2021.9663736\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Visual SLAM (VSLAM) has shown remarkable performance in robot navigation and its practical applicability can be enriched by building a multi-robot collaboration framework called Visual collaborative SLAM (CoSLAM). CoSLAM extends the usage of SLAM for navigating in larger areas for certain applications like inspection etc. using multiple vehicles which not only saves time but also power. Visual CoSLAM framework suffers from problems like i) Robot can start from anywhere in the scene using their own VSLAM which save both time and power ii) making the framework independent of the choice of SLAM for greater applicability of different SLAMs, iii) avoiding collision with other robots by a robust merging of two noisy maps, when the visual overlap is detected. Very few works are available in the literature which addresses the above problems in a single framework in a practical sense. In this paper, we present a framework for CoSLAM using monocular cameras addressing all the above problems. Unlike existing systems which work only on ORB SLAM, our framework is truly independent of SLAMs. We propose a deep learning based algorithm to find out the visually overlapped scene required for merging two or more 3D maps. Our Map Merging is robust in presence of outliers as we compute similarity transforms using both structural information as well as camera-camera relationships and choose one based on a statistical inference. Experimental results show that our framework is robust and works well for any individual SLAM where we demonstrate our result on ORB and EdgeSLAM which are prototypical extremes methods for map merging in a CoSLAM framework.\",\"PeriodicalId\":246327,\"journal\":{\"name\":\"2021 17th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 17th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AVSS52988.2021.9663736\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 17th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AVSS52988.2021.9663736","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Efficient And Robust Framework For Collaborative Monocular Visual Slam
Visual SLAM (VSLAM) has shown remarkable performance in robot navigation and its practical applicability can be enriched by building a multi-robot collaboration framework called Visual collaborative SLAM (CoSLAM). CoSLAM extends the usage of SLAM for navigating in larger areas for certain applications like inspection etc. using multiple vehicles which not only saves time but also power. Visual CoSLAM framework suffers from problems like i) Robot can start from anywhere in the scene using their own VSLAM which save both time and power ii) making the framework independent of the choice of SLAM for greater applicability of different SLAMs, iii) avoiding collision with other robots by a robust merging of two noisy maps, when the visual overlap is detected. Very few works are available in the literature which addresses the above problems in a single framework in a practical sense. In this paper, we present a framework for CoSLAM using monocular cameras addressing all the above problems. Unlike existing systems which work only on ORB SLAM, our framework is truly independent of SLAMs. We propose a deep learning based algorithm to find out the visually overlapped scene required for merging two or more 3D maps. Our Map Merging is robust in presence of outliers as we compute similarity transforms using both structural information as well as camera-camera relationships and choose one based on a statistical inference. Experimental results show that our framework is robust and works well for any individual SLAM where we demonstrate our result on ORB and EdgeSLAM which are prototypical extremes methods for map merging in a CoSLAM framework.