{"title":"有监督和无监督深度学习在视觉SLAM中的应用综述","authors":"U. Ukaegbu, L. Tartibu, Chee Wah Lim","doi":"10.1115/imece2022-95685","DOIUrl":null,"url":null,"abstract":"\n Visual Simultaneous Localization and Mapping (V-SLAM) is a trending robotics research concept as well as the basis for autonomous and smart navigation. It is an integral part of vision-based applications which include virtual reality, unmanned aerial vehicles, augmented reality, and unmanned ground vehicles. V-SLAM carries out localization and mapping by learning relevant feature points from images and estimating their pose based on the correlation between the camera and the feature points. It also represents the ability of a robot to effectively navigate itself, employing visual sensors and prior information of the given location, in an uncharted environment while updating and constructing a coordinated map of the scene. However, due to the challenges of data association triggered by illumination, different viewpoints and environment dynamics, there has been rapid adoption of deep learning in the area of feature extraction/description, pose/depth estimation, mapping, loop closure detection and global optimization as it concerns visual SLAM. This paper sets out to elucidate diverse applications of supervised and unsupervised deep learning methods in all aspects of visual SLAM. It also briefly explains a case study regarding the application of both deep learning and SLAM for underground mining applications. It highlights recent research developments in addition to limitations hindering their effective application and investigates how a combination of deep learning with other methods offers a promising direction for visual SLAM research.","PeriodicalId":146276,"journal":{"name":"Volume 3: Advanced Materials: Design, Processing, Characterization and Applications; Advances in Aerospace Technology","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Supervised and Unsupervised Deep Learning Applications for Visual SLAM: A Review\",\"authors\":\"U. Ukaegbu, L. Tartibu, Chee Wah Lim\",\"doi\":\"10.1115/imece2022-95685\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n Visual Simultaneous Localization and Mapping (V-SLAM) is a trending robotics research concept as well as the basis for autonomous and smart navigation. It is an integral part of vision-based applications which include virtual reality, unmanned aerial vehicles, augmented reality, and unmanned ground vehicles. V-SLAM carries out localization and mapping by learning relevant feature points from images and estimating their pose based on the correlation between the camera and the feature points. It also represents the ability of a robot to effectively navigate itself, employing visual sensors and prior information of the given location, in an uncharted environment while updating and constructing a coordinated map of the scene. However, due to the challenges of data association triggered by illumination, different viewpoints and environment dynamics, there has been rapid adoption of deep learning in the area of feature extraction/description, pose/depth estimation, mapping, loop closure detection and global optimization as it concerns visual SLAM. This paper sets out to elucidate diverse applications of supervised and unsupervised deep learning methods in all aspects of visual SLAM. It also briefly explains a case study regarding the application of both deep learning and SLAM for underground mining applications. It highlights recent research developments in addition to limitations hindering their effective application and investigates how a combination of deep learning with other methods offers a promising direction for visual SLAM research.\",\"PeriodicalId\":146276,\"journal\":{\"name\":\"Volume 3: Advanced Materials: Design, Processing, Characterization and Applications; Advances in Aerospace Technology\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Volume 3: Advanced Materials: Design, Processing, Characterization and Applications; Advances in Aerospace Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1115/imece2022-95685\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Volume 3: Advanced Materials: Design, Processing, Characterization and Applications; Advances in Aerospace Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1115/imece2022-95685","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Supervised and Unsupervised Deep Learning Applications for Visual SLAM: A Review
Visual Simultaneous Localization and Mapping (V-SLAM) is a trending robotics research concept as well as the basis for autonomous and smart navigation. It is an integral part of vision-based applications which include virtual reality, unmanned aerial vehicles, augmented reality, and unmanned ground vehicles. V-SLAM carries out localization and mapping by learning relevant feature points from images and estimating their pose based on the correlation between the camera and the feature points. It also represents the ability of a robot to effectively navigate itself, employing visual sensors and prior information of the given location, in an uncharted environment while updating and constructing a coordinated map of the scene. However, due to the challenges of data association triggered by illumination, different viewpoints and environment dynamics, there has been rapid adoption of deep learning in the area of feature extraction/description, pose/depth estimation, mapping, loop closure detection and global optimization as it concerns visual SLAM. This paper sets out to elucidate diverse applications of supervised and unsupervised deep learning methods in all aspects of visual SLAM. It also briefly explains a case study regarding the application of both deep learning and SLAM for underground mining applications. It highlights recent research developments in addition to limitations hindering their effective application and investigates how a combination of deep learning with other methods offers a promising direction for visual SLAM research.