{"title":"ATDN vSLAM: An all-through Deep Learning-Based Solution for Visual Simultaneous Localization and Mapping","authors":"M'aty'as Sz'ant'o, Gyorgy R. Bog'ar, L. Vajta","doi":"10.3311/PPee.20437","DOIUrl":null,"url":null,"abstract":"In this paper, a novel solution is introduced for visual Simultaneous Localization and Mapping (vSLAM) that is built up of Deep Learning components. The proposed architecture is a highly modular framework in which each component offers state of the art results in their respective fields of vision-based Deep Learning solutions. The paper shows that with the synergic integration of these individual building blocks, a functioning and efficient all-through deep neural (ATDN) vSLAM system can be created. The Embedding Distance Loss function is introduced and using it the ATDN architecture is trained. The resulting system managed to achieve 4.4% translation and 0.0176 deg/m rotational error on a subset of the KITTI dataset. The proposed architecture can be used for efficient and low-latency autonomous driving (AD) aiding database creation as well as a basis for autonomous vehicle (AV) control.","PeriodicalId":37664,"journal":{"name":"Periodica polytechnica Electrical engineering and computer science","volume":"83 1","pages":"236-247"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Periodica polytechnica Electrical engineering and computer science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3311/PPee.20437","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper, a novel solution is introduced for visual Simultaneous Localization and Mapping (vSLAM) that is built up of Deep Learning components. The proposed architecture is a highly modular framework in which each component offers state of the art results in their respective fields of vision-based Deep Learning solutions. The paper shows that with the synergic integration of these individual building blocks, a functioning and efficient all-through deep neural (ATDN) vSLAM system can be created. The Embedding Distance Loss function is introduced and using it the ATDN architecture is trained. The resulting system managed to achieve 4.4% translation and 0.0176 deg/m rotational error on a subset of the KITTI dataset. The proposed architecture can be used for efficient and low-latency autonomous driving (AD) aiding database creation as well as a basis for autonomous vehicle (AV) control.
期刊介绍:
The main scope of the journal is to publish original research articles in the wide field of electrical engineering and informatics fitting into one of the following five Sections of the Journal: (i) Communication systems, networks and technology, (ii) Computer science and information theory, (iii) Control, signal processing and signal analysis, medical applications, (iv) Components, Microelectronics and Material Sciences, (v) Power engineering and mechatronics, (vi) Mobile Software, Internet of Things and Wearable Devices, (vii) Solid-state lighting and (viii) Vehicular Technology (land, airborne, and maritime mobile services; automotive, radar systems; antennas and radio wave propagation).