基于特征的实时视觉SLAM地标提取

2010 International Conference on Advances in Recent Technologies in Communication and Computing Pub Date : 2010-10-16 DOI:10.1109/ARTCOM.2010.10

Natesh Srinivasan

{"title":"基于特征的实时视觉SLAM地标提取","authors":"Natesh Srinivasan","doi":"10.1109/ARTCOM.2010.10","DOIUrl":null,"url":null,"abstract":"Recently, there has been a marked increase in using machine learning techniques for object detection because of its immunity to noise and variations in backgrounds. SLAM (Simultaneous Localization and Mapping) is an approach to mapping the environment in which the robot moves, by using landmarks, much like the human visual system. The application of a robust object detection system can be extended into the field of SLAM by using these as powerful visual landmarks. While the traditional approach to SLAM (based on sensors like the SONARs or LASERs) can provide a good perception of depth, they cannot form effective landmarks. The output of these devices contain the range data mapped on a 2D space. The landmark has to be significant to show up as a pattern and hence only significant landmarks get extracted. While the visual information may be more than enough to form very good landmarks, the required computational resource increases way beyond the realm of the present day embedded processors. We use GPUs (Graphic Processing Units) to process the visual information since they have been very successful in doing real-time rendering for graphics application which involve similar mathematics. The presence of a large number of cores makes this a challenging problem to solve as programming them can be quite complex to exploit the full bandwidth of these processors. Much work is going on to integrate these units into embedded devices which make it feasible to solve the problem of visual SLAM.","PeriodicalId":398854,"journal":{"name":"2010 International Conference on Advances in Recent Technologies in Communication and Computing","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Feature Based Landmark Extraction for Real Time Visual SLAM\",\"authors\":\"Natesh Srinivasan\",\"doi\":\"10.1109/ARTCOM.2010.10\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, there has been a marked increase in using machine learning techniques for object detection because of its immunity to noise and variations in backgrounds. SLAM (Simultaneous Localization and Mapping) is an approach to mapping the environment in which the robot moves, by using landmarks, much like the human visual system. The application of a robust object detection system can be extended into the field of SLAM by using these as powerful visual landmarks. While the traditional approach to SLAM (based on sensors like the SONARs or LASERs) can provide a good perception of depth, they cannot form effective landmarks. The output of these devices contain the range data mapped on a 2D space. The landmark has to be significant to show up as a pattern and hence only significant landmarks get extracted. While the visual information may be more than enough to form very good landmarks, the required computational resource increases way beyond the realm of the present day embedded processors. We use GPUs (Graphic Processing Units) to process the visual information since they have been very successful in doing real-time rendering for graphics application which involve similar mathematics. The presence of a large number of cores makes this a challenging problem to solve as programming them can be quite complex to exploit the full bandwidth of these processors. Much work is going on to integrate these units into embedded devices which make it feasible to solve the problem of visual SLAM.\",\"PeriodicalId\":398854,\"journal\":{\"name\":\"2010 International Conference on Advances in Recent Technologies in Communication and Computing\",\"volume\":\"53 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-10-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 International Conference on Advances in Recent Technologies in Communication and Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ARTCOM.2010.10\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 International Conference on Advances in Recent Technologies in Communication and Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ARTCOM.2010.10","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

摘要

最近，由于机器学习技术对噪声和背景变化的免疫，使用机器学习技术进行对象检测的情况明显增加。SLAM(同步定位和映射)是一种通过使用地标来绘制机器人移动环境的方法，就像人类的视觉系统一样。利用这些强大的视觉标志，鲁棒目标检测系统的应用可以扩展到SLAM领域。虽然传统的SLAM方法(基于SONARs或激光器等传感器)可以提供良好的深度感知，但它们无法形成有效的地标。这些设备的输出包含映射在二维空间上的距离数据。路标必须是显著的才能显示为一个模式，因此只有显著的路标被提取出来。虽然视觉信息可能足以形成非常好的地标，但所需的计算资源的增加远远超出了当今嵌入式处理器的范围。我们使用gpu(图形处理单元)来处理视觉信息，因为它们在涉及类似数学的图形应用程序的实时渲染方面非常成功。大量核心的存在使得这成为一个具有挑战性的问题，因为编程它们可能非常复杂，以利用这些处理器的全部带宽。将这些单元集成到嵌入式设备中，使解决视觉SLAM问题成为可能，目前正在进行大量工作。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Feature Based Landmark Extraction for Real Time Visual SLAM

Recently, there has been a marked increase in using machine learning techniques for object detection because of its immunity to noise and variations in backgrounds. SLAM (Simultaneous Localization and Mapping) is an approach to mapping the environment in which the robot moves, by using landmarks, much like the human visual system. The application of a robust object detection system can be extended into the field of SLAM by using these as powerful visual landmarks. While the traditional approach to SLAM (based on sensors like the SONARs or LASERs) can provide a good perception of depth, they cannot form effective landmarks. The output of these devices contain the range data mapped on a 2D space. The landmark has to be significant to show up as a pattern and hence only significant landmarks get extracted. While the visual information may be more than enough to form very good landmarks, the required computational resource increases way beyond the realm of the present day embedded processors. We use GPUs (Graphic Processing Units) to process the visual information since they have been very successful in doing real-time rendering for graphics application which involve similar mathematics. The presence of a large number of cores makes this a challenging problem to solve as programming them can be quite complex to exploit the full bandwidth of these processors. Much work is going on to integrate these units into embedded devices which make it feasible to solve the problem of visual SLAM.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2010 International Conference on Advances in Recent Technologies in Communication and Computing

自引率

0.00%

发文量