基于特征的实时视觉SLAM地标提取

Natesh Srinivasan
{"title":"基于特征的实时视觉SLAM地标提取","authors":"Natesh Srinivasan","doi":"10.1109/ARTCOM.2010.10","DOIUrl":null,"url":null,"abstract":"Recently, there has been a marked increase in using machine learning techniques for object detection because of its immunity to noise and variations in backgrounds. SLAM (Simultaneous Localization and Mapping) is an approach to mapping the environment in which the robot moves, by using landmarks, much like the human visual system. The application of a robust object detection system can be extended into the field of SLAM by using these as powerful visual landmarks. While the traditional approach to SLAM (based on sensors like the SONARs or LASERs) can provide a good perception of depth, they cannot form effective landmarks. The output of these devices contain the range data mapped on a 2D space. The landmark has to be significant to show up as a pattern and hence only significant landmarks get extracted. While the visual information may be more than enough to form very good landmarks, the required computational resource increases way beyond the realm of the present day embedded processors. We use GPUs (Graphic Processing Units) to process the visual information since they have been very successful in doing real-time rendering for graphics application which involve similar mathematics. The presence of a large number of cores makes this a challenging problem to solve as programming them can be quite complex to exploit the full bandwidth of these processors. Much work is going on to integrate these units into embedded devices which make it feasible to solve the problem of visual SLAM.","PeriodicalId":398854,"journal":{"name":"2010 International Conference on Advances in Recent Technologies in Communication and Computing","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Feature Based Landmark Extraction for Real Time Visual SLAM\",\"authors\":\"Natesh Srinivasan\",\"doi\":\"10.1109/ARTCOM.2010.10\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, there has been a marked increase in using machine learning techniques for object detection because of its immunity to noise and variations in backgrounds. SLAM (Simultaneous Localization and Mapping) is an approach to mapping the environment in which the robot moves, by using landmarks, much like the human visual system. The application of a robust object detection system can be extended into the field of SLAM by using these as powerful visual landmarks. While the traditional approach to SLAM (based on sensors like the SONARs or LASERs) can provide a good perception of depth, they cannot form effective landmarks. The output of these devices contain the range data mapped on a 2D space. The landmark has to be significant to show up as a pattern and hence only significant landmarks get extracted. While the visual information may be more than enough to form very good landmarks, the required computational resource increases way beyond the realm of the present day embedded processors. We use GPUs (Graphic Processing Units) to process the visual information since they have been very successful in doing real-time rendering for graphics application which involve similar mathematics. The presence of a large number of cores makes this a challenging problem to solve as programming them can be quite complex to exploit the full bandwidth of these processors. Much work is going on to integrate these units into embedded devices which make it feasible to solve the problem of visual SLAM.\",\"PeriodicalId\":398854,\"journal\":{\"name\":\"2010 International Conference on Advances in Recent Technologies in Communication and Computing\",\"volume\":\"53 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-10-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 International Conference on Advances in Recent Technologies in Communication and Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ARTCOM.2010.10\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 International Conference on Advances in Recent Technologies in Communication and Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ARTCOM.2010.10","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

最近,由于机器学习技术对噪声和背景变化的免疫,使用机器学习技术进行对象检测的情况明显增加。SLAM(同步定位和映射)是一种通过使用地标来绘制机器人移动环境的方法,就像人类的视觉系统一样。利用这些强大的视觉标志,鲁棒目标检测系统的应用可以扩展到SLAM领域。虽然传统的SLAM方法(基于SONARs或激光器等传感器)可以提供良好的深度感知,但它们无法形成有效的地标。这些设备的输出包含映射在二维空间上的距离数据。路标必须是显著的才能显示为一个模式,因此只有显著的路标被提取出来。虽然视觉信息可能足以形成非常好的地标,但所需的计算资源的增加远远超出了当今嵌入式处理器的范围。我们使用gpu(图形处理单元)来处理视觉信息,因为它们在涉及类似数学的图形应用程序的实时渲染方面非常成功。大量核心的存在使得这成为一个具有挑战性的问题,因为编程它们可能非常复杂,以利用这些处理器的全部带宽。将这些单元集成到嵌入式设备中,使解决视觉SLAM问题成为可能,目前正在进行大量工作。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Feature Based Landmark Extraction for Real Time Visual SLAM
Recently, there has been a marked increase in using machine learning techniques for object detection because of its immunity to noise and variations in backgrounds. SLAM (Simultaneous Localization and Mapping) is an approach to mapping the environment in which the robot moves, by using landmarks, much like the human visual system. The application of a robust object detection system can be extended into the field of SLAM by using these as powerful visual landmarks. While the traditional approach to SLAM (based on sensors like the SONARs or LASERs) can provide a good perception of depth, they cannot form effective landmarks. The output of these devices contain the range data mapped on a 2D space. The landmark has to be significant to show up as a pattern and hence only significant landmarks get extracted. While the visual information may be more than enough to form very good landmarks, the required computational resource increases way beyond the realm of the present day embedded processors. We use GPUs (Graphic Processing Units) to process the visual information since they have been very successful in doing real-time rendering for graphics application which involve similar mathematics. The presence of a large number of cores makes this a challenging problem to solve as programming them can be quite complex to exploit the full bandwidth of these processors. Much work is going on to integrate these units into embedded devices which make it feasible to solve the problem of visual SLAM.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Image Compression Using PCA and Improved Technique with MLP Neural Network A Static Improvement of Predictive Control for Single Phase Voltage Fed Power Factor Correction Converters Design of Fractional Order Differentiators and Integrators Using Indirect Discretization Approach Performance Analysis of UMTS and WLAN Interworking with Multi-service Load Stock Market Prediction Using a Hybrid Neuro-fuzzy System
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1