Accurate and Scalable Contour-based Camera Pose Estimation Using Deep Learning with Synthetic Data

Ilyar Asl Sabbaghian Hokmabadi, M. Ai, Chrysostomos Minaretzis, M. Sideris, N. El-Sheimy
{"title":"Accurate and Scalable Contour-based Camera Pose Estimation Using Deep Learning with Synthetic Data","authors":"Ilyar Asl Sabbaghian Hokmabadi, M. Ai, Chrysostomos Minaretzis, M. Sideris, N. El-Sheimy","doi":"10.1109/PLANS53410.2023.10139984","DOIUrl":null,"url":null,"abstract":"Pose detection of objects is an important topic in object-level mapping and indoor localization. In the past, pose estimation methods were performed either with the help of artificial markers or natural features found on the object. However, due to the fact that the markers can only be utilized in controlled environment experiments, the application of marker-based approaches is very limited. Furthermore, methods that depend on the object's natural visual features require texture on the object and lack robustness to illumination and camera viewpoint variations. With the advent of Deep Learning (DL), the classical pose estimation methods have been outperformed. The DL-based pose estimation can detect deep features of the object and exhibits higher robustness to many distortions and variabilities caused by the changes in the illumination and viewpoint conditions. However, the massive training data set requirement is the main challenge with most DL-based methods. The training set is often a real set of images that have been manually labeled or annotated. In addition, such methods face problems related to the degradation of their predicted accuracy in the presence of uncertainties due to the symmetrical structure of many objects. To address the aforementioned issues, a novel and very fast method for generating synthetic data, as well as a contour-based technique for accurate pose estimation (that can handle pose ambiguities for a symmetrical object) are proposed in this paper. The tests that are conducted in multiple indoor scenarios demonstrate not only the effectiveness of the synthetic data generation but also exhibit, in many cases, the very high accuracy of the proposed pose estimation method.","PeriodicalId":344794,"journal":{"name":"2023 IEEE/ION Position, Location and Navigation Symposium (PLANS)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE/ION Position, Location and Navigation Symposium (PLANS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PLANS53410.2023.10139984","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Pose detection of objects is an important topic in object-level mapping and indoor localization. In the past, pose estimation methods were performed either with the help of artificial markers or natural features found on the object. However, due to the fact that the markers can only be utilized in controlled environment experiments, the application of marker-based approaches is very limited. Furthermore, methods that depend on the object's natural visual features require texture on the object and lack robustness to illumination and camera viewpoint variations. With the advent of Deep Learning (DL), the classical pose estimation methods have been outperformed. The DL-based pose estimation can detect deep features of the object and exhibits higher robustness to many distortions and variabilities caused by the changes in the illumination and viewpoint conditions. However, the massive training data set requirement is the main challenge with most DL-based methods. The training set is often a real set of images that have been manually labeled or annotated. In addition, such methods face problems related to the degradation of their predicted accuracy in the presence of uncertainties due to the symmetrical structure of many objects. To address the aforementioned issues, a novel and very fast method for generating synthetic data, as well as a contour-based technique for accurate pose estimation (that can handle pose ambiguities for a symmetrical object) are proposed in this paper. The tests that are conducted in multiple indoor scenarios demonstrate not only the effectiveness of the synthetic data generation but also exhibit, in many cases, the very high accuracy of the proposed pose estimation method.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用合成数据的深度学习进行精确和可扩展的基于轮廓的相机姿态估计
物体的姿态检测是物体级映射和室内定位中的一个重要课题。过去,姿态估计方法要么借助于人工标记,要么借助于物体上的自然特征。然而,由于标记只能用于受控环境实验,基于标记的方法的应用非常有限。此外,依赖于物体的自然视觉特征的方法需要物体的纹理,并且缺乏对照明和相机视点变化的鲁棒性。随着深度学习(DL)的出现,经典的姿态估计方法已经被超越。基于dl的姿态估计可以检测到目标的深层特征,并且对光照和视点条件变化引起的许多畸变和可变性具有更高的鲁棒性。然而,海量的训练数据集需求是大多数基于dl的方法面临的主要挑战。训练集通常是手动标记或注释的真实图像集。此外,由于许多物体的对称结构存在不确定性,这种方法面临着预测精度下降的问题。为了解决上述问题,本文提出了一种新的、非常快速的合成数据生成方法,以及一种基于轮廓的精确姿态估计技术(可以处理对称物体的姿态模糊)。在多个室内场景中进行的测试不仅证明了合成数据生成的有效性,而且在许多情况下显示了所提出的姿态估计方法的非常高的准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Simulations using LEO-PNT systems: A Brief Survey Signal Mode Transition Detection in Starlink LEO Satellite Downlink Signals Terrain-Relative Navigation with Neuro-Inspired Elevation Encoding Random Finite Set Approach to Signal Strength Based Passive Localization and Tracking Prediction of Ground Wave Propagation Delays in Terrestrial Radio Navigation Systems Based on Soil Texture Maps
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1