基于物体检测的稀疏语义定位数据融合

Pub Date : 2024-04-20 DOI:10.20965/jrm.2024.p0375

Irem Uygur, Renato Miyagusuku, Sarthak Pathak, Hajime Asama, Atsushi Yamashita

{"title":"基于物体检测的稀疏语义定位数据融合","authors":"Irem Uygur, Renato Miyagusuku, Sarthak Pathak, Hajime Asama, Atsushi Yamashita","doi":"10.20965/jrm.2024.p0375","DOIUrl":null,"url":null,"abstract":"Semantic information has started to be used in localization methods to introduce a non-geometric distinction in the environment. However, efficient ways to integrate this information remain a question. We propose an approach for fusing data from different object classes by analyzing the posterior for each object class to improve robustness and accuracy for self-localization. Our system uses the bearing angle to the objects’ center and objects’ class names as sensor model input to localize the user on a 2D annotated map consisting of objects’ class names and center coordinates. Sensor model input is obtained by an object detector on equirectangular images of a 360° field of view camera. As object detection performance varies based on location and object class, different object classes generate different likelihoods. We account for this by using appropriate weights generated by a Gaussian process model trained by using our posterior analysis. Our approach follows a systematic way to fuse data from different object classes and use them as a likelihood function of a Monte Carlo localization (MCL) algorithm.","PeriodicalId":0,"journal":{"name":"","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data Fusion for Sparse Semantic Localization Based on Object Detection\",\"authors\":\"Irem Uygur, Renato Miyagusuku, Sarthak Pathak, Hajime Asama, Atsushi Yamashita\",\"doi\":\"10.20965/jrm.2024.p0375\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Semantic information has started to be used in localization methods to introduce a non-geometric distinction in the environment. However, efficient ways to integrate this information remain a question. We propose an approach for fusing data from different object classes by analyzing the posterior for each object class to improve robustness and accuracy for self-localization. Our system uses the bearing angle to the objects’ center and objects’ class names as sensor model input to localize the user on a 2D annotated map consisting of objects’ class names and center coordinates. Sensor model input is obtained by an object detector on equirectangular images of a 360° field of view camera. As object detection performance varies based on location and object class, different object classes generate different likelihoods. We account for this by using appropriate weights generated by a Gaussian process model trained by using our posterior analysis. Our approach follows a systematic way to fuse data from different object classes and use them as a likelihood function of a Monte Carlo localization (MCL) algorithm.\",\"PeriodicalId\":0,\"journal\":{\"name\":\"\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0,\"publicationDate\":\"2024-04-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.20965/jrm.2024.p0375\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.20965/jrm.2024.p0375","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

语义信息已开始用于定位方法，以引入环境中的非几何区分。然而，整合这些信息的有效方法仍然是个问题。我们提出了一种通过分析每个物体类别的后验数据来融合不同物体类别数据的方法，以提高自定位的稳健性和准确性。我们的系统使用到物体中心的方位角和物体类别名称作为传感器模型输入，在由物体类别名称和中心坐标组成的二维注释地图上定位用户。传感器模型输入由 360° 视场相机等角图像上的物体检测器获得。由于物体检测性能因位置和物体类别而异，不同的物体类别会产生不同的可能性。我们通过使用后验分析训练出的高斯过程模型所产生的适当权重来解决这一问题。我们的方法采用一种系统化的方式来融合来自不同物体类别的数据，并将其用作蒙特卡罗定位（MCL）算法的似然函数。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

Data Fusion for Sparse Semantic Localization Based on Object Detection

Semantic information has started to be used in localization methods to introduce a non-geometric distinction in the environment. However, efficient ways to integrate this information remain a question. We propose an approach for fusing data from different object classes by analyzing the posterior for each object class to improve robustness and accuracy for self-localization. Our system uses the bearing angle to the objects’ center and objects’ class names as sensor model input to localize the user on a 2D annotated map consisting of objects’ class names and center coordinates. Sensor model input is obtained by an object detector on equirectangular images of a 360° field of view camera. As object detection performance varies based on location and object class, different object classes generate different likelihoods. We account for this by using appropriate weights generated by a Gaussian process model trained by using our posterior analysis. Our approach follows a systematic way to fuse data from different object classes and use them as a likelihood function of a Monte Carlo localization (MCL) algorithm.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助