Multi-Plane Projection for Extending Perspective Image Object Detection Models to 360° Images

Yasuto Nagase, Y. Babazaki, Katsuhiko Takahashi
{"title":"Multi-Plane Projection for Extending Perspective Image Object Detection Models to 360° Images","authors":"Yasuto Nagase, Y. Babazaki, Katsuhiko Takahashi","doi":"10.23919/MVA57639.2023.10215689","DOIUrl":null,"url":null,"abstract":"Since 360° cameras are still in their diffusion phase, there are no large annotated datasets or models trained on them as there are for perspective cameras. Creating new 360°-specific datasets and training recognition models for each domain and tasks have a significant barrier for many users aiming at practical applications. Therefore, we propose a novel technique to effectively adapt the existing models to 360° images. The 360° images are projected to multiple planes and adapted to the existing model, and the detected results are unified in a spherical coordinate system. In experiments, we evaluated our method on an object detection task and compared it to baselines, which showed an improvement in recognition accuracy of up to 6.7%.","PeriodicalId":338734,"journal":{"name":"2023 18th International Conference on Machine Vision and Applications (MVA)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 18th International Conference on Machine Vision and Applications (MVA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/MVA57639.2023.10215689","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Since 360° cameras are still in their diffusion phase, there are no large annotated datasets or models trained on them as there are for perspective cameras. Creating new 360°-specific datasets and training recognition models for each domain and tasks have a significant barrier for many users aiming at practical applications. Therefore, we propose a novel technique to effectively adapt the existing models to 360° images. The 360° images are projected to multiple planes and adapted to the existing model, and the detected results are unified in a spherical coordinate system. In experiments, we evaluated our method on an object detection task and compared it to baselines, which showed an improvement in recognition accuracy of up to 6.7%.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
多平面投影扩展透视图像对象检测模型到360°图像
由于360°相机仍处于扩散阶段,因此没有像透视相机那样对它们进行大型注释数据集或模型训练。为每个领域和任务创建新的360°特定数据集和训练识别模型对于许多针对实际应用的用户来说是一个很大的障碍。因此,我们提出了一种新的技术,可以有效地使现有模型适应360°图像。将360°图像投影到多个平面并适应现有模型,将检测结果统一到球坐标系中。在实验中,我们对目标检测任务进行了评估,并将其与基线进行了比较,结果表明该方法的识别准确率提高了6.7%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Small Object Detection for Birds with Swin Transformer CG-based dataset generation and adversarial image conversion for deep cucumber recognition Uncertainty Criteria in Active Transfer Learning for Efficient Video-Specific Human Pose Estimation Joint Learning with Group Relation and Individual Action Diabetic Retinopathy Grading based on a Sparse Network Fusion of Heterogeneous ConvNeXt Models with Category Attention
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1