Patch PlaNet: Landmark Recognition with Patch Classification Using Convolutional Neural Networks

K. Cunha, Lucas Maggi, V. Teichrieb, J. P. Lima, J. Quintino, F. Q. Silva, André L. M. Santos, Helder Pinho
{"title":"Patch PlaNet: Landmark Recognition with Patch Classification Using Convolutional Neural Networks","authors":"K. Cunha, Lucas Maggi, V. Teichrieb, J. P. Lima, J. Quintino, F. Q. Silva, André L. M. Santos, Helder Pinho","doi":"10.1109/SIBGRAPI.2018.00023","DOIUrl":null,"url":null,"abstract":"In this work we address the problem of landmark recognition. We extend PlaNet, a model based on deep neural networks that approaches the problem of landmark recognition as a classification problem and performs the recognition of places around the world. We propose an extension of the PlaNet technique in which we use a voting scheme to perform the classification, dividing the image into previously defined regions and inferring the landmark based on these regions. The prediction of the model depends not only on the information of the features learned by the deep convolutional neural network architecture during training, but also uses local information from each region in the image for which the classification is made. To validate our proposal, we performed the training of the original PlaNet model and our variation using a database built with images from Flickr, and evaluated the models in the Paris and Oxford Buildings datasets. It was possible to notice that the addition of image division and voting structure improves the accuracy result of the model by 5-11 percentage points on average, reducing the level of ambiguity found during the inference of the model.","PeriodicalId":208985,"journal":{"name":"2018 31st SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 31st SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIBGRAPI.2018.00023","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

In this work we address the problem of landmark recognition. We extend PlaNet, a model based on deep neural networks that approaches the problem of landmark recognition as a classification problem and performs the recognition of places around the world. We propose an extension of the PlaNet technique in which we use a voting scheme to perform the classification, dividing the image into previously defined regions and inferring the landmark based on these regions. The prediction of the model depends not only on the information of the features learned by the deep convolutional neural network architecture during training, but also uses local information from each region in the image for which the classification is made. To validate our proposal, we performed the training of the original PlaNet model and our variation using a database built with images from Flickr, and evaluated the models in the Paris and Oxford Buildings datasets. It was possible to notice that the addition of image division and voting structure improves the accuracy result of the model by 5-11 percentage points on average, reducing the level of ambiguity found during the inference of the model.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Patch PlaNet:使用卷积神经网络进行斑块分类的地标识别
在这项工作中,我们解决了地标识别的问题。我们扩展了PlaNet,这是一个基于深度神经网络的模型,它将地标识别问题作为分类问题来处理,并对世界各地的地点进行识别。我们提出了PlaNet技术的扩展,其中我们使用投票方案来执行分类,将图像划分为先前定义的区域,并根据这些区域推断地标。该模型的预测不仅依赖于深度卷积神经网络架构在训练过程中学习到的特征信息,而且还使用了图像中每个分类区域的局部信息。为了验证我们的建议,我们使用一个由Flickr图像构建的数据库对原始行星模型和我们的变体进行了训练,并在巴黎和牛津建筑数据集中评估了模型。可以注意到,图像分割和投票结构的加入使模型的准确率结果平均提高了5-11个百分点,减少了模型推理过程中发现的模糊程度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Graph Spectral Filtering for Network Simplification A Photon Tracing Approach to Solve Inverse Rendering Problems Asynchronous Stroboscopic Structured Lighting Image Processing Using Low-Cost Cameras Scene Conversion for Physically-Based Renderers Multicenter Imaging Studies: Automated Approach to Evaluating Data Variability and the Role of Outliers
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1