Patch PlaNet: Landmark Recognition with Patch Classification Using Convolutional Neural Networks

2018 31st SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) Pub Date : 2018-10-01 DOI:10.1109/SIBGRAPI.2018.00023

K. Cunha, Lucas Maggi, V. Teichrieb, J. P. Lima, J. Quintino, F. Q. Silva, André L. M. Santos, Helder Pinho

{"title":"Patch PlaNet: Landmark Recognition with Patch Classification Using Convolutional Neural Networks","authors":"K. Cunha, Lucas Maggi, V. Teichrieb, J. P. Lima, J. Quintino, F. Q. Silva, André L. M. Santos, Helder Pinho","doi":"10.1109/SIBGRAPI.2018.00023","DOIUrl":null,"url":null,"abstract":"In this work we address the problem of landmark recognition. We extend PlaNet, a model based on deep neural networks that approaches the problem of landmark recognition as a classification problem and performs the recognition of places around the world. We propose an extension of the PlaNet technique in which we use a voting scheme to perform the classification, dividing the image into previously defined regions and inferring the landmark based on these regions. The prediction of the model depends not only on the information of the features learned by the deep convolutional neural network architecture during training, but also uses local information from each region in the image for which the classification is made. To validate our proposal, we performed the training of the original PlaNet model and our variation using a database built with images from Flickr, and evaluated the models in the Paris and Oxford Buildings datasets. It was possible to notice that the addition of image division and voting structure improves the accuracy result of the model by 5-11 percentage points on average, reducing the level of ambiguity found during the inference of the model.","PeriodicalId":208985,"journal":{"name":"2018 31st SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 31st SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIBGRAPI.2018.00023","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

In this work we address the problem of landmark recognition. We extend PlaNet, a model based on deep neural networks that approaches the problem of landmark recognition as a classification problem and performs the recognition of places around the world. We propose an extension of the PlaNet technique in which we use a voting scheme to perform the classification, dividing the image into previously defined regions and inferring the landmark based on these regions. The prediction of the model depends not only on the information of the features learned by the deep convolutional neural network architecture during training, but also uses local information from each region in the image for which the classification is made. To validate our proposal, we performed the training of the original PlaNet model and our variation using a database built with images from Flickr, and evaluated the models in the Paris and Oxford Buildings datasets. It was possible to notice that the addition of image division and voting structure improves the accuracy result of the model by 5-11 percentage points on average, reducing the level of ambiguity found during the inference of the model.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Patch PlaNet:使用卷积神经网络进行斑块分类的地标识别

在这项工作中，我们解决了地标识别的问题。我们扩展了PlaNet，这是一个基于深度神经网络的模型，它将地标识别问题作为分类问题来处理，并对世界各地的地点进行识别。我们提出了PlaNet技术的扩展，其中我们使用投票方案来执行分类，将图像划分为先前定义的区域，并根据这些区域推断地标。该模型的预测不仅依赖于深度卷积神经网络架构在训练过程中学习到的特征信息，而且还使用了图像中每个分类区域的局部信息。为了验证我们的建议，我们使用一个由Flickr图像构建的数据库对原始行星模型和我们的变体进行了训练，并在巴黎和牛津建筑数据集中评估了模型。可以注意到，图像分割和投票结构的加入使模型的准确率结果平均提高了5-11个百分点，减少了模型推理过程中发现的模糊程度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2018 31st SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

自引率

0.00%

发文量

期刊最新文献

Graph Spectral Filtering for Network Simplification A Photon Tracing Approach to Solve Inverse Rendering Problems Asynchronous Stroboscopic Structured Lighting Image Processing Using Low-Cost Cameras Scene Conversion for Physically-Based Renderers Multicenter Imaging Studies: Automated Approach to Evaluating Data Variability and the Role of Outliers