Visual Landmarks Recognition of Urban Structures using Convolutional Neural Network

S. K. Raza, Tabarka Rajab, Syed Jowaid Ahmed, M. Khurram
{"title":"Visual Landmarks Recognition of Urban Structures using Convolutional Neural Network","authors":"S. K. Raza, Tabarka Rajab, Syed Jowaid Ahmed, M. Khurram","doi":"10.1109/iCoMET48670.2020.9073795","DOIUrl":null,"url":null,"abstract":"The landmark Recognition of any urban structure is a frustrating task to accomplish. This paper presents a deep learning based approach for recognition of the urban structures. Urban structures are basically the usual offices, houses, apartments etc. A generalized word for all these buildings can be urban structures. This classification problem was identified by our team while making our way into the visual positioning system, we saw that the visual landmark recognition of urban structures was an unexplored domain by using deep learning approaches. We also Identified that unlike the landmarks such as the Eiffel Tower, Statue Of Liberty etc, urban structures were difficult to learn by a neural network, the data requirements were quite distinct compared to the ones mentioned for all these famous landmarks around the globe. Most of all the landmark database was created, augmented, labelled and preprocessed in lab. The transfer learning based approaches were not that useful since the overall size of the model would always be so enormous and integration of a transfer learning based model would be meaningless for any deployed web app or even for a local mobile based app. By collecting and engineering our own data using a data collection app, we have proposed a neural architecture which can be configured to learn multiple number of urban landmarks while retaining its overall size under 60MBs. This was achieved by a Convolutional Neural Network based binary classification model trained on landmarks existing in our university. Later this was deployed on a web based application and further on it is being made into an app.","PeriodicalId":431051,"journal":{"name":"2020 3rd International Conference on Computing, Mathematics and Engineering Technologies (iCoMET)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 3rd International Conference on Computing, Mathematics and Engineering Technologies (iCoMET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iCoMET48670.2020.9073795","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

The landmark Recognition of any urban structure is a frustrating task to accomplish. This paper presents a deep learning based approach for recognition of the urban structures. Urban structures are basically the usual offices, houses, apartments etc. A generalized word for all these buildings can be urban structures. This classification problem was identified by our team while making our way into the visual positioning system, we saw that the visual landmark recognition of urban structures was an unexplored domain by using deep learning approaches. We also Identified that unlike the landmarks such as the Eiffel Tower, Statue Of Liberty etc, urban structures were difficult to learn by a neural network, the data requirements were quite distinct compared to the ones mentioned for all these famous landmarks around the globe. Most of all the landmark database was created, augmented, labelled and preprocessed in lab. The transfer learning based approaches were not that useful since the overall size of the model would always be so enormous and integration of a transfer learning based model would be meaningless for any deployed web app or even for a local mobile based app. By collecting and engineering our own data using a data collection app, we have proposed a neural architecture which can be configured to learn multiple number of urban landmarks while retaining its overall size under 60MBs. This was achieved by a Convolutional Neural Network based binary classification model trained on landmarks existing in our university. Later this was deployed on a web based application and further on it is being made into an app.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于卷积神经网络的城市建筑视觉地标识别
任何城市建筑的地标性识别都是一项令人沮丧的任务。本文提出了一种基于深度学习的城市结构识别方法。城市建筑基本上是通常的办公室、房子、公寓等。所有这些建筑的一个概括词可以是城市结构。我们的团队在进入视觉定位系统的过程中发现了这个分类问题,我们看到城市结构的视觉地标识别是一个使用深度学习方法尚未探索的领域。我们还发现,与埃菲尔铁塔、自由女神像等地标不同,城市结构很难通过神经网络学习,与全球所有这些著名地标的数据要求相比,数据要求非常不同。大多数的地标数据库都是在实验室中创建、扩增、标记和预处理的。转移为基础的学习方法并不有用,因为模型的总体规模将永远如此巨大和集成传输的学习基础模型将毫无意义的任何部署web应用程序,甚至基于本地移动应用。通过收集和工程使用数据收集自己的数据应用,我们提出了一个神经结构,学习可以配置多个数量的城市地标,同时保留其总体规模下60 mbs。这是通过一个基于卷积神经网络的二元分类模型来实现的,该模型训练了我们大学现有的地标。后来,它被部署在一个基于web的应用程序上,并进一步被制作成一个应用程序。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Detecting Faulty Sensors by Analyzing the Uncertain Data Using Probabilistic Database Construction of the Exact Solution of Ripa Model with Primitive Variable Approach A Review on Hybrid Energy Storage Systems in Microgrids Meta-model for Stress Testing on Blockchain Nodes Ethics of Artificial Intelligence: Research Challenges and Potential Solutions
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1