Large-Scale Aerial Image Categorization by Multi-Task Discriminative Topologies Discovery

WISMM '14 Pub Date : 2014-11-07 DOI:10.1145/2661714.2661718
Yingjie Xia, Luming Zhang, Suhua Tang
{"title":"Large-Scale Aerial Image Categorization by Multi-Task Discriminative Topologies Discovery","authors":"Yingjie Xia, Luming Zhang, Suhua Tang","doi":"10.1145/2661714.2661718","DOIUrl":null,"url":null,"abstract":"Fast and accurately categorizing the millions of aerial images on Google Maps is a useful technique in multimedia applications. Existing methods cannot handle this task effectively due to two reasons. 1) It is challenging to build a realtime image categorization system, as some geo-aware Apps update over 20 aerial images per second. 2) The aerial images' topologies are the key to distinguish their categories, but they cannot be encoded by the generic visual descriptors. To solve these two problems, we propose an efficient aerial image categorization system, aiming at mining discriminative topologies of aerial images under a multi-task learning framework. Particularly, we first construct a region adjacency graph (RAG) that describes the topology of each aerial image. Thereby, aerial image categorization can be formulated as RAG-to-RAG matching. Based on graph theory, RAG-to-RAG matching is conducted by comparing all their respective graphlets (i.e., small subgraphs). Because the number of graphlets is huge, a multi-task feature selection algorithm is derived to discover topologies jointly discriminative to multiple categories. The discovered topologies are used to extract the discriminative graphlets. Finally, these graphlets are integrated into an AdaBoost model for predicting aerial image categories. Experiments show that our approach is competitive several existing recognition models. Further, over 24 aerial images are categorized per second, reflecting that our system is ready for real-world applications.","PeriodicalId":365687,"journal":{"name":"WISMM '14","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"WISMM '14","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2661714.2661718","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Fast and accurately categorizing the millions of aerial images on Google Maps is a useful technique in multimedia applications. Existing methods cannot handle this task effectively due to two reasons. 1) It is challenging to build a realtime image categorization system, as some geo-aware Apps update over 20 aerial images per second. 2) The aerial images' topologies are the key to distinguish their categories, but they cannot be encoded by the generic visual descriptors. To solve these two problems, we propose an efficient aerial image categorization system, aiming at mining discriminative topologies of aerial images under a multi-task learning framework. Particularly, we first construct a region adjacency graph (RAG) that describes the topology of each aerial image. Thereby, aerial image categorization can be formulated as RAG-to-RAG matching. Based on graph theory, RAG-to-RAG matching is conducted by comparing all their respective graphlets (i.e., small subgraphs). Because the number of graphlets is huge, a multi-task feature selection algorithm is derived to discover topologies jointly discriminative to multiple categories. The discovered topologies are used to extract the discriminative graphlets. Finally, these graphlets are integrated into an AdaBoost model for predicting aerial image categories. Experiments show that our approach is competitive several existing recognition models. Further, over 24 aerial images are categorized per second, reflecting that our system is ready for real-world applications.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于多任务判别拓扑发现的大规模航空图像分类
快速准确地对谷歌地图上数以百万计的航拍图像进行分类是多媒体应用中的一项有用技术。由于两个原因,现有的方法不能有效地处理这个任务。1)构建实时图像分类系统具有挑战性,因为一些地理感知应用程序每秒更新超过20张航空图像。2)航拍图像的拓扑结构是区分航拍图像类别的关键,但航拍图像的拓扑结构不能被通用的视觉描述符编码。为了解决这两个问题,我们提出了一种高效的航空图像分类系统,旨在在多任务学习框架下挖掘航空图像的判别拓扑。特别地,我们首先构造一个区域邻接图(RAG)来描述每个航空图像的拓扑结构。因此,航空图像分类可以表述为RAG-to-RAG匹配。基于图论,通过比较它们各自的所有图元(即小子图)来进行RAG-to-RAG匹配。针对石墨烯数量庞大的特点,提出了一种多任务特征选择算法,用于发现对多个类别联合判别的拓扑。将发现的拓扑用于提取鉴别石墨烯。最后,将这些小块集成到AdaBoost模型中,用于预测航空图像类别。实验表明,我们的方法与现有的几种识别模型相比具有一定的竞争力。此外,每秒可以对超过24张航拍图像进行分类,这表明我们的系统已经为现实世界的应用做好了准备。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Large-Scale Aerial Image Categorization by Multi-Task Discriminative Topologies Discovery Empirical Observation of User Activities: Check-ins, Venue Photos and Tips in Foursquare Pushing Image Recognition in the Real World: Towards Recognizing Millions of Entities Towards Storytelling by Extracting Social Information from OSN Photo's Metadata Monitoring of User Generated Video Broadcasting Services
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1