Geo-Tile2Vec: A Multi-Modal and Multi-Stage Embedding Framework for Urban Analytics

IF 1.6 Q4 REMOTE SENSING ACM Transactions on Spatial Algorithms and Systems Pub Date : 2022-11-18 DOI:10.1145/3571741

Yan-Min Luo, C. Leong, Shuhai Jiao, F. Chung, Wenjie Li, Guoping Liu

{"title":"Geo-Tile2Vec: A Multi-Modal and Multi-Stage Embedding Framework for Urban Analytics","authors":"Yan-Min Luo, C. Leong, Shuhai Jiao, F. Chung, Wenjie Li, Guoping Liu","doi":"10.1145/3571741","DOIUrl":null,"url":null,"abstract":"Cities are very complex systems. Representing urban regions are essential for exploring, understanding, and predicting properties and features of cities. The enrichment of multi-modal urban big data has provided opportunities for researchers to enhance urban region embedding. However, existing works failed to develop an integrated pipeline that fully utilizes effective and informative data sources within geographic units. In this article, we regard a geo-tile as a geographic unit and propose a multi-modal and multi-stage representation learning framework, namely Geo-Tile2Vec, for urban analytics, especially for urban region properties identification. Specifically, in the early stage, geo-tile embeddings are firstly inferred through dynamic mobility events which are combinations of point-of-interest (POI) data and trajectory data by a Word2Vec-like model and metric learning. Then, in the latter stage, we use static street-level imagery to further enrich the embedding information by metric learning. Lastly, the framework learns distributed geo-tile embeddings for the given multi-modal data. We conduct experiments on real-world urban datasets. Four downstream tasks, i.e., main POI category classification task, main land use category classification task, restaurant average price regression task, and firm number regression task, are adopted for validating the effectiveness of the proposed framework in representing geo-tiles. Our proposed framework can significantly improve the performances of all downstream tasks. In addition, we also demonstrate that geo-tiles with similar urban region properties are geometrically closer in the vector space.","PeriodicalId":43641,"journal":{"name":"ACM Transactions on Spatial Algorithms and Systems","volume":"9 1","pages":"1 - 25"},"PeriodicalIF":1.6000,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Spatial Algorithms and Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3571741","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"REMOTE SENSING","Score":null,"Total":0}

引用次数: 1

Abstract

Cities are very complex systems. Representing urban regions are essential for exploring, understanding, and predicting properties and features of cities. The enrichment of multi-modal urban big data has provided opportunities for researchers to enhance urban region embedding. However, existing works failed to develop an integrated pipeline that fully utilizes effective and informative data sources within geographic units. In this article, we regard a geo-tile as a geographic unit and propose a multi-modal and multi-stage representation learning framework, namely Geo-Tile2Vec, for urban analytics, especially for urban region properties identification. Specifically, in the early stage, geo-tile embeddings are firstly inferred through dynamic mobility events which are combinations of point-of-interest (POI) data and trajectory data by a Word2Vec-like model and metric learning. Then, in the latter stage, we use static street-level imagery to further enrich the embedding information by metric learning. Lastly, the framework learns distributed geo-tile embeddings for the given multi-modal data. We conduct experiments on real-world urban datasets. Four downstream tasks, i.e., main POI category classification task, main land use category classification task, restaurant average price regression task, and firm number regression task, are adopted for validating the effectiveness of the proposed framework in representing geo-tiles. Our proposed framework can significantly improve the performances of all downstream tasks. In addition, we also demonstrate that geo-tiles with similar urban region properties are geometrically closer in the vector space.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

geotile2vec:城市分析的多模态和多阶段嵌入框架

城市是非常复杂的系统。代表城市区域对于探索、理解和预测城市的特性和特征至关重要。多模态城市大数据的丰富为研究人员增强城市区域嵌入提供了机会。然而，现有的工作未能开发出一个充分利用地理单元内有效和信息丰富的数据源的综合管道。在本文中，我们将地理瓦片视为一个地理单元，并提出了一个多模式、多阶段的表示学习框架，即geo-Tile2Vec，用于城市分析，特别是城市区域属性识别。具体而言，在早期阶段，首先通过动态移动事件来推断地理瓦片嵌入，动态移动事件是兴趣点（POI）数据和轨迹数据的组合，通过类似Word2Vec的模型和度量学习。然后，在后一阶段，我们使用静态街道级图像，通过度量学习进一步丰富嵌入信息。最后，该框架学习给定多模态数据的分布式地理瓦片嵌入。我们在真实世界的城市数据集上进行实验。采用四个下游任务，即主要POI类别分类任务、主要土地利用类别分类任务，餐厅平均价格回归任务和企业数量回归任务，验证了所提出的框架在表示地理瓦片方面的有效性。我们提出的框架可以显著提高所有下游任务的性能。此外，我们还证明了具有相似城市区域属性的地理瓦片在向量空间中在几何上更接近。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

ACM Transactions on Spatial Algorithms and Systems REMOTE SENSING-

CiteScore

4.40

自引率

5.30%

发文量

期刊介绍： ACM Transactions on Spatial Algorithms and Systems (TSAS) is a scholarly journal that publishes the highest quality papers on all aspects of spatial algorithms and systems and closely related disciplines. It has a multi-disciplinary perspective in that it spans a large number of areas where spatial data is manipulated or visualized (regardless of how it is specified - i.e., geometrically or textually) such as geography, geographic information systems (GIS), geospatial and spatiotemporal databases, spatial and metric indexing, location-based services, web-based spatial applications, geographic information retrieval (GIR), spatial reasoning and mining, security and privacy, as well as the related visual computing areas of computer graphics, computer vision, geometric modeling, and visualization where the spatial, geospatial, and spatiotemporal data is central.