DeepMeshCity：用于城市网格预测的深度学习模型

IF 4 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS ACM Transactions on Knowledge Discovery from Data Pub Date : 2024-03-15 DOI:10.1145/3652859

Chi Zhang, Linhao Cai, Meng Chen, Xiucheng Li, Gao Cong

{"title":"DeepMeshCity：用于城市网格预测的深度学习模型","authors":"Chi Zhang, Linhao Cai, Meng Chen, Xiucheng Li, Gao Cong","doi":"10.1145/3652859","DOIUrl":null,"url":null,"abstract":"<p>Urban grid prediction can be applied to many classic spatial-temporal prediction tasks such as air quality prediction, crowd density prediction, and traffic flow prediction, which is of great importance to smart city building. In light of its practical values, many methods have been developed for it and have achieved promising results. Despite their successes, two main challenges remain open: a) how to well capture the global dependencies and b) how to effectively model the multi-scale spatial-temporal correlations? To address these two challenges, we propose a novel method—<sans-serif>DeepMeshCity</sans-serif>, with a carefully-designed Self-Attention Citywide Grid Learner (<sans-serif>SA-CGL</sans-serif>) block comprising of a self-attention unit and a Citywide Grid Learner (<sans-serif>CGL</sans-serif>) unit. The self-attention block aims to capture the global spatial dependencies, and the <sans-serif>CGL</sans-serif> unit is responsible for learning the spatial-temporal correlations. In particular, a multi-scale memory unit is proposed to traverse all stacked <sans-serif>SA-CGL</sans-serif> blocks along a zigzag path to capture the multi-scale spatial-temporal correlations. In addition, we propose to initialize the single-scale memory units and the multi-scale memory units by using the corresponding ones in the previous fragment stack, so as to speed up the model training. We evaluate the performance of our proposed model by comparing with several state-of-the-art methods on four real-world datasets for two urban grid prediction applications. The experimental results verify the superiority of DeepMeshCity over the existing ones. The code is available at https://github.com/ILoveStudying/DeepMeshCity.</p>","PeriodicalId":49249,"journal":{"name":"ACM Transactions on Knowledge Discovery from Data","volume":"19 1","pages":""},"PeriodicalIF":4.0000,"publicationDate":"2024-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DeepMeshCity: A Deep Learning Model for Urban Grid Prediction\",\"authors\":\"Chi Zhang, Linhao Cai, Meng Chen, Xiucheng Li, Gao Cong\",\"doi\":\"10.1145/3652859\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Urban grid prediction can be applied to many classic spatial-temporal prediction tasks such as air quality prediction, crowd density prediction, and traffic flow prediction, which is of great importance to smart city building. In light of its practical values, many methods have been developed for it and have achieved promising results. Despite their successes, two main challenges remain open: a) how to well capture the global dependencies and b) how to effectively model the multi-scale spatial-temporal correlations? To address these two challenges, we propose a novel method—<sans-serif>DeepMeshCity</sans-serif>, with a carefully-designed Self-Attention Citywide Grid Learner (<sans-serif>SA-CGL</sans-serif>) block comprising of a self-attention unit and a Citywide Grid Learner (<sans-serif>CGL</sans-serif>) unit. The self-attention block aims to capture the global spatial dependencies, and the <sans-serif>CGL</sans-serif> unit is responsible for learning the spatial-temporal correlations. In particular, a multi-scale memory unit is proposed to traverse all stacked <sans-serif>SA-CGL</sans-serif> blocks along a zigzag path to capture the multi-scale spatial-temporal correlations. In addition, we propose to initialize the single-scale memory units and the multi-scale memory units by using the corresponding ones in the previous fragment stack, so as to speed up the model training. We evaluate the performance of our proposed model by comparing with several state-of-the-art methods on four real-world datasets for two urban grid prediction applications. The experimental results verify the superiority of DeepMeshCity over the existing ones. The code is available at https://github.com/ILoveStudying/DeepMeshCity.</p>\",\"PeriodicalId\":49249,\"journal\":{\"name\":\"ACM Transactions on Knowledge Discovery from Data\",\"volume\":\"19 1\",\"pages\":\"\"},\"PeriodicalIF\":4.0000,\"publicationDate\":\"2024-03-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM Transactions on Knowledge Discovery from Data\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1145/3652859\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Knowledge Discovery from Data","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1145/3652859","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

城市网格预测可应用于许多经典的时空预测任务，如空气质量预测、人群密度预测、交通流量预测等，对智慧城市建设具有重要意义。鉴于其实用价值，许多方法已被开发出来，并取得了可喜的成果。尽管这些方法取得了成功，但仍存在两大挑战：a) 如何很好地捕捉全局相关性；b) 如何有效地模拟多尺度时空相关性？为了解决这两个难题，我们提出了一种新方法--DeepMeshCity，其中包含一个精心设计的自我关注全城网格学习器（SA-CGL）模块，由一个自我关注单元和一个全城网格学习器（CGL）单元组成。自我注意单元旨在捕捉全局空间依赖关系，而全城网格学习单元则负责学习时空相关性。我们特别提出了一个多尺度记忆单元，用于沿着之字形路径遍历所有堆叠的 SA-CGL 块，以捕捉多尺度时空相关性。此外，我们还建议使用前一个片段堆栈中的相应记忆单元来初始化单尺度记忆单元和多尺度记忆单元，从而加快模型训练速度。我们在两个城市网格预测应用的四个实际数据集上，通过与几种最先进的方法进行比较，评估了我们提出的模型的性能。实验结果验证了 DeepMeshCity 优于现有方法。代码可在 https://github.com/ILoveStudying/DeepMeshCity 上获取。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

DeepMeshCity: A Deep Learning Model for Urban Grid Prediction

Urban grid prediction can be applied to many classic spatial-temporal prediction tasks such as air quality prediction, crowd density prediction, and traffic flow prediction, which is of great importance to smart city building. In light of its practical values, many methods have been developed for it and have achieved promising results. Despite their successes, two main challenges remain open: a) how to well capture the global dependencies and b) how to effectively model the multi-scale spatial-temporal correlations? To address these two challenges, we propose a novel method—DeepMeshCity, with a carefully-designed Self-Attention Citywide Grid Learner (SA-CGL) block comprising of a self-attention unit and a Citywide Grid Learner (CGL) unit. The self-attention block aims to capture the global spatial dependencies, and the CGL unit is responsible for learning the spatial-temporal correlations. In particular, a multi-scale memory unit is proposed to traverse all stacked SA-CGL blocks along a zigzag path to capture the multi-scale spatial-temporal correlations. In addition, we propose to initialize the single-scale memory units and the multi-scale memory units by using the corresponding ones in the previous fragment stack, so as to speed up the model training. We evaluate the performance of our proposed model by comparing with several state-of-the-art methods on four real-world datasets for two urban grid prediction applications. The experimental results verify the superiority of DeepMeshCity over the existing ones. The code is available at https://github.com/ILoveStudying/DeepMeshCity.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

ACM Transactions on Knowledge Discovery from Data COMPUTER SCIENCE, INFORMATION SYSTEMS-COMPUTER SCIENCE, SOFTWARE ENGINEERING

CiteScore

6.70

自引率

5.60%

发文量

172

审稿时长

3 months

期刊介绍： TKDD welcomes papers on a full range of research in the knowledge discovery and analysis of diverse forms of data. Such subjects include, but are not limited to: scalable and effective algorithms for data mining and big data analysis, mining brain networks, mining data streams, mining multi-media data, mining high-dimensional data, mining text, Web, and semi-structured data, mining spatial and temporal data, data mining for community generation, social network analysis, and graph structured data, security and privacy issues in data mining, visual, interactive and online data mining, pre-processing and post-processing for data mining, robust and scalable statistical methods, data mining languages, foundations of data mining, KDD framework and process, and novel applications and infrastructures exploiting data mining technology including massively parallel processing and cloud computing platforms. TKDD encourages papers that explore the above subjects in the context of large distributed networks of computers, parallel or multiprocessing computers, or new data devices. TKDD also encourages papers that describe emerging data mining applications that cannot be satisfied by the current data mining technology.