首页 > 最新文献

Geoinformatica最新文献

英文 中文
A spatial dependency based reinforcement learning model for selecting features in spatial classification 基于空间依赖性的强化学习模型,用于在空间分类中选择特征
IF 2 4区 计算机科学 Q1 Social Sciences Pub Date : 2024-05-17 DOI: 10.1007/s10707-024-00523-x
Cheng Wei, Wenhao Yu
{"title":"A spatial dependency based reinforcement learning model for selecting features in spatial classification","authors":"Cheng Wei, Wenhao Yu","doi":"10.1007/s10707-024-00523-x","DOIUrl":"https://doi.org/10.1007/s10707-024-00523-x","url":null,"abstract":"","PeriodicalId":55109,"journal":{"name":"Geoinformatica","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140964309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How opportunistic mobile monitoring can enhance air quality assessment? 机会性移动监测如何加强空气质量评估?
IF 2 4区 计算机科学 Q1 Social Sciences Pub Date : 2024-04-29 DOI: 10.1007/s10707-024-00516-w
Mohammad Abboud, Yehia Taher, Karine Zeitouni, Ana-Maria Olteanu-Raimond

The deteriorating air quality in urban areas, particularly in developing countries, has led to increased attention being paid to the issue. Daily reports of air pollution are essential to effectively manage public health risks. Pollution estimation has become crucial to expanding spatial and temporal coverage and estimating pollution levels at different locations. The emergence of low-cost sensors has enabled high-resolution data collection, either in fixed or mobile settings, and various approaches have been proposed to estimate air pollution using this technology. The objective of this study is to enhance the data from fixed stations by incorporating opportunistic mobile monitoring (OMM) data. The main research question we are dealing with is: How can we augment fixed station data through OMM? In order to address the challenge of limited OMM data availability, we leverage existing data collected during periods when the pollution maps align with those observed by the fixed stations. By combining the fixed and mobile data, we apply interpolation techniques to produce more accurate pollution maps. The efficacy of our approach is validated through experiments conducted on a real-life dataset.

城市地区,尤其是发展中国家的城市地区空气质量不断恶化,导致人们越来越关注这一问题。空气污染的日常报告对于有效管理公共健康风险至关重要。污染估算对于扩大时空覆盖范围和估算不同地点的污染水平至关重要。低成本传感器的出现使得在固定或移动环境中收集高分辨率数据成为可能。本研究的目的是通过纳入机会性移动监测(OMM)数据来增强来自固定站点的数据。我们要解决的主要研究问题是:如何增强固定监测站的数据?如何通过 OMM 增强固定监测站的数据?为了应对 OMM 数据可用性有限的挑战,我们利用了在污染地图与固定站点观测到的污染地图一致期间收集到的现有数据。通过结合固定数据和移动数据,我们采用插值技术绘制出更精确的污染地图。我们在现实数据集上进行的实验验证了我们方法的有效性。
{"title":"How opportunistic mobile monitoring can enhance air quality assessment?","authors":"Mohammad Abboud, Yehia Taher, Karine Zeitouni, Ana-Maria Olteanu-Raimond","doi":"10.1007/s10707-024-00516-w","DOIUrl":"https://doi.org/10.1007/s10707-024-00516-w","url":null,"abstract":"<p>The deteriorating air quality in urban areas, particularly in developing countries, has led to increased attention being paid to the issue. Daily reports of air pollution are essential to effectively manage public health risks. Pollution estimation has become crucial to expanding spatial and temporal coverage and estimating pollution levels at different locations. The emergence of low-cost sensors has enabled high-resolution data collection, either in fixed or mobile settings, and various approaches have been proposed to estimate air pollution using this technology. The objective of this study is to enhance the data from fixed stations by incorporating opportunistic mobile monitoring (OMM) data. The main research question we are dealing with is: How can we augment fixed station data through OMM? In order to address the challenge of limited OMM data availability, we leverage existing data collected during periods when the pollution maps align with those observed by the fixed stations. By combining the fixed and mobile data, we apply interpolation techniques to produce more accurate pollution maps. The efficacy of our approach is validated through experiments conducted on a real-life dataset.</p>","PeriodicalId":55109,"journal":{"name":"Geoinformatica","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140840735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Foresight plus: serverless spatio-temporal traffic forecasting 前瞻加:无服务器时空流量预测
IF 2 4区 计算机科学 Q1 Social Sciences Pub Date : 2024-04-26 DOI: 10.1007/s10707-024-00517-9
Joe Oakley, Chris Conlan, Gunduz Vehbi Demirci, Alexandros Sfyridis, Hakan Ferhatosmanoglu

Building a real-time spatio-temporal forecasting system is a challenging problem with many practical applications such as traffic and road network management. Most forecasting research focuses on achieving (often marginal) improvements in evaluation metrics such as MAE/MAPE on static benchmark datasets, with less attention paid to building practical pipelines which achieve timely and accurate forecasts when the network is under heavy load. Transport authorities also need to leverage dynamic data sources such as roadworks and vehicle-level flow data, while also supporting ad-hoc inference workloads at low cost. Our cloud-based forecasting solution Foresight, developed in collaboration with Transport for the West Midlands (TfWM), is able to ingest, aggregate and process streamed traffic data, enhanced with dynamic vehicle-level flow and urban event information, to produce regularly scheduled forecasts with high accuracy. In this work, we extend Foresight with several novel enhancements, into a new system which we term Foresight Plus. New features include an efficient method for extending the forecasting scale, enabling predictions further into the future. We also augment the inference architecture with a new, fully serverless design which offers a more cost-effective solution and which seamlessly handles sporadic inference workloads over multiple forecasting scales. We observe that Graph Neural Network (GNN) forecasting models are robust to extensions of the forecasting scale, achieving consistent performance up to 48 hours ahead. This is in contrast to the 1 hour forecasting periods popularly considered in this context. Further, our serverless inference solution is shown to be more cost-effective than provisioned alternatives in corresponding use-cases. We identify the optimal memory configuration of serverless resources to achieve an attractive cost-to-performance ratio.

建立实时时空预测系统是一个具有挑战性的问题,在交通和道路网络管理等许多实际应用中都存在这个问题。大多数预测研究都侧重于在静态基准数据集上实现 MAE/MAPE 等评估指标的改进(通常是微不足道的改进),而较少关注如何建立实用的管道,以便在网络处于高负荷状态时实现及时、准确的预测。交通管理部门还需要利用道路工程和车辆流量数据等动态数据源,同时以低成本支持临时推理工作负载。我们与西米德兰兹交通局(TfWM)合作开发的基于云计算的预测解决方案 Foresight 能够摄取、聚合和处理流式交通数据,并通过动态车辆级流量和城市事件信息进行增强,从而生成高精度的定期预测。在这项工作中,我们对 Foresight 进行了一些新的改进,将其扩展为一个新系统,我们称之为 Foresight Plus。新功能包括扩展预测范围的有效方法,从而能够预测更远的未来。我们还通过一种全新的、完全无服务器的设计增强了推理架构,这种设计提供了一种更具成本效益的解决方案,可在多个预测规模上无缝处理零星推理工作量。我们观察到,图形神经网络(GNN)预测模型对预测规模的扩展非常稳健,可在提前 48 小时内实现一致的性能。这与在此背景下普遍考虑的 1 小时预测期形成了鲜明对比。此外,在相应的使用案例中,我们的无服务器推理解决方案比配置替代方案更具成本效益。我们确定了无服务器资源的最佳内存配置,以实现极具吸引力的性价比。
{"title":"Foresight plus: serverless spatio-temporal traffic forecasting","authors":"Joe Oakley, Chris Conlan, Gunduz Vehbi Demirci, Alexandros Sfyridis, Hakan Ferhatosmanoglu","doi":"10.1007/s10707-024-00517-9","DOIUrl":"https://doi.org/10.1007/s10707-024-00517-9","url":null,"abstract":"<p>Building a real-time spatio-temporal forecasting system is a challenging problem with many practical applications such as traffic and road network management. Most forecasting research focuses on achieving (often marginal) improvements in evaluation metrics such as MAE/MAPE on static benchmark datasets, with less attention paid to building practical pipelines which achieve timely and accurate forecasts when the network is under heavy load. Transport authorities also need to leverage dynamic data sources such as roadworks and vehicle-level flow data, while also supporting ad-hoc inference workloads at low cost. Our cloud-based forecasting solution Foresight, developed in collaboration with Transport for the West Midlands (TfWM), is able to ingest, aggregate and process streamed traffic data, enhanced with dynamic vehicle-level flow and urban event information, to produce regularly scheduled forecasts with high accuracy. In this work, we extend Foresight with several novel enhancements, into a new system which we term <b>Foresight Plus</b>. New features include an efficient method for extending the forecasting scale, enabling predictions further into the future. We also augment the inference architecture with a new, fully serverless design which offers a more cost-effective solution and which seamlessly handles sporadic inference workloads over multiple forecasting scales. We observe that Graph Neural Network (GNN) forecasting models are robust to extensions of the forecasting scale, achieving consistent performance up to 48 hours ahead. This is in contrast to the 1 hour forecasting periods popularly considered in this context. Further, our serverless inference solution is shown to be more cost-effective than provisioned alternatives in corresponding use-cases. We identify the optimal memory configuration of serverless resources to achieve an attractive cost-to-performance ratio.</p>","PeriodicalId":55109,"journal":{"name":"Geoinformatica","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140799495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transfer-learning-based representation learning for trajectory similarity search 基于迁移学习的轨迹相似性搜索表征学习
IF 2 4区 计算机科学 Q1 Social Sciences Pub Date : 2024-04-13 DOI: 10.1007/s10707-024-00515-x
Danling Lai, Jianfeng Qu, Yu Sang, Xi Chen

Trajectory similarity search is one of the most fundamental tasks in spatial-temporal data analysis. Classical methods are based on predefined trajectory similarity measures, consuming high time and space costs. To accelerate similarity computation, some deep metric learning methods have recently been proposed to approximate predefined measures based on the learned representation of trajectories. However, instead of predefined measures, real applications may require personalized measures, which cannot be effectively learned by existing models due to insufficient labels. Thus, this paper proposes a transfer-learning-based model FTL-Traj, which addresses this problem by effectively transferring knowledge from several existing measures as source measures. Particularly, a ProbSparse self-attention-based GRU unit is designed to extract the spatial and structural information of each trajectory. Confronted with diverse source measures, the priority modeling assists the model for the rational ensemble. Then, sparse labels are enriched with rank knowledge and collaboration knowledge via transfer learning. Extensive experiments on two real-world datasets demonstrate the superiority of our model.

轨迹相似性搜索是时空数据分析中最基本的任务之一。传统方法基于预定义的轨迹相似性度量,耗费大量时间和空间。为了加速相似性计算,最近有人提出了一些深度度量学习方法,根据学习到的轨迹表示来近似预定义度量。然而,实际应用中可能需要个性化的度量,而不是预定义的度量,由于标签不足,现有模型无法有效地学习这些度量。因此,本文提出了一种基于迁移学习的模型 FTL-Traj,该模型通过有效迁移多个现有测量指标的知识作为源测量指标来解决这一问题。特别是设计了一个基于 ProbSparse 自注意的 GRU 单元,以提取每个轨迹的空间和结构信息。面对多样化的源测量,优先建模有助于建立合理的集合模型。然后,通过迁移学习用等级知识和协作知识丰富稀疏标签。在两个真实世界数据集上的广泛实验证明了我们模型的优越性。
{"title":"Transfer-learning-based representation learning for trajectory similarity search","authors":"Danling Lai, Jianfeng Qu, Yu Sang, Xi Chen","doi":"10.1007/s10707-024-00515-x","DOIUrl":"https://doi.org/10.1007/s10707-024-00515-x","url":null,"abstract":"<p>Trajectory similarity search is one of the most fundamental tasks in spatial-temporal data analysis. Classical methods are based on predefined trajectory similarity measures, consuming high time and space costs. To accelerate similarity computation, some deep metric learning methods have recently been proposed to approximate predefined measures based on the learned representation of trajectories. However, instead of predefined measures, real applications may require personalized measures, which cannot be effectively learned by existing models due to insufficient labels. Thus, this paper proposes a transfer-learning-based model <span>FTL-Traj</span>, which addresses this problem by effectively transferring knowledge from several existing measures as source measures. Particularly, a ProbSparse self-attention-based GRU unit is designed to extract the spatial and structural information of each trajectory. Confronted with diverse source measures, the priority modeling assists the model for the rational ensemble. Then, sparse labels are enriched with rank knowledge and collaboration knowledge via transfer learning. Extensive experiments on two real-world datasets demonstrate the superiority of our model.</p>","PeriodicalId":55109,"journal":{"name":"Geoinformatica","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140565932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A survey on the computation of representative trajectories 代表性轨迹计算调查
IF 2 4区 计算机科学 Q1 Social Sciences Pub Date : 2024-04-02 DOI: 10.1007/s10707-024-00514-y

Abstract

The process of computing a representative trajectory for a set of raw (or even semantically enriched) trajectories is an attractive solution to minimize several challenges related to trajectory management, like trajectory data integration or trajectory pattern analysis. We identify two main strategies for accomplishing such a process (trajectory data summarization and trajectory data fusion), but we argue that this subject is still an open issue, and we did not find a survey with such a focus. In order to fill this literature gap, this paper presents a survey that analyzes several issues around the two aforementioned strategies, like the type of representative data computed by each approach, the dimensions that are considered by the approach (spatial, temporal, and semantics), the accomplished methods of the proposed processes, and how the process is evaluated. Additionally, we compare these two research areas (trajectory summarization and trajectory fusion) in literature to analyze their relationship. Finally, some open issues related to this subject are also pointed out.

摘要 为一组原始轨迹(甚至是语义丰富的轨迹)计算代表性轨迹的过程是一个极具吸引力的解决方案,可最大限度地减少与轨迹管理相关的若干挑战,如轨迹数据整合或轨迹模式分析。我们确定了实现这一过程的两种主要策略(轨迹数据汇总和轨迹数据融合),但我们认为这一主题仍是一个未决问题,而且我们也没有找到以这一主题为重点的调查报告。为了填补这一文献空白,本文提出了一项调查,分析了围绕上述两种策略的几个问题,如每种方法计算的代表性数据的类型、方法考虑的维度(空间、时间和语义)、建议流程的完成方法以及如何评估流程。此外,我们还比较了文献中的这两个研究领域(轨迹总结和轨迹融合),分析它们之间的关系。最后,我们还指出了与本课题相关的一些有待解决的问题。
{"title":"A survey on the computation of representative trajectories","authors":"","doi":"10.1007/s10707-024-00514-y","DOIUrl":"https://doi.org/10.1007/s10707-024-00514-y","url":null,"abstract":"<h3>Abstract</h3> <p>The process of computing a representative trajectory for a set of raw (or even semantically enriched) trajectories is an attractive solution to minimize several challenges related to trajectory management, like trajectory data integration or trajectory pattern analysis. We identify two main strategies for accomplishing such a process (trajectory data summarization and trajectory data fusion), but we argue that this subject is still an open issue, and we did not find a survey with such a focus. In order to fill this literature gap, this paper presents a survey that analyzes several issues around the two aforementioned strategies, like the type of representative data computed by each approach, the dimensions that are considered by the approach (spatial, temporal, and semantics), the accomplished methods of the proposed processes, and how the process is evaluated. Additionally, we compare these two research areas (trajectory summarization and trajectory fusion) in literature to analyze their relationship. Finally, some open issues related to this subject are also pointed out.</p>","PeriodicalId":55109,"journal":{"name":"Geoinformatica","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140565934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SkyEye: continuous processing of moving spatial-keyword queries over moving objects 天眼:连续处理移动物体上的移动空间关键词查询
IF 2 4区 计算机科学 Q1 Social Sciences Pub Date : 2024-03-20 DOI: 10.1007/s10707-024-00512-0
Mariam Orabi, Zaher Al Aghbari, Ibrahim Kamel

With the spread of GPS-equipped portable devices, Location-Based Services (LBSs) flourished. Some crucial LBSs require real-time processing of moving spatial-keyword queries over moving objects, such as an ambulance seeking for volunteers. The research community proposed solutions for scenarios assuming that either the queries or the queried objects are moving, but solutions are needed assuming that both are moving. This work proposes SkyEye; a model that efficiently processes moving continuous top-k spatial-keyword queries over moving objects in a directed streets network. SkyEye computes queries’ answer sets for time intervals and smartly updates the answer sets based on the recent history. Novel optimization techniques and indexing structures are leveraged to improve SkyEye’s efficiency and scalability. The mathematical foundations of these optimization techniques are thoroughly demonstrated. Finally, extensive experiments showed that SkyEye has significant performance improvements in terms of efficiency, scalability, and accuracy compared to a baseline model.

随着装有 GPS 的便携式设备的普及,基于位置的服务(LBS)蓬勃发展。一些重要的定位服务需要实时处理移动物体上的移动空间关键词查询,例如救护车寻找志愿者。研究界针对假设查询或被查询对象都在移动的场景提出了解决方案,但还需要假设两者都在移动的解决方案。本研究提出了 SkyEye 模型,该模型可高效处理在有向街道网络中对移动物体进行的连续顶 k 空间关键词查询。SkyEye 计算查询在时间间隔内的答案集,并根据最近的历史智能更新答案集。新颖的优化技术和索引结构可提高 SkyEye 的效率和可扩展性。对这些优化技术的数学基础进行了全面论证。最后,大量实验表明,与基线模型相比,SkyEye 在效率、可扩展性和准确性方面都有显著的性能提升。
{"title":"SkyEye: continuous processing of moving spatial-keyword queries over moving objects","authors":"Mariam Orabi, Zaher Al Aghbari, Ibrahim Kamel","doi":"10.1007/s10707-024-00512-0","DOIUrl":"https://doi.org/10.1007/s10707-024-00512-0","url":null,"abstract":"<p>With the spread of GPS-equipped portable devices, Location-Based Services (LBSs) flourished. Some crucial LBSs require real-time processing of moving spatial-keyword queries over moving objects, such as an ambulance seeking for volunteers. The research community proposed solutions for scenarios assuming that either the queries or the queried objects are moving, but solutions are needed assuming that both are moving. This work proposes SkyEye; a model that efficiently processes moving continuous top-k spatial-keyword queries over moving objects in a directed streets network. SkyEye computes queries’ answer sets for time intervals and smartly updates the answer sets based on the recent history. Novel optimization techniques and indexing structures are leveraged to improve SkyEye’s efficiency and scalability. The mathematical foundations of these optimization techniques are thoroughly demonstrated. Finally, extensive experiments showed that SkyEye has significant performance improvements in terms of efficiency, scalability, and accuracy compared to a baseline model.</p>","PeriodicalId":55109,"journal":{"name":"Geoinformatica","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140173157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cross-domain NER in the data-poor scenarios for human mobility knowledge 在数据匮乏的情况下进行跨域 NER,以获取人类流动知识
IF 2 4区 计算机科学 Q1 Social Sciences Pub Date : 2024-03-05 DOI: 10.1007/s10707-024-00513-z
Yutong Jiang, Fusheng Jin, Mengnan Chen, Guoming Liu, He Pang, Ye Yuan

In recent years, the exploration of knowledge in large-scale human mobility has gained significant attention. In order to achieve a semantic understanding of human behavior and uncover patterns in large-scale human mobility, Named Entity Recognition (NER) is a crucial technology. The rapid advancements in IoT and CPS technologies have led to the collection of massive human mobility data from various sources. Therefore, there’s a need for Cross-domain NER which can transfer entity information from the source domain to automatically identify and classify entities in different target domain texts. In the situation of the data-poor, how could we transfer human mobility knowledge over time and space is particularly significant, therefore this paper proposes an Adaptive Text Sequence Enhancement Module (at-SAM) to help the model enhance the association between entities in sentences in the data-poor target domains. This paper also proposes a Predicted Label-Guided Dual Sequence Aware Information Module (Dual-SAM) to improve the transferability of label information. Experiments were conducted in domains that contain hidden knowledge about human mobility, the results show that this method can transfer task knowledge between multiple different domains in the data-poor scenarios and achieve SOTA performance.

近年来,对大规模人员流动中知识的探索受到了广泛关注。为了实现对人类行为的语义理解并揭示大规模人员流动的模式,命名实体识别(NER)是一项至关重要的技术。物联网和 CPS 技术的飞速发展导致从各种来源收集到大量的人类移动数据。因此,需要跨域 NER,它可以从源域传输实体信息,自动识别和分类不同目标域文本中的实体。在数据匮乏的情况下,如何在时间和空间上转移人类移动知识显得尤为重要,因此本文提出了自适应文本序列增强模块(at-SAM),以帮助模型增强数据匮乏的目标域中句子中实体之间的关联。本文还提出了预测标签引导的双序列感知信息模块(Dual-SAM),以提高标签信息的可转移性。实验在包含有关人类移动性的隐藏知识的领域中进行,结果表明该方法可以在数据贫乏的场景下在多个不同领域之间转移任务知识,并实现 SOTA 性能。
{"title":"Cross-domain NER in the data-poor scenarios for human mobility knowledge","authors":"Yutong Jiang, Fusheng Jin, Mengnan Chen, Guoming Liu, He Pang, Ye Yuan","doi":"10.1007/s10707-024-00513-z","DOIUrl":"https://doi.org/10.1007/s10707-024-00513-z","url":null,"abstract":"<p>In recent years, the exploration of knowledge in large-scale human mobility has gained significant attention. In order to achieve a semantic understanding of human behavior and uncover patterns in large-scale human mobility, Named Entity Recognition (NER) is a crucial technology. The rapid advancements in IoT and CPS technologies have led to the collection of massive human mobility data from various sources. Therefore, there’s a need for Cross-domain NER which can transfer entity information from the source domain to automatically identify and classify entities in different target domain texts. In the situation of the data-poor, how could we transfer human mobility knowledge over time and space is particularly significant, therefore this paper proposes an Adaptive Text Sequence Enhancement Module (at-SAM) to help the model enhance the association between entities in sentences in the data-poor target domains. This paper also proposes a Predicted Label-Guided Dual Sequence Aware Information Module (Dual-SAM) to improve the transferability of label information. Experiments were conducted in domains that contain hidden knowledge about human mobility, the results show that this method can transfer task knowledge between multiple different domains in the data-poor scenarios and achieve SOTA performance.</p>","PeriodicalId":55109,"journal":{"name":"Geoinformatica","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140032631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DGFormer: a physics-guided station level weather forecasting model with dynamic spatial-temporal graph neural network DGFormer:采用动态时空图神经网络的物理学指导台站级天气预报模型
IF 2 4区 计算机科学 Q1 Social Sciences Pub Date : 2024-02-16 DOI: 10.1007/s10707-024-00511-1
Zhewen Xu, Xiaohui Wei, Jieyun Hao, Junze Han, Hongliang Li, Changzheng Liu, Zijian Li, Dongyuan Tian, Nong Zhang

In recent years, there has been an increased interest in understanding and predicting the weather using weather station data with Spatial-Temporal Graph Neural Networks (STGNN). However, it has large prediction errors as a result of the inherent non-linearities and the influence of dynamic spatio-temporal auto-correlation. Using a continuously-varying graph topology chronologically, while embedding domain knowledge to enforce validity, can effectively resolve the issue, but the implementation of such concept constitutes an interdisciplinary challenge for researchers. A Dynamic Graph Former (DGFormer) model is proposed to address this challenge. It combines a topology learner through a deep generative layer with domain knowledge enhancement inserted into the STGNN structure, where the derived physics-guided method allows for an efficient integration with the earth system. For capture of the optimal topology, we merge a node-embedding-based similarity metric learning and the superposition principle as physical assistants into the dynamic graph module. We evaluate our model with a real-world weather dataset on short-term (12 hours) and medium-range (360 hours) prediction tasks. DGFormer achieves outstanding performance with obvious improvements by up to 34.84% at short-term prediction and by up to 23.25% at medium-range prediction compared with the state-of-the-art methods. We also conducted detailed analyses for cities in three regions and visualized the dynamic graph, revealing the characteristics, advantages, and graph visualization of our model.

近年来,人们越来越关注利用空间-时间图神经网络(STGNN)来理解和预测气象站数据。然而,由于其固有的非线性和动态时空自相关性的影响,其预测误差较大。按时间顺序使用连续变化的图拓扑,同时嵌入领域知识以加强有效性,可以有效地解决这一问题,但这一概念的实现对研究人员来说是一个跨学科的挑战。为应对这一挑战,我们提出了一种动态图形成器(DGFormer)模型。它通过深度生成层将拓扑学习器与插入 STGNN 结构的领域知识增强相结合,其中衍生的物理引导方法允许与地球系统高效集成。为了捕捉最佳拓扑结构,我们将基于节点嵌入的相似度量学习和叠加原理作为物理辅助工具融入动态图模块。我们利用真实世界的天气数据集对我们的模型进行了短期(12 小时)和中期(360 小时)预测任务的评估。与最先进的方法相比,DGFormer 在短期预测和中程预测方面分别取得了 34.84% 和 23.25% 的明显改善,表现出色。我们还对三个地区的城市进行了详细分析,并将动态图可视化,从而揭示了我们模型的特点、优势和图形可视化。
{"title":"DGFormer: a physics-guided station level weather forecasting model with dynamic spatial-temporal graph neural network","authors":"Zhewen Xu, Xiaohui Wei, Jieyun Hao, Junze Han, Hongliang Li, Changzheng Liu, Zijian Li, Dongyuan Tian, Nong Zhang","doi":"10.1007/s10707-024-00511-1","DOIUrl":"https://doi.org/10.1007/s10707-024-00511-1","url":null,"abstract":"<p>In recent years, there has been an increased interest in understanding and predicting the weather using weather station data with Spatial-Temporal Graph Neural Networks (STGNN). However, it has large prediction errors as a result of the inherent non-linearities and the influence of dynamic spatio-temporal auto-correlation. Using a continuously-varying graph topology chronologically, while embedding domain knowledge to enforce validity, can effectively resolve the issue, but the implementation of such concept constitutes an interdisciplinary challenge for researchers. A Dynamic Graph Former (DGFormer) model is proposed to address this challenge. It combines a topology learner through a deep generative layer with domain knowledge enhancement inserted into the STGNN structure, where the derived physics-guided method allows for an efficient integration with the earth system. For capture of the optimal topology, we merge a node-embedding-based similarity metric learning and the superposition principle as physical assistants into the dynamic graph module. We evaluate our model with a real-world weather dataset on short-term (12 hours) and medium-range (360 hours) prediction tasks. DGFormer achieves outstanding performance with obvious improvements by up to 34.84% at short-term prediction and by up to 23.25% at medium-range prediction compared with the state-of-the-art methods. We also conducted detailed analyses for cities in three regions and visualized the dynamic graph, revealing the characteristics, advantages, and graph visualization of our model.</p>","PeriodicalId":55109,"journal":{"name":"Geoinformatica","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139753342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient spatial queries over complex polygons with hybrid representations 使用混合表示法对复杂多边形进行高效空间查询
IF 2 4区 计算机科学 Q1 Social Sciences Pub Date : 2023-12-27 DOI: 10.1007/s10707-023-00508-2
Dejun Teng, Furqan Baig, Zhaohui Peng, Jun Kong, Fusheng Wang

One major goal of spatial query processing is to mitigate I/O costs and minimize the search space. However, geometric computation can be heavy-duty for spatial queries, in particular for complex geometries such as polygons with many edges based on a vector-based representation. Many past techniques have been provided for spatial partitioning and indexing, which are mainly built on minimal bounding boxes or other approximation methods and are not optimized for reducing geometric computation. In this paper, we propose a novel vector-raster hybrid approach through rasterization, where rich pixel-centric information is preserved to help not only filter out more candidates but also reduce geometry computation load. Based on the hybrid model, we implement four typical spatial queries, which can be generalized for other types of spatial queries. We also propose cost models to estimate the latency for those query types. Our experiments demonstrate that the hybrid model can boost the performance of spatial queries on complex polygons by up to one order of magnitude.

空间查询处理的一个主要目标是降低 I/O 成本和最小化搜索空间。然而,对于空间查询来说,几何计算可能是一项繁重的工作,尤其是对于复杂的几何图形,如基于向量表示的有许多边的多边形。过去已经提供了许多空间分区和索引技术,这些技术主要建立在最小边界框或其他近似方法上,并没有针对减少几何计算进行优化。在本文中,我们提出了一种新颖的矢量-栅格混合方法,通过栅格化,保留丰富的以像素为中心的信息,不仅有助于筛选出更多候选对象,还能减少几何计算负荷。在混合模型的基础上,我们实现了四种典型的空间查询,并可推广到其他类型的空间查询。我们还提出了成本模型来估算这些查询类型的延迟。实验证明,混合模型可以将复杂多边形的空间查询性能提高一个数量级。
{"title":"Efficient spatial queries over complex polygons with hybrid representations","authors":"Dejun Teng, Furqan Baig, Zhaohui Peng, Jun Kong, Fusheng Wang","doi":"10.1007/s10707-023-00508-2","DOIUrl":"https://doi.org/10.1007/s10707-023-00508-2","url":null,"abstract":"<p>One major goal of spatial query processing is to mitigate I/O costs and minimize the search space. However, geometric computation can be heavy-duty for spatial queries, in particular for complex geometries such as polygons with many edges based on a vector-based representation. Many past techniques have been provided for spatial partitioning and indexing, which are mainly built on minimal bounding boxes or other approximation methods and are not optimized for reducing geometric computation. In this paper, we propose a novel vector-raster hybrid approach through rasterization, where rich pixel-centric information is preserved to help not only filter out more candidates but also reduce geometry computation load. Based on the hybrid model, we implement four typical spatial queries, which can be generalized for other types of spatial queries. We also propose cost models to estimate the latency for those query types. Our experiments demonstrate that the hybrid model can boost the performance of spatial queries on complex polygons by up to one order of magnitude.</p>","PeriodicalId":55109,"journal":{"name":"Geoinformatica","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2023-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139053259","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Meta-learning based passenger flow prediction for newly-operated stations 基于元学习的新站客流预测
IF 2 4区 计算机科学 Q1 Social Sciences Pub Date : 2023-11-29 DOI: 10.1007/s10707-023-00510-8
Kuo Han, Jinlei Zhang, Xiaopeng Tian, Songsong Li, Chunqi Zhu

By tapping into the human mobility of the urban rail transit (URT) network to understand the travel demands and characteristics of passengers in the urban space, URT managers are able to obtain more support for decision-making to improve the effectiveness of operation and management, the travel experience of passengers, as well as public safety. However, not all URT networks have sufficient human mobility data (e.g., newly-operated URT networks). It is necessary to provide data support for mining human mobility in data-poor URT networks. Therefore, we propose a method called Meta Long Short-Term Memory Network (Meta-LSTM) for passenger flow prediction at URT stations to provide data support for networks that lack data. The Meta-LSTM is to construct a framework that increases the generalization ability of a long short-term memory network (LSTM) to various passenger flow characteristics by learning passenger flow characteristics from multiple data-rich stations and then applying the learned parameter to data-scarce stations by parameter initialization. The Meta-LSTM is applied to the URT network of Nanning, Hangzhou, and Beijing, China. The experiments on three real-world URT networks demonstrate the effectiveness of our proposed Meta-LSTM over several competitive baseline models. Results also show that our proposed Meta-LSTM has a good generalization ability to various passenger flow characteristics, which can provide a reference for passenger flow prediction in the stations with limited data.

利用城市轨道交通网络的人的移动性,了解城市空间中乘客的出行需求和特征,可以为轨道交通管理者获得更多的决策支持,从而提高运营管理的有效性,改善乘客的出行体验,提高公共安全。然而,并不是所有的轨道交通网络都有足够的人员移动数据(例如,新运营的轨道交通网络)。在数据贫乏的轨道交通网络中,为挖掘人的移动性提供数据支持是十分必要的。因此,我们提出了一种称为元长短期记忆网络(Meta- lstm)的轨道交通车站客流预测方法,为缺乏数据的网络提供数据支持。Meta-LSTM是通过学习多个数据丰富的站点的客流特征,然后通过参数初始化将学习到的参数应用于数据稀缺的站点,从而构建一个框架,提高LSTM对各种客流特征的泛化能力。Meta-LSTM应用于中国南宁、杭州和北京的轨道交通网络。在三个真实的URT网络上的实验证明了我们提出的Meta-LSTM在几个竞争基线模型上的有效性。结果还表明,本文提出的Meta-LSTM对各种客流特征具有良好的泛化能力,可为数据有限的车站客流预测提供参考。
{"title":"Meta-learning based passenger flow prediction for newly-operated stations","authors":"Kuo Han, Jinlei Zhang, Xiaopeng Tian, Songsong Li, Chunqi Zhu","doi":"10.1007/s10707-023-00510-8","DOIUrl":"https://doi.org/10.1007/s10707-023-00510-8","url":null,"abstract":"<p>By tapping into the human mobility of the urban rail transit (URT) network to understand the travel demands and characteristics of passengers in the urban space, URT managers are able to obtain more support for decision-making to improve the effectiveness of operation and management, the travel experience of passengers, as well as public safety. However, not all URT networks have sufficient human mobility data (e.g., newly-operated URT networks). It is necessary to provide data support for mining human mobility in data-poor URT networks. Therefore, we propose a method called Meta Long Short-Term Memory Network (Meta-LSTM) for passenger flow prediction at URT stations to provide data support for networks that lack data. The Meta-LSTM is to construct a framework that increases the generalization ability of a long short-term memory network (LSTM) to various passenger flow characteristics by learning passenger flow characteristics from multiple data-rich stations and then applying the learned parameter to data-scarce stations by parameter initialization. The Meta-LSTM is applied to the URT network of Nanning, Hangzhou, and Beijing, China. The experiments on three real-world URT networks demonstrate the effectiveness of our proposed Meta-LSTM over several competitive baseline models. Results also show that our proposed Meta-LSTM has a good generalization ability to various passenger flow characteristics, which can provide a reference for passenger flow prediction in the stations with limited data.</p>","PeriodicalId":55109,"journal":{"name":"Geoinformatica","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2023-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138516628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Geoinformatica
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1