首页 > 最新文献

ACM Transactions on Spatial Algorithms and Systems最新文献

英文 中文
Regionalization-based Collaborative Filtering: Harnessing Geographical Information in Recommenders 基于区域化的协同过滤:在推荐器中利用地理信息
IF 1.9 Q1 Mathematics Pub Date : 2024-04-08 DOI: 10.1145/3656641
Rodrigo Alves
Regionalization, also known as spatially constrained clustering, is an unsupervised machine learning technique used to identify and define spatially contiguous regions. In this work, we introduce a methodology to regionalize recommendation systems (RSs) based on a collaborative filtering approach. Two main challenges arise when performing regionalization based on users’ preferences in RSs: (1) unstructured data, as interactions are often scarce and observed on a smaller scale; and (2) the difficulty of evaluation of the quality of the clustering results. To address these challenges, our methodology relies on inductive matrix completion (IMC), a fundamental approach to recover unknown entries of a rating matrix while utilizing region information to extract a region-based feature matrix. With this feature matrix, our method becomes adaptive and seamlessly integrates with various regionalization algorithms to create regionalization candidates. This enables us to derive more accurate recommendations that consider regionalized effects and discover interesting patterns in localized user behavior. We experimentally evaluate our model on synthetic datasets to demonstrate its efficacy in settings where our underlying assumptions are correct. Furthermore, we present a real-world case study illustrating the interpretable information the model can derive in terms of regionalized recommendation relevance.
区域化又称空间约束聚类,是一种用于识别和定义空间毗连区域的无监督机器学习技术。在这项工作中,我们介绍了一种基于协同过滤方法的推荐系统(RS)区域化方法。根据用户在 RS 中的偏好进行区域化时会遇到两个主要挑战:(1) 非结构化数据,因为交互通常很少,而且观察到的规模较小;(2) 难以评估聚类结果的质量。为了应对这些挑战,我们的方法依赖于归纳矩阵补全(IMC),这是一种恢复评级矩阵未知项的基本方法,同时利用区域信息提取基于区域的特征矩阵。有了这个特征矩阵,我们的方法就能自适应并与各种区域化算法无缝集成,从而创建区域化候选方案。这使我们能够在考虑区域化效应的基础上得出更准确的推荐,并发现本地化用户行为中的有趣模式。我们在合成数据集上对我们的模型进行了实验评估,以证明它在我们的基本假设正确的情况下的有效性。此外,我们还介绍了一个真实世界的案例研究,说明了该模型在区域化推荐相关性方面所能得出的可解释信息。
{"title":"Regionalization-based Collaborative Filtering: Harnessing Geographical Information in Recommenders","authors":"Rodrigo Alves","doi":"10.1145/3656641","DOIUrl":"https://doi.org/10.1145/3656641","url":null,"abstract":"Regionalization, also known as spatially constrained clustering, is an unsupervised machine learning technique used to identify and define spatially contiguous regions. In this work, we introduce a methodology to regionalize recommendation systems (RSs) based on a collaborative filtering approach. Two main challenges arise when performing regionalization based on users’ preferences in RSs: (1) unstructured data, as interactions are often scarce and observed on a smaller scale; and (2) the difficulty of evaluation of the quality of the clustering results. To address these challenges, our methodology relies on inductive matrix completion (IMC), a fundamental approach to recover unknown entries of a rating matrix while utilizing region information to extract a region-based feature matrix. With this feature matrix, our method becomes adaptive and seamlessly integrates with various regionalization algorithms to create regionalization candidates. This enables us to derive more accurate recommendations that consider regionalized effects and discover interesting patterns in localized user behavior. We experimentally evaluate our model on synthetic datasets to demonstrate its efficacy in settings where our underlying assumptions are correct. Furthermore, we present a real-world case study illustrating the interpretable information the model can derive in terms of regionalized recommendation relevance.","PeriodicalId":43641,"journal":{"name":"ACM Transactions on Spatial Algorithms and Systems","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140729425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Let’s Speak Trajectories: A Vision To Use NLP Models For Trajectory Analysis Tasks 让我们说说轨迹:将 NLP 模型用于轨迹分析任务的愿景
IF 1.9 Q1 Mathematics Pub Date : 2024-04-08 DOI: 10.1145/3656470
Mashaal Musleh, M. Mokbel
The availability of trajectory data combined with various real life practical applications have sparked the interest of the research community to design a plethora of algorithms for various trajectory analysis techniques. However, there is an apparent lack of full-fledged systems that provide the infrastructure support for trajectory analysis techniques, which hinders the applicability of most of the designed algorithms. Inspired by the tremendous success of the BERT deep learning model in solving various Natural Language Processing (NLP) tasks, our vision is to have a BERT-like system for trajectory analysis tasks. We envision that in a few years, we will have such system, where no one needs to worry again about each specific trajectory analysis operation. Whether it is trajectory imputation, similarity, clustering, or whatever, it would be one system that researchers, developers, and practitioners can deploy to get high accuracy for their trajectory operations. Our vision stands on a solid ground that trajectories in a space are highly analogous to statements in a language. We outline the challenges and the road to our vision. Exploratory results confirm the promise and possibility of our vision.
轨迹数据的可用性与各种现实生活中的实际应用相结合,激发了研究界为各种轨迹分析技术设计大量算法的兴趣。然而,为轨迹分析技术提供基础设施支持的成熟系统明显缺乏,这阻碍了大多数设计算法的适用性。受 BERT 深度学习模型在解决各种自然语言处理(NLP)任务方面取得巨大成功的启发,我们的愿景是为轨迹分析任务提供类似 BERT 的系统。我们的设想是,几年后,我们将拥有这样的系统,人们无需再为每个具体的轨迹分析操作操心。无论是轨迹估算、相似性、聚类还是其他,研究人员、开发人员和从业人员都可以部署这样一个系统,以获得高精度的轨迹操作。我们的愿景建立在一个坚实的基础之上,即空间中的轨迹与语言中的语句高度相似。我们概述了实现我们愿景的挑战和道路。探索结果证实了我们愿景的前景和可能性。
{"title":"Let’s Speak Trajectories: A Vision To Use NLP Models For Trajectory Analysis Tasks","authors":"Mashaal Musleh, M. Mokbel","doi":"10.1145/3656470","DOIUrl":"https://doi.org/10.1145/3656470","url":null,"abstract":"The availability of trajectory data combined with various real life practical applications have sparked the interest of the research community to design a plethora of algorithms for various trajectory analysis techniques. However, there is an apparent lack of full-fledged systems that provide the infrastructure support for trajectory analysis techniques, which hinders the applicability of most of the designed algorithms. Inspired by the tremendous success of the BERT deep learning model in solving various Natural Language Processing (NLP) tasks, our vision is to have a BERT-like system for trajectory analysis tasks. We envision that in a few years, we will have such system, where no one needs to worry again about each specific trajectory analysis operation. Whether it is trajectory imputation, similarity, clustering, or whatever, it would be one system that researchers, developers, and practitioners can deploy to get high accuracy for their trajectory operations. Our vision stands on a solid ground that trajectories in a space are highly analogous to statements in a language. We outline the challenges and the road to our vision. Exploratory results confirm the promise and possibility of our vision.","PeriodicalId":43641,"journal":{"name":"ACM Transactions on Spatial Algorithms and Systems","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140731682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the Opportunities and Challenges of Foundation Models for GeoAI (Vision Paper) 论 GeoAI 基础模型的机遇与挑战(远景规划论文)
IF 1.9 Q1 Mathematics Pub Date : 2024-03-20 DOI: 10.1145/3653070
Gengchen Mai, Weiming Huang, Jin Sun, Suhang Song, Deepak Mishra, Ninghao Liu, Song Gao, Tianming Liu, Gao Cong, Yingjie Hu, Chris Cundy, Ziyuan Li, Rui Zhu, Ni Lao
Large pre-trained models, also known as foundation models (FMs), are trained in a task-agnostic manner on large-scale data and can be adapted to a wide range of downstream tasks by fine-tuning, few-shot, or even zero-shot learning. Despite their successes in language and vision tasks, we have yet seen an attempt to develop foundation models for geospatial artificial intelligence (GeoAI). In this work, we explore the promises and challenges of developing multimodal foundation models for GeoAI. We first investigate the potential of many existing FMs by testing their performances on seven tasks across multiple geospatial domains including Geospatial Semantics, Health Geography, Urban Geography, and Remote Sensing. Our results indicate that on several geospatial tasks that only involve text modality such as toponym recognition, location description recognition, and US state-level/county-level dementia time series forecasting, the task-agnostic LLMs can outperform task-specific fully-supervised models in a zero-shot or few-shot learning setting. However, on other geospatial tasks, especially tasks that involve multiple data modalities (e.g., POI-based urban function classification, street view image-based urban noise intensity classification, and remote sensing image scene classification), existing foundation models still underperform task-specific models. Based on these observations, we propose that one of the major challenges of developing a foundation model for GeoAI is to address the multimodality nature of geospatial tasks. After discussing the distinct challenges of each geospatial data modality, we suggest the possibility of a multimodal foundation model which can reason over various types of geospatial data through geospatial alignments. We conclude this paper by discussing the unique risks and challenges to develop such a model for GeoAI.
大型预训练模型(也称为基础模型(FM))是在大规模数据上以与任务无关的方式进行训练的,可以通过微调、少量学习甚至零点学习来适应各种下游任务。尽管在语言和视觉任务中取得了成功,但我们尚未看到为地理空间人工智能(GeoAI)开发基础模型的尝试。在这项工作中,我们将探索为 GeoAI 开发多模态基础模型的前景和挑战。我们首先调查了许多现有基础模型的潜力,测试了它们在多个地理空间领域(包括地理空间语义学、健康地理学、城市地理学和遥感)的七项任务中的表现。我们的结果表明,在地名识别、位置描述识别和美国州级/县级痴呆症时间序列预测等只涉及文本模式的地理空间任务中,任务无关的 LLM 在零点学习或少点学习环境下的表现优于特定任务的完全监督模型。然而,在其他地理空间任务中,特别是涉及多种数据模式的任务(如基于 POI 的城市功能分类、基于街景图像的城市噪声强度分类和遥感图像场景分类),现有的基础模型仍然不如特定任务模型。基于这些观察结果,我们提出为 GeoAI 开发基础模型的主要挑战之一是解决地理空间任务的多模态特性。在讨论了每种地理空间数据模式的独特挑战之后,我们提出了多模式基础模型的可能性,该模型可以通过地理空间排列对各种类型的地理空间数据进行推理。最后,我们讨论了为 GeoAI 开发这种模型的独特风险和挑战。
{"title":"On the Opportunities and Challenges of Foundation Models for GeoAI (Vision Paper)","authors":"Gengchen Mai, Weiming Huang, Jin Sun, Suhang Song, Deepak Mishra, Ninghao Liu, Song Gao, Tianming Liu, Gao Cong, Yingjie Hu, Chris Cundy, Ziyuan Li, Rui Zhu, Ni Lao","doi":"10.1145/3653070","DOIUrl":"https://doi.org/10.1145/3653070","url":null,"abstract":"\u0000 Large pre-trained models, also known as\u0000 foundation models\u0000 (FMs), are trained in a task-agnostic manner on large-scale data and can be adapted to a wide range of downstream tasks by fine-tuning, few-shot, or even zero-shot learning. Despite their successes in language and vision tasks, we have yet seen an attempt to develop foundation models for geospatial artificial intelligence (GeoAI). In this work, we explore the promises and challenges of developing multimodal foundation models for GeoAI. We first investigate the potential of many existing FMs by testing their performances on seven tasks across multiple geospatial domains including Geospatial Semantics, Health Geography, Urban Geography, and Remote Sensing. Our results indicate that on several geospatial tasks that only involve text modality such as toponym recognition, location description recognition, and US state-level/county-level dementia time series forecasting, the task-agnostic LLMs can outperform task-specific fully-supervised models in a zero-shot or few-shot learning setting. However, on other geospatial tasks, especially tasks that involve multiple data modalities (e.g., POI-based urban function classification, street view image-based urban noise intensity classification, and remote sensing image scene classification), existing foundation models still underperform task-specific models. Based on these observations, we propose that one of the major challenges of developing a foundation model for GeoAI is to address the multimodality nature of geospatial tasks. After discussing the distinct challenges of each geospatial data modality, we suggest the possibility of a multimodal foundation model which can reason over various types of geospatial data through geospatial alignments. We conclude this paper by discussing the unique risks and challenges to develop such a model for GeoAI.\u0000","PeriodicalId":43641,"journal":{"name":"ACM Transactions on Spatial Algorithms and Systems","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140226718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parallel Topology-aware Mesh Simplification on Terrain Trees 地形树上的并行拓扑感知网格简化
IF 1.9 Q1 Mathematics Pub Date : 2024-03-13 DOI: 10.1145/3652602
Yunting Song, Riccardo Fellegara, F. Iuricich, Leila De Floriani
We address the problem of performing a topology-aware simplification algorithm on a compact and distributed data structure for triangle meshes, the Terrain trees. Topology-aware operators have been defined to coarsen a Triangulated Irregular Network (TIN) without affecting the topology of its underlying terrain, i.e., without modifying critical features of the terrain, such as pits, saddles, peaks, and their connectivity. However, their scalability is limited for large-scale meshes. Our proposed algorithm uses a batched processing strategy to reduce both the memory and time requirements of the simplification process and thanks to the spatial decomposition on the basis of Terrain trees, it can be easily parallelized. Also, since a Terrain tree after the simplification process becomes less compact and efficient, we propose an efficient post-processing step for updating hierarchical spatial decomposition. Our experiments on real-world TINs, derived from topographic and bathymetric LiDAR data, demonstrate the scalability and efficiency of our approach. Specifically, topology-aware simplification on Terrain trees uses 40% less memory and half the time compared to the most compact and efficient connectivity-based data structure for TINs. Furthermore, the parallel simplification algorithm on the Terrain trees exhibits a 12x speedup with an OpenMP implementation. The quality of the output mesh is not significantly affected by the distributed and parallel simplification strategy of Terrain trees, and we obtain similar quality levels compared to the global baseline method.
我们要解决的问题是,如何在三角形网格的紧凑分布式数据结构--地形树上执行拓扑感知简化算法。拓扑感知算子已被定义为在不影响底层地形拓扑的情况下粗化三角形不规则网络(TIN),即不修改地形的关键特征,如坑、鞍、峰及其连接性。然而,对于大规模网格,它们的可扩展性是有限的。我们提出的算法采用分批处理策略,减少了简化过程对内存和时间的要求,而且由于基于地形树的空间分解,该算法可以轻松实现并行化。此外,由于简化过程后的地形树变得不那么紧凑和高效,我们提出了一种高效的后处理步骤,用于更新分层空间分解。我们在真实世界的 TIN 上进行了实验,这些 TIN 来源于地形和测深 LiDAR 数据,证明了我们方法的可扩展性和高效性。具体来说,与最紧凑、最高效的基于连接性的 TIN 数据结构相比,在地形树上进行拓扑感知简化可节省 40% 的内存和一半的时间。此外,采用 OpenMP 实现的 Terrain 树并行简化算法的速度提高了 12 倍。Terrain 树的分布式并行简化策略对输出网格的质量影响不大,与全局基准方法相比,我们获得了相似的质量水平。
{"title":"Parallel Topology-aware Mesh Simplification on Terrain Trees","authors":"Yunting Song, Riccardo Fellegara, F. Iuricich, Leila De Floriani","doi":"10.1145/3652602","DOIUrl":"https://doi.org/10.1145/3652602","url":null,"abstract":"We address the problem of performing a topology-aware simplification algorithm on a compact and distributed data structure for triangle meshes, the Terrain trees. Topology-aware operators have been defined to coarsen a Triangulated Irregular Network (TIN) without affecting the topology of its underlying terrain, i.e., without modifying critical features of the terrain, such as pits, saddles, peaks, and their connectivity. However, their scalability is limited for large-scale meshes. Our proposed algorithm uses a batched processing strategy to reduce both the memory and time requirements of the simplification process and thanks to the spatial decomposition on the basis of Terrain trees, it can be easily parallelized. Also, since a Terrain tree after the simplification process becomes less compact and efficient, we propose an efficient post-processing step for updating hierarchical spatial decomposition. Our experiments on real-world TINs, derived from topographic and bathymetric LiDAR data, demonstrate the scalability and efficiency of our approach. Specifically, topology-aware simplification on Terrain trees uses 40% less memory and half the time compared to the most compact and efficient connectivity-based data structure for TINs. Furthermore, the parallel simplification algorithm on the Terrain trees exhibits a 12x speedup with an OpenMP implementation. The quality of the output mesh is not significantly affected by the distributed and parallel simplification strategy of Terrain trees, and we obtain similar quality levels compared to the global baseline method.","PeriodicalId":43641,"journal":{"name":"ACM Transactions on Spatial Algorithms and Systems","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140247941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Challenge of Data Analytics with Climate-Neutral Urban Mobility (Vision Paper) 气候中和城市交通数据分析的挑战(远景规划论文)
IF 1.9 Q1 Mathematics Pub Date : 2024-02-23 DOI: 10.1145/3649312
Stephan Winter, Monika Sester, M. Tomko, Alexandra Millonig
Urban mobility is a major contributor to human-induced climate change, a challenge that urban and transport planning and spatial computing academic communities have been actively addressing. In this paper we argue, however, that the common data analytics research into incremental efficiency improvements of originally non-sustainable urban mobility systems will never be able to help reach climate neutrality – the goal we must achieve by 2050 as per the Paris Agreement. This imperative is exacerbated by the observation that improvements, by data analytics, in one segment of urban mobility typically have unintended and often adverse consequences in other segments. In this vision paper we argue for a data analytics agenda to advance climate action at the core of urban mobility research. This agenda must disrupt the way we think and operate, as much as it is disrupting the accessibility issues of society in cities.
城市交通是人类引起气候变化的主要因素,城市与交通规划和空间计算学术界一直在积极应对这一挑战。然而,我们在本文中认为,对原本不可持续的城市交通系统进行增量效率改进的普通数据分析研究永远无法帮助实现气候中和--《巴黎协定》规定我们必须在 2050 年之前实现的目标。数据分析在城市交通某一环节的改进通常会给其他环节带来意想不到的、往往是不利的后果,这一观察结果加剧了这一必要性。在这份愿景文件中,我们主张将数据分析议程作为城市交通研究的核心,以推进气候行动。这一议程必须颠覆我们的思维和运作方式,就像它正在颠覆城市中的社会无障碍问题一样。
{"title":"The Challenge of Data Analytics with Climate-Neutral Urban Mobility (Vision Paper)","authors":"Stephan Winter, Monika Sester, M. Tomko, Alexandra Millonig","doi":"10.1145/3649312","DOIUrl":"https://doi.org/10.1145/3649312","url":null,"abstract":"\u0000 Urban mobility is a major contributor to human-induced climate change, a challenge that urban and transport planning and spatial computing academic communities have been actively addressing. In this paper we argue, however, that the common data analytics research into incremental efficiency improvements of originally non-sustainable urban mobility systems will never be able to help reach climate\u0000 neutrality\u0000 – the goal we must achieve by 2050 as per the Paris Agreement. This imperative is exacerbated by the observation that improvements, by data analytics, in one segment of urban mobility typically have unintended and often adverse consequences in other segments. In this vision paper we argue for a data analytics agenda to advance climate action at the core of urban mobility research. This agenda must disrupt the way we think and operate, as much as it is disrupting the accessibility issues of society in cities.\u0000","PeriodicalId":43641,"journal":{"name":"ACM Transactions on Spatial Algorithms and Systems","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140435288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scalable Spatio-Temporal Top-k Interaction Queries on Dynamic Communities 动态社区上的可扩展时空 Top-k 交互查询
IF 1.9 Q1 Mathematics Pub Date : 2024-02-16 DOI: 10.1145/3648374
Abdulaziz Almaslukh, Yongyi Liu, A. Magdy
Social media platforms generate massive amounts of data that reveal valuable insights about users and communities at large. Existing techniques have not fully exploited such data to help practitioners perform a deep analysis of large online communities. Lack of scalability hinders analyzing communities of large sizes and requires tremendous system resources and unacceptable runtime. This paper proposes a new analytical query that identifies the top- k posts that a given user community has interacted with during a specific time interval and within a spatial range. We propose a novel indexing framework that captures the interactions of users and communities to provide a low query latency. Moreover, we propose exact and approximate algorithms to process the query efficiently and utilize the index content to prune the search space. The extensive experimental evaluation on real data has shown the superiority of our techniques and their scalability to support large online communities.
社交媒体平台产生了大量数据,揭示了有关用户和整个社区的宝贵信息。现有技术尚未充分利用这些数据来帮助从业人员对大型在线社区进行深入分析。缺乏可扩展性阻碍了对大型社区的分析,而且需要巨大的系统资源和难以接受的运行时间。本文提出了一种新的分析查询方法,可识别特定用户社区在特定时间间隔和空间范围内互动最多的 k 个帖子。我们提出了一种新颖的索引框架,它能捕捉用户和社区的互动,从而提供较低的查询延迟。此外,我们还提出了精确和近似的算法来高效处理查询,并利用索引内容来修剪搜索空间。在真实数据上进行的广泛实验评估显示了我们技术的优越性及其支持大型在线社区的可扩展性。
{"title":"Scalable Spatio-Temporal Top-k Interaction Queries on Dynamic Communities","authors":"Abdulaziz Almaslukh, Yongyi Liu, A. Magdy","doi":"10.1145/3648374","DOIUrl":"https://doi.org/10.1145/3648374","url":null,"abstract":"\u0000 Social media platforms generate massive amounts of data that reveal valuable insights about users and communities at large. Existing techniques have not fully exploited such data to help practitioners perform a deep analysis of large online communities. Lack of scalability hinders analyzing communities of large sizes and requires tremendous system resources and unacceptable runtime. This paper proposes a new analytical query that identifies the top-\u0000 k\u0000 posts that a given user community has interacted with during a specific time interval and within a spatial range. We propose a novel indexing framework that captures the interactions of users and communities to provide a low query latency. Moreover, we propose exact and approximate algorithms to process the query efficiently and utilize the index content to prune the search space. The extensive experimental evaluation on real data has shown the superiority of our techniques and their scalability to support large online communities.\u0000","PeriodicalId":43641,"journal":{"name":"ACM Transactions on Spatial Algorithms and Systems","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139961859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RE-Trace : Re-Identification of Modified GPS Trajectories RE-Trace : 重新识别修改后的 GPS 轨迹
IF 1.9 Q1 Mathematics Pub Date : 2024-02-05 DOI: 10.1145/3643680
Stefan Schestakov, Simon Gottschalk, Thorben Funke, Elena Demidova
GPS trajectories are a critical asset for building spatio-temporal predictive models in urban regions in the context of road safety monitoring, traffic management, and mobility services. Currently, reliable and efficient data misuse detection methods for such personal, spatio-temporal data, particularly in data breach cases, are missing. This article addresses an essential aspect of data misuse detection, namely the re-identification of leaked and potentially modified GPS trajectories. We present RE-Trace – a contrastive learning-based model that facilitates reliable and efficient re-identification of GPS trajectories and resists specific trajectory transformation attacks aimed to obscure a trajectory’s origin. RE-Trace utilizes contrastive learning with a transformer-based trajectory encoder to create trajectory representations, robust to various trajectory modifications. We present a comprehensive threat model for GPS trajectory modifications and demonstrate the effectiveness and efficiency of the RE-Trace re-identification approach on three real-world datasets. Our evaluation results demonstrate that RE-Trace significantly outperforms state-of-the-art baselines on all data sets and identifies modified GPS trajectories effectively and efficiently.
在道路安全监控、交通管理和移动服务方面,GPS 轨迹是在城市地区建立时空预测模型的重要资产。目前,针对此类个人时空数据(尤其是在数据泄露情况下)的可靠、高效的数据滥用检测方法尚属空白。本文探讨了数据滥用检测的一个重要方面,即重新识别泄露和可能被修改的 GPS 轨迹。我们介绍了 RE-Trace--一种基于对比学习的模型,它有助于可靠、高效地重新识别 GPS 轨迹,并抵御旨在掩盖轨迹来源的特定轨迹转换攻击。RE-Trace 利用对比学习和基于变换器的轨迹编码器来创建轨迹表征,并能抵御各种轨迹修改。我们提出了 GPS 轨迹修改的综合威胁模型,并在三个真实世界数据集上展示了 RE-Trace 重新识别方法的有效性和效率。我们的评估结果表明,RE-Trace 在所有数据集上的表现都明显优于最先进的基线方法,并能有效和高效地识别修改后的 GPS 轨迹。
{"title":"RE-Trace\u0000 : Re-Identification of Modified GPS Trajectories","authors":"Stefan Schestakov, Simon Gottschalk, Thorben Funke, Elena Demidova","doi":"10.1145/3643680","DOIUrl":"https://doi.org/10.1145/3643680","url":null,"abstract":"\u0000 GPS trajectories are a critical asset for building spatio-temporal predictive models in urban regions in the context of road safety monitoring, traffic management, and mobility services. Currently, reliable and efficient data misuse detection methods for such personal, spatio-temporal data, particularly in data breach cases, are missing. This article addresses an essential aspect of data misuse detection, namely the re-identification of leaked and potentially modified GPS trajectories. We present\u0000 RE-Trace\u0000 – a contrastive learning-based model that facilitates reliable and efficient re-identification of GPS trajectories and resists specific trajectory transformation attacks aimed to obscure a trajectory’s origin.\u0000 RE-Trace\u0000 utilizes contrastive learning with a transformer-based trajectory encoder to create trajectory representations, robust to various trajectory modifications. We present a comprehensive threat model for GPS trajectory modifications and demonstrate the effectiveness and efficiency of the\u0000 RE-Trace\u0000 re-identification approach on three real-world datasets. Our evaluation results demonstrate that\u0000 RE-Trace\u0000 significantly outperforms state-of-the-art baselines on all data sets and identifies modified GPS trajectories effectively and efficiently.\u0000","PeriodicalId":43641,"journal":{"name":"ACM Transactions on Spatial Algorithms and Systems","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139864458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RE-Trace : Re-Identification of Modified GPS Trajectories RE-Trace : 重新识别修改后的 GPS 轨迹
IF 1.9 Q1 Mathematics Pub Date : 2024-02-05 DOI: 10.1145/3643680
Stefan Schestakov, Simon Gottschalk, Thorben Funke, Elena Demidova
GPS trajectories are a critical asset for building spatio-temporal predictive models in urban regions in the context of road safety monitoring, traffic management, and mobility services. Currently, reliable and efficient data misuse detection methods for such personal, spatio-temporal data, particularly in data breach cases, are missing. This article addresses an essential aspect of data misuse detection, namely the re-identification of leaked and potentially modified GPS trajectories. We present RE-Trace – a contrastive learning-based model that facilitates reliable and efficient re-identification of GPS trajectories and resists specific trajectory transformation attacks aimed to obscure a trajectory’s origin. RE-Trace utilizes contrastive learning with a transformer-based trajectory encoder to create trajectory representations, robust to various trajectory modifications. We present a comprehensive threat model for GPS trajectory modifications and demonstrate the effectiveness and efficiency of the RE-Trace re-identification approach on three real-world datasets. Our evaluation results demonstrate that RE-Trace significantly outperforms state-of-the-art baselines on all data sets and identifies modified GPS trajectories effectively and efficiently.
在道路安全监控、交通管理和移动服务方面,GPS 轨迹是在城市地区建立时空预测模型的重要资产。目前,针对此类个人时空数据(尤其是在数据泄露情况下)的可靠、高效的数据滥用检测方法尚属空白。本文探讨了数据滥用检测的一个重要方面,即重新识别泄露和可能被修改的 GPS 轨迹。我们介绍了 RE-Trace--一种基于对比学习的模型,它有助于可靠、高效地重新识别 GPS 轨迹,并抵御旨在掩盖轨迹来源的特定轨迹转换攻击。RE-Trace 利用对比学习和基于变换器的轨迹编码器来创建轨迹表征,并能抵御各种轨迹修改。我们提出了 GPS 轨迹修改的综合威胁模型,并在三个真实世界数据集上展示了 RE-Trace 重新识别方法的有效性和效率。我们的评估结果表明,RE-Trace 在所有数据集上的表现都明显优于最先进的基线方法,并能有效和高效地识别修改后的 GPS 轨迹。
{"title":"RE-Trace\u0000 : Re-Identification of Modified GPS Trajectories","authors":"Stefan Schestakov, Simon Gottschalk, Thorben Funke, Elena Demidova","doi":"10.1145/3643680","DOIUrl":"https://doi.org/10.1145/3643680","url":null,"abstract":"\u0000 GPS trajectories are a critical asset for building spatio-temporal predictive models in urban regions in the context of road safety monitoring, traffic management, and mobility services. Currently, reliable and efficient data misuse detection methods for such personal, spatio-temporal data, particularly in data breach cases, are missing. This article addresses an essential aspect of data misuse detection, namely the re-identification of leaked and potentially modified GPS trajectories. We present\u0000 RE-Trace\u0000 – a contrastive learning-based model that facilitates reliable and efficient re-identification of GPS trajectories and resists specific trajectory transformation attacks aimed to obscure a trajectory’s origin.\u0000 RE-Trace\u0000 utilizes contrastive learning with a transformer-based trajectory encoder to create trajectory representations, robust to various trajectory modifications. We present a comprehensive threat model for GPS trajectory modifications and demonstrate the effectiveness and efficiency of the\u0000 RE-Trace\u0000 re-identification approach on three real-world datasets. Our evaluation results demonstrate that\u0000 RE-Trace\u0000 significantly outperforms state-of-the-art baselines on all data sets and identifies modified GPS trajectories effectively and efficiently.\u0000","PeriodicalId":43641,"journal":{"name":"ACM Transactions on Spatial Algorithms and Systems","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139804384","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Causal Probabilistic Spatio-temporal Fusion Transformers in Two-sided Ride-Hailing Markets 双面乘车市场中的因果概率时空融合变换器
IF 1.9 Q1 Mathematics Pub Date : 2024-02-03 DOI: 10.1145/3643848
Shixiang Wan, S. Luo, Hongtu Zhu
In this work, we address the complex problem of multi-objective time series forecasting with a focus on predicting interdependent targets such as supply and demand in ride-hailing services. Traditional machine learning techniques approach the targets independently, while deep learning strategies may use joint learning with shared representations, both neglecting inter-target causal relationships and potentially compromising the models’ generalization capabilities. Our novel CausalTrans model introduces a framework to define and leverage the temporal causal interplay between supply and demand, incorporating both temporal and spatial causality into the forecasting process. Additionally, we enhance computational efficiency by introducing an innovative fast attention mechanism that reduces the time complexity from quadratic to linear without sacrificing performance. Our comprehensive experiments show that CausalTrans significantly surpasses contemporary forecasting methods, achieving up to a 15 (% ) reduction in error, thus setting a new benchmark in the field.
在这项工作中,我们解决了多目标时间序列预测的复杂问题,重点是预测相互依存的目标,如打车服务的供需关系。传统的机器学习技术独立处理目标,而深度学习策略可能会使用共享表征的联合学习,这都会忽略目标间的因果关系,并可能损害模型的泛化能力。我们新颖的 CausalTrans 模型引入了一个框架,用于定义和利用供应与需求之间的时间因果关系,将时间和空间因果关系纳入预测过程。此外,我们还通过引入创新的快速关注机制来提高计算效率,在不影响性能的前提下将时间复杂性从二次方降低到线性。我们的综合实验表明,CausalTrans 显著超越了当代预测方法,误差减少了 15%,从而在该领域树立了新的标杆。
{"title":"Causal Probabilistic Spatio-temporal Fusion Transformers in Two-sided Ride-Hailing Markets","authors":"Shixiang Wan, S. Luo, Hongtu Zhu","doi":"10.1145/3643848","DOIUrl":"https://doi.org/10.1145/3643848","url":null,"abstract":"\u0000 In this work, we address the complex problem of multi-objective time series forecasting with a focus on predicting interdependent targets such as supply and demand in ride-hailing services. Traditional machine learning techniques approach the targets independently, while deep learning strategies may use joint learning with shared representations, both neglecting inter-target causal relationships and potentially compromising the models’ generalization capabilities. Our novel\u0000 CausalTrans\u0000 model introduces a framework to define and leverage the temporal causal interplay between supply and demand, incorporating both temporal and spatial causality into the forecasting process. Additionally, we enhance computational efficiency by introducing an innovative fast attention mechanism that reduces the time complexity from quadratic to linear without sacrificing performance. Our comprehensive experiments show that\u0000 CausalTrans\u0000 significantly surpasses contemporary forecasting methods, achieving up to a 15\u0000 \u0000 (% )\u0000 \u0000 reduction in error, thus setting a new benchmark in the field.\u0000","PeriodicalId":43641,"journal":{"name":"ACM Transactions on Spatial Algorithms and Systems","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139808216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Causal Probabilistic Spatio-temporal Fusion Transformers in Two-sided Ride-Hailing Markets 双面乘车市场中的因果概率时空融合变换器
IF 1.9 Q1 Mathematics Pub Date : 2024-02-03 DOI: 10.1145/3643848
Shixiang Wan, S. Luo, Hongtu Zhu
In this work, we address the complex problem of multi-objective time series forecasting with a focus on predicting interdependent targets such as supply and demand in ride-hailing services. Traditional machine learning techniques approach the targets independently, while deep learning strategies may use joint learning with shared representations, both neglecting inter-target causal relationships and potentially compromising the models’ generalization capabilities. Our novel CausalTrans model introduces a framework to define and leverage the temporal causal interplay between supply and demand, incorporating both temporal and spatial causality into the forecasting process. Additionally, we enhance computational efficiency by introducing an innovative fast attention mechanism that reduces the time complexity from quadratic to linear without sacrificing performance. Our comprehensive experiments show that CausalTrans significantly surpasses contemporary forecasting methods, achieving up to a 15 (% ) reduction in error, thus setting a new benchmark in the field.
在这项工作中,我们解决了多目标时间序列预测的复杂问题,重点是预测相互依存的目标,如打车服务的供需关系。传统的机器学习技术独立处理目标,而深度学习策略可能会使用共享表征的联合学习,这都会忽略目标间的因果关系,并可能损害模型的泛化能力。我们新颖的 CausalTrans 模型引入了一个框架,用于定义和利用供应与需求之间的时间因果关系,将时间和空间因果关系纳入预测过程。此外,我们还通过引入创新的快速关注机制来提高计算效率,在不影响性能的前提下将时间复杂性从二次方降低到线性。我们的综合实验表明,CausalTrans 显著超越了当代预测方法,误差减少了 15%,从而在该领域树立了新的标杆。
{"title":"Causal Probabilistic Spatio-temporal Fusion Transformers in Two-sided Ride-Hailing Markets","authors":"Shixiang Wan, S. Luo, Hongtu Zhu","doi":"10.1145/3643848","DOIUrl":"https://doi.org/10.1145/3643848","url":null,"abstract":"\u0000 In this work, we address the complex problem of multi-objective time series forecasting with a focus on predicting interdependent targets such as supply and demand in ride-hailing services. Traditional machine learning techniques approach the targets independently, while deep learning strategies may use joint learning with shared representations, both neglecting inter-target causal relationships and potentially compromising the models’ generalization capabilities. Our novel\u0000 CausalTrans\u0000 model introduces a framework to define and leverage the temporal causal interplay between supply and demand, incorporating both temporal and spatial causality into the forecasting process. Additionally, we enhance computational efficiency by introducing an innovative fast attention mechanism that reduces the time complexity from quadratic to linear without sacrificing performance. Our comprehensive experiments show that\u0000 CausalTrans\u0000 significantly surpasses contemporary forecasting methods, achieving up to a 15\u0000 \u0000 (% )\u0000 \u0000 reduction in error, thus setting a new benchmark in the field.\u0000","PeriodicalId":43641,"journal":{"name":"ACM Transactions on Spatial Algorithms and Systems","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139868129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ACM Transactions on Spatial Algorithms and Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1