首页 > 最新文献

17th International Symposium on Spatial and Temporal Databases最新文献

英文 中文
Spatial Skyline Queries on Triangulated Irregular Networks 不规则三角网的空间天际线查询
Pub Date : 2021-08-23 DOI: 10.1145/3469830.3470901
Yuta Kasai, Kento Sugiura, Y. Ishikawa
A spatial skyline query is a query to find a set of data points that are not spatially dominated by other data points, given a set of data points P and query points Q in a multidimensional space. The query enumerates the skyline points based on distance in a multidimensional space. However, existing spatial skyline queries can lead to large errors with actual travel distances in geo-spaces because the query is based on the Euclidean distance. We propose a spatial skyline query on triangulated irregular networks (TINs), which are frequently used to represent the surfaces of terrain. We define a new spatial skyline query based on more accurate travel distances considering the TIN distance instead of the Euclidean distance. We also propose an efficient solution method using indexes to find nearest-neighbor points in TIN space and reduce the numbers of unnecessary data points and TIN vertices. The proposed method achieves a computational complexity of O(|P′||Q|N′2 + |P′|2|Q|), where P′ and N′ are the reduced sets of data points and number of TIN vertices, respectively, based on the range of query points. The proposed method can process a query faster than the naive method with Θ(|P||Q|N2 + |P|2|Q|), where N is the number of TIN vertices. Moreover, experiments verify that the proposed method is faster than the naive method by using a spatial index to reduce the numbers of unnecessary data points and TIN vertices.
空间天际线查询是在多维空间中给定一组数据点P和查询点Q,以查找一组在空间上不受其他数据点支配的数据点的查询。该查询基于多维空间中的距离枚举天际线点。然而,现有的空间天际线查询可能会导致地理空间中实际旅行距离的大误差,因为查询是基于欧几里得距离的。我们提出了一个不规则三角网(TINs)的空间天际线查询,它经常被用来表示地形的表面。我们定义了一个新的基于更精确的旅行距离的空间天际线查询,考虑TIN距离而不是欧几里得距离。我们还提出了一种利用索引在TIN空间中寻找最近邻点的有效解决方法,并减少了不必要的数据点和TIN顶点的数量。该方法的计算复杂度为O(|P ' |Q|N ' 2 + |P ' |2|Q|),其中P '和N '分别为基于查询点范围的数据点和TIN顶点数的约简集。提出的方法可以比使用Θ(|P| Q|N2 + |P|2|Q|)的朴素方法更快地处理查询,其中N是TIN顶点的数量。此外,通过使用空间索引来减少不必要的数据点和TIN顶点的数量,实验验证了该方法比朴素方法更快。
{"title":"Spatial Skyline Queries on Triangulated Irregular Networks","authors":"Yuta Kasai, Kento Sugiura, Y. Ishikawa","doi":"10.1145/3469830.3470901","DOIUrl":"https://doi.org/10.1145/3469830.3470901","url":null,"abstract":"A spatial skyline query is a query to find a set of data points that are not spatially dominated by other data points, given a set of data points P and query points Q in a multidimensional space. The query enumerates the skyline points based on distance in a multidimensional space. However, existing spatial skyline queries can lead to large errors with actual travel distances in geo-spaces because the query is based on the Euclidean distance. We propose a spatial skyline query on triangulated irregular networks (TINs), which are frequently used to represent the surfaces of terrain. We define a new spatial skyline query based on more accurate travel distances considering the TIN distance instead of the Euclidean distance. We also propose an efficient solution method using indexes to find nearest-neighbor points in TIN space and reduce the numbers of unnecessary data points and TIN vertices. The proposed method achieves a computational complexity of O(|P′||Q|N′2 + |P′|2|Q|), where P′ and N′ are the reduced sets of data points and number of TIN vertices, respectively, based on the range of query points. The proposed method can process a query faster than the naive method with Θ(|P||Q|N2 + |P|2|Q|), where N is the number of TIN vertices. Moreover, experiments verify that the proposed method is faster than the naive method by using a spatial index to reduce the numbers of unnecessary data points and TIN vertices.","PeriodicalId":206910,"journal":{"name":"17th International Symposium on Spatial and Temporal Databases","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116040283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effective Traffic Forecasting with Multi-Resolution Learning 基于多分辨率学习的有效交通预测
Pub Date : 2021-08-23 DOI: 10.1145/3469830.3470904
Abdullah AlDwyish, E. Tanin, Hairuo Xie, S. Karunasekera, K. Ramamohanarao
Traffic forecasting plays a vital role in traffic management systems. Recently, deep learning models have been applied to citywide traffic forecasting. However, the existing work models and predicts traffic at a single (dense) resolution, making it challenging to capture long-range spatial dependencies or high-level traffic dynamics. This shortcoming limits the accuracy of prediction and results in computationally expensive models. We propose a traffic forecasting model based on deep convolutional networks to improve the accuracy of citywide traffic forecasting. Our model uses a hierarchical architecture that captures traffic dynamics at multiple spatial resolutions. Based on this architecture, we apply a multi-task learning scheme, which trains the model to predict traffic at different resolutions. Our model helps provide a coherent understanding of traffic dynamics by capturing spatial dependencies between different regions of a city. Experimental results on multiple real datasets show that our model can achieve competitive results compared to complex state-of-the-art approaches while being more computationally efficient.
交通预测在交通管理系统中起着至关重要的作用。最近,深度学习模型已被应用于全市交通预测。然而,现有的工作以单一(密集)分辨率建模和预测交通,使得捕获远程空间依赖性或高级交通动态具有挑战性。这一缺点限制了预测的准确性,并导致计算昂贵的模型。为了提高城市交通预测的准确性,提出了一种基于深度卷积网络的交通预测模型。我们的模型使用分层架构,在多个空间分辨率下捕获交通动态。在此基础上,我们应用了一个多任务学习方案,该方案训练模型在不同分辨率下预测交通。我们的模型通过捕捉城市不同区域之间的空间依赖关系,帮助提供对交通动态的连贯理解。在多个真实数据集上的实验结果表明,与最先进的复杂方法相比,我们的模型可以获得具有竞争力的结果,同时计算效率更高。
{"title":"Effective Traffic Forecasting with Multi-Resolution Learning","authors":"Abdullah AlDwyish, E. Tanin, Hairuo Xie, S. Karunasekera, K. Ramamohanarao","doi":"10.1145/3469830.3470904","DOIUrl":"https://doi.org/10.1145/3469830.3470904","url":null,"abstract":"Traffic forecasting plays a vital role in traffic management systems. Recently, deep learning models have been applied to citywide traffic forecasting. However, the existing work models and predicts traffic at a single (dense) resolution, making it challenging to capture long-range spatial dependencies or high-level traffic dynamics. This shortcoming limits the accuracy of prediction and results in computationally expensive models. We propose a traffic forecasting model based on deep convolutional networks to improve the accuracy of citywide traffic forecasting. Our model uses a hierarchical architecture that captures traffic dynamics at multiple spatial resolutions. Based on this architecture, we apply a multi-task learning scheme, which trains the model to predict traffic at different resolutions. Our model helps provide a coherent understanding of traffic dynamics by capturing spatial dependencies between different regions of a city. Experimental results on multiple real datasets show that our model can achieve competitive results compared to complex state-of-the-art approaches while being more computationally efficient.","PeriodicalId":206910,"journal":{"name":"17th International Symposium on Spatial and Temporal Databases","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129031355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
SPRIG: A Learned Spatial Index for Range and kNN Queries 用于范围和kNN查询的学习空间索引
Pub Date : 2021-08-23 DOI: 10.1145/3469830.3470892
Songnian Zhang, S. Ray, Rongxing Lu, Yandong Zheng
A corpus of recent work has revealed that the learned index can improve query performance while reducing the storage overhead. It potentially offers an opportunity to address the spatial query processing challenges caused by the surge in location-based services. Although several learned indexes have been proposed to process spatial data, the main idea behind these approaches is to utilize the existing one-dimensional learned models, which requires either converting the spatial data into one-dimensional data or applying the learned model on individual dimensions separately. As a result, these approaches cannot fully utilize or take advantage of the information regarding the spatial distribution of the original spatial data. To this end, in this paper, we exploit it by using the spatial (multi-dimensional) interpolation function as the learned model, which can be directly employed on the spatial data. Specifically, we design an efficient SPatial inteRpolation functIon based Grid index (SPRIG) to process the range and kNN queries. Detailed experiments are conducted on real-world datasets. The results indicate that, compared to the traditional spatial indexes, our proposed learned index can significantly improve the index building and query processing performance with less storage overhead. Moreover, in the best case, our index achieves up to an order of magnitude better performance than ZM-index in range queries and is about 2.7 × , 3 × , and 9 × faster than the multi-dimensional learned index Flood in terms of index building, range queries, and kNN queries, respectively.
最近的研究表明,学习索引可以提高查询性能,同时减少存储开销。它潜在地提供了一个机会来解决由于基于位置的服务激增而带来的空间查询处理挑战。虽然已经提出了几种学习索引来处理空间数据,但这些方法背后的主要思想是利用现有的一维学习模型,这需要将空间数据转换为一维数据或将学习模型分别应用于单个维度。因此,这些方法不能充分利用或利用原始空间数据的空间分布信息。为此,本文利用空间(多维)插值函数作为学习模型,可直接应用于空间数据。具体来说,我们设计了一个高效的基于网格索引的空间插值函数(SPRIG)来处理范围和kNN查询。在真实世界的数据集上进行了详细的实验。结果表明,与传统的空间索引相比,我们提出的学习索引可以显著提高索引构建和查询处理性能,且存储开销较小。此外,在最好的情况下,我们的索引在范围查询方面的性能比ZM-index高出一个数量级,在索引建立、范围查询和kNN查询方面分别比多维学习索引Flood快2.7倍、3倍和9倍。
{"title":"SPRIG: A Learned Spatial Index for Range and kNN Queries","authors":"Songnian Zhang, S. Ray, Rongxing Lu, Yandong Zheng","doi":"10.1145/3469830.3470892","DOIUrl":"https://doi.org/10.1145/3469830.3470892","url":null,"abstract":"A corpus of recent work has revealed that the learned index can improve query performance while reducing the storage overhead. It potentially offers an opportunity to address the spatial query processing challenges caused by the surge in location-based services. Although several learned indexes have been proposed to process spatial data, the main idea behind these approaches is to utilize the existing one-dimensional learned models, which requires either converting the spatial data into one-dimensional data or applying the learned model on individual dimensions separately. As a result, these approaches cannot fully utilize or take advantage of the information regarding the spatial distribution of the original spatial data. To this end, in this paper, we exploit it by using the spatial (multi-dimensional) interpolation function as the learned model, which can be directly employed on the spatial data. Specifically, we design an efficient SPatial inteRpolation functIon based Grid index (SPRIG) to process the range and kNN queries. Detailed experiments are conducted on real-world datasets. The results indicate that, compared to the traditional spatial indexes, our proposed learned index can significantly improve the index building and query processing performance with less storage overhead. Moreover, in the best case, our index achieves up to an order of magnitude better performance than ZM-index in range queries and is about 2.7 × , 3 × , and 9 × faster than the multi-dimensional learned index Flood in terms of index building, range queries, and kNN queries, respectively.","PeriodicalId":206910,"journal":{"name":"17th International Symposium on Spatial and Temporal Databases","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122958227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Metro Maps on Flexible Base Grids 灵活基础网格上的地铁地图
Pub Date : 2021-08-23 DOI: 10.1145/3469830.3470899
H. Bast, P. Brosi, Sabine Storandt
We present new generic methods to efficiently draw schematized metro maps for a wide variety of layouts, including octilinear, hexalinear, and orthoradial maps. The maps are drawn by mapping the input graph to a suitable grid graph. Previous work was restricted to regular octilinear grids. In this work, we investigate a variety of grids, including triangular grids and orthoradial grids. In particular, we also construct sparse grids where the local node density adapts to the input graph (e.g. octilinear Hanan grids, which we introduce in this work). For octilinear maps, this reduces the grid size by a factor of up to 5 compared to previous work, while still achieving close-to-optimal layouts. For many maps, this reduction also leads to up to 5 times faster solution times of the underlying optimization problem. We evaluate our approach on five maps. All octilinear maps can be computed in under 0.5 seconds, all hexalinear and orthoradial maps can be computed in under 2.5 seconds.
我们提出了新的通用方法来有效地绘制各种布局的示意图地铁地图,包括八线,六线和正交地图。通过将输入图映射到合适的网格图来绘制映射。以前的工作仅限于正则的八边形网格。在这项工作中,我们研究了各种网格,包括三角形网格和正交网格。特别是,我们还构建了稀疏网格,其中局部节点密度与输入图相适应(例如,我们在本工作中引入的八线性哈南网格)。对于八线地图,与之前的工作相比,这将网格大小减少了5倍,同时仍然实现了接近最佳的布局。对于许多地图,这种减少还导致底层优化问题的解决速度提高了5倍。我们在五张地图上评估我们的方法。所有的八边形地图都可以在0.5秒内计算完成,所有的六边形和直角地图都可以在2.5秒内计算完成。
{"title":"Metro Maps on Flexible Base Grids","authors":"H. Bast, P. Brosi, Sabine Storandt","doi":"10.1145/3469830.3470899","DOIUrl":"https://doi.org/10.1145/3469830.3470899","url":null,"abstract":"We present new generic methods to efficiently draw schematized metro maps for a wide variety of layouts, including octilinear, hexalinear, and orthoradial maps. The maps are drawn by mapping the input graph to a suitable grid graph. Previous work was restricted to regular octilinear grids. In this work, we investigate a variety of grids, including triangular grids and orthoradial grids. In particular, we also construct sparse grids where the local node density adapts to the input graph (e.g. octilinear Hanan grids, which we introduce in this work). For octilinear maps, this reduces the grid size by a factor of up to 5 compared to previous work, while still achieving close-to-optimal layouts. For many maps, this reduction also leads to up to 5 times faster solution times of the underlying optimization problem. We evaluate our approach on five maps. All octilinear maps can be computed in under 0.5 seconds, all hexalinear and orthoradial maps can be computed in under 2.5 seconds.","PeriodicalId":206910,"journal":{"name":"17th International Symposium on Spatial and Temporal Databases","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116483124","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Attribute Propagation for Utilities 实用程序的属性传播
Pub Date : 2021-08-23 DOI: 10.1145/3469830.3470907
Dev Oliver, P. Bakalov, Sangho Kim, E. Hoel
Utility systems such as electric, fiber/telco, gas, and water require the realistic modeling of network attributes or values over distance. For example, consider hydraulic pressure in a pipe network; as water flows away from the reservoir or pump, pressure decreases due to pipe friction, leakage, consumption, etc. Attribute propagation is the process whereby network attributes that change over distance (e.g., maximum allowable operating pressure, phase, etc.) are calculated and maintained. This is important for improving safety as well as efficiency. However, attribute propagation is challenging due to the size of the data, which could have tens of millions of nodes and edges per utility, and billions of nodes and edges at the nationwide scale. Additionally, results may need to be calculated and available quickly for interactive analysis. Previous approaches require immediate updates to all nodes and edges downstream of a node/edge being edited (to account for changes in attribute values), which could be computationally intensive and result in a slow user experience for editing attribute values. This paper presents Propagators, which feature an in-memory approach to attribute propagation. Propagators leverage a network index as well as a heuristic based on colocated sources with similar attribute values to increase computational savings. We present experiments that demonstrate the scalability of Propagators, which have been implemented in ArcGIS Pro and ArcGIS Enterprise.
公用事业系统,如电力、光纤/电信、天然气和水,需要对网络属性或值进行实时建模。例如,考虑管网中的液压压力;当水从储层或泵流出时,由于管道摩擦、泄漏、消耗等原因,压力降低。属性传播是计算和维护网络属性随距离变化的过程(例如,最大允许工作压力,相位等)。这对提高安全性和效率都很重要。然而,由于数据的大小,属性传播是具有挑战性的,每个实用程序可能有数千万个节点和边,在全国范围内可能有数十亿个节点和边。此外,结果可能需要被计算出来,并且可以快速地用于交互分析。以前的方法需要立即更新正在编辑的节点/边缘下游的所有节点和边缘(以考虑属性值的变化),这可能是计算密集型的,并且导致编辑属性值的用户体验缓慢。本文提出了传播器,它的特点是在内存中进行属性传播。传播器利用网络索引以及基于具有相似属性值的并置源的启发式方法来增加计算节省。通过实验验证了传播器的可扩展性,并在ArcGIS Pro和ArcGIS Enterprise中实现。
{"title":"Attribute Propagation for Utilities","authors":"Dev Oliver, P. Bakalov, Sangho Kim, E. Hoel","doi":"10.1145/3469830.3470907","DOIUrl":"https://doi.org/10.1145/3469830.3470907","url":null,"abstract":"Utility systems such as electric, fiber/telco, gas, and water require the realistic modeling of network attributes or values over distance. For example, consider hydraulic pressure in a pipe network; as water flows away from the reservoir or pump, pressure decreases due to pipe friction, leakage, consumption, etc. Attribute propagation is the process whereby network attributes that change over distance (e.g., maximum allowable operating pressure, phase, etc.) are calculated and maintained. This is important for improving safety as well as efficiency. However, attribute propagation is challenging due to the size of the data, which could have tens of millions of nodes and edges per utility, and billions of nodes and edges at the nationwide scale. Additionally, results may need to be calculated and available quickly for interactive analysis. Previous approaches require immediate updates to all nodes and edges downstream of a node/edge being edited (to account for changes in attribute values), which could be computationally intensive and result in a slow user experience for editing attribute values. This paper presents Propagators, which feature an in-memory approach to attribute propagation. Propagators leverage a network index as well as a heuristic based on colocated sources with similar attribute values to increase computational savings. We present experiments that demonstrate the scalability of Propagators, which have been implemented in ArcGIS Pro and ArcGIS Enterprise.","PeriodicalId":206910,"journal":{"name":"17th International Symposium on Spatial and Temporal Databases","volume":"123 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122479034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MaSEC: Discovering Anchorages and Co-movement Patterns on Streaming Vessel Trajectories 发现流船轨迹上的锚地和共同运动模式
Pub Date : 2021-08-23 DOI: 10.1145/3469830.3470909
A. Tritsarolis, Y. Kontoulis, N. Pelekis, Y. Theodoridis
The massive-scale data generation of positioning (tracking) messages, collected by various surveillance means, has posed new challenges in the field of mobility data analytics in terms of extracting valuable knowledge out of this data. One of these challenges is online cluster analysis, where the goal is to unveil hidden patterns of collective behaviour from streaming trajectories, such as co-movement and co-stationary (aka anchorage) patterns. Towards this direction, in this paper, we demonstrate MaSEC (Moving and Stationary Evolving Clusters), a system that discovers valuable behavioural patterns as above. In particular, our system provides a unified solution that discovers both moving and stationary evolving clusters on streaming vessel position data in an online mode. The functionality of our system is evaluated over two real-world datasets from the maritime domain.
通过各种监控手段收集的定位(跟踪)信息的大规模数据生成,在从这些数据中提取有价值的知识方面,对移动数据分析领域提出了新的挑战。其中一个挑战是在线聚类分析,其目标是从流轨迹中揭示隐藏的集体行为模式,例如共同运动和共同静止(又名锚定)模式。朝着这个方向,在本文中,我们展示了MaSEC(移动和静止进化集群),一个发现上述有价值的行为模式的系统。特别是,我们的系统提供了一个统一的解决方案,可以在线模式发现流船位置数据的移动和静止演变集群。我们的系统的功能是在两个来自海洋领域的真实数据集上进行评估的。
{"title":"MaSEC: Discovering Anchorages and Co-movement Patterns on Streaming Vessel Trajectories","authors":"A. Tritsarolis, Y. Kontoulis, N. Pelekis, Y. Theodoridis","doi":"10.1145/3469830.3470909","DOIUrl":"https://doi.org/10.1145/3469830.3470909","url":null,"abstract":"The massive-scale data generation of positioning (tracking) messages, collected by various surveillance means, has posed new challenges in the field of mobility data analytics in terms of extracting valuable knowledge out of this data. One of these challenges is online cluster analysis, where the goal is to unveil hidden patterns of collective behaviour from streaming trajectories, such as co-movement and co-stationary (aka anchorage) patterns. Towards this direction, in this paper, we demonstrate MaSEC (Moving and Stationary Evolving Clusters), a system that discovers valuable behavioural patterns as above. In particular, our system provides a unified solution that discovers both moving and stationary evolving clusters on streaming vessel position data in an online mode. The functionality of our system is evaluated over two real-world datasets from the maritime domain.","PeriodicalId":206910,"journal":{"name":"17th International Symposium on Spatial and Temporal Databases","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123730775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Geolocating Traffic Signs using Large Imagery Datasets 使用大型图像数据集定位交通标志
Pub Date : 2021-08-23 DOI: 10.1145/3469830.3470900
Kasper F. Pedersen, K. Torp
Maintaining a database with the type, location, and direction of traffic signs is a labor-intensive part of asset management for many road authorities. Today there are high-quality cameras in cell-phones that can add location (EXIF) metadata to the images. This makes it efficient and cheap to collect large geo-located imagery datasets. Detecting traffic signs from imagery is also much simpler today due to the availability of several high-quality open-source object-detection solutions. In this paper, we use the detection of traffic signs to find both the location and the direction of physical traffic signs. Five approaches to cluster the detections are presented. An extensive experimental evaluation shows that it is important to consider both the location and the direction. The evaluation is done on a novel dataset with 21,565 images that is available free for download. This includes the ground-truth location of 277 traffic signs and all source code. The conclusion is that traffic signs are detected with an F1 score of 0.8889, a location accuracy of 5.097-meter (MAE), and a direction accuracy of ± 11.375°(MAE). Only data from two trips are needed to get these results.
维护一个包含交通标志的类型、位置和方向的数据库是许多道路管理部门资产管理的劳动密集型部分。如今,手机中的高质量摄像头可以为图像添加位置(EXIF)元数据。这使得收集大型地理定位图像数据集变得高效和廉价。由于一些高质量的开源对象检测解决方案的可用性,从图像中检测交通标志也变得简单得多。在本文中,我们使用交通标志检测来寻找物理交通标志的位置和方向。提出了五种聚类检测的方法。广泛的实验评估表明,同时考虑位置和方向是很重要的。评估是在一个包含21,565张图像的新数据集上完成的,该数据集可以免费下载。这包括277个交通标志的真实位置和所有源代码。结果表明,该方法检测到的交通标志F1值为0.8889,定位精度为5.097 m (MAE),方向精度为±11.375°(MAE)。只需要两次行程的数据就可以得到这些结果。
{"title":"Geolocating Traffic Signs using Large Imagery Datasets","authors":"Kasper F. Pedersen, K. Torp","doi":"10.1145/3469830.3470900","DOIUrl":"https://doi.org/10.1145/3469830.3470900","url":null,"abstract":"Maintaining a database with the type, location, and direction of traffic signs is a labor-intensive part of asset management for many road authorities. Today there are high-quality cameras in cell-phones that can add location (EXIF) metadata to the images. This makes it efficient and cheap to collect large geo-located imagery datasets. Detecting traffic signs from imagery is also much simpler today due to the availability of several high-quality open-source object-detection solutions. In this paper, we use the detection of traffic signs to find both the location and the direction of physical traffic signs. Five approaches to cluster the detections are presented. An extensive experimental evaluation shows that it is important to consider both the location and the direction. The evaluation is done on a novel dataset with 21,565 images that is available free for download. This includes the ground-truth location of 277 traffic signs and all source code. The conclusion is that traffic signs are detected with an F1 score of 0.8889, a location accuracy of 5.097-meter (MAE), and a direction accuracy of ± 11.375°(MAE). Only data from two trips are needed to get these results.","PeriodicalId":206910,"journal":{"name":"17th International Symposium on Spatial and Temporal Databases","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128309574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Where have all the larvae gone? Towards Fast Main Pathway Identification from Geospatial Trajectories 幼虫都到哪里去了?基于地理空间轨迹的主路径快速识别研究
Pub Date : 2021-08-23 DOI: 10.1145/3469830.3470896
Carola Trahms, P. Handmann, W. Rath, M. Visbeck, M. Renz
The distribution of passively drifting particles within highly turbulent flows is a classic problem in marine sciences. The use of trajectory clustering on huge amounts of simulated marine trajectory data to identify main pathways of drifting particles has not been widely investigated from a data science perspective yet. In this paper, we propose a fast and computationally light method to efficiently identify main pathways in large amounts of trajectory data. It aims at overcoming some of the issues of probabilistic maps and existing trajectory clustering approaches. Our approach is evaluated against simulated larvae dispersion data based on a real-world model that have been produced as part of work in the marine science domain.
强湍流中被动漂移粒子的分布是海洋科学中的一个经典问题。从数据科学的角度来看,利用大量模拟海洋轨迹数据的轨迹聚类来识别漂流粒子的主要路径还没有得到广泛的研究。在本文中,我们提出了一种快速且计算量小的方法来有效地识别大量轨迹数据中的主要路径。它旨在克服概率映射和现有轨迹聚类方法的一些问题。我们的方法是根据模拟的幼虫分散数据进行评估的,该数据基于一个现实世界的模型,该模型是海洋科学领域工作的一部分。
{"title":"Where have all the larvae gone? Towards Fast Main Pathway Identification from Geospatial Trajectories","authors":"Carola Trahms, P. Handmann, W. Rath, M. Visbeck, M. Renz","doi":"10.1145/3469830.3470896","DOIUrl":"https://doi.org/10.1145/3469830.3470896","url":null,"abstract":"The distribution of passively drifting particles within highly turbulent flows is a classic problem in marine sciences. The use of trajectory clustering on huge amounts of simulated marine trajectory data to identify main pathways of drifting particles has not been widely investigated from a data science perspective yet. In this paper, we propose a fast and computationally light method to efficiently identify main pathways in large amounts of trajectory data. It aims at overcoming some of the issues of probabilistic maps and existing trajectory clustering approaches. Our approach is evaluated against simulated larvae dispersion data based on a real-world model that have been produced as part of work in the marine science domain.","PeriodicalId":206910,"journal":{"name":"17th International Symposium on Spatial and Temporal Databases","volume":"92 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133821935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Clustering of Adverse Events of Post-Market Approved Drugs 上市后批准药物不良事件的聚类
Pub Date : 2021-08-23 DOI: 10.1145/3469830.3470903
Ahmed Askar, Andreas Zuefle
Adverse side effects of a drug may vary over space and time due to different populations, environments, and drug quality. Discovering all side effects during the development process is impossible. Once a drug is approved, observed adverse effects are reported by doctors and patients and made available in the Adverse Event Reporting System provided by the U.S. Food and Drug Administration . Mining such records of reported adverse effects, this study proposes a spatial clustering approach to identify regions that exhibit similar adverse effects. We apply a topic modeling approach on textual representations of reported adverse effects using Latent Dirichlet Allocation. By describing a spatial region as a mixture of the resulting latent topics, we find clusters of regions that exhibit similar (topics of) adverse events for the same drug using Hierarchical Agglomerative Clustering. We investigate the resulting clusters for spatial autocorrelation to test the hypothesis that certain (topics of) adverse effects may occur only in certain spatial regions using Moran’s I measure of spatial autocorrelation. Our experimental evaluation exemplary applies our proposed framework to a number of blood-thinning drugs, showing that some drugs exhibit more coherent textual topics among their reported adverse effects than other drugs, but showing no significant spatial autocorrelation of these topics. Our approach can be applied to other drugs or vaccines to study if spatially localized adverse effects may justify further investigation.
由于不同的人群、环境和药物质量,药物的不良副作用可能随时间和空间而变化。在开发过程中发现所有副作用是不可能的。一旦药物被批准,观察到的不良反应将由医生和患者报告,并在美国食品和药物管理局提供的不良事件报告系统中提供。挖掘这些报告的不良影响记录,本研究提出了一种空间聚类方法来识别表现出类似不良影响的区域。我们使用潜在狄利克雷分配对报告的不利影响的文本表示应用主题建模方法。通过将空间区域描述为由此产生的潜在主题的混合物,我们发现使用分层聚集聚类的区域簇表现出相同药物的类似(主题)不良事件。我们研究了空间自相关的结果簇,以检验使用Moran 's I空间自相关测量的某些(主题)不利影响可能仅在某些空间区域发生的假设。我们的实验评估示例将我们提出的框架应用于许多血液稀释药物,表明一些药物在其报告的不良反应中表现出比其他药物更连贯的文本主题,但这些主题没有显着的空间自相关性。我们的方法可以应用于其他药物或疫苗,以研究空间局部不良反应是否值得进一步研究。
{"title":"Clustering of Adverse Events of Post-Market Approved Drugs","authors":"Ahmed Askar, Andreas Zuefle","doi":"10.1145/3469830.3470903","DOIUrl":"https://doi.org/10.1145/3469830.3470903","url":null,"abstract":"Adverse side effects of a drug may vary over space and time due to different populations, environments, and drug quality. Discovering all side effects during the development process is impossible. Once a drug is approved, observed adverse effects are reported by doctors and patients and made available in the Adverse Event Reporting System provided by the U.S. Food and Drug Administration . Mining such records of reported adverse effects, this study proposes a spatial clustering approach to identify regions that exhibit similar adverse effects. We apply a topic modeling approach on textual representations of reported adverse effects using Latent Dirichlet Allocation. By describing a spatial region as a mixture of the resulting latent topics, we find clusters of regions that exhibit similar (topics of) adverse events for the same drug using Hierarchical Agglomerative Clustering. We investigate the resulting clusters for spatial autocorrelation to test the hypothesis that certain (topics of) adverse effects may occur only in certain spatial regions using Moran’s I measure of spatial autocorrelation. Our experimental evaluation exemplary applies our proposed framework to a number of blood-thinning drugs, showing that some drugs exhibit more coherent textual topics among their reported adverse effects than other drugs, but showing no significant spatial autocorrelation of these topics. Our approach can be applied to other drugs or vaccines to study if spatially localized adverse effects may justify further investigation.","PeriodicalId":206910,"journal":{"name":"17th International Symposium on Spatial and Temporal Databases","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129220841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Spatial Dimensions of Algorithmic Transparency: A Summary 算法透明度的空间维度:综述
Pub Date : 2021-08-23 DOI: 10.1145/3469830.3470898
Jayant Gupta, A. Long, C. Xu, Tian Tang, S. Shekhar
Spatial data brings an important dimension to AI’s quest for algorithmic transparency. For example, data driven computer-aided policy-decisions use measures of segregation (e.g., dissimilarity index) or income-inequality (e.g., Gini index), and these measures are affected by space partitioning choice. This may lead policymakers to underestimate the level of inequality or segregation within a region. The problem stems from the fact that many segregation based analyses use aggregated census data but do not report result sensitivity to choice of spatial partitioning (e.g., census block, tract). Beyond the well-known Modifiable Areal Unit Problem, this paper shows (via mathematical proofs as well as case studies with census data and census based synthetic micro-population data) that values of many measures (e.g., Gini index, dissimilarity index) diminish monotonically with increasing spatial-unit size in a hierarchical space partitioning (e.g., block, block-group, tract), however the ranking based on spatially aggregated measures remain sensitive to the scale of spatial partitions (e.g., block, block group). This paper highlights the need for social scientists to report how rankings of inequality are affected by the choice of spatial partitions.
空间数据为人工智能对算法透明度的追求带来了一个重要的维度。例如,数据驱动的计算机辅助决策使用隔离度量(例如,不相似指数)或收入不平等度量(例如,基尼指数),而这些度量受到空间划分选择的影响。这可能会导致政策制定者低估一个地区内部的不平等或隔离程度。这个问题源于这样一个事实,即许多基于隔离的分析使用汇总的人口普查数据,但没有报告结果对空间划分选择的敏感性(例如,人口普查块,地区)。除了众所周知的可修改面积单位问题之外,本文通过数学证明以及人口普查数据和基于人口普查的合成微人口数据的案例研究表明,在分层空间划分(例如块,块群,区域)中,许多度量(例如基尼指数,不相似指数)的值随着空间单元大小的增加而单调减小。然而,基于空间聚合措施的排名仍然对空间分区的规模(例如,块,块组)敏感。这篇论文强调了社会科学家报告不平等排名如何受到空间划分选择的影响的必要性。
{"title":"Spatial Dimensions of Algorithmic Transparency: A Summary","authors":"Jayant Gupta, A. Long, C. Xu, Tian Tang, S. Shekhar","doi":"10.1145/3469830.3470898","DOIUrl":"https://doi.org/10.1145/3469830.3470898","url":null,"abstract":"Spatial data brings an important dimension to AI’s quest for algorithmic transparency. For example, data driven computer-aided policy-decisions use measures of segregation (e.g., dissimilarity index) or income-inequality (e.g., Gini index), and these measures are affected by space partitioning choice. This may lead policymakers to underestimate the level of inequality or segregation within a region. The problem stems from the fact that many segregation based analyses use aggregated census data but do not report result sensitivity to choice of spatial partitioning (e.g., census block, tract). Beyond the well-known Modifiable Areal Unit Problem, this paper shows (via mathematical proofs as well as case studies with census data and census based synthetic micro-population data) that values of many measures (e.g., Gini index, dissimilarity index) diminish monotonically with increasing spatial-unit size in a hierarchical space partitioning (e.g., block, block-group, tract), however the ranking based on spatially aggregated measures remain sensitive to the scale of spatial partitions (e.g., block, block group). This paper highlights the need for social scientists to report how rankings of inequality are affected by the choice of spatial partitions.","PeriodicalId":206910,"journal":{"name":"17th International Symposium on Spatial and Temporal Databases","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129447932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
17th International Symposium on Spatial and Temporal Databases
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1