Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems最新文献

英文中文

Computing highly occluded paths using a sparse network 使用稀疏网络计算高度闭塞的路径

Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pub Date : 2014-11-04 DOI: 10.1145/2666310.2666394

Niel Lebeck, Thomas Mølhave, P. Agarwal

Computing paths over a terrain that are highly occluded with respect to observers is an important problem in GIS. Given a fast algorithm for computing the visibility map, the path-planning step becomes the bottleneck. In this paper, we present an approach for quickly computing occluded paths over a terrain using a sparse network, a sparse 1-dimensional network over the terrain. We present different strategies for constructing the sparse network. Experimental results show that our approach results in significantly improved time for computing highly occluded paths between two query points, and that the different strategies offer a tradeoff between higher-quality paths and lower preprocessing times. Furthermore, there are strategies that achieve near-optimal paths with small preprocessing cost.

在地理信息系统中，计算高度遮挡的地形上的路径是一个重要的问题。在给定快速可见性映射计算算法的情况下，路径规划步骤成为瓶颈。在本文中，我们提出了一种使用稀疏网络快速计算地形上遮挡路径的方法，即地形上的稀疏一维网络。我们提出了构建稀疏网络的不同策略。实验结果表明，我们的方法显著提高了计算两个查询点之间高度闭塞路径的时间，并且不同的策略提供了更高质量路径和更低预处理时间之间的权衡。此外，还有一些策略可以用较小的预处理成本实现接近最优的路径。

引用次数: 4

TREADS: a safe route recommender using social media mining and text summarization TREADS:使用社交媒体挖掘和文本摘要的安全路线推荐

Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pub Date : 2014-11-04 DOI: 10.1145/2666310.2666368

Kaiqun Fu, Yen-Cheng Lu, Chang-Tien Lu

This paper presents TREADS, a novel travel route recommendation system that suggests safe travel itineraries in real time by incorporating social media data resources and points of interest review summarization techniques. The system consists of an efficient route recommendation service that considers safety and user interest factors, a transportation related tweets retriever with high accuracy, and a novel text summarization module that provides summaries of location based Twitter data and Yelp reviews to enhance our route recommendation service. We demonstrate the system by utilizing crime and points of interest data in the Washington DC area. TREADS is targeted to provide safe, effective, and convenient travel strategies for commuters and tourists. Our proposed system, integrated with multiple social media resources, can greatly improve the travel experience for tourists in unfamiliar cities.

本文介绍了一种新的旅行路线推荐系统TREADS，该系统通过整合社交媒体数据资源和兴趣点审查汇总技术，实时推荐安全的旅行路线。该系统包括一个考虑安全和用户兴趣因素的高效路线推荐服务，一个高精度的交通相关推文检索器，以及一个新颖的文本摘要模块，该模块提供基于位置的Twitter数据和Yelp评论的摘要，以增强我们的路线推荐服务。我们通过利用华盛顿特区的犯罪和兴趣点数据来演示该系统。TREADS旨在为通勤者和游客提供安全、有效、便捷的出行策略。我们提出的系统与多种社交媒体资源相结合，可以极大地改善游客在陌生城市的旅行体验。

引用次数: 33

Efficient itinerary planning with category constraints 有效的行程规划与类别约束

Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pub Date : 2014-11-04 DOI: 10.1145/2666310.2666411

P. Bolzoni, S. Helmer, Kevin Wellenzohn, J. Gamper, Periklis Andritsos

We propose a more realistic approach to trip planning for tourist applications by adding category information to points of interest (POIs). This makes it easier for tourists to formulate their preferences by stating constraints on categories rather than individual POIs. However, solving this problem is not just a matter of extending existing algorithms. In our approach we exploit the fact that POIs are usually not evenly distributed but tend to appear in clusters. We develop a group of efficient algorithms based on clustering with guaranteed theoretical bounds. We also evaluate our algorithms experimentally, using real-world data sets, showing that in practice the results are better than the theoretical guarantees and very close to the optimal solution.

我们提出了一种更现实的旅游应用程序旅行规划方法，即在兴趣点(poi)中添加类别信息。这使得游客更容易通过说明类别限制而不是单独的poi来制定他们的偏好。然而，解决这个问题不仅仅是扩展现有算法的问题。在我们的方法中，我们利用poi通常不是均匀分布的事实，而是倾向于出现在集群中。我们开发了一组基于保证理论界聚类的高效算法。我们还通过实验评估了我们的算法，使用真实世界的数据集，表明在实践中结果比理论保证更好，并且非常接近最优解。

引用次数: 37

A demonstration of GeomSMS: an SMS framework for sharing geospatial features 演示GeomSMS:一个共享地理空间特征的SMS框架

Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pub Date : 2014-11-04 DOI: 10.1145/2666310.2666372

Chandan Misra, A. Dasgupta, S. Ghosh, D. Bhattacharyya

This work presents GeomSMS as the first full-fledged SMS framework with the native support for geometric objects for sharing spatial information ubiquitously across mobile users. GeomSMS is an extension to Open GeoSMS Standard by Open GeoSpatial Consortium (OGC) that provides developers a Short Message Service (SMS) encoding for sharing only location information, namely latitude and longitude, between location based services (LBS) and applications. GeomSMS keeps the GeoSMS standard as it is, but adds support for sharing two other geometric features: line and polygon, apart from existing point feature. GeomSMS shares these features in the SMS payload without altering the GeoSMS standard. We describe the architecture of the system that utilizes the framework and demonstrates a real-life mobile application BeckonMe with one example from each of line and polygon feature.

这项工作提出了GeomSMS作为第一个成熟的SMS框架，具有对几何对象的原生支持，可以在移动用户之间无处不在地共享空间信息。GeomSMS是开放地理空间联盟(OGC)对开放地理空间管理标准的扩展，它为开发人员提供了一种短消息服务(SMS)编码，用于在基于位置的服务(LBS)和应用程序之间共享位置信息，即纬度和经度。GeomSMS保留了GeoSMS标准，但除了现有的点特征外，还增加了对共享另外两个几何特征的支持:直线和多边形。在不改变GeoSMS标准的情况下，GeomSMS在SMS有效载荷中共享这些功能。我们描述了利用该框架的系统架构，并演示了一个现实生活中的移动应用程序BeckonMe，其中包括每个线和多边形特征的一个示例。

引用次数: 1

Ambiguity and plausibility: managing classification quality in volunteered geographic information 模糊与似是而非:地理信息分类质量管理

Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pub Date : 2014-11-04 DOI: 10.1145/2666310.2666392

Ahmed Loai Ali, Falko Schmid, R. Al-Salman, Tomi Kauppinen

With the ubiquity of technology and tools, current Volunteered Geographic Information (VGI) projects allow the public to contribute, maintain, and use geo-spatial data. One of the most prominent and successful VGI project is OpenStreetMap (OSM), where more than one million volunteers collected and contributed data that is obtainable for everybody. However, this kind of contribution mechanism is usually associated with data quality issues, e.g., geographic entities such as gardens or parks can be assigned with inappropriate classification by volunteers. Based on the observation that geographic features usually inherit certain properties and characteristics, we propose a novel classification-based approach allowing the identification of entities with inappropriate classification. We use the rich data set of OSM to analyze the properties of geographic entities with respect to their implicit characteristics in order to develop classifiers based on them. Our developed classifiers show high detection accuracies. However, due to the absence of proper training data we additionally performed a user study to verify our findings by means of intra-user-agreement. The results of our study support the detections of our classifiers and show that our classification-based approaches can be a valuable tool for managing and improving VGI data.

随着技术和工具的普及，当前的志愿地理信息(VGI)项目允许公众贡献、维护和使用地理空间数据。最突出和最成功的VGI项目之一是OpenStreetMap (OSM)，其中有超过一百万志愿者收集并贡献了每个人都可以获得的数据。然而，这种贡献机制通常与数据质量问题有关，例如，志愿人员可能以不适当的分类分配花园或公园等地理实体。基于地理特征通常继承某些属性和特征的观察，我们提出了一种新的基于分类的方法，允许识别不适当分类的实体。我们使用OSM的丰富数据集来分析地理实体的隐式特征属性，从而开发基于它们的分类器。我们开发的分类器显示出较高的检测精度。然而，由于缺乏适当的培训数据，我们另外进行了一项用户研究，通过用户内部协议来验证我们的发现。我们的研究结果支持我们的分类器的检测，并表明我们基于分类的方法可以成为管理和改进VGI数据的有价值的工具。

{"title":"Ambiguity and plausibility: managing classification quality in volunteered geographic information","authors":"Ahmed Loai Ali, Falko Schmid, R. Al-Salman, Tomi Kauppinen","doi":"10.1145/2666310.2666392","DOIUrl":"https://doi.org/10.1145/2666310.2666392","url":null,"abstract":"With the ubiquity of technology and tools, current Volunteered Geographic Information (VGI) projects allow the public to contribute, maintain, and use geo-spatial data. One of the most prominent and successful VGI project is OpenStreetMap (OSM), where more than one million volunteers collected and contributed data that is obtainable for everybody. However, this kind of contribution mechanism is usually associated with data quality issues, e.g., geographic entities such as gardens or parks can be assigned with inappropriate classification by volunteers. Based on the observation that geographic features usually inherit certain properties and characteristics, we propose a novel classification-based approach allowing the identification of entities with inappropriate classification. We use the rich data set of OSM to analyze the properties of geographic entities with respect to their implicit characteristics in order to develop classifiers based on them. Our developed classifiers show high detection accuracies. However, due to the absence of proper training data we additionally performed a user study to verify our findings by means of intra-user-agreement. The results of our study support the detections of our classifiers and show that our classification-based approaches can be a valuable tool for managing and improving VGI data.","PeriodicalId":153031,"journal":{"name":"Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129607042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 52

Spatio-temporal trajectory simplification for inferring travel paths 用于推断旅行路径的时空轨迹简化

Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pub Date : 2014-11-04 DOI: 10.1145/2666310.2666409

Hengfeng Li, L. Kulik, K. Ramamohanarao

Mining GPS trajectories of moving vehicles has led to many research directions, such as traffic modeling and driving predication. An important challenge is how to map GPS traces to a road network accurately under noisy conditions. However, to the best of our knowledge, there is no existing work that first simplifies a trajectory to improve map matching. In this paper we propose three trajectory simplification algorithms that can deal with both offline and online trajectory data. We use weighting functions to incorporate spatial knowledge, such as segment lengths and turning angles, into our simplification algorithms. In addition, we measure the noise degree of a GPS point based on its spatio-temporal relationship to its neighbors. The effectiveness of our algorithms is comprehensively evaluated on real trajectory datasets with varying the noise levels and sampling rates. Our evaluation shows that under highly noisy conditions, our proposed algorithms considerably improve map matching accuracy and reduce computational costs compared to the state-of-the-art methods.

GPS运动轨迹的挖掘已成为交通建模和驾驶预测等领域的重要研究方向。一个重要的挑战是如何在噪声条件下准确地将GPS轨迹映射到道路网络。然而，据我们所知，目前还没有一项工作是首先简化轨迹来提高地图匹配。本文提出了三种可以同时处理离线和在线轨迹数据的轨迹简化算法。我们使用加权函数将空间知识(如路段长度和转弯角度)纳入我们的简化算法中。此外，我们还基于GPS点与其相邻点的时空关系来测量其噪声程度。在不同噪声水平和采样率的真实轨迹数据集上全面评估了算法的有效性。我们的评估表明，在高噪声条件下，与最先进的方法相比，我们提出的算法显着提高了地图匹配精度并降低了计算成本。

引用次数: 14

Secure mutual proximity zone enclosure evaluation 安全相互接近区域外壳评估

Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pub Date : 2014-11-04 DOI: 10.1145/2666310.2666384

Sunoh Choi, Gabriel Ghinita, E. Bertino

Mobile users engage in novel and exciting location-based social media applications (e.g., geosocial networks, spatial crowdsourcing) in which they interact with other users situated in their proximity. In several application scenarios, users define their own proximity zones of interest (typically in the form of polygonal regions, such as a collection of city blocks), and want to find other users with whom they are in a mutual enclosure relationship with respect to their respective proximity zones. This boils down to evaluating two point-in-polygon enclosure conditions, which is easy to achieve for revealed user locations and proximity zones. However, users may be reluctant to share their whereabouts with their friends and with social media service providers, as location data can help one infer sensitive details such as an individual's health status, financial situation or lifestyle choices. In this paper, we propose a mechanism that allows users to securely evaluate mutual proximity zone enclosure on encrypted location data. Our solution uses homomorphic encryption, and supports convex polygonal proximity zones. We provide a security analysis of the proposed solution, we investigate performance optimizations, and we show experimentally that our approach scales well for datasets of millions of users.

移动用户参与新颖和令人兴奋的基于位置的社交媒体应用程序(例如，地理社交网络，空间众包)，他们在其中与位于他们附近的其他用户互动。在一些应用场景中，用户定义自己感兴趣的邻近区域(通常以多边形区域的形式，例如城市街区的集合)，并希望找到与其各自邻近区域处于相互封闭关系的其他用户。这可以归结为评估两个多边形中点的封闭条件，这对于显示的用户位置和邻近区域来说很容易实现。然而，用户可能不愿意与朋友和社交媒体服务提供商分享他们的行踪，因为位置数据可以帮助人们推断出个人的健康状况、财务状况或生活方式选择等敏感细节。在本文中，我们提出了一种机制，允许用户安全地评估加密位置数据上的相互接近区域。我们的解决方案使用同态加密，并支持凸多边形邻近区。我们对所建议的解决方案进行了安全性分析，研究了性能优化，并通过实验表明，我们的方法可以很好地适用于数百万用户的数据集。

{"title":"Secure mutual proximity zone enclosure evaluation","authors":"Sunoh Choi, Gabriel Ghinita, E. Bertino","doi":"10.1145/2666310.2666384","DOIUrl":"https://doi.org/10.1145/2666310.2666384","url":null,"abstract":"Mobile users engage in novel and exciting location-based social media applications (e.g., geosocial networks, spatial crowdsourcing) in which they interact with other users situated in their proximity. In several application scenarios, users define their own proximity zones of interest (typically in the form of polygonal regions, such as a collection of city blocks), and want to find other users with whom they are in a mutual enclosure relationship with respect to their respective proximity zones. This boils down to evaluating two point-in-polygon enclosure conditions, which is easy to achieve for revealed user locations and proximity zones. However, users may be reluctant to share their whereabouts with their friends and with social media service providers, as location data can help one infer sensitive details such as an individual's health status, financial situation or lifestyle choices. In this paper, we propose a mechanism that allows users to securely evaluate mutual proximity zone enclosure on encrypted location data. Our solution uses homomorphic encryption, and supports convex polygonal proximity zones. We provide a security analysis of the proposed solution, we investigate performance optimizations, and we show experimentally that our approach scales well for datasets of millions of users.","PeriodicalId":153031,"journal":{"name":"Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116021235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

BioenergyKDF: enabling spatiotemporal data synthesis and research collaboration 生物能源kdf:实现时空数据综合和研究合作

Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pub Date : 2014-11-04 DOI: 10.1145/2666310.2666488

Aaron T. Myers, S. Movva, R. Karthik, B. Bhaduri, D. White, N. Thomas, Adrian S Z Chase

The Bioenergy Knowledge Discovery Framework (BioenergyKDF) is a scalable, web-based collaborative environment for scientists working on bioenergy related research in which the connections between data, literature, and models can be explored and more clearly understood. The fully-operational and deployed system, built on multiple open source libraries and architectures, stores contributions from the community of practice and makes them easy to find, but that is just its base functionality. The BioenergyKDF provides a national spatiotemporal decision support capability that enables data sharing, analysis, modeling, and visualization as well as fosters the development and management of the U.S. bioenergy infrastructure, which is an essential component of the national energy infrastructure. The BioenergyKDF is built on a flexible, customizable platform that can be extended to support the requirements of any user community---especially those that work with spatiotemporal data. While there are several community data-sharing software platforms available, some developed and distributed by national governments, none of them have the full suite of capabilities available in BioenergyKDF. For example, this component-based platform and database independent architecture allows it to be quickly deployed to existing infrastructure and to connect to existing data repositories (spatial or otherwise). As new data, analysis, and features are added; the BioenergyKDF will help lead research and support decisions concerning bioenergy into the future, but will also enable the development and growth of additional communities of practice both inside and outside of the Department of Energy. These communities will be able to leverage the substantial investment the agency has made in the KDF platform to quickly stand up systems that are customized to their data and research needs.

生物能源知识发现框架(BioenergyKDF)是一个可扩展的、基于网络的协作环境，供从事生物能源相关研究的科学家使用，在这个环境中，可以探索和更清楚地理解数据、文献和模型之间的联系。完全可操作和部署的系统，构建在多个开源库和架构上，存储来自实践社区的贡献，并使它们易于找到，但这只是它的基本功能。生物能源kdf提供国家时空决策支持能力，使数据共享、分析、建模和可视化成为可能，同时促进美国生物能源基础设施的开发和管理，这是国家能源基础设施的重要组成部分。BioenergyKDF建立在一个灵活的、可定制的平台上，可以扩展以支持任何用户群体的需求，特别是那些处理时空数据的用户群体。虽然有几个可用的社区数据共享软件平台，其中一些是由国家政府开发和分发的，但它们都没有BioenergyKDF提供的全套功能。例如，这种基于组件的平台和独立于数据库的体系结构允许它快速部署到现有的基础设施，并连接到现有的数据存储库(空间或其他)。随着新数据、分析和特性的加入;生物能源kdf将帮助领导未来有关生物能源的研究和支持决策，但也将使能源部内外的其他实践社区的发展和成长成为可能。这些社区将能够利用该机构在KDF平台上所做的大量投资，快速建立根据其数据和研究需求定制的系统。

{"title":"BioenergyKDF: enabling spatiotemporal data synthesis and research collaboration","authors":"Aaron T. Myers, S. Movva, R. Karthik, B. Bhaduri, D. White, N. Thomas, Adrian S Z Chase","doi":"10.1145/2666310.2666488","DOIUrl":"https://doi.org/10.1145/2666310.2666488","url":null,"abstract":"The Bioenergy Knowledge Discovery Framework (BioenergyKDF) is a scalable, web-based collaborative environment for scientists working on bioenergy related research in which the connections between data, literature, and models can be explored and more clearly understood. The fully-operational and deployed system, built on multiple open source libraries and architectures, stores contributions from the community of practice and makes them easy to find, but that is just its base functionality. The BioenergyKDF provides a national spatiotemporal decision support capability that enables data sharing, analysis, modeling, and visualization as well as fosters the development and management of the U.S. bioenergy infrastructure, which is an essential component of the national energy infrastructure. The BioenergyKDF is built on a flexible, customizable platform that can be extended to support the requirements of any user community---especially those that work with spatiotemporal data. While there are several community data-sharing software platforms available, some developed and distributed by national governments, none of them have the full suite of capabilities available in BioenergyKDF. For example, this component-based platform and database independent architecture allows it to be quickly deployed to existing infrastructure and to connect to existing data repositories (spatial or otherwise). As new data, analysis, and features are added; the BioenergyKDF will help lead research and support decisions concerning bioenergy into the future, but will also enable the development and growth of additional communities of practice both inside and outside of the Department of Energy. These communities will be able to leverage the substantial investment the agency has made in the KDF platform to quickly stand up systems that are customized to their data and research needs.","PeriodicalId":153031,"journal":{"name":"Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128635152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Index-supported pattern matching on symbolic trajectories 符号轨迹上索引支持的模式匹配

Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pub Date : 2014-11-04 DOI: 10.1145/2666310.2666402

Fabio Valdés, R. H. Güting

Recording mobility data with GPS-enabled devices, e.g., smart phones or vehicles, has become a common issue for private persons, companies, and institutions. Consequently, the requirements for managing these enormous datasets have increased drastically, so trajectory management has become an active research field. In order to avoid querying raw trajectories, which is neither convenient nor efficient, a symbolic representation of the geometric data has been introduced. A comprehensive framework for describing and querying symbolic trajectories including an expressive pattern language as well as an efficient matching algorithm was presented lately. A symbolic trajectory, basically being a time-dependent symbolic value (e.g., a label), can contain names of traversed roads, a speed profile, transportation modes, behaviors of animals, or cells inside a cellular network. The quality and efficiency of transportation systems, targeted advertising, animal research, crime investigation, etc. may be improved by analyzing such data. The main contribution of this paper is an improvement of our previous approach, featuring algorithms and data structures optimizing the matching of symbolic trajectories for any kind of pattern with the help of two indexes. More specifically, a trie is applied for the symbolic values (i.e., labels or places), while the time intervals are stored in a one-dimensional R-tree. Hence, we avoid the linear scan of every trajectory, being necessary without index support. As a result, the computation cost for the pattern matching is nearly independent from the trajectory size. Our work details the concept and the implementation of the new approach, followed by an experimental evaluation.

使用具有gps功能的设备(例如智能手机或车辆)记录移动数据已成为个人、公司和机构的共同问题。因此，管理这些庞大数据集的需求急剧增加，因此轨迹管理已成为一个活跃的研究领域。为了避免查询原始轨迹既不方便又不高效，引入了几何数据的符号表示。最近提出了一种描述和查询符号轨迹的综合框架，包括一种表达模式语言和一种高效的匹配算法。符号轨迹，基本上是一个与时间相关的符号值(例如，一个标签)，可以包含走过的道路的名称，速度轮廓，运输方式，动物的行为，或蜂窝网络中的细胞。通过分析这些数据，可以提高交通系统、定向广告、动物研究、犯罪调查等的质量和效率。本文的主要贡献是对我们以前的方法的改进，其特点是算法和数据结构在两个索引的帮助下优化任何类型模式的符号轨迹匹配。更具体地说，对符号值(即标签或位置)应用树，而时间间隔存储在一维r树中。因此，我们避免了在没有索引支持的情况下对每个轨迹进行线性扫描。因此，模式匹配的计算成本几乎与轨迹大小无关。我们的工作详细介绍了新方法的概念和实施，然后进行了实验评估。

{"title":"Index-supported pattern matching on symbolic trajectories","authors":"Fabio Valdés, R. H. Güting","doi":"10.1145/2666310.2666402","DOIUrl":"https://doi.org/10.1145/2666310.2666402","url":null,"abstract":"Recording mobility data with GPS-enabled devices, e.g., smart phones or vehicles, has become a common issue for private persons, companies, and institutions. Consequently, the requirements for managing these enormous datasets have increased drastically, so trajectory management has become an active research field. In order to avoid querying raw trajectories, which is neither convenient nor efficient, a symbolic representation of the geometric data has been introduced. A comprehensive framework for describing and querying symbolic trajectories including an expressive pattern language as well as an efficient matching algorithm was presented lately. A symbolic trajectory, basically being a time-dependent symbolic value (e.g., a label), can contain names of traversed roads, a speed profile, transportation modes, behaviors of animals, or cells inside a cellular network. The quality and efficiency of transportation systems, targeted advertising, animal research, crime investigation, etc. may be improved by analyzing such data. The main contribution of this paper is an improvement of our previous approach, featuring algorithms and data structures optimizing the matching of symbolic trajectories for any kind of pattern with the help of two indexes. More specifically, a trie is applied for the symbolic values (i.e., labels or places), while the time intervals are stored in a one-dimensional R-tree. Hence, we avoid the linear scan of every trajectory, being necessary without index support. As a result, the computation cost for the pattern matching is nearly independent from the trajectory size. Our work details the concept and the implementation of the new approach, followed by an experimental evaluation.","PeriodicalId":153031,"journal":{"name":"Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127588981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

SATO: a spatial data partitioning framework for scalable query processing SATO:用于可扩展查询处理的空间数据分区框架

Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pub Date : 2014-11-04 DOI: 10.1145/2666310.2666365

Hoang Vo, Ablimit Aji, Fusheng Wang

Scalable spatial query processing relies on effective spatial data partitioning for query parallelization, data pruning, and load balancing. These are often challenged by the intrinsic characteristics of spatial data, such as high skew in data distribution and high complexity of irregular multi-dimensional objects. In this demo, we present SATO, a spatial data partitioning framework that can quickly analyze and partition spatial data with an optimal spatial partitioning strategy for scalable query processing. SATO works in following steps: 1) Sample, which samples a small fraction of input data for analysis, 2) Analyze, which quickly analyzes sampled data to find an optimal partition strategy, 3) Tear, which provides data skew aware partitioning and supports MapReduce based scalable partitioning, and 4) Optimize, which collects succinct partition statistics for potential query optimization. SATO also provides multiple level partitioning, which can be used to significantly improve window based queries in cloud based spatial query processing systems. SATO comes with a visualization component that provides heat maps and histograms for qualitative evaluation. SATO has been implemented within the Hadoop-GIS, a high performance spatial data warehousing system over MapReduce. SATO is also released as an independent software package to support various scalable spatial query processing systems. Our experiments have demonstrated that SATO can generate much balanced partitioning that can significantly improve spatial query performance with MapReduce comparing to traditional spatial partitioning approaches.

可扩展的空间查询处理依赖于有效的空间数据分区来实现查询并行化、数据修剪和负载平衡。这些往往受到空间数据的固有特性的挑战，如数据分布的高度偏态和不规则多维对象的高度复杂性。在这个演示中，我们介绍了SATO，这是一个空间数据分区框架，可以使用可扩展查询处理的最佳空间分区策略快速分析和分区空间数据。SATO的工作步骤如下:1)Sample(采样一小部分输入数据进行分析)，2)Analyze(快速分析采样数据以找到最优分区策略)，3)Tear(提供数据倾斜感知分区并支持基于MapReduce的可扩展分区)，以及4)Optimize(收集简洁的分区统计数据以进行潜在的查询优化)。SATO还提供多级分区，可用于显著改进基于云的空间查询处理系统中基于窗口的查询。SATO附带一个可视化组件，提供热图和直方图进行定性评估。SATO已经在Hadoop-GIS中实现，这是一个基于MapReduce的高性能空间数据仓库系统。SATO还作为一个独立的软件包发布，以支持各种可扩展的空间查询处理系统。我们的实验表明，与传统的空间分区方法相比，SATO可以生成更加平衡的分区，可以显着提高MapReduce的空间查询性能。

{"title":"SATO: a spatial data partitioning framework for scalable query processing","authors":"Hoang Vo, Ablimit Aji, Fusheng Wang","doi":"10.1145/2666310.2666365","DOIUrl":"https://doi.org/10.1145/2666310.2666365","url":null,"abstract":"Scalable spatial query processing relies on effective spatial data partitioning for query parallelization, data pruning, and load balancing. These are often challenged by the intrinsic characteristics of spatial data, such as high skew in data distribution and high complexity of irregular multi-dimensional objects. In this demo, we present SATO, a spatial data partitioning framework that can quickly analyze and partition spatial data with an optimal spatial partitioning strategy for scalable query processing. SATO works in following steps: 1) Sample, which samples a small fraction of input data for analysis, 2) Analyze, which quickly analyzes sampled data to find an optimal partition strategy, 3) Tear, which provides data skew aware partitioning and supports MapReduce based scalable partitioning, and 4) Optimize, which collects succinct partition statistics for potential query optimization. SATO also provides multiple level partitioning, which can be used to significantly improve window based queries in cloud based spatial query processing systems. SATO comes with a visualization component that provides heat maps and histograms for qualitative evaluation. SATO has been implemented within the Hadoop-GIS, a high performance spatial data warehousing system over MapReduce. SATO is also released as an independent software package to support various scalable spatial query processing systems. Our experiments have demonstrated that SATO can generate much balanced partitioning that can significantly improve spatial query performance with MapReduce comparing to traditional spatial partitioning approaches.","PeriodicalId":153031,"journal":{"name":"Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"214 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121471351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 69

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀