首页 > 最新文献

Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems最新文献

英文 中文
Gloria
Rachid Kachemir, Brad Kellett, Krishna Behara
Indexing and delivering spatial data to a massive user base composed of over a billion devices around the world stretches the limits of traditional infrastructure and operational tools. For instance, offline bulk indexing and loading fall short of viable solutions when it comes to data at scale; Integration with distributed systems such as Apache Hadoop© or Spark© is sparse, while data loading is often performed in a sub-optimal fashion by relying on intermediate file formats. We present in this paper an approach toward a hybrid on- line/offline indexing framework called Gloria that has been running in production settings for the past year at over 350k requests per seconds with lookup latencies under 5μs. The resulting output is an in-memory key-value store and we show that by leveraging higher level MapReduce [7] constructs as defined in FlumeJava [5], Gloria can achieve large scale key-value offline indexing in a fraction of the time required by traditional datastores while maintaining similar operational performance. Gloria also provides a spatial layer based on improvements to pointer-less quadtrees [12] and locational identifiers we call shift key that reduces the nearest neighbor problem in spatial data to simple key-value lookups. Shift keys have shown to outperform well established solutions such as Google S2 with locational key operations.
{"title":"Gloria","authors":"Rachid Kachemir, Brad Kellett, Krishna Behara","doi":"10.1145/2996913.2997013","DOIUrl":"https://doi.org/10.1145/2996913.2997013","url":null,"abstract":"Indexing and delivering spatial data to a massive user base composed of over a billion devices around the world stretches the limits of traditional infrastructure and operational tools. For instance, offline bulk indexing and loading fall short of viable solutions when it comes to data at scale; Integration with distributed systems such as Apache Hadoop© or Spark© is sparse, while data loading is often performed in a sub-optimal fashion by relying on intermediate file formats. We present in this paper an approach toward a hybrid on- line/offline indexing framework called Gloria that has been running in production settings for the past year at over 350k requests per seconds with lookup latencies under 5μs. The resulting output is an in-memory key-value store and we show that by leveraging higher level MapReduce [7] constructs as defined in FlumeJava [5], Gloria can achieve large scale key-value offline indexing in a fraction of the time required by traditional datastores while maintaining similar operational performance. Gloria also provides a spatial layer based on improvements to pointer-less quadtrees [12] and locational identifiers we call shift key that reduces the nearest neighbor problem in spatial data to simple key-value lookups. Shift keys have shown to outperform well established solutions such as Google S2 with locational key operations.","PeriodicalId":20525,"journal":{"name":"Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"22 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81523982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Deducing individual driving preferences for user-aware navigation 为用户感知导航推断个人驾驶偏好
S. Funke, S. Laue, Sabine Storandt
We study the problem of learning individual route preferences of drivers. Most current route planning services only compute shortest or quickest paths. But many other criteria might play a role for a user to prefer a certain route, as, e.g., fuel consumption, jam likeliness, road conditions, scenicness of the route, turns, allowed maximum speeds, toll costs and many more. Specifying the importance of each criterion manually is a non-trivial, unintuitive and time consuming undertaking for a user. Therefore, we develop approaches that deduce such preferences automatically based on paths previously driven by the user. We present an LP-formulation of the problem making use of a Dijkstra-based separation oracle. The resulting algorithm runs in polynomial time and allows for the user preference computation in few seconds even if several hundred routes are taken into account. Our experiments show that new route suggestions based on these learned preferences reflect the users definition of an optimal route very well.
我们研究了驾驶员个体路线偏好的学习问题。目前大多数路由规划服务只计算最短或最快的路径。但是,许多其他标准可能会对用户选择某条路线起作用,例如,燃料消耗,堵塞可能性,道路状况,路线的风景,转弯,允许的最大速度,收费费用等等。对于用户来说,手动指定每个标准的重要性是一项重要的、不直观的、耗时的工作。因此,我们开发了基于用户先前驱动的路径自动推断此类偏好的方法。我们提出了一个lp公式的问题,利用dijkstra为基础的分离预言。结果算法在多项式时间内运行,即使考虑数百条路由,也可以在几秒钟内计算出用户偏好。我们的实验表明,基于这些学习偏好的新路线建议很好地反映了用户对最优路线的定义。
{"title":"Deducing individual driving preferences for user-aware navigation","authors":"S. Funke, S. Laue, Sabine Storandt","doi":"10.1145/2996913.2997004","DOIUrl":"https://doi.org/10.1145/2996913.2997004","url":null,"abstract":"We study the problem of learning individual route preferences of drivers. Most current route planning services only compute shortest or quickest paths. But many other criteria might play a role for a user to prefer a certain route, as, e.g., fuel consumption, jam likeliness, road conditions, scenicness of the route, turns, allowed maximum speeds, toll costs and many more. Specifying the importance of each criterion manually is a non-trivial, unintuitive and time consuming undertaking for a user. Therefore, we develop approaches that deduce such preferences automatically based on paths previously driven by the user. We present an LP-formulation of the problem making use of a Dijkstra-based separation oracle. The resulting algorithm runs in polynomial time and allows for the user preference computation in few seconds even if several hundred routes are taken into account. Our experiments show that new route suggestions based on these learned preferences reflect the users definition of an optimal route very well.","PeriodicalId":20525,"journal":{"name":"Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"36 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82719057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Personalized location models with adaptive mixtures 具有自适应混合的个性化位置模型
Moshe Lichman, Dimitrios Kotzias, Padhraic Smyth
Personalization is increasingly important for a range of applications that rely on location-based modeling. A key aspect in building personalized models is using population-level information to smooth noisy sparse data at the individual level. In this paper we develop a general mixture model framework for learning individual-level location models where the model adaptively combines different types of smoothing information. In a series of experiments with Twitter geolocation data and Gowalla check-in data we demonstrate that the proposed approach can be significantly more accurate than more traditional smoothing and matrix factorization techniques. The improvement in performance over matrix factorization is pronounced and may be explained by the tendency of dimensionality reduction methods to over-smooth and not retain enough detail at the individual level.
对于依赖于基于位置的建模的一系列应用程序来说,个性化变得越来越重要。建立个性化模型的一个关键方面是使用种群级信息来平滑个体级的噪声稀疏数据。在本文中,我们开发了一个通用的混合模型框架,用于学习个人层面的位置模型,该模型自适应地组合了不同类型的平滑信息。在Twitter地理位置数据和Gowalla签到数据的一系列实验中,我们证明了所提出的方法比传统的平滑和矩阵分解技术要准确得多。与矩阵分解相比,性能的提高是明显的,这可能是由于降维方法倾向于过于平滑,而在个体层面上没有保留足够的细节。
{"title":"Personalized location models with adaptive mixtures","authors":"Moshe Lichman, Dimitrios Kotzias, Padhraic Smyth","doi":"10.1145/2996913.2996953","DOIUrl":"https://doi.org/10.1145/2996913.2996953","url":null,"abstract":"Personalization is increasingly important for a range of applications that rely on location-based modeling. A key aspect in building personalized models is using population-level information to smooth noisy sparse data at the individual level. In this paper we develop a general mixture model framework for learning individual-level location models where the model adaptively combines different types of smoothing information. In a series of experiments with Twitter geolocation data and Gowalla check-in data we demonstrate that the proposed approach can be significantly more accurate than more traditional smoothing and matrix factorization techniques. The improvement in performance over matrix factorization is pronounced and may be explained by the tendency of dimensionality reduction methods to over-smooth and not retain enough detail at the individual level.","PeriodicalId":20525,"journal":{"name":"Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"66 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82869965","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Managing massive trajectories on the cloud 管理云上的大量轨迹
Jie Bao, Ruiyuan Li, Xiuwen Yi, Yu Zheng
With advances in location-acquisition techniques, such as GPS- embedded phones, an enormous volume of trajectory data is generated, by people, vehicles, and animals. This trajectory data is one of the most important data sources in many urban computing applications, e.g., traffic modeling, user profiling analysis, air quality inference, and resource allocation. To utilize large scale trajectory data efficiently and effectively, cloud computing platforms, e.g., Microsoft Azure, are the most convenient and economic way. However, traditional cloud computing platforms are not designed to deal with spatio-temporal data, such as trajectories. To this end, we design and implement a holistic cloud-based trajectory data management system on Microsoft Azure to bridge the gap between trajectory data and urban applications. Our system can efficiently store, index, and query large trajectory data with three functions: 1) trajectory ID-temporal query, 2) trajectory spatio-temporal query, and 3) trajectory mapmatching. The efficiency of the system is tested and tuned based on real-time trajectory data feeds. The system is currently used in many internal urban applications, as we will illustrate using case studies.
随着位置获取技术的进步,例如嵌入GPS的手机,大量的轨迹数据由人、车辆和动物产生。这些轨迹数据是许多城市计算应用中最重要的数据源之一,例如交通建模、用户分析、空气质量推断和资源分配。为了高效、有效地利用大规模轨迹数据,云计算平台,如微软Azure,是最方便、最经济的方式。然而,传统的云计算平台并不是为处理时空数据而设计的,比如轨迹。为此,我们在Microsoft Azure上设计并实现了一个基于云的整体轨迹数据管理系统,以弥合轨迹数据与城市应用之间的差距。我们的系统能够高效地存储、索引和查询大型轨迹数据,实现了三个功能:1)轨迹id -时间查询,2)轨迹时空查询,3)轨迹映射匹配。基于实时轨迹数据馈送,测试和调整了系统的效率。该系统目前在许多城市内部应用中使用,我们将使用案例研究来说明。
{"title":"Managing massive trajectories on the cloud","authors":"Jie Bao, Ruiyuan Li, Xiuwen Yi, Yu Zheng","doi":"10.1145/2996913.2996916","DOIUrl":"https://doi.org/10.1145/2996913.2996916","url":null,"abstract":"With advances in location-acquisition techniques, such as GPS- embedded phones, an enormous volume of trajectory data is generated, by people, vehicles, and animals. This trajectory data is one of the most important data sources in many urban computing applications, e.g., traffic modeling, user profiling analysis, air quality inference, and resource allocation. To utilize large scale trajectory data efficiently and effectively, cloud computing platforms, e.g., Microsoft Azure, are the most convenient and economic way. However, traditional cloud computing platforms are not designed to deal with spatio-temporal data, such as trajectories. To this end, we design and implement a holistic cloud-based trajectory data management system on Microsoft Azure to bridge the gap between trajectory data and urban applications. Our system can efficiently store, index, and query large trajectory data with three functions: 1) trajectory ID-temporal query, 2) trajectory spatio-temporal query, and 3) trajectory mapmatching. The efficiency of the system is tested and tuned based on real-time trajectory data feeds. The system is currently used in many internal urban applications, as we will illustrate using case studies.","PeriodicalId":20525,"journal":{"name":"Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"37 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89114553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 50
RxSpatial: the reactive spatial library RxSpatial:响应空间库
Youying Shi, Abdeltawab M. Hendawi, Jayant Gupta, H. Fattah, Mohamed H. Ali
The spatial libraries that have been developed by Microsoft, IBM and Oracle have substantially changed the capabilities of geospatial computing. These libraries implement several functionalities that include intersection, distance, and area for various geospatial objects. These libraries came out to address a wealth of use cases that were challenging in that era. As time goes by, GPS devices and location-aware mobile technologies increased the demand for geospatial computing, in general, and for real time geostreaming, in particular. Existing commercial spatial libraries were originally designed to support operations on stationary objects with limited or no capabilities for moving objects. In this paper, we introduce the RxSpatial library, a real time reactive spatial library for spatiotemporal stream query processing. RxSpatial provides, (1) a front-end, which is a programming interface for developers who are familiar with the Microsoft. NET Reactive framework and the Microsoft SQL Server Spatial Library, and (2) a back-end for processing spatial operations in a streaming fashion. RxSpatial provides the programming convenience at the front end and the query processing efficiency at the back end.
由Microsoft、IBM和Oracle开发的空间库极大地改变了地理空间计算的能力。这些库实现了一些功能,包括各种地理空间对象的交集、距离和面积。这些库的出现是为了解决在那个时代具有挑战性的大量用例。随着时间的推移,GPS设备和位置感知移动技术增加了对地理空间计算的需求,特别是对实时地理流的需求。现有的商业空间库最初是为了支持对固定对象的操作而设计的,对移动对象的操作能力有限或没有。本文介绍了RxSpatial库,这是一个用于时空流查询处理的实时响应空间库。RxSpatial提供了(1)前端,它是为熟悉Microsoft的开发人员提供的编程接口。. NET响应式框架和Microsoft SQL Server空间库,以及(2)以流方式处理空间操作的后端。RxSpatial提供了前端的编程便利性和后端的查询处理效率。
{"title":"RxSpatial: the reactive spatial library","authors":"Youying Shi, Abdeltawab M. Hendawi, Jayant Gupta, H. Fattah, Mohamed H. Ali","doi":"10.1145/2996913.2996948","DOIUrl":"https://doi.org/10.1145/2996913.2996948","url":null,"abstract":"The spatial libraries that have been developed by Microsoft, IBM and Oracle have substantially changed the capabilities of geospatial computing. These libraries implement several functionalities that include intersection, distance, and area for various geospatial objects. These libraries came out to address a wealth of use cases that were challenging in that era. As time goes by, GPS devices and location-aware mobile technologies increased the demand for geospatial computing, in general, and for real time geostreaming, in particular. Existing commercial spatial libraries were originally designed to support operations on stationary objects with limited or no capabilities for moving objects. In this paper, we introduce the RxSpatial library, a real time reactive spatial library for spatiotemporal stream query processing. RxSpatial provides, (1) a front-end, which is a programming interface for developers who are familiar with the Microsoft. NET Reactive framework and the Microsoft SQL Server Spatial Library, and (2) a back-end for processing spatial operations in a streaming fashion. RxSpatial provides the programming convenience at the front end and the query processing efficiency at the back end.","PeriodicalId":20525,"journal":{"name":"Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"219 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89118045","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
LOCAl: a personalized cache mechanism for location-based social networks LOCAl:基于位置的社交网络的个性化缓存机制
Dimitrios Tomaras, Ioannis Boutsis, V. Kalogeraki, D. Gunopulos
Recommending nearby Points of Interest (POI) has received growing interest in mobile location-based networks today, where users share content embedded with location information. In this work, we propose a novel caching framework to support personalised proactive caching for mobile location-based social networks. We propose "LOCAI", which uses a probabilistic approach in order to predict the POIs that users will access and retrieve the appropriate data objects that will fulfill user preferences. Our detailed experimental evaluation, using data from the Foursquare location-based social network, illustrates that LOCAI minimizes the user latency to retrieve the data objects they are interested in, is efficient and practical.
推荐附近的兴趣点(POI)在今天的基于移动位置的网络中受到越来越多的关注,用户可以分享嵌入位置信息的内容。在这项工作中,我们提出了一个新的缓存框架来支持基于移动位置的社交网络的个性化主动缓存。我们提出“LOCAI”,它使用概率方法来预测用户将访问的poi,并检索满足用户偏好的适当数据对象。我们详细的实验评估使用了基于Foursquare位置的社交网络的数据,表明LOCAI最大限度地减少了用户检索他们感兴趣的数据对象的延迟,是高效和实用的。
{"title":"LOCAl: a personalized cache mechanism for location-based social networks","authors":"Dimitrios Tomaras, Ioannis Boutsis, V. Kalogeraki, D. Gunopulos","doi":"10.1145/2996913.2996981","DOIUrl":"https://doi.org/10.1145/2996913.2996981","url":null,"abstract":"Recommending nearby Points of Interest (POI) has received growing interest in mobile location-based networks today, where users share content embedded with location information. In this work, we propose a novel caching framework to support personalised proactive caching for mobile location-based social networks. We propose \"LOCAI\", which uses a probabilistic approach in order to predict the POIs that users will access and retrieve the appropriate data objects that will fulfill user preferences. Our detailed experimental evaluation, using data from the Foursquare location-based social network, illustrates that LOCAI minimizes the user latency to retrieve the data objects they are interested in, is efficient and practical.","PeriodicalId":20525,"journal":{"name":"Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"97 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80756935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A visual and computational analysis approach for exploring significant locations and time periods along a bus route 一种可视化和计算分析方法,用于探索沿公交路线的重要位置和时间段
J. Mazimpaka, S. Timpf
Understanding human mobility is important for planning and delivering various services in urban area. An important element for mobility understanding is to understand the context in which the movement takes place. In this direction, we propose a method for identifying significant locations and time periods along a bus route. The significance is based on special characteristics that locations have during specific time periods as determined from their effect of these locations on the movement of the bus. The method extracts discriminative features from the space, time and other selected attributes and then classifies locations and time periods into 5 significance classes. The classes are then rendered in different views for discovering and understanding patterns. The novelty of the method is an explicit consideration of the time dimension at different granularity levels and a visualization that facilitates comparison across the space and time dimensions while avoiding a visual clutter. We demonstrate the applicability of our approach by applying it on a large set of bus trajectories.
了解人类的流动性对于规划和提供城市地区的各种服务非常重要。了解移动性的一个重要因素是了解移动性发生的背景。在这个方向上,我们提出了一种确定公交路线上重要位置和时间段的方法。其重要性是基于特定时间段内位置的特殊特征,这些特征是由这些位置对公共汽车运动的影响决定的。该方法从空间、时间和其他选定的属性中提取判别特征,然后将地点和时间段分为5个显著性类。然后在不同的视图中呈现这些类,以便发现和理解模式。该方法的新颖之处在于明确地考虑了不同粒度级别上的时间维度,以及便于跨空间和时间维度进行比较的可视化,同时避免了视觉混乱。我们通过将我们的方法应用于一组大型公共汽车轨迹来证明其适用性。
{"title":"A visual and computational analysis approach for exploring significant locations and time periods along a bus route","authors":"J. Mazimpaka, S. Timpf","doi":"10.1145/2996913.2996936","DOIUrl":"https://doi.org/10.1145/2996913.2996936","url":null,"abstract":"Understanding human mobility is important for planning and delivering various services in urban area. An important element for mobility understanding is to understand the context in which the movement takes place. In this direction, we propose a method for identifying significant locations and time periods along a bus route. The significance is based on special characteristics that locations have during specific time periods as determined from their effect of these locations on the movement of the bus. The method extracts discriminative features from the space, time and other selected attributes and then classifies locations and time periods into 5 significance classes. The classes are then rendered in different views for discovering and understanding patterns. The novelty of the method is an explicit consideration of the time dimension at different granularity levels and a visualization that facilitates comparison across the space and time dimensions while avoiding a visual clutter. We demonstrate the applicability of our approach by applying it on a large set of bus trajectories.","PeriodicalId":20525,"journal":{"name":"Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"17 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86140283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
GCMF: an efficient end-to-end spatial join system over large polygonal datasets on GPGPU platform GCMF:基于GPGPU平台的大型多边形数据集的高效端到端空间连接系统
D. Aghajarian, S. Puri, S. Prasad
Given two layers of large polygonal datasets, detecting those pairs of cross-layer polygons which satisfy a join predicate, such as intersection or contain, is one of the most computationally intensive primitive operations in the spatial domain applications. In this work, we introduce GCMF, an end-to-end software system, that is able to handle spatial join (with ST_Intersect operation) over non-indexed polygonal datasets with over 3 GB file size comprising more than 600, 000 polygons on a single GPU within less than 8 sec by applying innovative filter and refinement techniques. GCMF performs a two-step filtering phase. 1) A sort-based Minimum Bounding Rectangle (MBR) filtering step detects potentially overlapping polygon pairs up to 20 times faster than the optimized GEOS library routine. 2) A linear time Common MBR filtering step (based on the overlapping area of two given MBRs) that not only eliminates two-third of the candidate polygon pairs but also reduces the number of edges to be considered in the refinement phase by 40-fold on an average based on our experimental results with real datasets. Furthermore, for the refinement phase, GCMF implements a load-balanced parallel point-in-polygon and edge-intersection tests over GPU. Our experimental results with three different real datasets show up to 39-fold end-to- end speedup versus optimized sequential routines of GEOS C++ library as well as PostgreSQL spatial database with PostGIS.
给定两层大型多边形数据集,检测满足连接谓词(如交集或包含)的跨层多边形对是空间域应用中计算量最大的基本运算之一。在这项工作中,我们介绍了GCMF,一个端到端软件系统,它能够通过应用创新的过滤器和细化技术,在不到8秒的时间内处理非索引多边形数据集的空间连接(使用ST_Intersect操作),这些数据集的文件大小超过3gb,在单个GPU上包含超过600,000个多边形。GCMF执行两步过滤阶段。1)基于排序的最小边界矩形(MBR)滤波步骤检测潜在重叠多边形对的速度比优化的GEOS库例程快20倍。2)一个线性时间通用MBR滤波步骤(基于两个给定MBR的重叠面积),不仅消除了三分之二的候选多边形对,而且根据我们在真实数据集上的实验结果,在细化阶段平均减少了40倍要考虑的边缘数量。此外,在细化阶段,GCMF在GPU上实现了负载平衡并行多边形点和边交测试。我们在三个不同的真实数据集上的实验结果表明,与优化后的GEOS c++库顺序例程和PostgreSQL空间数据库与PostGIS相比,端到端加速高达39倍。
{"title":"GCMF: an efficient end-to-end spatial join system over large polygonal datasets on GPGPU platform","authors":"D. Aghajarian, S. Puri, S. Prasad","doi":"10.1145/2996913.2996982","DOIUrl":"https://doi.org/10.1145/2996913.2996982","url":null,"abstract":"Given two layers of large polygonal datasets, detecting those pairs of cross-layer polygons which satisfy a join predicate, such as intersection or contain, is one of the most computationally intensive primitive operations in the spatial domain applications. In this work, we introduce GCMF, an end-to-end software system, that is able to handle spatial join (with ST_Intersect operation) over non-indexed polygonal datasets with over 3 GB file size comprising more than 600, 000 polygons on a single GPU within less than 8 sec by applying innovative filter and refinement techniques. GCMF performs a two-step filtering phase. 1) A sort-based Minimum Bounding Rectangle (MBR) filtering step detects potentially overlapping polygon pairs up to 20 times faster than the optimized GEOS library routine. 2) A linear time Common MBR filtering step (based on the overlapping area of two given MBRs) that not only eliminates two-third of the candidate polygon pairs but also reduces the number of edges to be considered in the refinement phase by 40-fold on an average based on our experimental results with real datasets. Furthermore, for the refinement phase, GCMF implements a load-balanced parallel point-in-polygon and edge-intersection tests over GPU. Our experimental results with three different real datasets show up to 39-fold end-to- end speedup versus optimized sequential routines of GEOS C++ library as well as PostgreSQL spatial database with PostGIS.","PeriodicalId":20525,"journal":{"name":"Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"11 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86568832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
A distantly supervised method for extracting spatio-temporal information from text 一种从文本中提取时空信息的远程监督方法
Seyed Iman Mirrezaei, Bruno Martins, I. Cruz
This paper describes Triplex-ST, a novel information extraction system for collecting spatio-temporal information from textual resources. Triplex-ST is based on a distantly supervised approach, which leverages rich linguistic annotations together with information in existing knowledge bases. In particular, we leverage triples associated with temporal and/or spatial contexts, e.g., as available from the YAGO knowledge base, so as to infer templates that capture new facts from previously unseen sentences.
本文介绍了一种从文本资源中采集时空信息的新型信息提取系统Triplex-ST。Triplex-ST基于远程监督方法,它利用丰富的语言注释以及现有知识库中的信息。特别是,我们利用与时间和/或空间上下文相关的三元组,例如,从YAGO知识库中可用的三元组,从而推断出从以前未见过的句子中捕获新事实的模板。
{"title":"A distantly supervised method for extracting spatio-temporal information from text","authors":"Seyed Iman Mirrezaei, Bruno Martins, I. Cruz","doi":"10.1145/2996913.2996967","DOIUrl":"https://doi.org/10.1145/2996913.2996967","url":null,"abstract":"This paper describes Triplex-ST, a novel information extraction system for collecting spatio-temporal information from textual resources. Triplex-ST is based on a distantly supervised approach, which leverages rich linguistic annotations together with information in existing knowledge bases. In particular, we leverage triples associated with temporal and/or spatial contexts, e.g., as available from the YAGO knowledge base, so as to infer templates that capture new facts from previously unseen sentences.","PeriodicalId":20525,"journal":{"name":"Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"32 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89307439","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems 第24届ACM SIGSPATIAL国际地理信息系统进展会议论文集
Mohamed H. Ali, S. Newsam, S. Ravada, M. Renz, Goce Trajcevski
These proceedings contain the papers from the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM SIGSPATIAL 2016), held in the San Francisco Bay Area, California, USA, October 31 through November 3, 2016. The conference started as a series of symposia and workshops back in 1993 with the aim of promoting interdisciplinary discussions among researchers, developers, users, and practitioners and fostering research in all aspects of geographic information systems, especially in relation to novel systems based on geospatial data and knowledge. It provides a forum for original research contributions covering all conceptual, design, and implementation aspects of geospatial data ranging from applications, user interfaces and visualization, to data storage, query processing, indexing and data mining. The conference is the premier annual event of the ACM Special Interest Group on Spatial Information (ACM SIGSPATIAL).
这些论文集包含了第24届ACM SIGSPATIAL国际地理信息系统进展会议(ACM SIGSPATIAL 2016)的论文,该会议于2016年10月31日至11月3日在美国加利福尼亚州旧金山湾区举行。该会议于1993年以一系列专题讨论会和工作坊开始,旨在促进研究人员、开发人员、用户和实践者之间的跨学科讨论,并促进地理信息系统各个方面的研究,特别是与基于地理空间数据和知识的新系统有关的研究。它为原始研究贡献提供了一个论坛,涵盖地理空间数据的所有概念、设计和实现方面,从应用程序、用户界面和可视化到数据存储、查询处理、索引和数据挖掘。该会议是ACM空间信息特别兴趣小组(ACM SIGSPATIAL)的首要年度活动。
{"title":"Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","authors":"Mohamed H. Ali, S. Newsam, S. Ravada, M. Renz, Goce Trajcevski","doi":"10.1145/2996913","DOIUrl":"https://doi.org/10.1145/2996913","url":null,"abstract":"These proceedings contain the papers from the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM SIGSPATIAL 2016), held in the San Francisco Bay Area, California, USA, October 31 through November 3, 2016. The conference started as a series of symposia and workshops back in 1993 with the aim of promoting interdisciplinary discussions among researchers, developers, users, and practitioners and fostering research in all aspects of geographic information systems, especially in relation to novel systems based on geospatial data and knowledge. It provides a forum for original research contributions covering all conceptual, design, and implementation aspects of geospatial data ranging from applications, user interfaces and visualization, to data storage, query processing, indexing and data mining. The conference is the premier annual event of the ACM Special Interest Group on Spatial Information (ACM SIGSPATIAL).","PeriodicalId":20525,"journal":{"name":"Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"80 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80071580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
期刊
Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1