GCMF: an efficient end-to-end spatial join system over large polygonal datasets on GPGPU platform

D. Aghajarian, S. Puri, S. Prasad
{"title":"GCMF: an efficient end-to-end spatial join system over large polygonal datasets on GPGPU platform","authors":"D. Aghajarian, S. Puri, S. Prasad","doi":"10.1145/2996913.2996982","DOIUrl":null,"url":null,"abstract":"Given two layers of large polygonal datasets, detecting those pairs of cross-layer polygons which satisfy a join predicate, such as intersection or contain, is one of the most computationally intensive primitive operations in the spatial domain applications. In this work, we introduce GCMF, an end-to-end software system, that is able to handle spatial join (with ST_Intersect operation) over non-indexed polygonal datasets with over 3 GB file size comprising more than 600, 000 polygons on a single GPU within less than 8 sec by applying innovative filter and refinement techniques. GCMF performs a two-step filtering phase. 1) A sort-based Minimum Bounding Rectangle (MBR) filtering step detects potentially overlapping polygon pairs up to 20 times faster than the optimized GEOS library routine. 2) A linear time Common MBR filtering step (based on the overlapping area of two given MBRs) that not only eliminates two-third of the candidate polygon pairs but also reduces the number of edges to be considered in the refinement phase by 40-fold on an average based on our experimental results with real datasets. Furthermore, for the refinement phase, GCMF implements a load-balanced parallel point-in-polygon and edge-intersection tests over GPU. Our experimental results with three different real datasets show up to 39-fold end-to- end speedup versus optimized sequential routines of GEOS C++ library as well as PostgreSQL spatial database with PostGIS.","PeriodicalId":20525,"journal":{"name":"Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"26","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2996913.2996982","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 26

Abstract

Given two layers of large polygonal datasets, detecting those pairs of cross-layer polygons which satisfy a join predicate, such as intersection or contain, is one of the most computationally intensive primitive operations in the spatial domain applications. In this work, we introduce GCMF, an end-to-end software system, that is able to handle spatial join (with ST_Intersect operation) over non-indexed polygonal datasets with over 3 GB file size comprising more than 600, 000 polygons on a single GPU within less than 8 sec by applying innovative filter and refinement techniques. GCMF performs a two-step filtering phase. 1) A sort-based Minimum Bounding Rectangle (MBR) filtering step detects potentially overlapping polygon pairs up to 20 times faster than the optimized GEOS library routine. 2) A linear time Common MBR filtering step (based on the overlapping area of two given MBRs) that not only eliminates two-third of the candidate polygon pairs but also reduces the number of edges to be considered in the refinement phase by 40-fold on an average based on our experimental results with real datasets. Furthermore, for the refinement phase, GCMF implements a load-balanced parallel point-in-polygon and edge-intersection tests over GPU. Our experimental results with three different real datasets show up to 39-fold end-to- end speedup versus optimized sequential routines of GEOS C++ library as well as PostgreSQL spatial database with PostGIS.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
GCMF:基于GPGPU平台的大型多边形数据集的高效端到端空间连接系统
给定两层大型多边形数据集,检测满足连接谓词(如交集或包含)的跨层多边形对是空间域应用中计算量最大的基本运算之一。在这项工作中,我们介绍了GCMF,一个端到端软件系统,它能够通过应用创新的过滤器和细化技术,在不到8秒的时间内处理非索引多边形数据集的空间连接(使用ST_Intersect操作),这些数据集的文件大小超过3gb,在单个GPU上包含超过600,000个多边形。GCMF执行两步过滤阶段。1)基于排序的最小边界矩形(MBR)滤波步骤检测潜在重叠多边形对的速度比优化的GEOS库例程快20倍。2)一个线性时间通用MBR滤波步骤(基于两个给定MBR的重叠面积),不仅消除了三分之二的候选多边形对,而且根据我们在真实数据集上的实验结果,在细化阶段平均减少了40倍要考虑的边缘数量。此外,在细化阶段,GCMF在GPU上实现了负载平衡并行多边形点和边交测试。我们在三个不同的真实数据集上的实验结果表明,与优化后的GEOS c++库顺序例程和PostgreSQL空间数据库与PostGIS相比,端到端加速高达39倍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Location corroborations by mobile devices without traces Knowledge-based trajectory completion from sparse GPS samples Particle filter for real-time human mobility prediction following unprecedented disaster Pyspatiotemporalgeom: a python library for spatiotemporal types and operations Fast transportation network traversal with hyperedges: (industrial paper)
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1