Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems最新文献

英文中文

Knowledge-based trajectory completion from sparse GPS samples 基于知识的稀疏GPS样本轨迹补全

Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pub Date : 2016-10-31 DOI: 10.1145/2996913.2996924

Yongni Li, Yangyan Li, D. Gunopulos, L. Guibas

Traffic trajectories collected from GPS-enabled mobile devices or vehicles are widely used in urban planning, traffic management, and location based services. Their performance often relies on having dense trajectories. However, due to the power and bandwidth limitation on these devices, collecting dense trajectory is too costly on a large scale. We show that by exploiting structural regularity in large trajectory data, the complete geometry of trajectories can be inferred from sparse GPS samples without information about the underlying road network - a process called trajectory completion. In this paper, we present a knowledge-based approach for completing traffic trajectories. Our method extracts a network of road junctions and estimates traffic flows across junctions. GPS samples within each flow cluster are then used to achieve fine-level completion of individual trajectories. Finally, we demonstrate that our method is effective for trajectory completion on both synthesized and real traffic trajectories. On average 72.7% of real trajectories with sampling rate of 60 seconds/sample are completed without map information. Comparing to map matching, over 89% of points on completed trajectories are within 15 meters from the map matched path.

从具有gps功能的移动设备或车辆收集的交通轨迹广泛用于城市规划、交通管理和基于位置的服务。它们的性能通常依赖于密集的轨迹。然而，由于这些设备的功率和带宽限制，大规模收集密集轨迹的成本太高。我们表明，通过利用大型轨迹数据中的结构规则，可以从稀疏的GPS样本中推断出轨迹的完整几何形状，而不需要有关底层道路网络的信息——这一过程称为轨迹完成。在本文中，我们提出了一种基于知识的方法来完成交通轨迹。我们的方法提取道路交叉口网络并估计交叉口的交通流量。然后使用每个流簇中的GPS样本来实现单个轨迹的精细完成。最后，我们证明了我们的方法对于合成和真实交通轨迹的轨迹补全是有效的。平均72.7%的真实轨迹在没有地图信息的情况下完成，采样率为60秒/样本。与地图匹配相比，完成轨迹上超过89%的点距离地图匹配路径不到15米。

{"title":"Knowledge-based trajectory completion from sparse GPS samples","authors":"Yongni Li, Yangyan Li, D. Gunopulos, L. Guibas","doi":"10.1145/2996913.2996924","DOIUrl":"https://doi.org/10.1145/2996913.2996924","url":null,"abstract":"Traffic trajectories collected from GPS-enabled mobile devices or vehicles are widely used in urban planning, traffic management, and location based services. Their performance often relies on having dense trajectories. However, due to the power and bandwidth limitation on these devices, collecting dense trajectory is too costly on a large scale. We show that by exploiting structural regularity in large trajectory data, the complete geometry of trajectories can be inferred from sparse GPS samples without information about the underlying road network - a process called trajectory completion. In this paper, we present a knowledge-based approach for completing traffic trajectories. Our method extracts a network of road junctions and estimates traffic flows across junctions. GPS samples within each flow cluster are then used to achieve fine-level completion of individual trajectories. Finally, we demonstrate that our method is effective for trajectory completion on both synthesized and real traffic trajectories. On average 72.7% of real trajectories with sampling rate of 60 seconds/sample are completed without map information. Comparing to map matching, over 89% of points on completed trajectories are within 15 meters from the map matched path.","PeriodicalId":20525,"journal":{"name":"Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"4 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73051988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 30

Polygon consensus: smart crowdsourcing for extracting building footprints from historical maps Polygon consensus:从历史地图中提取建筑足迹的智能众包

Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pub Date : 2016-10-31 DOI: 10.1145/2996913.2996951

B. Budig, Thomas C. van Dijk, F. Feitsch, M. Arteaga

Over the course of three years, the New York Public Library has run a crowdsourcing project to extract polygonal representation of the building footprints from insurance atlases of the 19th and early-20th century. As is common in crowd-sourcing projects, the overall problem was decomposed into small user tasks and each task was given to multiple users. In the case of polygons representing building footprints, it is unclear how best to integrate the answers into a majority vote: given a set of polygons ostensibly describing the same footprint, what is the consensus? We discuss desirable properties of such a "consensus polygon" and arrive at an efficient algorithm. We have manually evaluated the algorithm on approximately 3,000 polygons corresponding to 200 footprints and observe that our algorithmic consensus polygons are correct for 96% of the footprints whereas only 85% of the (input) crowd polygons are correct.

在三年的时间里，纽约公共图书馆开展了一个众包项目，从19世纪和20世纪初的保险地图集中提取建筑足迹的多边形表示。正如众包项目中常见的那样，将整个问题分解为小的用户任务，每个任务分配给多个用户。在多边形代表建筑足迹的情况下，目前尚不清楚如何最好地将答案整合到多数投票中:给定一组表面上描述相同足迹的多边形，共识是什么?我们讨论了这种“一致多边形”的理想性质，并得出了一种有效的算法。我们手动评估了大约3000个多边形对应200个足迹的算法，并观察到我们的算法共识多边形对96%的足迹是正确的，而只有85%的(输入)人群多边形是正确的。

引用次数: 15

Real-time detection and classification of traffic jams from probe data 从探针数据实时检测和分类交通阻塞

Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pub Date : 2016-10-31 DOI: 10.1145/2996913.2996988

Bo Xu, Tiffany Barkley, Andrew P. Lewis, Jane Macfarlane, D. Pietrobon, Matei Stroila

In this paper we present our experience on detecting and classifying traffic jams in real time from probe data. We classify traffic jams at two levels. At a higher level, we classify traffic jams into recurring and non-recurring jams. Then at a lower level we identify accidents out of non-recurring jams based on features that characterize upstream and downstream traffic patterns. Accidents are highly unpredictable and usually create heavy and long lasting congestion, and therefore are particularly worth detecting. We discuss the challenges of detecting accidents in real time as well as our approaches and results.

本文介绍了利用探测数据实时检测和分类交通阻塞的经验。我们把交通堵塞分为两级。在更高的层次上，我们把交通堵塞分为经常性和非经常性。然后，在较低的层次上，我们根据上下游交通模式的特征，从非重复性拥堵中识别事故。事故是高度不可预测的，通常会造成严重和持久的拥堵，因此特别值得注意。我们讨论了实时检测事故的挑战，以及我们的方法和结果。

引用次数: 6

A simple efficient approximation algorithm for dynamic time warping 一种简单有效的动态时间翘曲近似算法

Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pub Date : 2016-10-31 DOI: 10.1145/2996913.2996954

Rex Ying, Jiangwei Pan, K. Fox, P. Agarwal

Dynamic time warping (DTW) is a widely used curve similarity measure. We present a simple and efficient (1 + ε)- approximation algorithm for DTW between a pair of point sequences, say, P and Q, each of which is sampled from a curve. We prove that the running time of the algorithm is O([EQUATION]n log σ) for a pair of k-packed curves with a total of n points, assuming that the spreads of P and Q are bounded by σ. The spread of a point set is the ratio of the maximum to the minimum pairwise distance, and a curve is called K- packed if the length of its intersection with any disk of radius r is at most Kr. Although an algorithm with similar asymptotic time complexity was presented in [1], our algorithm is considerably simpler and more efficient in practice. We have implemented our algorithm. Our experiments on both synthetic and real-world data sets show that it is an order of magnitude faster than the standard exact DP algorithm on point sequences of length 5, 000 or more while keeping the approximation error within 5--10%. We demonstrate the efficacy of our algorithm by using it in two applications - computing the k most similar trajectories to a query trajectory, and running the iterative closest point method for a pair of trajectories. We show that we can achieve 8--12 times speedup using our algorithm as a subroutine in these applications, without compromising much in accuracy.

动态时间规整(DTW)是一种应用广泛的曲线相似性度量方法。我们提出了一种简单而有效的(1 + ε)-近似算法，用于对点序列之间的DTW，例如，P和Q，每个点序列都是从曲线上采样的。我们证明了在假设P和Q的扩展以σ为界的条件下，对于一对共n个点的k填充曲线，该算法的运行时间为O([EQUATION]n log σ)。点集的扩展是最大与最小两两距离之比，如果曲线与任何半径为r的盘相交的长度不超过Kr，则称为K-填充曲线。虽然在[1]中提出了一种具有类似渐近时间复杂度的算法，但我们的算法在实践中要简单得多，效率也更高。我们已经实现了我们的算法。我们在合成数据集和真实数据集上的实验表明，在长度为5000或更多的点序列上，它比标准精确DP算法快一个数量级，同时将近似误差保持在5- 10%以内。我们通过在两个应用程序中使用我们的算法来证明其有效性-计算k个与查询轨迹最相似的轨迹，并对一对轨迹运行迭代最近点方法。我们表明，在这些应用程序中，使用我们的算法作为子程序，我们可以实现8- 12倍的加速，而不会在精度上打折扣。

{"title":"A simple efficient approximation algorithm for dynamic time warping","authors":"Rex Ying, Jiangwei Pan, K. Fox, P. Agarwal","doi":"10.1145/2996913.2996954","DOIUrl":"https://doi.org/10.1145/2996913.2996954","url":null,"abstract":"Dynamic time warping (DTW) is a widely used curve similarity measure. We present a simple and efficient (1 + ε)- approximation algorithm for DTW between a pair of point sequences, say, P and Q, each of which is sampled from a curve. We prove that the running time of the algorithm is O([EQUATION]n log σ) for a pair of k-packed curves with a total of n points, assuming that the spreads of P and Q are bounded by σ. The spread of a point set is the ratio of the maximum to the minimum pairwise distance, and a curve is called K- packed if the length of its intersection with any disk of radius r is at most Kr. Although an algorithm with similar asymptotic time complexity was presented in [1], our algorithm is considerably simpler and more efficient in practice. We have implemented our algorithm. Our experiments on both synthetic and real-world data sets show that it is an order of magnitude faster than the standard exact DP algorithm on point sequences of length 5, 000 or more while keeping the approximation error within 5--10%. We demonstrate the efficacy of our algorithm by using it in two applications - computing the k most similar trajectories to a query trajectory, and running the iterative closest point method for a pair of trajectories. We show that we can achieve 8--12 times speedup using our algorithm as a subroutine in these applications, without compromising much in accuracy.","PeriodicalId":20525,"journal":{"name":"Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"71 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75636172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 23

LEDS 发光二极管

Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pub Date : 2016-10-31 DOI: 10.1145/2996913.2996928

Zhi Liu, Yan Huang, Joshua R. Trampier

Twitter is one of the most popular social media platforms where people can share their opinions, thoughts, interests, and whereabouts. In this work, we propose a Local Event Discovery and Summarization (LEDS) framework to detect local events from Twitter. Many existing algorithms for event detection focus on larger-scale events and are not sensitive to smaller-scale local events. Most of the local events detected by these methods are major events such as important sports, shows, or large natural disasters. In this paper, we propose the LEDS framework to detect both larger and smaller events. LEDS contains three key steps: 1) Detecting possible event related terms by monitoring abnormal distribution in different locations and times; 2) Clustering tweets based on their key terms, time, and location distribution; and 3) Extracting descriptions including time, location, and key sentences of local events from clusters. The framework is evaluated on a real world Twitter dataset with more than 60 million tweets. The results show that compared with previous work, LEDS can detect smaller-scale and greater variety of local events. More than 43 percent of detected local events do not have an official organizer, cannot be seen on news media, and only attract the attention from a small group of people.

{"title":"LEDS","authors":"Zhi Liu, Yan Huang, Joshua R. Trampier","doi":"10.1145/2996913.2996928","DOIUrl":"https://doi.org/10.1145/2996913.2996928","url":null,"abstract":"Twitter is one of the most popular social media platforms where people can share their opinions, thoughts, interests, and whereabouts. In this work, we propose a Local Event Discovery and Summarization (LEDS) framework to detect local events from Twitter. Many existing algorithms for event detection focus on larger-scale events and are not sensitive to smaller-scale local events. Most of the local events detected by these methods are major events such as important sports, shows, or large natural disasters. In this paper, we propose the LEDS framework to detect both larger and smaller events. LEDS contains three key steps: 1) Detecting possible event related terms by monitoring abnormal distribution in different locations and times; 2) Clustering tweets based on their key terms, time, and location distribution; and 3) Extracting descriptions including time, location, and key sentences of local events from clusters. The framework is evaluated on a real world Twitter dataset with more than 60 million tweets. The results show that compared with previous work, LEDS can detect smaller-scale and greater variety of local events. More than 43 percent of detected local events do not have an official organizer, cannot be seen on news media, and only attract the attention from a small group of people.","PeriodicalId":20525,"journal":{"name":"Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"98 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76081045","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

SOLEV: a video generation framework for solar events from mixed data sources (demo paper) SOLEV:基于混合数据源的太阳事件视频生成框架(演示论文)

Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pub Date : 2016-10-31 DOI: 10.1145/2996913.2996963

S. F. Boubrahimi, Berkay Aydin, Dustin J. Kempton, R. Angryk

One of the main strengths of Geographical Information Systems (GIS) is the analysis of spatial and attributive data. Spatiotemporal interpolation techniques allow the expansion of the collected data to the sites where no samples are available. In the context of GIS, the data, be it interpolated or collected, are visual in nature and hard to understand in raw forms. Visualization of complex evolving region trajectories is often times used as an aid to better understand the data and its underlying patterns. In this work, we created SOLEV, a solar event video generation framework that integrates multiple data sources of solar images. This is the first framework of this kind that not only visualizes spatial solar event boundaries, but also the tracked and interpolated spatiotemporal trajectories they form over time.

地理信息系统(GIS)的主要优势之一是对空间和属性数据的分析。时空插值技术允许将收集到的数据扩展到没有样本的地点。在地理信息系统的背景下，无论是插入还是收集的数据，本质上都是可视化的，以原始形式很难理解。复杂演化区域轨迹的可视化经常被用作更好地理解数据及其潜在模式的辅助手段。在这项工作中，我们创建了SOLEV，一个太阳事件视频生成框架，集成了多个太阳图像数据源。这是第一个这样的框架，不仅可以可视化空间太阳事件边界，而且还可以跟踪和插值它们随时间形成的时空轨迹。

引用次数: 1

Price-aware real-time ride-sharing at scale: an auction-based approach 价格敏感的大规模实时拼车:一种基于拍卖的方法

Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pub Date : 2016-10-31 DOI: 10.1145/2996913.2996974

M. Asghari, Dingxiong Deng, C. Shahabi, Ugur Demiryurek, Yaguang Li

Real-time ride-sharing, which enables on-the-fly matching between riders and drivers (even en-route), is an important problem due to its environmental and societal benefits. With the emergence of many ride-sharing platforms (e.g., Uber and Lyft), the design of a scalable framework to match riders and drivers based on their various constraints while maximizing the overall profit of the platform becomes a distinguishing business strategy. A key challenge of such framework is to satisfy both types of the users in the system, e.g., reducing both riders' and drivers' travel distances. However, the majority of the existing approaches focus only on minimizing the total travel distance of drivers which is not always equivalent to shorter trips for riders. Hence, we propose a fair pricing model that simultaneously satisfies both the riders' and drivers' constraints and desires (formulated as their profiles). In particular, we introduce a distributed auction-based framework where each driver's mobile app automatically bids on every nearby request taking into account many factors such as both the driver's and the riders' profiles, their itineraries, the pricing model, and the current number of riders in the vehicle. Subsequently, the server determines the highest bidder and assigns the rider to that driver. We show that this framework is scalable and efficient, processing hundreds of tasks per second in the presence of thousands of drivers. We compare our framework with the state-of-the-art approaches in both industry and academia through experiments on New York City's taxi dataset. Our results show that our framework can simultaneously match more riders to drivers (i.e., higher service rate) by engaging the drivers more effectively. Moreover, our frame-work schedules shorter trips for riders (i.e., better service quality). Finally, as a consequence of higher service rate and shorter trips, our framework increases the overall profit of the ride-sharing platforms.

由于其环境和社会效益，实时乘车共享是一个重要的问题，它可以实现乘客和司机之间的即时匹配(甚至在途中)。随着许多拼车平台(如Uber和Lyft)的出现，设计一个可扩展的框架来匹配乘客和司机的各种约束，同时最大化平台的整体利润成为一种独特的商业策略。这种框架的一个关键挑战是满足系统中两种类型的用户，例如，减少乘客和司机的旅行距离。然而，现有的大多数方法只关注最小化驾驶员的总行程距离，这并不总是等同于缩短乘客的行程。因此，我们提出了一个公平的定价模型，同时满足乘客和司机的约束和愿望(表述为他们的个人资料)。特别是，我们引入了一个基于分布式拍卖的框架，每个司机的移动应用程序自动对每个附近的请求进行出价，考虑到许多因素，如司机和乘客的个人资料，他们的行程，定价模型，以及当前车辆中的乘客数量。随后，服务器确定出价最高的竞标者，并将骑手分配给该驾驶员。我们证明了这个框架是可扩展的和高效的，在数千个驱动程序存在的情况下每秒处理数百个任务。我们通过对纽约市出租车数据集的实验，将我们的框架与工业界和学术界最先进的方法进行了比较。我们的研究结果表明，通过更有效地吸引司机，我们的框架可以同时匹配更多的乘客和司机(即更高的服务率)。此外，我们的框架为乘客安排了更短的行程(即更好的服务质量)。最后，由于更高的服务率和更短的行程，我们的框架增加了拼车平台的整体利润。

{"title":"Price-aware real-time ride-sharing at scale: an auction-based approach","authors":"M. Asghari, Dingxiong Deng, C. Shahabi, Ugur Demiryurek, Yaguang Li","doi":"10.1145/2996913.2996974","DOIUrl":"https://doi.org/10.1145/2996913.2996974","url":null,"abstract":"Real-time ride-sharing, which enables on-the-fly matching between riders and drivers (even en-route), is an important problem due to its environmental and societal benefits. With the emergence of many ride-sharing platforms (e.g., Uber and Lyft), the design of a scalable framework to match riders and drivers based on their various constraints while maximizing the overall profit of the platform becomes a distinguishing business strategy. A key challenge of such framework is to satisfy both types of the users in the system, e.g., reducing both riders' and drivers' travel distances. However, the majority of the existing approaches focus only on minimizing the total travel distance of drivers which is not always equivalent to shorter trips for riders. Hence, we propose a fair pricing model that simultaneously satisfies both the riders' and drivers' constraints and desires (formulated as their profiles). In particular, we introduce a distributed auction-based framework where each driver's mobile app automatically bids on every nearby request taking into account many factors such as both the driver's and the riders' profiles, their itineraries, the pricing model, and the current number of riders in the vehicle. Subsequently, the server determines the highest bidder and assigns the rider to that driver. We show that this framework is scalable and efficient, processing hundreds of tasks per second in the presence of thousands of drivers. We compare our framework with the state-of-the-art approaches in both industry and academia through experiments on New York City's taxi dataset. Our results show that our framework can simultaneously match more riders to drivers (i.e., higher service rate) by engaging the drivers more effectively. Moreover, our frame-work schedules shorter trips for riders (i.e., better service quality). Finally, as a consequence of higher service rate and shorter trips, our framework increases the overall profit of the ride-sharing platforms.","PeriodicalId":20525,"journal":{"name":"Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83949120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 100

On supporting compilation in spatial query engines: (vision paper) 在空间查询引擎中支持编译:(视觉文件)

Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pub Date : 2016-10-31 DOI: 10.1145/2996913.2996945

Ruby Y. Tahboub, Tiark Rompf

Today's 'Big' spatial computing and analytics are largely processed in-memory. Still, evaluation in prominent spatial query engines is neither fully optimized for modern-class platforms nor taking full advantage of compilation (i.e., generating low-level query code). Query compilation has been rapidly rising inside in-memory relational database management systems (RDBMSs) achieving remarkable speedups; how can we bring similar benefits to spatial query engines? In this research, we bring in proven Programming Languages (PL) approaches e.g., partial evaluation, generative programming, etc. and leverage the power of modern hardware to extend query compilation inside spatial query engines. We envision a fully compiled spatial query engine that is efficient and feasible to implement in a high-level language. We describe LB2-Spatial; a prototype for a fully compiled spatial query engine that employs generative and multi-stage programming to realize query compilation. Furthermore, we discuss challenges, and conduct a preliminary experiment to highlight potential gains of compilation. Finally, we sketch potential avenues for supporting spatial query compilation in Postgres/ PostGIS; a traditional RDBMS and Spark/ Spark SQL; a main-memory cluster computing framework.

今天的“大”空间计算和分析主要是在内存中处理的。但是，突出的空间查询引擎中的求值既没有针对现代级平台进行充分优化，也没有充分利用编译(即生成低级查询代码)的优势。查询编译在内存关系数据库管理系统(rdbms)中得到了迅速的发展，并取得了显著的速度提升;我们如何为空间查询引擎带来类似的好处?在本研究中，我们引入了经过验证的编程语言(PL)方法，如部分求值、生成式编程等，并利用现代硬件的力量在空间查询引擎中扩展查询编译。我们设想了一个完全编译的空间查询引擎，它可以高效、可行地用高级语言实现。我们描述lb2 -空间;一个采用生成式多阶段编程实现查询编译的全编译空间查询引擎原型。此外，我们讨论了挑战，并进行了初步实验，以突出编译的潜在收益。最后，我们概述了在Postgres/ PostGIS中支持空间查询编译的潜在途径;传统的RDBMS和Spark/ Spark SQL;一个主存集群计算框架。

{"title":"On supporting compilation in spatial query engines: (vision paper)","authors":"Ruby Y. Tahboub, Tiark Rompf","doi":"10.1145/2996913.2996945","DOIUrl":"https://doi.org/10.1145/2996913.2996945","url":null,"abstract":"Today's 'Big' spatial computing and analytics are largely processed in-memory. Still, evaluation in prominent spatial query engines is neither fully optimized for modern-class platforms nor taking full advantage of compilation (i.e., generating low-level query code). Query compilation has been rapidly rising inside in-memory relational database management systems (RDBMSs) achieving remarkable speedups; how can we bring similar benefits to spatial query engines? In this research, we bring in proven Programming Languages (PL) approaches e.g., partial evaluation, generative programming, etc. and leverage the power of modern hardware to extend query compilation inside spatial query engines. We envision a fully compiled spatial query engine that is efficient and feasible to implement in a high-level language. We describe LB2-Spatial; a prototype for a fully compiled spatial query engine that employs generative and multi-stage programming to realize query compilation. Furthermore, we discuss challenges, and conduct a preliminary experiment to highlight potential gains of compilation. Finally, we sketch potential avenues for supporting spatial query compilation in Postgres/ PostGIS; a traditional RDBMS and Spark/ Spark SQL; a main-memory cluster computing framework.","PeriodicalId":20525,"journal":{"name":"Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"85 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81019350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

Simba: spatial in-memory big data analysis Simba:空间内存大数据分析

Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pub Date : 2016-10-31 DOI: 10.1145/2996913.2996935

Dong Xie, Feifei Li, Bin Yao, Gefei Li, Zhongpu Chen, Liang Zhou, M. Guo

We present the Simba (Spatial In-Memory Big data Analytics) system, which offers scalable and efficient in-memory spatial query processing and analytics for big spatial data. Simba natively extends the Spark SQL engine to support rich spatial queries and analytics through both SQL and DataFrame API. It enables the construction of indexes over RDDs inside the engine in order to work with big spatial data and complex spatial operations. Simba also comes with an effective query optimizer, which leverages its indexes and novel spatial-aware optimizations, to achieve both low latency and high throughput in big spatial data analysis. This demonstration proposal describes key ideas in the design of Simba, and presents a demonstration plan.

我们提出了Simba(空间内存大数据分析)系统，它为大空间数据提供了可扩展和高效的内存空间查询处理和分析。Simba原生扩展了Spark SQL引擎，通过SQL和DataFrame API支持丰富的空间查询和分析。它支持在引擎内部的rdd上构建索引，以便处理大空间数据和复杂的空间操作。Simba还附带了一个有效的查询优化器，它利用其索引和新颖的空间感知优化，在大空间数据分析中实现低延迟和高吞吐量。本演示提案描述了Simba设计中的关键思想，并给出了演示计划。

引用次数: 16

Who goes there?: approaches to mapping facial appearance diversity 谁去那儿?:绘制面部外貌多样性的方法

Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pub Date : 2016-10-31 DOI: 10.1145/2996913.2996997

Zachary Bessinger, C. Stauffer, Nathan Jacobs

Geotagged imagery, from satellite, aerial, and ground-level cameras, provides a rich record of how the appearance of scenes and objects differ across the globe. Modern web- based mapping software makes it easy to see how different places around the world look, both from satellite and ground-level views. Unfortunately, interfaces for exploring how the appearance of objects depend on geographic location are quite limited. In this work, we focus on a particularly common object, the human face, and propose learning generative models that relate facial appearance and geographic location. We train these models using a novel dataset of geotagged face imagery we constructed for this task. We present qualitative and quantitative results that demonstrate that these models capture meaningful trends in appearance. We also describe a framework for constructing a web-based visualization that captures the geospatial distribution of human facial appearance.

来自卫星、空中和地面摄像机的地理标记图像提供了丰富的记录，说明全球各地的场景和物体的外观如何不同。现代基于网络的地图软件可以很容易地从卫星和地面上看到世界各地不同的地方。不幸的是，用于探索对象的外观如何依赖于地理位置的接口非常有限。在这项工作中，我们专注于一个特别常见的对象，人脸，并提出了将面部外观和地理位置联系起来的学习生成模型。我们使用我们为此任务构建的地理标记面部图像的新数据集来训练这些模型。我们提出的定性和定量结果表明，这些模型捕捉有意义的趋势在外观。我们还描述了一个框架，用于构建基于web的可视化，以捕获人类面部外观的地理空间分布。

引用次数: 6

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀