Proceedings of the 2016 International Conference on Management of Data最新文献_第9页

Proceedings of the 2016 International Conference on Management of Data

Pub Date : 2016-06-14 DOI: 10.1145/2882903.2915965

Miao Qiao, Junhao Gan, Yufei Tao

This paper studies a type of continuous queries called range thresholding on streams (RTS). Imagine the stream as an unbounded sequence of elements each of which is a real value. A query registers an interval, and must be notified as soon as a certain number of incoming elements fall into the interval. The system needs to support multiple queries simultaneously, and aims to minimize the space consumption and computation time. Currently, all the solutions to this problem entail quadratic time O(nm) to process n stream elements and m queries, which severely limits their applicability to only a small number of queries. We propose the first algorithm that breaks the quadratic barrier, by reducing the computation cost dramatically to O(n + m), subject only to a polylogarithmic factor. The algorithm is general enough to guarantee the same on weighted versions of the queries even in d-dimensional space of any constant d. Its vast advantage over the previous methods in practical environments has been confirmed through extensive experimentation.

本文研究了一种称为流上范围阈值(RTS)的连续查询。将流想象为一个无界的元素序列，其中每个元素都是实值。查询注册一个间隔，并且必须在一定数量的传入元素落入该间隔时立即通知查询。系统需要同时支持多个查询，并以最小化空间消耗和计算时间为目标。目前，该问题的所有解决方案都需要O(nm)的二次时间来处理n个流元素和m个查询，这严重限制了它们仅适用于少量查询。我们提出了第一个打破二次障碍的算法，通过将计算成本大幅降低到O(n + m)，仅受多对数因素的影响。该算法具有足够的通用性，即使在任意常数d的d维空间中，也能保证查询的加权版本相同。在实际环境中，它比以前的方法具有巨大的优势，这已经通过大量的实验得到了证实。

引用次数: 5

Elastic Pipelining in an In-Memory Database Cluster 内存数据库集群中的弹性管道

Proceedings of the 2016 International Conference on Management of Data

Pub Date : 2016-06-14 DOI: 10.1145/2882903.2882904

Li Wang, Minqi Zhou, Zhenjie Zhang, Y. Yang, Aoying Zhou, D. Bitton

An in-memory database cluster consists of multiple interconnected nodes with a large capacity of RAM and modern multi-core CPUs. As a conventional query processing strategy, pipelining remains a promising solution for in-memory parallel database systems, as it avoids expensive intermediate result materialization and parallelizes the data processing among nodes. However, to fully unleash the power of pipelining in a cluster with multi-core nodes, it is crucial for the query optimizer to generate good query plans with appropriate intra-node parallelism, in order to maximize CPU and network bandwidth utilization. A suboptimal plan, on the contrary, causes load imbalance in the pipelines and consequently degrades the query performance. Parallelism assignment optimization at compile time is nearly impossible, as the workload in each node is affected by numerous factors and is highly dynamic during query evaluation. To tackle this problem, we propose elastic pipelining, which makes it possible to optimize intra-node parallelism assignments in the pipelines based on the actual workload at runtime. It is achieved with the adoption of new elastic iterator model and a fully optimized dynamic scheduler. The elastic iterator model generally upgrades traditional iterator model with new dynamic multi-core execution adjustment capability. And the dynamic scheduler efficiently provisions CPU cores to query execution segments in the pipelines based on the light-weight measurements on the operators. Extensive experiments on real and synthetic (TPC-H) data show that our proposal achieves almost full CPU utilization on typical decision-making analytical queries, outperforming state-of-the-art open-source systems by a huge margin.

内存数据库集群由多个相互连接的节点组成，这些节点具有大容量的RAM和现代多核cpu。作为一种传统的查询处理策略，流水线仍然是内存中并行数据库系统的一种很有前途的解决方案，因为它避免了昂贵的中间结果物化，并使节点之间的数据处理并行化。然而，为了在具有多核节点的集群中充分发挥流水线的功能，查询优化器必须生成具有适当节点内并行性的良好查询计划，以最大限度地提高CPU和网络带宽利用率。相反，次优计划会导致管道中的负载不平衡，从而降低查询性能。编译时的并行分配优化几乎是不可能的，因为每个节点中的工作负载受到许多因素的影响，并且在查询求值期间是高度动态的。为了解决这个问题，我们提出了弹性管道，这使得在运行时根据实际工作负载优化管道中的节点内并行分配成为可能。通过采用新的弹性迭代器模型和完全优化的动态调度来实现。弹性迭代器模型一般是对传统迭代器模型的升级，具有新的动态多核执行调整能力。动态调度器基于对操作符的轻量级度量，有效地分配CPU内核来查询管道中的执行段。在真实和合成(TPC-H)数据上进行的大量实验表明，我们的建议在典型的决策分析查询上实现了几乎全部的CPU利用率，大大优于最先进的开源系统。

{"title":"Elastic Pipelining in an In-Memory Database Cluster","authors":"Li Wang, Minqi Zhou, Zhenjie Zhang, Y. Yang, Aoying Zhou, D. Bitton","doi":"10.1145/2882903.2882904","DOIUrl":"https://doi.org/10.1145/2882903.2882904","url":null,"abstract":"An in-memory database cluster consists of multiple interconnected nodes with a large capacity of RAM and modern multi-core CPUs. As a conventional query processing strategy, pipelining remains a promising solution for in-memory parallel database systems, as it avoids expensive intermediate result materialization and parallelizes the data processing among nodes. However, to fully unleash the power of pipelining in a cluster with multi-core nodes, it is crucial for the query optimizer to generate good query plans with appropriate intra-node parallelism, in order to maximize CPU and network bandwidth utilization. A suboptimal plan, on the contrary, causes load imbalance in the pipelines and consequently degrades the query performance. Parallelism assignment optimization at compile time is nearly impossible, as the workload in each node is affected by numerous factors and is highly dynamic during query evaluation. To tackle this problem, we propose elastic pipelining, which makes it possible to optimize intra-node parallelism assignments in the pipelines based on the actual workload at runtime. It is achieved with the adoption of new elastic iterator model and a fully optimized dynamic scheduler. The elastic iterator model generally upgrades traditional iterator model with new dynamic multi-core execution adjustment capability. And the dynamic scheduler efficiently provisions CPU cores to query execution segments in the pipelines based on the light-weight measurements on the operators. Extensive experiments on real and synthetic (TPC-H) data show that our proposal achieves almost full CPU utilization on typical decision-making analytical queries, outperforming state-of-the-art open-source systems by a huge margin.","PeriodicalId":20483,"journal":{"name":"Proceedings of the 2016 International Conference on Management of Data","volume":"134 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73388280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 21

Dynamic Prefetching of Data Tiles for Interactive Visualization 交互式可视化中数据块的动态预取

Proceedings of the 2016 International Conference on Management of Data

Pub Date : 2016-06-14 DOI: 10.1145/2882903.2882919

L. Battle, Remco Chang, M. Stonebraker

In this paper, we present ForeCache, a general-purpose tool for exploratory browsing of large datasets. ForeCache utilizes a client-server architecture, where the user interacts with a lightweight client-side interface to browse datasets, and the data to be browsed is retrieved from a DBMS running on a back-end server. We assume a detail-on-demand browsing paradigm, and optimize the back-end support for this paradigm by inserting a separate middleware layer in front of the DBMS. To improve response times, the middleware layer fetches data ahead of the user as she explores a dataset. We consider two different mechanisms for prefetching: (a) learning what to fetch from the user's recent movements, and (b) using data characteristics (e.g., histograms) to find data similar to what the user has viewed in the past. We incorporate these mechanisms into a single prediction engine that adjusts its prediction strategies over time, based on changes in the user's behavior. We evaluated our prediction engine with a user study, and found that our dynamic prefetching strategy provides: (1) significant improvements in overall latency when compared with non-prefetching systems (430% improvement); and (2) substantial improvements in both prediction accuracy (25% improvement) and latency (88% improvement) relative to existing prefetching techniques.

在本文中，我们提出了一个用于探索性浏览大型数据集的通用工具ForeCache。ForeCache利用客户机-服务器体系结构，其中用户与轻量级客户机端接口交互以浏览数据集，要浏览的数据从后端服务器上运行的DBMS检索。我们假设一个按需详细浏览范例，并通过在DBMS前面插入一个单独的中间件层来优化对该范例的后端支持。为了改善响应时间，中间件层在用户浏览数据集之前获取数据。我们考虑了两种不同的预取机制:(a)学习从用户最近的运动中获取什么，以及(b)使用数据特征(例如，直方图)来查找与用户过去查看过的数据相似的数据。我们将这些机制整合到一个单一的预测引擎中，根据用户行为的变化，随着时间的推移调整其预测策略。我们通过用户研究评估了我们的预测引擎，发现我们的动态预取策略提供了:(1)与非预取系统相比，总体延迟显著改善(改善430%);(2)相对于现有的预取技术，预测精度(提高25%)和延迟(提高88%)都有了实质性的提高。

{"title":"Dynamic Prefetching of Data Tiles for Interactive Visualization","authors":"L. Battle, Remco Chang, M. Stonebraker","doi":"10.1145/2882903.2882919","DOIUrl":"https://doi.org/10.1145/2882903.2882919","url":null,"abstract":"In this paper, we present ForeCache, a general-purpose tool for exploratory browsing of large datasets. ForeCache utilizes a client-server architecture, where the user interacts with a lightweight client-side interface to browse datasets, and the data to be browsed is retrieved from a DBMS running on a back-end server. We assume a detail-on-demand browsing paradigm, and optimize the back-end support for this paradigm by inserting a separate middleware layer in front of the DBMS. To improve response times, the middleware layer fetches data ahead of the user as she explores a dataset. We consider two different mechanisms for prefetching: (a) learning what to fetch from the user's recent movements, and (b) using data characteristics (e.g., histograms) to find data similar to what the user has viewed in the past. We incorporate these mechanisms into a single prediction engine that adjusts its prediction strategies over time, based on changes in the user's behavior. We evaluated our prediction engine with a user study, and found that our dynamic prefetching strategy provides: (1) significant improvements in overall latency when compared with non-prefetching systems (430% improvement); and (2) substantial improvements in both prediction accuracy (25% improvement) and latency (88% improvement) relative to existing prefetching techniques.","PeriodicalId":20483,"journal":{"name":"Proceedings of the 2016 International Conference on Management of Data","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82161385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 157

Publishing Graph Degree Distribution with Node Differential Privacy 具有节点差分隐私的发布图度分布

Proceedings of the 2016 International Conference on Management of Data

Pub Date : 2016-06-14 DOI: 10.1145/2882903.2926745

Wei-Yen Day, Ninghui Li, Min Lyu

Graph data publishing under node-differential privacy (node-DP) is challenging due to the huge sensitivity of queries. However, since a node in graph data oftentimes represents a person, node-DP is necessary to achieve personal data protection. In this paper, we investigate the problem of publishing the degree distribution of a graph under node-DP by exploring the projection approach to reduce the sensitivity. We propose two approaches based on aggregation and cumulative histogram to publish the degree distribution. The experiments demonstrate that our approaches greatly reduce the error of approximating the true degree distribution and have significant improvement over existing works. We also present the introspective analysis for understanding the factors of publishing the degree distribution with node-DP.

由于查询的高度敏感性，在节点差分隐私(node-DP)下的图数据发布具有挑战性。然而，由于图数据中的节点通常代表一个人，因此需要node- dp来实现个人数据保护。本文通过探索投影法来降低灵敏度，研究了节点dp下图的度分布发布问题。我们提出了两种基于聚合和累积直方图的方法来发布度分布。实验表明，我们的方法大大减小了逼近真实度分布的误差，与现有的方法相比有了显著的改进。本文还提出了内省分析，以了解节点- dp发布度分布的影响因素。

引用次数: 144

GTS: A Fast and Scalable Graph Processing Method based on Streaming Topology to GPUs GTS:一种基于gpu流拓扑的快速可扩展图形处理方法

Proceedings of the 2016 International Conference on Management of Data

Pub Date : 2016-06-14 DOI: 10.1145/2882903.2915204

Min-Soo Kim, K. An, Himchan Park, Hyunseok Seo, Jinwook Kim

A fast and scalable graph processing method becomes increasingly important as graphs become popular in a wide range of applications and their sizes are growing rapidly. Most of distributed graph processing methods require a lot of machines equipped with a total of thousands of CPU cores and a few terabyte main memory for handling billion-scale graphs. Meanwhile, GPUs could be a promising direction toward fast processing of large-scale graphs by exploiting thousands of GPU cores. All of the existing methods using GPUs, however, fail to process large-scale graphs that do not fit in main memory of a single machine. Here, we propose a fast and scalable graph processing method GTS that handles even RMAT32 (64 billion edges) very efficiently only by using a single machine. The proposed method stores graphs in PCI-E SSDs and executes a graph algorithm using thousands of GPU cores while streaming topology data of graphs to GPUs via PCI-E interface. GTS is fast due to no communication overhead and scalable due to no data duplication from graph partitioning among machines. Through extensive experiments, we show that GTS consistently and significantly outperforms the major distributed graph processing methods, GraphX, Giraph, and PowerGraph, and the state-of-the-art GPU-based method TOTEM.

随着图形在广泛的应用中越来越流行，图形的大小也在迅速增长，一种快速、可扩展的图形处理方法变得越来越重要。大多数分布式图形处理方法需要大量的机器，总共配备数千个CPU内核和几tb的主内存来处理十亿规模的图形。同时，GPU可以利用数千个GPU内核来快速处理大规模图形，这是一个很有前途的方向。然而，所有使用gpu的现有方法都无法处理不适合单个机器主存储器的大规模图形。在这里，我们提出了一种快速且可扩展的图形处理方法GTS，该方法仅使用一台机器就可以非常有效地处理RMAT32(640亿个边)。该方法将图形存储在PCI-E固态硬盘中，并使用数千个GPU内核执行图形算法，同时通过PCI-E接口将图形拓扑数据流式传输到GPU。GTS由于没有通信开销而速度很快，并且由于机器之间的图分区没有数据复制而具有可伸缩性。通过广泛的实验，我们表明GTS持续且显著地优于主要的分布式图形处理方法，GraphX, Giraph和PowerGraph，以及最先进的基于gpu的方法TOTEM。

{"title":"GTS: A Fast and Scalable Graph Processing Method based on Streaming Topology to GPUs","authors":"Min-Soo Kim, K. An, Himchan Park, Hyunseok Seo, Jinwook Kim","doi":"10.1145/2882903.2915204","DOIUrl":"https://doi.org/10.1145/2882903.2915204","url":null,"abstract":"A fast and scalable graph processing method becomes increasingly important as graphs become popular in a wide range of applications and their sizes are growing rapidly. Most of distributed graph processing methods require a lot of machines equipped with a total of thousands of CPU cores and a few terabyte main memory for handling billion-scale graphs. Meanwhile, GPUs could be a promising direction toward fast processing of large-scale graphs by exploiting thousands of GPU cores. All of the existing methods using GPUs, however, fail to process large-scale graphs that do not fit in main memory of a single machine. Here, we propose a fast and scalable graph processing method GTS that handles even RMAT32 (64 billion edges) very efficiently only by using a single machine. The proposed method stores graphs in PCI-E SSDs and executes a graph algorithm using thousands of GPU cores while streaming topology data of graphs to GPUs via PCI-E interface. GTS is fast due to no communication overhead and scalable due to no data duplication from graph partitioning among machines. Through extensive experiments, we show that GTS consistently and significantly outperforms the major distributed graph processing methods, GraphX, Giraph, and PowerGraph, and the state-of-the-art GPU-based method TOTEM.","PeriodicalId":20483,"journal":{"name":"Proceedings of the 2016 International Conference on Management of Data","volume":"137 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89355862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 69

Fast Multi-Column Sorting in Main-Memory Column-Stores 快速多列排序在主存列存储

Proceedings of the 2016 International Conference on Management of Data

Pub Date : 2016-06-14 DOI: 10.1145/2882903.2915205

Wenjian Xu, Ziqiang Feng, Eric Lo

Sorting is a crucial operation that could be used to implement SQL operators such as GROUP BY, ORDER BY, and SQL:2003 PARTITION BY. Queries with multiple attributes in those clauses are common in real workloads. When executing queries of that kind, state-of-the-art main-memory column-stores require one round of sorting per input column. With the advent of recent fast scans and denormalization techniques, that kind of multi-column sorting could become a bottleneck. In this paper, we propose a new technique called "code massaging", which manipulates the bits across the columns so that the overall sorting time can be reduced by eliminating some rounds of sorting and/or by improving the degree of SIMD data level parallelism. Empirical results show that a main-memory column-store with code massaging can achieve speedup of up to 4.7X, 4.7X, 4X, and 3.2X on TPC-H, TPC-H skew, TPC-DS, and real workload, respectively.

排序是一项关键操作，可用于实现诸如GROUP BY、ORDER BY和SQL:2003 PARTITION BY等SQL操作符。在这些子句中具有多个属性的查询在实际工作负载中很常见。在执行这类查询时，最先进的主存列存储需要对每个输入列进行一轮排序。随着最近快速扫描和反规范化技术的出现，这种多列排序可能成为瓶颈。在本文中，我们提出了一种称为“代码处理”的新技术，该技术可以跨列操作位，从而通过消除一些排序轮和/或通过提高SIMD数据级并行度来减少总体排序时间。经验结果表明，带有代码按摩的主存列存储在TPC-H、TPC-H倾斜、TPC-DS和实际工作负载上分别可以实现高达4.7倍、4.7倍、4倍和3.2倍的加速。

引用次数: 11

Hybrid Pulling/Pushing for I/O-Efficient Distributed and Iterative Graph Computing I/ o高效分布式迭代图计算的混合拉/推

Proceedings of the 2016 International Conference on Management of Data

Pub Date : 2016-06-14 DOI: 10.1145/2882903.2882938

Zhigang Wang, Yu Gu, Y. Bao, Ge Yu, J. Yu

Billion-node graphs are rapidly growing in size in many applications such as online social networks. Most graph algorithms generate a large number of messages during iterative computations. Vertex-centric distributed systems usually store graph data and message data on disk to improve scalability. Currently, these distributed systems with disk-resident data take a push-based approach to handle messages. This works well if few messages reside on disk. Otherwise, it is I/O-inefficient due to expensive random writes. By contrast, the existing memory-resident pull-based approach individually pulls messages for each vertex on demand. Although it can be used to avoid disk operations regarding messages, expensive I/O costs are incurred by random and frequent access to vertices. This paper proposes a hybrid solution to support switching between push and pull adaptively, to obtain optimal performance for distributed systems with disk-resident data in different scenarios. We first employ a new block-centric technique (b-pull) to improve the I/O-performance of pulling messages, although the iterative computation is vertex-centric. I/O costs of data accesses are shifted from the receiver side where messages are written/read by push to the sender side where graph data are read by b-pull. Graph data are organized by clustering vertices and edges to achieve high I/O-efficiency in b-pull. Second, we design a seamless switching mechanism and a prominent performance prediction method to guarantee efficiency when switching between push and b-pull. We conduct extensive performance studies to confirm the effectiveness of our proposals over existing up-to-date solutions using a broad spectrum of real-world graphs.

在在线社交网络等许多应用程序中，十亿节点图的规模正在迅速增长。大多数图算法在迭代计算过程中都会产生大量的消息。以顶点为中心的分布式系统通常将图形数据和消息数据存储在磁盘上，以提高可伸缩性。目前，这些具有磁盘驻留数据的分布式系统采用基于推送的方法来处理消息。如果只有很少的消息驻留在磁盘上，那么这种方法可以很好地工作。否则，由于昂贵的随机写入，它是I/ o效率低下的。相比之下，现有的基于内存驻留的pull方法根据需要分别为每个顶点提取消息。尽管可以使用它来避免与消息相关的磁盘操作，但是随机和频繁地访问顶点会产生昂贵的I/O成本。本文提出了一种支持自适应推拉切换的混合方案，以在不同场景下获得具有磁盘驻留数据的分布式系统的最优性能。我们首先采用了一种新的以块为中心的技术(b-pull)来提高提取消息的I/ o性能，尽管迭代计算是以顶点为中心的。数据访问的I/O成本从通过push写入/读取消息的接收端转移到通过b-pull读取图形数据的发送端。在b-pull中，图形数据通过聚类顶点和边来组织，以达到较高的I/ o效率。其次，我们设计了无缝切换机构和突出的性能预测方法，以保证推拉切换的效率。我们进行了广泛的性能研究，以确认我们的建议比现有的最新解决方案的有效性，并使用广泛的现实世界图表。

{"title":"Hybrid Pulling/Pushing for I/O-Efficient Distributed and Iterative Graph Computing","authors":"Zhigang Wang, Yu Gu, Y. Bao, Ge Yu, J. Yu","doi":"10.1145/2882903.2882938","DOIUrl":"https://doi.org/10.1145/2882903.2882938","url":null,"abstract":"Billion-node graphs are rapidly growing in size in many applications such as online social networks. Most graph algorithms generate a large number of messages during iterative computations. Vertex-centric distributed systems usually store graph data and message data on disk to improve scalability. Currently, these distributed systems with disk-resident data take a push-based approach to handle messages. This works well if few messages reside on disk. Otherwise, it is I/O-inefficient due to expensive random writes. By contrast, the existing memory-resident pull-based approach individually pulls messages for each vertex on demand. Although it can be used to avoid disk operations regarding messages, expensive I/O costs are incurred by random and frequent access to vertices. This paper proposes a hybrid solution to support switching between push and pull adaptively, to obtain optimal performance for distributed systems with disk-resident data in different scenarios. We first employ a new block-centric technique (b-pull) to improve the I/O-performance of pulling messages, although the iterative computation is vertex-centric. I/O costs of data accesses are shifted from the receiver side where messages are written/read by push to the sender side where graph data are read by b-pull. Graph data are organized by clustering vertices and edges to achieve high I/O-efficiency in b-pull. Second, we design a seamless switching mechanism and a prominent performance prediction method to guarantee efficiency when switching between push and b-pull. We conduct extensive performance studies to confirm the effectiveness of our proposals over existing up-to-date solutions using a broad spectrum of real-world graphs.","PeriodicalId":20483,"journal":{"name":"Proceedings of the 2016 International Conference on Management of Data","volume":"60 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90285807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 30

AT-GIS: Highly Parallel Spatial Query Processing with Associative Transducers 基于关联传感器的高度并行空间查询处理

Proceedings of the 2016 International Conference on Management of Data

Pub Date : 2016-06-14 DOI: 10.1145/2882903.2882962

Peter Ogden, David B. Thomas, P. Pietzuch

Users in many domains, including urban planning, transportation, and environmental science want to execute analytical queries over continuously updated spatial datasets. Current solutions for large-scale spatial query processing either rely on extensions to RDBMS, which entails expensive loading and indexing phases when the data changes, or distributed map/reduce frameworks, running on resource-hungry compute clusters. Both solutions struggle with the sequential bottleneck of parsing complex, hierarchical spatial data formats, which frequently dominates query execution time. Our goal is to fully exploit the parallelism offered by modern multi-core CPUs for parsing and query execution, thus providing the performance of a cluster with the resources of a single machine. We describe AT-GIS, a highly-parallel spatial query processing system that scales linearly to a large number of CPU cores. AT-GIS integrates the parsing and querying of spatial data using a new computational abstraction called associative transducers (ATs). ATs can form a single data-parallel pipeline for computation without requiring the spatial input data to be split into logically independent blocks. Using ATs, AT-GIS can execute, in parallel, spatial query operators on the raw input data in multiple formats, without any pre-processing. On a single 64-core machine, AT-GIT provides 3x the performance of an 8-node Hadoop cluster with 192 cores for containment queries, and 10x for aggregation queries.

包括城市规划、交通和环境科学在内的许多领域的用户都希望对不断更新的空间数据集执行分析查询。当前用于大规模空间查询处理的解决方案要么依赖于RDBMS的扩展，这需要在数据更改时进行昂贵的加载和索引阶段，要么依赖于运行在资源匮乏的计算集群上的分布式map/reduce框架。这两种解决方案都要克服解析复杂的、分层的空间数据格式的顺序瓶颈，这通常会影响查询的执行时间。我们的目标是充分利用现代多核cpu为解析和查询执行提供的并行性，从而用单个机器的资源提供集群的性能。我们描述了AT-GIS，一个高度并行的空间查询处理系统，线性扩展到大量的CPU内核。AT-GIS使用一种新的称为关联传感器(ATs)的计算抽象集成了空间数据的解析和查询。at可以形成一个单一的数据并行管道进行计算，而不需要将空间输入数据分割成逻辑独立的块。使用ATs, AT-GIS可以并行地对多种格式的原始输入数据执行空间查询运算符，而无需进行任何预处理。在单个64核机器上，AT-GIT提供的性能是8节点192核Hadoop集群的3倍，是聚合查询的10倍。

{"title":"AT-GIS: Highly Parallel Spatial Query Processing with Associative Transducers","authors":"Peter Ogden, David B. Thomas, P. Pietzuch","doi":"10.1145/2882903.2882962","DOIUrl":"https://doi.org/10.1145/2882903.2882962","url":null,"abstract":"Users in many domains, including urban planning, transportation, and environmental science want to execute analytical queries over continuously updated spatial datasets. Current solutions for large-scale spatial query processing either rely on extensions to RDBMS, which entails expensive loading and indexing phases when the data changes, or distributed map/reduce frameworks, running on resource-hungry compute clusters. Both solutions struggle with the sequential bottleneck of parsing complex, hierarchical spatial data formats, which frequently dominates query execution time. Our goal is to fully exploit the parallelism offered by modern multi-core CPUs for parsing and query execution, thus providing the performance of a cluster with the resources of a single machine. We describe AT-GIS, a highly-parallel spatial query processing system that scales linearly to a large number of CPU cores. AT-GIS integrates the parsing and querying of spatial data using a new computational abstraction called associative transducers (ATs). ATs can form a single data-parallel pipeline for computation without requiring the spatial input data to be split into logically independent blocks. Using ATs, AT-GIS can execute, in parallel, spatial query operators on the raw input data in multiple formats, without any pre-processing. On a single 64-core machine, AT-GIT provides 3x the performance of an 8-node Hadoop cluster with 192 cores for containment queries, and 10x for aggregation queries.","PeriodicalId":20483,"journal":{"name":"Proceedings of the 2016 International Conference on Management of Data","volume":"47 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75667756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Shasta: Interactive Reporting At Scale 沙斯塔:大规模互动报道

Proceedings of the 2016 International Conference on Management of Data

Pub Date : 2016-06-14 DOI: 10.1145/2882903.2904444

G. Manoharan, Stephan Ellner, Karl Schnaitter, Sridatta Chegu, Alejandro Estrella-Balderrama, Stephan Gudmundson, Apurv Gupta, B. Handy, Bart Samwel, Chad Whipkey, Larysa Aharkava, Himani Apte, Nitin Gangahar, Jun Xu, S. Venkataraman, D. Agrawal, J. Ullman

We describe Shasta, a middleware system built at Google to support interactive reporting in complex user-facing applications related to Google's Internet advertising business. Shasta targets applications with challenging requirements: First, user query latencies must be low. Second, underlying transactional data stores have complex "read-unfriendly" schemas, placing significant transformation logic between stored data and the read-only views that Shasta exposes to its clients. This transformation logic must be expressed in a way that scales to large and agile engineering teams. Finally, Shasta targets applications with strong data freshness requirements, making it challenging to precompute query results using common techniques such as ETL pipelines or materialized views. Instead, online queries must go all the way from primary storage to user-facing views, resulting in complex queries joining 50 or more tables. Designed as a layer on top of Google's F1 RDBMS and Mesa data warehouse, Shasta combines language and system techniques to meet these requirements. To help with expressing complex view specifications, we developed a query language called RVL, with support for modularized view templates that can be dynamically compiled into SQL. To execute these SQL queries with low latency at scale, we leveraged and extended F1's distributed query engine with facilities such as safe execution of C++ and Java UDFs. To reduce latency and increase read parallelism, we extended F1 storage with a distributed read-only in-memory cache. The system we describe is in production at Google, powering critical applications used by advertisers and internal sales teams. Shasta has significantly improved system scalability and software engineering efficiency compared to the middleware solutions it replaced.

我们描述了Shasta，一个在Google建立的中间件系统，用于支持与Google的互联网广告业务相关的复杂的面向用户的应用程序中的交互式报告。Shasta针对具有挑战性需求的应用程序:首先，用户查询延迟必须很低。其次，底层事务性数据存储具有复杂的“读不友好”模式，在存储的数据和Shasta向其客户端公开的只读视图之间放置了重要的转换逻辑。这种转换逻辑必须以一种适用于大型敏捷工程团队的方式来表达。最后，Shasta针对具有强烈数据新鲜度要求的应用程序，这使得使用通用技术(如ETL管道或物化视图)预先计算查询结果具有挑战性。相反，在线查询必须从主存储一直到面向用户的视图，导致复杂的查询连接50个或更多的表。作为Google的F1 RDBMS和Mesa数据仓库之上的一个层，Shasta结合了语言和系统技术来满足这些需求。为了帮助表达复杂的视图规范，我们开发了一种名为RVL的查询语言，支持可以动态编译成SQL的模块化视图模板。为了以低延迟的方式执行这些SQL查询，我们利用并扩展了F1的分布式查询引擎，并提供了安全执行c++和Java udf等功能。为了减少延迟和增加读取并行性，我们使用分布式只读内存缓存扩展了F1存储。我们所描述的系统正在谷歌生产中，为广告商和内部销售团队使用的关键应用程序提供动力。与它所取代的中间件解决方案相比，Shasta显著提高了系统的可伸缩性和软件工程效率。

{"title":"Shasta: Interactive Reporting At Scale","authors":"G. Manoharan, Stephan Ellner, Karl Schnaitter, Sridatta Chegu, Alejandro Estrella-Balderrama, Stephan Gudmundson, Apurv Gupta, B. Handy, Bart Samwel, Chad Whipkey, Larysa Aharkava, Himani Apte, Nitin Gangahar, Jun Xu, S. Venkataraman, D. Agrawal, J. Ullman","doi":"10.1145/2882903.2904444","DOIUrl":"https://doi.org/10.1145/2882903.2904444","url":null,"abstract":"We describe Shasta, a middleware system built at Google to support interactive reporting in complex user-facing applications related to Google's Internet advertising business. Shasta targets applications with challenging requirements: First, user query latencies must be low. Second, underlying transactional data stores have complex \"read-unfriendly\" schemas, placing significant transformation logic between stored data and the read-only views that Shasta exposes to its clients. This transformation logic must be expressed in a way that scales to large and agile engineering teams. Finally, Shasta targets applications with strong data freshness requirements, making it challenging to precompute query results using common techniques such as ETL pipelines or materialized views. Instead, online queries must go all the way from primary storage to user-facing views, resulting in complex queries joining 50 or more tables. Designed as a layer on top of Google's F1 RDBMS and Mesa data warehouse, Shasta combines language and system techniques to meet these requirements. To help with expressing complex view specifications, we developed a query language called RVL, with support for modularized view templates that can be dynamically compiled into SQL. To execute these SQL queries with low latency at scale, we leveraged and extended F1's distributed query engine with facilities such as safe execution of C++ and Java UDFs. To reduce latency and increase read parallelism, we extended F1 storage with a distributed read-only in-memory cache. The system we describe is in production at Google, powering critical applications used by advertisers and internal sales teams. Shasta has significantly improved system scalability and software engineering efficiency compared to the middleware solutions it replaced.","PeriodicalId":20483,"journal":{"name":"Proceedings of the 2016 International Conference on Management of Data","volume":"31 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82134824","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Goods: Organizing Google's Datasets 商品:组织谷歌的数据集

Proceedings of the 2016 International Conference on Management of Data

Pub Date : 2016-06-14 DOI: 10.1145/2882903.2903730

A. Halevy, Flip Korn, Natasha Noy, Christopher Olston, N. Polyzotis, Sudip Roy, Steven Euijong Whang

Enterprises increasingly rely on structured datasets to run their businesses. These datasets take a variety of forms, such as structured files, databases, spreadsheets, or even services that provide access to the data. The datasets often reside in different storage systems, may vary in their formats, may change every day. In this paper, we present GOODS, a project to rethink how we organize structured datasets at scale, in a setting where teams use diverse and often idiosyncratic ways to produce the datasets and where there is no centralized system for storing and querying them. GOODS extracts metadata ranging from salient information about each dataset (owners, timestamps, schema) to relationships among datasets, such as similarity and provenance. It then exposes this metadata through services that allow engineers to find datasets within the company, to monitor datasets, to annotate them in order to enable others to use their datasets, and to analyze relationships between them. We discuss the technical challenges that we had to overcome in order to crawl and infer the metadata for billions of datasets, to maintain the consistency of our metadata catalog at scale, and to expose the metadata to users. We believe that many of the lessons that we learned are applicable to building large-scale enterprise-level data-management systems in general.

企业越来越依赖于结构化数据集来运营业务。这些数据集采用各种形式，例如结构化文件、数据库、电子表格，甚至是提供对数据访问的服务。数据集通常驻留在不同的存储系统中，格式可能不同，每天都可能发生变化。在本文中，我们介绍了GOODS，这是一个重新思考我们如何大规模组织结构化数据集的项目，在这样的环境中，团队使用多种且通常是特殊的方式来生成数据集，并且没有集中的系统来存储和查询数据集。GOODS提取元数据，范围从每个数据集的重要信息(所有者、时间戳、模式)到数据集之间的关系，如相似性和来源。然后，它通过服务公开这些元数据，这些服务允许工程师在公司内部查找数据集，监控数据集，对数据集进行注释，以便其他人能够使用他们的数据集，并分析它们之间的关系。我们讨论了为了爬行和推断数十亿数据集的元数据、在规模上维护元数据目录的一致性以及向用户公开元数据而必须克服的技术挑战。我们相信，我们学到的许多经验教训一般都适用于构建大型企业级数据管理系统。

{"title":"Goods: Organizing Google's Datasets","authors":"A. Halevy, Flip Korn, Natasha Noy, Christopher Olston, N. Polyzotis, Sudip Roy, Steven Euijong Whang","doi":"10.1145/2882903.2903730","DOIUrl":"https://doi.org/10.1145/2882903.2903730","url":null,"abstract":"Enterprises increasingly rely on structured datasets to run their businesses. These datasets take a variety of forms, such as structured files, databases, spreadsheets, or even services that provide access to the data. The datasets often reside in different storage systems, may vary in their formats, may change every day. In this paper, we present GOODS, a project to rethink how we organize structured datasets at scale, in a setting where teams use diverse and often idiosyncratic ways to produce the datasets and where there is no centralized system for storing and querying them. GOODS extracts metadata ranging from salient information about each dataset (owners, timestamps, schema) to relationships among datasets, such as similarity and provenance. It then exposes this metadata through services that allow engineers to find datasets within the company, to monitor datasets, to annotate them in order to enable others to use their datasets, and to analyze relationships between them. We discuss the technical challenges that we had to overcome in order to crawl and infer the metadata for billions of datasets, to maintain the consistency of our metadata catalog at scale, and to expose the metadata to users. We believe that many of the lessons that we learned are applicable to building large-scale enterprise-level data-management systems in general.","PeriodicalId":20483,"journal":{"name":"Proceedings of the 2016 International Conference on Management of Data","volume":"142 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2016-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83783440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 175