Vectorized Bloom filters for advanced SIMD processors

International Workshop on Data Management on New Hardware Pub Date : 2014-06-23 DOI:10.1145/2619228.2619234

Orestis Polychroniou, K. A. Ross

引用次数: 61

Abstract

Analytics are at the core of many business intelligence tasks. Efficient query execution is facilitated by advanced hardware features, such as multi-core parallelism, shared-nothing low-latency caches, and SIMD vector instructions. Only recently, the SIMD capabilities of mainstream hardware have been augmented with wider vectors and non-contiguous loads termed gathers. While analytical DBMSs minimize the use of indexes in favor of scans based on sequential memory accesses, some data structures remain crucial. The Bloom filter, one such example, is the most efficient structure for filtering tuples based on their existence in a set and its performance is critical when joining tables with vastly different cardinalities. We introduce a vectorized implementation for probing Bloom filters based on gathers that eliminates conditional control flow and is independent of the SIMD length. Our techniques are generic and can be reused for accelerating other database operations. Our evaluation indicates a significant performance improvement over scalar code that can exceed 3X when the Bloom filter is cache-resident.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

矢量布隆过滤器先进的SIMD处理器

分析是许多商业智能任务的核心。高级硬件特性促进了高效的查询执行，例如多核并行性、无共享的低延迟缓存和SIMD矢量指令。直到最近，主流硬件的SIMD功能才通过更宽的矢量和称为集的非连续负载得到增强。虽然分析dbms最大限度地减少了索引的使用，而支持基于顺序内存访问的扫描，但一些数据结构仍然至关重要。Bloom过滤器就是这样一个例子，它是根据元组在集合中的存在性来过滤元组的最有效的结构，当连接基数差别很大的表时，它的性能至关重要。我们引入了一种矢量化实现，用于探测基于集合的布隆过滤器，该集合消除了条件控制流并且独立于SIMD长度。我们的技术是通用的，可以用于加速其他数据库操作。我们的评估表明，当Bloom过滤器驻留在缓存中时，性能比标量代码有显著的提高，可以超过3倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

International Workshop on Data Management on New Hardware

自引率

0.00%

发文量

期刊最新文献

On testing persistent-memory-based software SIMD-accelerated regular expression matching FPGA-accelerated group-by aggregation using synchronizing caches Customized OS support for data-processing Larger-than-memory data management on modern storage hardware for in-memory OLTP database systems