Join processing for flash SSDs: remembering past lessons

International Workshop on Data Management on New Hardware Pub Date : 2009-06-28 DOI:10.1145/1565694.1565696

Jaeyoung Do, J. Patel

{"title":"Join processing for flash SSDs: remembering past lessons","authors":"Jaeyoung Do, J. Patel","doi":"10.1145/1565694.1565696","DOIUrl":null,"url":null,"abstract":"Flash solid state drives (SSDs) provide an attractive alternative to traditional magnetic hard disk drives (HDDs) for DBMS applications. Naturally there is substantial interest in redesigning critical database internals, such as join algorithms, for flash SSDs. However, we must carefully consider the lessons that we have learnt from over three decades of designing and tuning algorithms for magnetic HDD-based systems, so that we continue to reuse techniques that worked for magnetic HDDs and also work with flash SSDs.\n The focus of this paper is on recalling some of these lessons in the context of ad hoc join algorithms. Based on an actual implementation of four common ad hoc join algorithms on both a magnetic HDD and a flash SSD, we show that many of the \"surprising\" results from magnetic HDD-based join methods also hold for flash SSDs. These results include the superiority of block nested loops join over sort-merge join and Grace hash join in many cases, and the benefits of blocked I/Os. In addition, we find that simply looking at the I/O costs when designing new flash SSD join algorithms can be problematic, as the CPU cost is often a bigger component of the total join cost with SSDs. We hope that these results provide insights and better starting points for researchers designing new join algorithms for flash SSDs.","PeriodicalId":298901,"journal":{"name":"International Workshop on Data Management on New Hardware","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"38","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Workshop on Data Management on New Hardware","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1565694.1565696","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 38

Abstract

Flash solid state drives (SSDs) provide an attractive alternative to traditional magnetic hard disk drives (HDDs) for DBMS applications. Naturally there is substantial interest in redesigning critical database internals, such as join algorithms, for flash SSDs. However, we must carefully consider the lessons that we have learnt from over three decades of designing and tuning algorithms for magnetic HDD-based systems, so that we continue to reuse techniques that worked for magnetic HDDs and also work with flash SSDs. The focus of this paper is on recalling some of these lessons in the context of ad hoc join algorithms. Based on an actual implementation of four common ad hoc join algorithms on both a magnetic HDD and a flash SSD, we show that many of the "surprising" results from magnetic HDD-based join methods also hold for flash SSDs. These results include the superiority of block nested loops join over sort-merge join and Grace hash join in many cases, and the benefits of blocked I/Os. In addition, we find that simply looking at the I/O costs when designing new flash SSD join algorithms can be problematic, as the CPU cost is often a bigger component of the total join cost with SSDs. We hope that these results provide insights and better starting points for researchers designing new join algorithms for flash SSDs.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

加入闪存ssd的处理:记住过去的教训

闪存固态驱动器(ssd)为DBMS应用程序提供了传统磁性硬盘驱动器(hdd)的有吸引力的替代方案。当然，对于为闪存ssd重新设计关键的数据库内部，例如连接算法，有很大的兴趣。然而，我们必须仔细考虑我们从30多年来为基于磁性hdd的系统设计和调整算法中学到的经验教训，以便我们继续重用适用于磁性hdd和闪存ssd的技术。本文的重点是回顾在特设连接算法上下文中的一些经验教训。通过在磁性HDD和闪存SSD上实际实现四种常见的临时连接算法，我们发现，基于磁性HDD的连接方法的许多“令人惊讶”的结果也适用于闪存SSD。这些结果包括在许多情况下块嵌套循环连接优于排序合并连接和Grace散列连接，以及阻塞I/ o的好处。此外，我们发现，在设计新的闪存SSD连接算法时，仅仅考虑I/O成本可能会有问题，因为CPU成本通常是SSD总连接成本中较大的组成部分。我们希望这些结果为研究人员设计闪存固态硬盘的新连接算法提供见解和更好的起点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

International Workshop on Data Management on New Hardware

自引率

0.00%

发文量

期刊最新文献

On testing persistent-memory-based software SIMD-accelerated regular expression matching FPGA-accelerated group-by aggregation using synchronizing caches Customized OS support for data-processing Larger-than-memory data management on modern storage hardware for in-memory OLTP database systems