Cost/Performance in Modern Data Stores: How Data Caching Systems Succeed

D. Lomet
{"title":"Cost/Performance in Modern Data Stores: How Data Caching Systems Succeed","authors":"D. Lomet","doi":"10.1145/3211922.3211927","DOIUrl":null,"url":null,"abstract":"Summary form only given, as follows. The complete presentation was not made available for publication as part of the conference proceedings. Data in traditional \"caching\" data systems resides on secondary storage, and is read into main memory only when operated on. This limits system performance. Main memory data stores with data always in main memory are much faster. But this performance comes at a cost. In this paper, we analyze the costs of both in-memory operations and secondary storage operations where data is not \"in cache\". We study the performance impact of cache misses on caching system performance. The analysis considers both execution and storage costs. Based on our analysis, we derive cost/performance results for a data caching system [Deuteronomy and its Bw-tree] and a main memory system [MassTree] to understand where each demonstrates the best cost per operation, what is driving the cost differences, and the scale of the differences. This analysis (1) provides insight into why data caching systems continue to dominate the market; (2) points to higher performance that does not rely on simply increasing main memory cache size; and (3) suggests a path to lower costs and hence better cost/performance.","PeriodicalId":186190,"journal":{"name":"2019 IEEE 35th International Conference on Data Engineering Workshops (ICDEW)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"28","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 35th International Conference on Data Engineering Workshops (ICDEW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3211922.3211927","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 28

Abstract

Summary form only given, as follows. The complete presentation was not made available for publication as part of the conference proceedings. Data in traditional "caching" data systems resides on secondary storage, and is read into main memory only when operated on. This limits system performance. Main memory data stores with data always in main memory are much faster. But this performance comes at a cost. In this paper, we analyze the costs of both in-memory operations and secondary storage operations where data is not "in cache". We study the performance impact of cache misses on caching system performance. The analysis considers both execution and storage costs. Based on our analysis, we derive cost/performance results for a data caching system [Deuteronomy and its Bw-tree] and a main memory system [MassTree] to understand where each demonstrates the best cost per operation, what is driving the cost differences, and the scale of the differences. This analysis (1) provides insight into why data caching systems continue to dominate the market; (2) points to higher performance that does not rely on simply increasing main memory cache size; and (3) suggests a path to lower costs and hence better cost/performance.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
现代数据存储的成本/性能:数据缓存系统如何成功
仅给出摘要形式,如下。完整的报告没有作为会议记录的一部分提供出版。传统的“缓存”数据系统中的数据驻留在二级存储器上,只有在对其进行操作时才读入主存。这限制了系统性能。数据总是在主存中的主存数据存储要快得多。但这种表现是有代价的。在本文中,我们分析了内存操作和二级存储操作的成本,其中数据不在“缓存中”。我们研究了缓存缺失对缓存系统性能的影响。该分析同时考虑了执行和存储成本。根据我们的分析,我们得出了数据缓存系统(Deuteronomy及其Bw-tree)和主内存系统(masstreet)的成本/性能结果,以了解每个操作在哪些方面表现出最佳成本,是什么导致了成本差异,以及差异的规模。本分析(1)提供了数据缓存系统继续主导市场的原因;(2)指向更高的性能,而不是简单地依赖于增加主内存缓存大小;(3)提出了降低成本从而提高性价比的途径。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Triangle Counting on GPU Using Fine-Grained Task Distribution Distilling Knowledge from User Information for Document Level Sentiment Classification Reachability in Large Graphs Using Bloom Filters Food Image to Cooking Instructions Conversion Through Compressed Embeddings Using Deep Learning Predicting Online User Purchase Behavior Based on Browsing History
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1