A Penalty Aware Memory Allocation Scheme for Key-Value Cache

2015 44th International Conference on Parallel Processing Pub Date : 2015-09-01 DOI:10.1109/ICPP.2015.62

Jianqiang Ou, Marc Patton, M. D. Moore, Yuehai Xu, Song Jiang

{"title":"A Penalty Aware Memory Allocation Scheme for Key-Value Cache","authors":"Jianqiang Ou, Marc Patton, M. D. Moore, Yuehai Xu, Song Jiang","doi":"10.1109/ICPP.2015.62","DOIUrl":null,"url":null,"abstract":"Key-value caches, represented by Mem cached, play a critical role in data centers. Its efficacy can significantly impact users' perceived service time and back-end systems' workloads. A central issue in the in-memory cache's management is memory allocation, or how the limited space is distributed for storing key-value items of various sizes. When a cache is full, the allocation issue is how to conduct replacement operations on items of different sizes. To effectively address the issue, a practitioner must simultaneously consider three factors, which are access locality, item size, and miss penalty. Existing designs consider only one or two of the first two factors, and pay little attention on miss penalty. This inadequacy can substantially compromise utilization of cache space and request service time. In this paper we propose a Penalty Aware Memory Allocation scheme (PAMA) that takes all three factors into account. While the three different factors cannot be directly compared to each other in a quantitative manner, PAMA uses their impacts on service time to determine where a unit of memory space should be (de)allocated. The impacts are quantified as the decrease (or increase) of service time if a unit of space is allocated (or deal located). PAMA efficiently tracks access pattern and use of memory, and speculatively evaluates the impacts to enable penalty-aware memory allocation for KV caches. Our evaluation with real-world Mem cached workload traces demonstrates that PAMA can significantly reduce request service time compared to other representative KV cache management schemes.","PeriodicalId":423007,"journal":{"name":"2015 44th International Conference on Parallel Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 44th International Conference on Parallel Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPP.2015.62","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

Abstract

Key-value caches, represented by Mem cached, play a critical role in data centers. Its efficacy can significantly impact users' perceived service time and back-end systems' workloads. A central issue in the in-memory cache's management is memory allocation, or how the limited space is distributed for storing key-value items of various sizes. When a cache is full, the allocation issue is how to conduct replacement operations on items of different sizes. To effectively address the issue, a practitioner must simultaneously consider three factors, which are access locality, item size, and miss penalty. Existing designs consider only one or two of the first two factors, and pay little attention on miss penalty. This inadequacy can substantially compromise utilization of cache space and request service time. In this paper we propose a Penalty Aware Memory Allocation scheme (PAMA) that takes all three factors into account. While the three different factors cannot be directly compared to each other in a quantitative manner, PAMA uses their impacts on service time to determine where a unit of memory space should be (de)allocated. The impacts are quantified as the decrease (or increase) of service time if a unit of space is allocated (or deal located). PAMA efficiently tracks access pattern and use of memory, and speculatively evaluates the impacts to enable penalty-aware memory allocation for KV caches. Our evaluation with real-world Mem cached workload traces demonstrates that PAMA can significantly reduce request service time compared to other representative KV cache management schemes.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

键值缓存的惩罚感知内存分配方案

以Mem缓存为代表的键值缓存在数据中心中起着至关重要的作用。它的有效性可以显著影响用户感知的服务时间和后端系统的工作负载。内存缓存管理中的一个中心问题是内存分配，或者如何分配有限的空间来存储不同大小的键值项。当缓存已满时，分配问题是如何对不同大小的项进行替换操作。为了有效地解决这个问题，从业者必须同时考虑三个因素，即访问位置、项目大小和错过处罚。现有的设计只考虑了前两个因素中的一个或两个，而很少关注失分惩罚。这种不足会严重影响缓存空间的利用率和请求服务时间。在本文中，我们提出了一种惩罚感知内存分配方案(PAMA)，该方案考虑了这三个因素。虽然这三个不同的因素不能以定量的方式直接相互比较，但PAMA使用它们对服务时间的影响来确定应该在何处分配一个内存空间单元。如果分配了一个单位的空间(或定位了一个交易)，则将影响量化为服务时间的减少(或增加)。PAMA有效地跟踪访问模式和内存的使用，并推测性地评估影响，以实现KV缓存的惩罚感知内存分配。我们对实际Mem缓存工作负载跟踪的评估表明，与其他代表性的KV缓存管理方案相比，PAMA可以显着减少请求服务时间。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2015 44th International Conference on Parallel Processing

自引率

0.00%

发文量