Balanced Allocations with Heterogeneous Bins: The Power of Memory

Proceedings of the ... Annual ACM-SIAM Symposium on Discrete Algorithms. ACM-SIAM Symposium on Discrete Algorithms Pub Date : 2023-01-24 DOI:10.1137/1.9781611977554.ch169

Dimitrios Los, Thomas Sauerwald, John Sylvester

{"title":"Balanced Allocations with Heterogeneous Bins: The Power of Memory","authors":"Dimitrios Los, Thomas Sauerwald, John Sylvester","doi":"10.1137/1.9781611977554.ch169","DOIUrl":null,"url":null,"abstract":"We consider the allocation of $m$ balls (jobs) into $n$ bins (servers). In the standard Two-Choice process, at each step $t=1,2,\\ldots,m$ we first sample two bins uniformly at random and place a ball in the least loaded bin. It is well-known that for any $m \\geq n$, this results in a gap (difference between the maximum and average load) of $\\log_2 \\log n + \\Theta(1)$ (with high probability). In this work, we consider the Memory process [Mitzenmacher, Prabhakar and Shah 2002] where instead of two choices, we only sample one bin per step but we have access to a cache which can store the location of one bin. Mitzenmacher, Prabhakar and Shah showed that in the lightly loaded case ($m = n$), the Memory process achieves a gap of $\\mathcal{O}(\\log \\log n)$. Extending the setting of Mitzenmacher et al. in two ways, we first allow the number of balls $m$ to be arbitrary, which includes the challenging heavily loaded case where $m \\geq n$. Secondly, we follow the heterogeneous bins model of Wieder [Wieder 2007], where the sampling distribution of bins can be biased up to some arbitrary multiplicative constant. Somewhat surprisingly, we prove that even in this setting, the Memory process still achieves an $\\mathcal{O}(\\log \\log n)$ gap bound. This is in stark contrast with the Two-Choice (or any $d$-Choice with $d=\\mathcal{O}(1)$) process, where it is known that the gap diverges as $m \\rightarrow \\infty$ [Wieder 2007]. Further, we show that for any sampling distribution independent of $m$ (but possibly dependent on $n$) the Memory process has a gap that can be bounded independently of $m$. Finally, we prove a tight gap bound of $\\mathcal{O}(\\log n)$ for Memory in another relaxed setting with heterogeneous (weighted) balls and a cache which can only be maintained for two steps.","PeriodicalId":92709,"journal":{"name":"Proceedings of the ... Annual ACM-SIAM Symposium on Discrete Algorithms. ACM-SIAM Symposium on Discrete Algorithms","volume":"45 1","pages":"4448-4477"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... Annual ACM-SIAM Symposium on Discrete Algorithms. ACM-SIAM Symposium on Discrete Algorithms","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1137/1.9781611977554.ch169","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

We consider the allocation of $m$ balls (jobs) into $n$ bins (servers). In the standard Two-Choice process, at each step $t=1,2,\ldots,m$ we first sample two bins uniformly at random and place a ball in the least loaded bin. It is well-known that for any $m \geq n$, this results in a gap (difference between the maximum and average load) of $\log_2 \log n + \Theta(1)$ (with high probability). In this work, we consider the Memory process [Mitzenmacher, Prabhakar and Shah 2002] where instead of two choices, we only sample one bin per step but we have access to a cache which can store the location of one bin. Mitzenmacher, Prabhakar and Shah showed that in the lightly loaded case ($m = n$), the Memory process achieves a gap of $\mathcal{O}(\log \log n)$. Extending the setting of Mitzenmacher et al. in two ways, we first allow the number of balls $m$ to be arbitrary, which includes the challenging heavily loaded case where $m \geq n$. Secondly, we follow the heterogeneous bins model of Wieder [Wieder 2007], where the sampling distribution of bins can be biased up to some arbitrary multiplicative constant. Somewhat surprisingly, we prove that even in this setting, the Memory process still achieves an $\mathcal{O}(\log \log n)$ gap bound. This is in stark contrast with the Two-Choice (or any $d$-Choice with $d=\mathcal{O}(1)$) process, where it is known that the gap diverges as $m \rightarrow \infty$ [Wieder 2007]. Further, we show that for any sampling distribution independent of $m$ (but possibly dependent on $n$) the Memory process has a gap that can be bounded independently of $m$. Finally, we prove a tight gap bound of $\mathcal{O}(\log n)$ for Memory in another relaxed setting with heterogeneous (weighted) balls and a cache which can only be maintained for two steps.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

异构箱的均衡分配:内存的力量

我们考虑将$m$球(作业)分配到$n$箱(服务器)中。在标准的两选过程中，在每一步$t=1,2,\ldots,m$，我们首先均匀随机地对两个箱子进行抽样，并将一个球放入装载最少的箱子中。众所周知，对于任何$m \geq n$，这会导致(高概率)$\log_2 \log n + \Theta(1)$的差距(最大和平均负载之间的差异)。在这项工作中，我们考虑内存过程[Mitzenmacher, Prabhakar and Shah 2002]，其中我们每一步只采样一个bin，而不是两个选择，但我们可以访问可以存储一个bin位置的缓存。Mitzenmacher, Prabhakar和Shah表明，在轻负载情况下($m = n$)， Memory进程实现了$\mathcal{O}(\log \log n)$的间隙。以两种方式扩展Mitzenmacher等人的设置，我们首先允许球的数量$m$是任意的，其中包括具有挑战性的重载情况$m \geq n$。其次，我们遵循Wieder [Wieder 2007]的异构箱模型，其中箱的抽样分布可以偏置到一些任意的乘法常数。有些令人惊讶的是，我们证明即使在这种设置下，Memory进程仍然达到$\mathcal{O}(\log \log n)$间隙界限。这与两种选择(或任何$d$ -选择与$d=\mathcal{O}(1)$)过程形成鲜明对比，其中已知差距发散为$m \rightarrow \infty$ [Wieder 2007]。此外，我们表明，对于任何独立于$m$(但可能依赖于$n$)的抽样分布，Memory进程都有一个可以独立于$m$限定的间隙。最后，我们证明了在另一种具有异构(加权)球和只能维持两步的缓存的宽松设置下内存的紧密间隙界$\mathcal{O}(\log n)$。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊