On the Analysis and Evaluation of Proximity-based Load-balancing Policies

IF 1.6 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS ACM Transactions on Modeling and Performance Evaluation of Computing Systems Pub Date : 2022-07-21 DOI:10.1145/3549933

Nitish K. Panigrahy, Thirupathaiah Vasantam, P. Basu, D. Towsley, A. Swami, K. Leung

{"title":"On the Analysis and Evaluation of Proximity-based Load-balancing Policies","authors":"Nitish K. Panigrahy, Thirupathaiah Vasantam, P. Basu, D. Towsley, A. Swami, K. Leung","doi":"10.1145/3549933","DOIUrl":null,"url":null,"abstract":"Distributed load balancing is the act of allocating jobs among a set of servers as evenly as possible. The static interpretation of distributed load balancing leads to formulating the load-balancing problem as a classical balls-and-bins problem with jobs (balls) never leaving the system and accumulating at the servers (bins). While most of the previous work in the static setting focus on studying the maximum number of jobs allocated to a server or maximum load, little importance has been given to the implementation cost, or the cost of moving a job/data to/from its allocated server, for such policies. This article designs and evaluates server proximity aware static load-balancing policies with a goal to reduce the implementation cost. We consider a class of proximity aware Power of Two (POT) choice-based assignment policies for allocating jobs to servers, where both jobs and servers are located on a two-dimensional Euclidean plane. In this framework, we investigate the tradeoff between the implementation cost and load-balancing performance of different allocation policies. To this end, we first design and evaluate a Spatial Power of two (sPOT) policy in which each job is allocated to the least loaded server among its two geographically nearest servers. We provide expressions for the lower bound on the asymptotic expected maximum load on the servers and prove that sPOT does not achieve classical POT load-balancing benefits. However, experimental results suggest the efficacy of sPOT with respect to expected implementation cost. We also propose two non-uniform server sampling-based POT policies that achieve the best of both implementation cost and load-balancing performance. We then extend our analysis to the case where servers are interconnected as an n-vertex graph G(S, E). We assume each job arrives at one of the servers, u, chosen uniformly at random from the vertex set S. We then assign each job to the server with minimum load among servers u and v where v is chosen according to one of the following two policies: (i) Unif-POT(k): Sample a server v uniformly at random from k-hop neighborhood of u; (ii) InvSq-POT(k): Sample a server v from k-hop neighborhood of u with probability proportional to the inverse square of the distance between u and v. An extensive simulation over a wide range of topologies validates the efficacy of both the policies. Our simulation results show that both policies consistently produce a load distribution that is much similar to that of a classical POT. Depending on topology, we observe the total variation distance to be of the order of 0.002–0.08 for both the policies while achieving a 8%–99% decrease in implementation cost as compared to the classical POT.","PeriodicalId":56350,"journal":{"name":"ACM Transactions on Modeling and Performance Evaluation of Computing Systems","volume":"7 1","pages":"1 - 27"},"PeriodicalIF":1.6000,"publicationDate":"2022-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Modeling and Performance Evaluation of Computing Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3549933","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 1

Abstract

Distributed load balancing is the act of allocating jobs among a set of servers as evenly as possible. The static interpretation of distributed load balancing leads to formulating the load-balancing problem as a classical balls-and-bins problem with jobs (balls) never leaving the system and accumulating at the servers (bins). While most of the previous work in the static setting focus on studying the maximum number of jobs allocated to a server or maximum load, little importance has been given to the implementation cost, or the cost of moving a job/data to/from its allocated server, for such policies. This article designs and evaluates server proximity aware static load-balancing policies with a goal to reduce the implementation cost. We consider a class of proximity aware Power of Two (POT) choice-based assignment policies for allocating jobs to servers, where both jobs and servers are located on a two-dimensional Euclidean plane. In this framework, we investigate the tradeoff between the implementation cost and load-balancing performance of different allocation policies. To this end, we first design and evaluate a Spatial Power of two (sPOT) policy in which each job is allocated to the least loaded server among its two geographically nearest servers. We provide expressions for the lower bound on the asymptotic expected maximum load on the servers and prove that sPOT does not achieve classical POT load-balancing benefits. However, experimental results suggest the efficacy of sPOT with respect to expected implementation cost. We also propose two non-uniform server sampling-based POT policies that achieve the best of both implementation cost and load-balancing performance. We then extend our analysis to the case where servers are interconnected as an n-vertex graph G(S, E). We assume each job arrives at one of the servers, u, chosen uniformly at random from the vertex set S. We then assign each job to the server with minimum load among servers u and v where v is chosen according to one of the following two policies: (i) Unif-POT(k): Sample a server v uniformly at random from k-hop neighborhood of u; (ii) InvSq-POT(k): Sample a server v from k-hop neighborhood of u with probability proportional to the inverse square of the distance between u and v. An extensive simulation over a wide range of topologies validates the efficacy of both the policies. Our simulation results show that both policies consistently produce a load distribution that is much similar to that of a classical POT. Depending on topology, we observe the total variation distance to be of the order of 0.002–0.08 for both the policies while achieving a 8%–99% decrease in implementation cost as compared to the classical POT.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于邻近性的负载均衡策略分析与评价

分布式负载平衡是在一组服务器之间尽可能均匀地分配作业的行为。分布式负载平衡的静态解释导致将负载平衡问题公式化为经典的ball和bins问题，其中作业（ball）永远不会离开系统并在服务器（bins）处累积。虽然以前在静态设置中的大多数工作都集中在研究分配给服务器的最大作业数或最大负载，但对于此类策略，实现成本或将作业/数据移动到其分配的服务器/从其移动数据的成本几乎没有被重视。本文设计并评估了服务器邻近感知静态负载平衡策略，旨在降低实现成本。我们考虑了一类基于接近感知二次幂（POT）选择的分配策略，用于将作业分配给服务器，其中作业和服务器都位于二维欧几里得平面上。在这个框架中，我们研究了不同分配策略的实现成本和负载平衡性能之间的权衡。为此，我们首先设计并评估了一种空间二次方（sPOT）策略，在该策略中，每个作业都被分配给其两个地理位置最近的服务器中负载最小的服务器。我们提供了服务器上渐近预期最大负载的下界的表达式，并证明了sPOT没有实现经典的POT负载平衡优势。然而，实验结果表明了sPOT相对于预期实施成本的有效性。我们还提出了两种基于非均匀服务器采样的POT策略，实现了实现成本和负载平衡性能的最佳化。然后，我们将分析扩展到服务器作为n顶点图G（S，E）互连的情况。我们假设每个作业到达从顶点集S随机均匀选择的服务器u之一。然后，我们将每个作业分配给服务器u和v中负载最小的服务器，其中v是根据以下两个策略之一选择的：（i）Unif POT（k）：从u的k跳邻域随机均匀采样服务器v；（ii）InvSq POT（k）：从u的k跳邻域采样服务器v，其概率与u和v之间距离的平方反比。在广泛的拓扑结构上进行的广泛模拟验证了这两种策略的有效性。我们的模拟结果表明，这两种策略一致地产生了与经典POT非常相似的负载分布。根据拓扑结构，我们观察到两种策略的总变化距离在0.002–0.08的数量级，同时与经典POT相比，实现成本降低了8%–99%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊