Performance Evaluation最新文献_第6页

Marginal and joint distribution of inter-departure times for a PH/PH/c queue PH/PH/c队列出发时间的边际和联合分布

IF 2.2 4区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Performance Evaluation

Pub Date : 2023-11-04 DOI: 10.1016/j.peva.2023.102383

Ruth Sagron , Yoav Kerner

In this paper, we introduce the Laplace–Stieltjes transform (LST) of the inter-departure time distribution in a PH/PH/c queue, and the two-dimensional joint LST of two consecutive inter-departure times to construct their correlation structure. We exploit the properties of phase-type (PH) random variables, as well as the steady-state distribution of the underlying continuous-time Markov chain in a PH/PH/c queue to construct these LSTs. We demonstrate our approach through numerical examples, while validating the results. Later, we analyze the correlation between two consecutive inter-departure times for various PH/PH/c queues. We observe that, if the fundamental elements of the queue have high (low) variability, then the correlation is positive (negative).

本文引入了PH/PH/c队列出发时间分布的Laplace-Stieltjes变换(LST)，以及两个连续出发时间的二维联合LST，构建了它们的关联结构。我们利用相型(PH)随机变量的性质，以及PH/PH/c队列中底层连续时间马尔可夫链的稳态分布来构建这些lst。我们通过数值例子来证明我们的方法，同时验证结果。随后，我们分析了不同PH/PH/c队列的两个连续出发时间之间的相关性。我们观察到，如果队列的基本元素具有高(低)可变性，则相关性为正(负)。

引用次数: 0

Fixed confidence community mode estimation 固定置信度社区模式估计

IF 2.2 4区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Performance Evaluation

Pub Date : 2023-11-01 DOI: 10.1016/j.peva.2023.102379

Meera Pai, Nikhil Karamchandani, Jayakrishnan Nair

Our aim is to estimate the largest community (a.k.a., mode) in a population composed of multiple disjoint communities. This estimation is performed in a fixed confidence setting via sequential sampling of individuals with replacement. We consider two sampling models: (i) an identityless model, wherein only the community of each sampled individual is revealed, and (ii) an identity-based model, wherein the learner is able to discern whether or not each sampled individual has been sampled before, in addition to the community of that individual. The former model corresponds to the classical problem of identifying the mode of a discrete distribution, whereas the latter seeks to capture the utility of identity information in mode estimation. For each of these models, we establish information theoretic lower bounds on the expected number of samples needed to meet the prescribed confidence level, and propose sound algorithms with a sample complexity that is provably asymptotically optimal. Our analysis highlights that identity information can indeed be utilized to improve the efficiency of community mode estimation.

我们的目标是在由多个不相交的群落组成的种群中估计最大的群落(即模式)。这种估计是在一个固定的置信度设置下通过对替换个体的顺序抽样进行的。我们考虑了两种抽样模型:(i)无身份模型，其中只显示每个抽样个体的社区;(ii)基于身份的模型，其中学习者能够辨别每个抽样个体之前是否被抽样过，以及该个体的社区。前一种模型对应于识别离散分布模式的经典问题，而后一种模型旨在捕获模式估计中身份信息的效用。对于这些模型中的每一个，我们建立了满足规定置信水平所需的期望样本数量的信息理论下界，并提出了具有可证明的渐近最优样本复杂度的可靠算法。我们的分析强调了身份信息确实可以用来提高社区模式估计的效率。

引用次数: 0

Robust fake-post detection against real-coloring adversaries 针对真实着色对手的鲁棒假后检测

IF 2.2 4区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Performance Evaluation

Pub Date : 2023-11-01 DOI: 10.1016/j.peva.2023.102372

Khushboo Agarwal , Veeraruna Kavitha

The viral propagation of fake posts on online social networks (OSNs) has become an alarming concern. The paper aims to design control mechanisms for fake post detection while negligibly affecting the propagation of real posts. Towards this, a warning mechanism based on crowd-signals was recently proposed, where all users actively declare the post as real or fake. In this paper, we consider a more realistic framework where users exhibit different adversarial or non-cooperative behaviour: (i) they can independently decide whether to provide their response, (ii) they can choose not to consider the warning signal while providing the response, and (iii) they can be real-coloring adversaries who deliberately declare any post as real. To analyse the post-propagation process in this complex system, we propose and study a new branching process, namely total-current population-dependent branching process with multiple death types. At first, we compare and show that the existing warning mechanism significantly under-performs in the presence of adversaries. Then, we design new mechanisms which remarkably perform better than the existing mechanism by cleverly eliminating the influence of the responses of the adversaries. Finally, we propose another enhanced mechanism which assumes minimal knowledge about the user-specific parameters. The theoretical results are validated using Monte-Carlo simulations.

虚假帖子在网络社交网络(sns)上的病毒式传播已经成为一个令人担忧的问题。本文旨在设计一种能够在不影响真实帖子传播的情况下检测假帖子的控制机制。为此，最近提出了一种基于人群信号的警告机制，所有用户都主动声明帖子是真的还是假的。在本文中，我们考虑了一个更现实的框架，其中用户表现出不同的对抗或非合作行为:(i)他们可以独立决定是否提供他们的响应，(ii)他们可以选择在提供响应时不考虑警告信号，(iii)他们可以是真实的对手，故意将任何帖子声明为真实。为了分析这一复杂系统的后繁殖过程，我们提出并研究了一种新的分支过程，即多死亡类型的总种群依赖分支过程。首先，我们比较并表明，在存在对手的情况下，现有的警告机制明显表现不佳。然后，通过巧妙地消除对手响应的影响，设计出比现有机制性能更好的新机制。最后，我们提出了另一种增强机制，该机制假设对用户特定参数的了解最少。通过蒙特卡罗仿真验证了理论结果。

{"title":"Robust fake-post detection against real-coloring adversaries","authors":"Khushboo Agarwal , Veeraruna Kavitha","doi":"10.1016/j.peva.2023.102372","DOIUrl":"https://doi.org/10.1016/j.peva.2023.102372","url":null,"abstract":"<div>The viral propagation of fake posts on online social networks (OSNs) has become an alarming concern. The paper aims to design control mechanisms for fake post detection while negligibly affecting the propagation of real posts. Towards this, a warning mechanism based on crowd-signals was recently proposed, where all users actively declare the post as real or fake. In this paper, we consider a more realistic framework where users exhibit different adversarial or non-cooperative behaviour: (i) they can independently decide whether to provide their response, (ii) they can choose not to consider the warning signal while providing the response, and (iii) they can be real-coloring adversaries who deliberately declare any post as real. To analyse the post-propagation process in this complex system, we propose and study a new branching process, namely total-current population-dependent branching process with multiple death types. At first, we compare and show that the existing warning mechanism significantly under-performs in the presence of adversaries. Then, we design new mechanisms which remarkably perform better than the existing mechanism by cleverly eliminating the influence of the responses of the adversaries. Finally, we propose another enhanced mechanism which assumes minimal knowledge about the user-specific parameters. The theoretical results are validated using Monte-Carlo simulations.</div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"162 ","pages":"Article 102372"},"PeriodicalIF":2.2,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91959209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

WattScope: Non-intrusive application-level power disaggregation in datacenters WattScope:数据中心非侵入式应用级电源分解

IF 2.2 4区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Performance Evaluation

Pub Date : 2023-11-01 DOI: 10.1016/j.peva.2023.102369

Xiaoding Guan, Noman Bashir, David Irwin, Prashant Shenoy

Datacenter capacity is growing exponentially to satisfy the increasing demand for many emerging computationally-intensive applications, such as deep learning. This trend has led to concerns over datacenters’ increasing energy consumption and carbon footprint. The most basic prerequisite for optimizing a datacenter’s energy- and carbon-efficiency is accurately monitoring and attributing energy consumption to specific users and applications. Since datacenter servers tend to be multi-tenant, i.e., they host many applications, server- and rack-level power monitoring alone does not provide insight into the energy usage and carbon emissions of their resident applications. At the same time, current application-level energy monitoring and attribution techniques are intrusive: they require privileged access to servers and necessitate coordinated support in hardware and software, neither of which is always possible in cloud environments. To address the problem, we design WattScope, a system for non-intrusively estimating the power consumption of individual applications using external measurements of a server’s aggregate power usage and without requiring direct access to the server’s operating system or applications. Our key insight is that, based on an analysis of production traces, the power characteristics of datacenter workloads, e.g., low variability, low magnitude, and high periodicity, are highly amenable to disaggregation of a server’s total power consumption into application-specific values. WattScope adapts and extends a machine learning-based technique for disaggregating building power and applies it to server- and rack-level power meter measurements that are already available in data centers. We evaluate WattScope’s accuracy on a production workload and show that it yields high accuracy, e.g., often $< \sim$ 10% normalized mean absolute error, and is thus a potentially useful tool for datacenters in externally monitoring application-level power usage.

数据中心容量呈指数级增长，以满足许多新兴的计算密集型应用程序(如深度学习)不断增长的需求。这一趋势引发了人们对数据中心不断增加的能源消耗和碳足迹的担忧。优化数据中心能源和碳效率的最基本先决条件是准确监测并将能源消耗归因于特定用户和应用程序。由于数据中心服务器往往是多租户的，也就是说，它们托管许多应用程序，因此单独的服务器和机架级电源监控并不能提供对其常驻应用程序的能源使用和碳排放的洞察。与此同时，当前的应用级能源监测和归属技术具有侵入性:它们需要对服务器的特权访问，并且需要硬件和软件的协调支持，这两者在云环境中都不可能实现。为了解决这个问题，我们设计了WattScope，这是一个使用服务器总功耗的外部测量来非侵入性地估计单个应用程序的功耗的系统，而不需要直接访问服务器的操作系统或应用程序。我们的关键见解是，基于对生产轨迹的分析，数据中心工作负载的功率特性(例如，低可变性、低幅度和高周期性)非常适合将服务器的总功耗分解为特定于应用程序的值。WattScope适应并扩展了一种基于机器学习的技术，用于分解建筑功率，并将其应用于数据中心已经可用的服务器和机架级功率计测量。我们评估了WattScope在生产工作负载上的准确性，并表明它产生了很高的准确性，例如，通常为< ~ 10%的标准化平均绝对误差，因此是数据中心外部监控应用级电源使用的潜在有用工具。

{"title":"WattScope: Non-intrusive application-level power disaggregation in datacenters","authors":"Xiaoding Guan, Noman Bashir, David Irwin, Prashant Shenoy","doi":"10.1016/j.peva.2023.102369","DOIUrl":"https://doi.org/10.1016/j.peva.2023.102369","url":null,"abstract":"<div>Datacenter capacity is growing exponentially to satisfy the increasing demand for many emerging computationally-intensive applications, such as deep learning. This trend has led to concerns over datacenters’ increasing energy consumption and carbon footprint. The most basic prerequisite for optimizing a datacenter’s energy- and carbon-efficiency is accurately monitoring and attributing energy consumption to specific users and applications. Since datacenter servers tend to be multi-tenant, i.e., they host many applications, server- and rack-level power monitoring alone does not provide insight into the energy usage and carbon emissions of their resident applications. At the same time, current application-level energy monitoring and attribution techniques are intrusive: they require privileged access to servers and necessitate coordinated support in hardware and software, neither of which is always possible in cloud environments. To address the problem, we design WattScope, a system for non-intrusively estimating the power consumption of individual applications using external measurements of a server’s aggregate power usage and without requiring direct access to the server’s operating system or applications. Our key insight is that, based on an analysis of production traces, the power characteristics of datacenter workloads, e.g., low variability, low magnitude, and high periodicity, are highly amenable to disaggregation of a server’s total power consumption into application-specific values. WattScope adapts and extends a machine learning-based technique for disaggregating building power and applies it to server- and rack-level power meter measurements that are already available in data centers. We evaluate WattScope’s accuracy on a production workload and show that it yields high accuracy, e.g., often <math><mrow><mo><</mo><mo>∼</mo></mrow></math>10% normalized mean absolute error, and is thus a potentially useful tool for datacenters in externally monitoring application-level power usage.</div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"162 ","pages":"Article 102369"},"PeriodicalIF":2.2,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"92025543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Performance of the Gittins policy in the G/G/1 and G/G/k, with and without setup times Gittins策略在G/G/1和G/G/k中的性能，有和没有设置时间

IF 2.2 4区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Performance Evaluation

Pub Date : 2023-10-29 DOI: 10.1016/j.peva.2023.102377

Yige Hong , Ziv Scully

How should we schedule jobs to minimize mean queue length? In the preemptive M/G/1 queue, we know the optimal policy is the Gittins policy, which uses any available information about jobs’ remaining service times to dynamically prioritize jobs. For models more complex than the M/G/1, optimal scheduling is generally intractable. This leads us to ask: beyond the M/G/1, does Gittins still perform well?

Recent results show Gittins performs well in the M/G/k, meaning that its additive suboptimality gap is bounded by an expression which is negligible in heavy traffic. But allowing multiple servers is just one way to extend the M/G/1, and most other extensions remain open. Does Gittins still perform well with non-Poisson arrival processes? Or if servers require setup times when transitioning from idle to busy?

In this paper, we give the first analysis of the Gittins policy that can handle any combination of (a) multiple servers, (b) non-Poisson arrivals, and (c) setup times. Our results thus cover the G/G/1 and G/G/k, with and without setup times, bounding Gittins’s suboptimality gap in each case. Each of (a), (b), and (c) adds a term to our bound, but all the terms are negligible in heavy traffic, thus implying Gittins’s heavy-traffic optimality in all the systems we consider. Another consequence of our results is that Gittins is optimal in the M/G/1 with setup times at all loads.

我们应该如何调度作业以最小化平均队列长度?在抢占式M/G/1队列中，我们知道最优策略是Gittins策略，该策略使用有关作业剩余服务时间的任何可用信息来动态地确定作业的优先级。对于比M/G/1更复杂的模型，最优调度通常是难以解决的。这让我们不禁要问:除了M/G/1之外，gittin的表现还好吗?最近的结果表明，Gittins在M/G/k中表现良好，这意味着它的加性次优性差距由一个表达式限定，在繁忙的交通中可以忽略不计。但是允许多个服务器只是扩展M/G/1的一种方式，大多数其他扩展仍然是开放的。gittin在非泊松到达过程中仍然表现良好吗?或者服务器从空闲到繁忙是否需要设置时间?在本文中，我们首次分析了可以处理(a)多个服务器，(b)非泊松到达和(c)设置时间的任何组合的Gittins策略。因此，我们的结果涵盖了G/G/1和G/G/k，有和没有设置时间，在每种情况下都限定了Gittins的次优性差距。(a)、(b)和(c)中的每一个都给我们的边界增加了一项，但是所有的项在繁忙的交通中都是可以忽略不计的，因此暗示了Gittins在我们考虑的所有系统中的繁忙交通最优性。我们的结果的另一个结果是，在M/G/1中，gittin在所有负载下的设置时间都是最优的。

{"title":"Performance of the Gittins policy in the G/G/1 and G/G/k, with and without setup times","authors":"Yige Hong , Ziv Scully","doi":"10.1016/j.peva.2023.102377","DOIUrl":"https://doi.org/10.1016/j.peva.2023.102377","url":null,"abstract":"<div>How should we schedule jobs to minimize mean queue length? In the preemptive M/G/1 queue, we know the optimal policy is the Gittins policy, which uses any available information about jobs’ remaining service times to dynamically prioritize jobs. For models more complex than the M/G/1, optimal scheduling is generally intractable. This leads us to ask: beyond the M/G/1, does Gittins still perform well?Recent results show Gittins performs well in the M/G/k, meaning that its additive suboptimality gap is bounded by an expression which is negligible in heavy traffic. But allowing multiple servers is just one way to extend the M/G/1, and most other extensions remain open. Does Gittins still perform well with non-Poisson arrival processes? Or if servers require setup times when transitioning from idle to busy?In this paper, we give the first analysis of the Gittins policy that can handle any combination of (a) multiple servers, (b) non-Poisson arrivals, and (c) setup times. Our results thus cover the G/G/1 and G/G/k, with and without setup times, bounding Gittins’s suboptimality gap in each case. Each of (a), (b), and (c) adds a term to our bound, but all the terms are negligible in heavy traffic, thus implying Gittins’s heavy-traffic optimality in all the systems we consider. Another consequence of our results is that Gittins is optimal in the M/G/1 with setup times at all loads.</div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"163 ","pages":"Article 102377"},"PeriodicalIF":2.2,"publicationDate":"2023-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0166531623000470/pdfft?md5=688fb1b83300cd7ea4fea9d191278825&pid=1-s2.0-S0166531623000470-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"92030400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

ASIP tandem queues with consumption 带消费的ASIP串联队列

IF 2.2 4区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Performance Evaluation

Pub Date : 2023-10-18 DOI: 10.1016/j.peva.2023.102380

Yaron Yeger , Onno Boxma , Jacques Resing , Maria Vlasiou

The Asymmetric Inclusion Process (ASIP) tandem queue is a model of stations in series with a gate after each station. At a gate opening, all customers in that station instantaneously move to the next station unidirectionally. In our study, we enhance the ASIP model by introducing the capability for individual customers to independently move from one station to the next, and by allowing both individual customers and batches of customers from any station to exit the system. The model is inspired by the process by which macromolecules are transported within cells.

We present a comprehensive analysis of various aspects of the queue length in the ASIP tandem model. Specifically, we provide an exact analysis of queue length moments and correlations and, under certain circumstances, of the queue length distribution. Furthermore, we propose an approximation for the joint queue length distribution. This approximation is derived using three different approaches, one of which employs the concept of the replica mean-field limit. Among other results, our analysis offers insight into the extent to which nutrients can support the survival of a cell.

非对称包含过程(ASIP)串列队列是站的串联模型，每个站后面都有一个门。门打开时，该站的所有顾客立即单向地移动到下一站。在我们的研究中，我们通过引入单个客户独立地从一个站点移动到下一个站点的能力，以及允许来自任何站点的单个客户和批量客户退出系统来增强ASIP模型。该模型的灵感来自于大分子在细胞内运输的过程。我们提出了在ASIP串联模型的队列长度的各个方面的全面分析。具体来说，我们提供了对队列长度矩和相关性的精确分析，以及在某些情况下对队列长度分布的精确分析。此外，我们提出了一个近似的联合队列长度分布。这个近似是用三种不同的方法推导出来的，其中一种方法采用了复制平均场极限的概念。在其他结果中，我们的分析提供了对营养物质在多大程度上支持细胞存活的见解。

{"title":"ASIP tandem queues with consumption","authors":"Yaron Yeger , Onno Boxma , Jacques Resing , Maria Vlasiou","doi":"10.1016/j.peva.2023.102380","DOIUrl":"https://doi.org/10.1016/j.peva.2023.102380","url":null,"abstract":"<div>The Asymmetric Inclusion Process (ASIP) tandem queue is a model of stations in series with a gate after each station. At a gate opening, all customers in that station instantaneously move to the next station unidirectionally. In our study, we enhance the ASIP model by introducing the capability for individual customers to independently move from one station to the next, and by allowing both individual customers and batches of customers from any station to exit the system. The model is inspired by the process by which macromolecules are transported within cells.We present a comprehensive analysis of various aspects of the queue length in the ASIP tandem model. Specifically, we provide an exact analysis of queue length moments and correlations and, under certain circumstances, of the queue length distribution. Furthermore, we propose an approximation for the joint queue length distribution. This approximation is derived using three different approaches, one of which employs the concept of the replica mean-field limit. Among other results, our analysis offers insight into the extent to which nutrients can support the survival of a cell.</div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"163 ","pages":"Article 102380"},"PeriodicalIF":2.2,"publicationDate":"2023-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0166531623000500/pdfft?md5=979d6daae1fd3cf701761a51f472a8ff&pid=1-s2.0-S0166531623000500-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"92030401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Online convex optimization with switching cost and delayed gradients 具有切换代价和延迟梯度的在线凸优化

IF 2.2 4区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Performance Evaluation

Pub Date : 2023-10-13 DOI: 10.1016/j.peva.2023.102371

Spandan Senapati , Rahul Vaze

We consider the online convex optimization (OCO) problem with quadratic and linear switching cost in the limited information setting, where an online algorithm can choose its action using only gradient information about the previous objective function. For $L$ -smooth and $μ$ -strongly convex objective functions, we propose an online multiple gradient descent (OMGD) algorithm and show that its competitive ratio for the OCO problem with quadratic switching cost is at most $4 (L + 5) + \frac{16 (L + 5)}{μ}$ . The competitive ratio upper bound for OMGD is also shown to be order-wise tight in terms of $L, μ$ . In addition, we show that the competitive ratio of any online algorithm is $max {Ω (L), Ω (\frac{L}{\sqrt{μ}})}$ in the limited information setting when the switching cost is quadratic. We also show that the OMGD algorithm achieves the optimal (order-wise) dynamic regret in the limited information setting. For the linear switching cost, the competitive ratio upper bound of the OMGD algorithm is shown to depend on both the path length and the squared path length of the problem instance, in addition to $L, μ$ , and is shown to be order-wise, the best competitive ratio any online algorithm can achieve. Consequently, we conclude that the optimal competitive ratio for the quadratic and linear switching costs are fundamentally different in the limited information setting.

研究了有限信息条件下具有二次线性切换代价的在线凸优化问题，其中在线算法仅利用前一个目标函数的梯度信息来选择其行为。对于L-光滑和μ-强凸目标函数，我们提出了一种在线多重梯度下降(OMGD)算法，并证明其对具有二次切换代价的OCO问题的竞争比最大为4(L+5)+16(L+5)μ。OMGD的竞争比上界也被证明是有序紧的，用L，μ表示。此外，我们还证明了在有限信息设置下，当切换成本为二次时，任何在线算法的竞争比都为max{Ω(L)，Ω(Lμ)}。我们还证明了OMGD算法在有限信息设置下实现了最优(顺序)动态遗憾。对于线性切换代价，OMGD算法的竞争比上界除了L、μ外，还取决于问题实例的路径长度和路径长度的平方，并且是有序的，是所有在线算法所能达到的最佳竞争比。因此，我们得出结论，在有限的信息设置下，二次型和线性型切换成本的最优竞争比是根本不同的。

{"title":"Online convex optimization with switching cost and delayed gradients","authors":"Spandan Senapati , Rahul Vaze","doi":"10.1016/j.peva.2023.102371","DOIUrl":"https://doi.org/10.1016/j.peva.2023.102371","url":null,"abstract":"<div>We consider the online convex optimization (OCO) problem with quadratic and linear switching cost in the limited information setting, where an online algorithm can choose its action using only gradient information about the previous objective function. For <math><mi>L</mi></math>-smooth and <math><mi>μ</mi></math>-strongly convex objective functions, we propose an online multiple gradient descent (OMGD) algorithm and show that its competitive ratio for the OCO problem with quadratic switching cost is at most <math><mrow><mn>4</mn><mrow><mo>(</mo><mi>L</mi><mo>+</mo><mn>5</mn><mo>)</mo></mrow><mo>+</mo><mfrac><mrow><mn>16</mn><mrow><mo>(</mo><mi>L</mi><mo>+</mo><mn>5</mn><mo>)</mo></mrow></mrow><mrow><mi>μ</mi></mrow></mfrac></mrow></math>. The competitive ratio upper bound for OMGD is also shown to be order-wise tight in terms of <math><mrow><mi>L</mi><mo>,</mo><mi>μ</mi></mrow></math>. In addition, we show that the competitive ratio of any online algorithm is <math><mrow><mo>max</mo><mrow><mo>{</mo><mi>Ω</mi><mrow><mo>(</mo><mi>L</mi><mo>)</mo></mrow><mo>,</mo><mi>Ω</mi><mrow><mo>(</mo><mfrac><mrow><mi>L</mi></mrow><mrow><msqrt><mrow><mi>μ</mi></mrow></msqrt></mrow></mfrac><mo>)</mo></mrow><mo>}</mo></mrow></mrow></math> in the limited information setting when the switching cost is quadratic. We also show that the OMGD algorithm achieves the optimal (order-wise) dynamic regret in the limited information setting. For the linear switching cost, the competitive ratio upper bound of the OMGD algorithm is shown to depend on both the path length and the squared path length of the problem instance, in addition to <math><mrow><mi>L</mi><mo>,</mo><mi>μ</mi></mrow></math>, and is shown to be order-wise, the best competitive ratio any online algorithm can achieve. Consequently, we conclude that the optimal competitive ratio for the quadratic and linear switching costs are fundamentally different in the limited information setting.</div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"162 ","pages":"Article 102371"},"PeriodicalIF":2.2,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49874155","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Quality competition among internet service providers 互联网服务提供商之间的质量竞争

IF 2.2 4区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Performance Evaluation

Pub Date : 2023-10-12 DOI: 10.1016/j.peva.2023.102375

Simon Scherrer, Seyedali Tabaeiaghdaei, Adrian Perrig

Internet service providers (ISPs) have a variety of quality attributes that determine their attractiveness for data transmission, ranging from quality-of-service metrics such as jitter to security properties such as the presence of DDoS defense systems. ISPs should optimize these attributes in line with their profit objective, i.e., maximize revenue from attracted traffic while minimizing attribute-related cost, all in the context of alternative offers by competing ISPs. However, this attribute optimization is difficult not least because many aspects of ISP competition are barely understood on a systematic level, e.g., the multi-dimensional and cost-driving nature of path quality, and the distributed decision making of ISPs on the same path.

In this paper, we improve this understanding by analyzing how ISP competition affects path quality and ISP profits. To that end, we develop a game-theoretic model in which ISPs (i) affect path quality via multiple attributes that entail costs, (ii) are on paths together with other selfish ISPs, and (iii) are in competition with alternative paths when attracting traffic. The model enables an extensive theoretical analysis, surprisingly showing that competition can have both positive and negative effects on path quality and ISP profits, depending on the network topology and the cost structure of ISPs. However, a large-scale simulation, which draws on real-world data to instantiate the model, shows that the positive effects will likely prevail in practice: If the number of selectable paths towards any destination increases from 1 to 5, the prevalence of quality attributes increases by at least 50%, while 75% of ISPs improve their profit.

互联网服务提供商(isp)具有各种质量属性，这些属性决定了它们对数据传输的吸引力，范围从抖动等服务质量指标到DDoS防御系统等安全属性。互联网服务提供商应根据其利润目标优化这些属性，即，在竞争的互联网服务提供商提供替代方案的背景下，从吸引的流量中获得最大的收入，同时最小化属性相关的成本。然而，这种属性优化是困难的，尤其是因为ISP竞争的许多方面在系统层面上几乎没有被理解，例如，路径质量的多维性和成本驱动性质，以及同一路径上的ISP的分布式决策。在本文中，我们通过分析ISP竞争如何影响路径质量和ISP利润来改进这种理解。为此，我们开发了一个博弈论模型，其中isp (i)通过包含成本的多个属性影响路径质量，(ii)与其他自私的isp一起在路径上，以及(iii)在吸引流量时与其他路径竞争。该模型可以进行广泛的理论分析，令人惊讶的是，竞争可以对路径质量和ISP利润产生积极和消极的影响，这取决于网络拓扑结构和ISP的成本结构。然而，利用真实世界数据来实例化模型的大规模模拟表明，积极影响可能在实践中普遍存在:如果通往任何目的地的可选择路径数量从1增加到5，则质量属性的流行率至少增加50%，而75%的互联网服务提供商提高了他们的利润。

{"title":"Quality competition among internet service providers","authors":"Simon Scherrer, Seyedali Tabaeiaghdaei, Adrian Perrig","doi":"10.1016/j.peva.2023.102375","DOIUrl":"https://doi.org/10.1016/j.peva.2023.102375","url":null,"abstract":"<div>Internet service providers (ISPs) have a variety of quality attributes that determine their attractiveness for data transmission, ranging from quality-of-service metrics such as jitter to security properties such as the presence of DDoS defense systems. ISPs should optimize these attributes in line with their profit objective, i.e., maximize revenue from attracted traffic while minimizing attribute-related cost, all in the context of alternative offers by competing ISPs. However, this attribute optimization is difficult not least because many aspects of ISP competition are barely understood on a systematic level, e.g., the multi-dimensional and cost-driving nature of path quality, and the distributed decision making of ISPs on the same path.In this paper, we improve this understanding by analyzing how ISP competition affects path quality and ISP profits. To that end, we develop a game-theoretic model in which ISPs (i) affect path quality via multiple attributes that entail costs, (ii) are on paths together with other selfish ISPs, and (iii) are in competition with alternative paths when attracting traffic. The model enables an extensive theoretical analysis, surprisingly showing that competition can have both positive and negative effects on path quality and ISP profits, depending on the network topology and the cost structure of ISPs. However, a large-scale simulation, which draws on real-world data to instantiate the model, shows that the positive effects will likely prevail in practice: If the number of selectable paths towards any destination increases from 1 to 5, the prevalence of quality attributes increases by at least 50%, while 75% of ISPs improve their profit.</div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"162 ","pages":"Article 102375"},"PeriodicalIF":2.2,"publicationDate":"2023-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49874153","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Load balancing policies without feedback using timed replicas 使用定时副本的无反馈负载平衡策略

IF 2.2 4区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Performance Evaluation

Pub Date : 2023-10-11 DOI: 10.1016/j.peva.2023.102381

Rooji Jinan , Ajay Badita , Tejas Bodas , Parimal Parag

Dispatching policies such as join the shortest queue (JSQ), join the queue with smallest workload (JSW), and their power of two variants are used in load balancing systems where the instantaneous queue length or workload information at all queues or a subset of them can be queried. In situations where the dispatcher has an associated memory, one can minimize this query overhead by maintaining a list of idle servers to which jobs can be dispatched. Recent alternative approaches that do not require querying such information include the cancel-on-start and cancel-on-complete replication policies. The downside of such policies however is that the servers must communicate either the start or the completion time instant of each service to the dispatcher and must allow the coordinated and instantaneous cancellation of all redundant replicas. In practice, the requirements of query messaging, memory, and replica cancellation pose challenges in their implementation and their advantages are not clear. In this work, we consider load-balancing policies that do not need to query load information, do not need memory, and do not need to cancel replicas. Our policies allow the dispatcher to append a timer to each job or its replica. A job or a replica is discarded if its timer expires before it starts receiving service. We analyze several variants of this policy which are novel and simple to implement. We numerically observe that the variants of the proposed policy outperform popular feedback-based policies for low arrival rates, despite no feedback from servers to the dispatcher.

调度策略，如加入最短队列(JSQ)、加入工作负载最小的队列(JSW)，以及它们的两种变体功能，都用于负载平衡系统中，在这些系统中，可以查询所有队列或其中一个子集的瞬时队列长度或工作负载信息。在调度程序具有关联内存的情况下，可以通过维护空闲服务器列表来最小化查询开销，这些空闲服务器可以将作业分派到这些服务器上。最近不需要查询此类信息的替代方法包括启动时取消复制策略和完成时取消复制策略。然而，这种策略的缺点是服务器必须将每个服务的开始时间或完成时间瞬间与调度程序通信，并且必须允许协调和即时取消所有冗余副本。在实践中，查询消息传递、内存和副本取消的需求给它们的实现带来了挑战，而且它们的优势并不清楚。在这项工作中，我们考虑了不需要查询负载信息、不需要内存、不需要取消副本的负载均衡策略。我们的策略允许调度程序为每个作业或其副本附加一个计时器。如果作业或副本的定时器在开始接受服务之前过期，则丢弃该作业或副本。我们分析了该策略的几种变体，它们新颖且易于实现。我们在数值上观察到，尽管没有从服务器到调度程序的反馈，但对于低到达率，建议策略的变体优于流行的基于反馈的策略。

{"title":"Load balancing policies without feedback using timed replicas","authors":"Rooji Jinan , Ajay Badita , Tejas Bodas , Parimal Parag","doi":"10.1016/j.peva.2023.102381","DOIUrl":"https://doi.org/10.1016/j.peva.2023.102381","url":null,"abstract":"<div>Dispatching policies such as join the shortest queue (JSQ), join the queue with smallest workload (JSW), and their power of two variants are used in load balancing systems where the instantaneous queue length or workload information at all queues or a subset of them can be queried. In situations where the dispatcher has an associated memory, one can minimize this query overhead by maintaining a list of idle servers to which jobs can be dispatched. Recent alternative approaches that do not require querying such information include the cancel-on-start and cancel-on-complete replication policies. The downside of such policies however is that the servers must communicate either the start or the completion time instant of each service to the dispatcher and must allow the coordinated and instantaneous cancellation of all redundant replicas. In practice, the requirements of query messaging, memory, and replica cancellation pose challenges in their implementation and their advantages are not clear. In this work, we consider load-balancing policies that do not need to query load information, do not need memory, and do not need to cancel replicas. Our policies allow the dispatcher to append a timer to each job or its replica. A job or a replica is discarded if its timer expires before it starts receiving service. We analyze several variants of this policy which are novel and simple to implement. We numerically observe that the variants of the proposed policy outperform popular feedback-based policies for low arrival rates, despite no feedback from servers to the dispatcher.</div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"162 ","pages":"Article 102381"},"PeriodicalIF":2.2,"publicationDate":"2023-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49874551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The RESET and MARC techniques, with application to multiserver-job analysis RESET和MARC技术，以及在多服务器作业分析中的应用

IF 2.2 4区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Performance Evaluation

Pub Date : 2023-10-11 DOI: 10.1016/j.peva.2023.102378

Isaac Grosof, Yige Hong, Mor Harchol-Balter, Alan Scheller-Wolf

Multiserver-job (MSJ) systems, where jobs need to run concurrently across many servers, are increasingly common in practice. The default service ordering in many settings is First-Come First-Served (FCFS) service. Virtually all theoretical work on MSJ FCFS models focuses on characterizing the stability region, with almost nothing known about mean response time.

We derive the first explicit characterization of mean response time in the MSJ FCFS system. Our formula characterizes mean response time up to an additive constant, which becomes negligible as arrival rate approaches throughput, and allows for general phase-type job durations.

We derive our result by utilizing two key techniques: REduction to Saturated for Expected Time (RESET) and MArkovian Relative Completions (MARC).

Using our novel RESET technique, we reduce the problem of characterizing mean response time in the MSJ FCFS system to an M/M/1 with Markovian service rate (MMSR). The Markov chain controlling the service rate is based on the saturated system, a simpler closed system which is far more analytically tractable.

Unfortunately, the MMSR has no explicit characterization of mean response time. We therefore use our novel MARC technique to give the first explicit characterization of mean response time in the MMSR, again up to constant additive error. We specifically introduce the concept of “relative completions,” which is the cornerstone of our MARC technique.

多服务器作业(MSJ)系统在实践中越来越普遍，其中作业需要在许多服务器上并发运行。在许多设置中，默认的服务顺序是先到先得(FCFS)服务。实际上，所有关于MSJ FCFS模型的理论工作都集中在描述稳定区域上，对平均响应时间几乎一无所知。我们首次导出了MSJ FCFS系统中平均响应时间的显式表征。我们的公式将平均响应时间描述为一个可加常数，当到达率接近吞吐量时，该常数可以忽略不计，并允许一般阶段类型的工作持续时间。我们通过使用两种关键技术得出了我们的结果:预期时间饱和还原(RESET)和马尔可夫相对完井(MARC)。利用我们新颖的RESET技术，我们将MSJ FCFS系统的平均响应时间表征问题降低到具有马尔可夫服务率(MMSR)的M/M/1。控制服务率的马尔可夫链基于饱和系统，这是一种更简单的封闭系统，更易于分析处理。不幸的是，MMSR没有明确的平均响应时间表征。因此，我们使用新颖的MARC技术给出了MMSR中平均响应时间的第一个明确表征，再次达到恒定的加性误差。我们特别介绍了“相对补全”的概念，这是MARC技术的基石。

{"title":"The RESET and MARC techniques, with application to multiserver-job analysis","authors":"Isaac Grosof, Yige Hong, Mor Harchol-Balter, Alan Scheller-Wolf","doi":"10.1016/j.peva.2023.102378","DOIUrl":"https://doi.org/10.1016/j.peva.2023.102378","url":null,"abstract":"<div>Multiserver-job (MSJ) systems, where jobs need to run concurrently across many servers, are increasingly common in practice. The default service ordering in many settings is First-Come First-Served (FCFS) service. Virtually all theoretical work on MSJ FCFS models focuses on characterizing the stability region, with almost nothing known about mean response time.We derive the first explicit characterization of mean response time in the MSJ FCFS system. Our formula characterizes mean response time up to an additive constant, which becomes negligible as arrival rate approaches throughput, and allows for general phase-type job durations.We derive our result by utilizing two key techniques: REduction to Saturated for Expected Time (RESET) and MArkovian Relative Completions (MARC).Using our novel RESET technique, we reduce the problem of characterizing mean response time in the MSJ FCFS system to an M/M/1 with Markovian service rate (MMSR). The Markov chain controlling the service rate is based on the saturated system, a simpler closed system which is far more analytically tractable.Unfortunately, the MMSR has no explicit characterization of mean response time. We therefore use our novel MARC technique to give the first explicit characterization of mean response time in the MMSR, again up to constant additive error. We specifically introduce the concept of “relative completions,” which is the cornerstone of our MARC technique.</div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"162 ","pages":"Article 102378"},"PeriodicalIF":2.2,"publicationDate":"2023-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49874588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0