Performance Evaluation最新文献_第5页

A dependence graph pattern mining method for processor performance analysis 一种用于处理器性能分析的依赖图模式挖掘方法

IF 2.2 4区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Performance Evaluation

Pub Date : 2024-02-28 DOI: 10.1016/j.peva.2024.102409

Yawen Zheng , Chenji Han , Tingting Zhang , Fuxin Zhang , Jian Wang

As the complexity of processor microarchitecture and applications increases, obtaining performance optimization knowledge, such as critical dependent chains, becomes more challenging. To tackle this issue, this paper employs pattern mining methods to analyze the critical path of processor micro-execution dependence graphs. We propose a high average utility pattern mining algorithm called Dependence Graph Miner (DG-Miner) based on the characteristics of dependence graphs. DG-Miner overcomes the limitations of current pattern mining algorithms for dependence graph pattern mining by offering support for variable utility, candidate generation using endpoint matching, the adjustable upper bound, and the concise pattern judgment mechanism. Experiments reveal that, compared with existing upper bound candidate generation methods, the adjustable upper bound reduces the number of candidate patterns by 28.14% and the running time by 27% on average. The concise pattern judgment mechanism enhances the conciseness of mining results by 16.31% and reduces the running time by 39.82%. Furthermore, DG-Miner aids in identifying critical dependent chains, critical program regions, and performance exceptions.

随着处理器微体系结构和应用复杂性的增加，获取性能优化知识（如关键依赖链）变得更具挑战性。为解决这一问题，本文采用模式挖掘方法来分析处理器微执行依赖图的关键路径。我们根据依赖图的特点，提出了一种名为依赖图挖掘器（DG-Miner）的高平均效用模式挖掘算法。DG-Miner 通过支持可变效用、使用端点匹配生成候选、可调上界和简洁的模式判断机制，克服了当前模式挖掘算法在依赖图模式挖掘方面的局限性。实验表明，与现有的上界候选生成方法相比，可调上界平均减少了 28.14% 的候选模式数量和 27% 的运行时间。简洁模式判断机制使挖掘结果的简洁性提高了 16.31%，运行时间缩短了 39.82%。此外，DG-Miner 还有助于识别关键依赖链、关键程序区域和性能异常。

{"title":"A dependence graph pattern mining method for processor performance analysis","authors":"Yawen Zheng , Chenji Han , Tingting Zhang , Fuxin Zhang , Jian Wang","doi":"10.1016/j.peva.2024.102409","DOIUrl":"https://doi.org/10.1016/j.peva.2024.102409","url":null,"abstract":"<div><p>As the complexity of processor microarchitecture and applications increases, obtaining performance optimization knowledge, such as critical dependent chains, becomes more challenging. To tackle this issue, this paper employs pattern mining methods to analyze the critical path of processor micro-execution dependence graphs. We propose a high average utility pattern mining algorithm called Dependence Graph Miner (DG-Miner) based on the characteristics of dependence graphs. DG-Miner overcomes the limitations of current pattern mining algorithms for dependence graph pattern mining by offering support for variable utility, candidate generation using endpoint matching, the adjustable upper bound, and the concise pattern judgment mechanism. Experiments reveal that, compared with existing upper bound candidate generation methods, the adjustable upper bound reduces the number of candidate patterns by 28.14% and the running time by 27% on average. The concise pattern judgment mechanism enhances the conciseness of mining results by 16.31% and reduces the running time by 39.82%. Furthermore, DG-Miner aids in identifying critical dependent chains, critical program regions, and performance exceptions.</p></div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"164 ","pages":"Article 102409"},"PeriodicalIF":2.2,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140014628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Accelerating multi-tier storage cache simulations using knee detection 利用膝点检测加速多层存储缓存模拟

IF 2.2 4区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Performance Evaluation

Pub Date : 2024-02-27 DOI: 10.1016/j.peva.2024.102410

Tyler Estro , Mário Antunes , Pranav Bhandari , Anshul Gandhi , Geoff Kuenning , Yifei Liu , Carl Waldspurger , Avani Wildani , Erez Zadok

Storage cache hierarchies include diverse topologies, assorted parameters and policies, and devices with varied performance characteristics. Simulation enables efficient exploration of their configuration space while avoiding expensive physical experiments. Miss Ratio Curves (MRCs) efficiently characterize the performance of a cache over a range of cache sizes, revealing “key points” for cache simulation, such as knees in the curve that immediately follow sharp cliffs. Unfortunately, there are no automated techniques for efficiently finding key points in MRCs, and the cross-application of existing knee-detection algorithms yields inaccurate results.

We present a multi-stage framework that identifies key points in any MRC, for both stack-based (e.g., LRU) and more sophisticated eviction algorithms (e.g., ARC). Our approach quickly locates candidates using efficient hash-based sampling, curve simplification, knee detection, and novel post-processing filters. We introduce Z-Method, a new multi-knee detection algorithm that employs statistical outlier detection to choose promising points robustly and efficiently.

We evaluated our framework against seven other knee-detection algorithms, identifying key points in multi-tier MRCs with both ARC and LRU policies for 106 diverse real-world workloads. Compared to naïve approaches, our framework reduced the total number of points needed to accurately identify the best two-tier cache hierarchies by an average factor of approximately $5.5 \times$ for ARC and $7.7 \times$ for LRU.

We also show how our framework can be used to seed the initial population for evolutionary algorithms. We ran 32,616 experiments requiring over three million cache simulations, on 151 samples, from three datasets, using a diverse set of population initialization techniques, evolutionary algorithms, knee-detection algorithms, cache replacement algorithms, and stopping criteria. Our results showed an overall acceleration rate of 34% across all configurations.

存储缓存层次结构包括不同的拓扑结构、各种参数和策略以及具有不同性能特征的设备。仿真可以有效探索其配置空间，同时避免昂贵的物理实验。未命中率曲线（MRC）能有效地描述不同大小缓存的性能，揭示缓存仿真的 "关键点"，如紧随急崖之后的曲线膝点。我们提出了一个多阶段框架，可识别任何 MRC 中的关键点，既适用于基于堆栈的算法（如 LRU），也适用于更复杂的驱逐算法（如 ARC）。我们的方法利用高效的哈希采样、曲线简化、膝点检测和新型后处理滤波器快速定位候选点。我们针对其他七种膝点检测算法对我们的框架进行了评估，针对 106 种不同的实际工作负载，在采用 ARC 和 LRU 策略的多层 MRC 中识别关键点。与天真方法相比，我们的框架减少了准确识别最佳双层缓存层次结构所需的点总数，ARC 和 LRU 的平均系数分别约为 5.5 倍和 7.7 倍。我们在三个数据集的 151 个样本上进行了 32616 次实验，需要 300 多万次缓存模拟，使用了一系列不同的种群初始化技术、进化算法、膝检测算法、缓存替换算法和停止标准。我们的结果表明，在所有配置中，总体加速率为 34%。

{"title":"Accelerating multi-tier storage cache simulations using knee detection","authors":"Tyler Estro , Mário Antunes , Pranav Bhandari , Anshul Gandhi , Geoff Kuenning , Yifei Liu , Carl Waldspurger , Avani Wildani , Erez Zadok","doi":"10.1016/j.peva.2024.102410","DOIUrl":"https://doi.org/10.1016/j.peva.2024.102410","url":null,"abstract":"<div><p>Storage cache hierarchies include diverse topologies, assorted parameters and policies, and devices with varied performance characteristics. Simulation enables efficient exploration of their configuration space while avoiding expensive physical experiments. Miss Ratio Curves (MRCs) efficiently characterize the performance of a cache over a range of cache sizes, revealing “key points” for cache simulation, such as knees in the curve that immediately follow sharp cliffs. Unfortunately, there are no automated techniques for efficiently finding key points in MRCs, and the cross-application of existing knee-detection algorithms yields inaccurate results.</p><p>We present a multi-stage framework that identifies key points in <em>any</em> MRC, for both stack-based (<em>e.g.</em>, LRU) and more sophisticated eviction algorithms (<em>e.g.</em>, ARC). Our approach quickly locates candidates using efficient hash-based sampling, curve simplification, knee detection, and novel post-processing filters. We introduce <em>Z-Method</em>, a new multi-knee detection algorithm that employs statistical outlier detection to choose promising points robustly and efficiently.</p><p>We evaluated our framework against seven other knee-detection algorithms, identifying key points in multi-tier MRCs with both ARC and LRU policies for 106 diverse real-world workloads. Compared to naïve approaches, our framework reduced the total number of points needed to accurately identify the best two-tier cache hierarchies by an average factor of approximately <span><math><mrow><mn>5</mn><mo>.</mo><mn>5</mn><mo>×</mo></mrow></math></span> for ARC and <span><math><mrow><mn>7</mn><mo>.</mo><mn>7</mn><mo>×</mo></mrow></math></span> for LRU.</p><p>We also show how our framework can be used to seed the initial population for evolutionary algorithms. We ran 32,616 experiments requiring over three million cache simulations, on 151 samples, from three datasets, using a diverse set of population initialization techniques, evolutionary algorithms, knee-detection algorithms, cache replacement algorithms, and stopping criteria. Our results showed an overall acceleration rate of 34% across all configurations.</p></div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"164 ","pages":"Article 102410"},"PeriodicalIF":2.2,"publicationDate":"2024-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140069358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Age of information minimization in hybrid cognitive radio networks under a timely throughput constraint 及时吞吐量约束下混合认知无线电网络的信息最小化时代

IF 2.2 4区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Performance Evaluation

Pub Date : 2024-02-23 DOI: 10.1016/j.peva.2024.102407

Mohamed F. El-Sherif , Sherif I. Rabia , Ahmed H. Abd El-Malek , W.K. Zahra

Exchanging time-critical information is prevalent in various industrial applications where low latency and timely delivery are paramount. Through this work, we consider a cognitive radio network comprised of multiple secondary users with time-sensitive traffic, and they can access the licensed channel under the hybrid interweave/underlay scheme to enhance spectrum utilization. Traffic in the secondary system is divided into two distinct categories: deadline-constrained data and status updates. Quality of service of data with expiration time, such as multimedia streams, is assessed through the timely throughput metric. However, the age of information metric is used to characterize the freshness of the status update packets, which is vital in several emerging applications. Within an interference constraint imposed by the primary user, a dynamic scheduling policy is proposed to optimize the weighted sum of the average age of information of the status update users under a strict timely throughput requirement for each user with deadline-constrained traffic. We formulate the optimization problem as a constrained Markov decision process. Then, through the drift-plus-penalty method, the problem is reduced into a series of unconstrained Markov decision problems. Finally, each subproblem is tackled using the backward dynamic programming technique. Simulation results illustrate the effect of the main system parameters, such as the PU transmitted power and transmission rate level, on the performance of the secondary system. Moreover, the model feasibility regarding the fulfillment of the constraints against PU activity is experimentally investigated under the proposed hybrid mode and classical interweave mode. The performance of the proposed policy is compared to two other low-complexity scheduling schemes, which ensure the satisfaction of the constraints; results show the performance superiority of our proposed policy.

交换时间关键型信息在各种工业应用中非常普遍，在这些应用中，低延迟和及时交付是最重要的。通过这项工作，我们考虑了一个由多个具有时间敏感性流量的次级用户组成的认知无线电网络，它们可以在混合交织/下层方案下访问许可信道，以提高频谱利用率。二次系统中的流量分为两类：有截止时间限制的数据和状态更新。通过及时吞吐量指标来评估过期数据（如多媒体流）的服务质量。然而，信息年龄指标用于描述状态更新数据包的新鲜度，这在一些新兴应用中至关重要。在主用户施加的干扰约束条件下，我们提出了一种动态调度策略，在严格的及时吞吐量要求下优化状态更新用户平均信息年龄的加权和，每个用户的流量都有截止日期限制。我们将优化问题表述为一个受限马尔可夫决策过程。然后，通过漂移加惩罚方法，将问题简化为一系列无约束马尔可夫决策问题。最后，利用后向动态编程技术处理每个子问题。仿真结果表明了主要系统参数（如 PU 发射功率和传输速率水平）对二次系统性能的影响。此外，在所提出的混合模式和经典交织模式下，实验研究了针对 PU 活动的约束条件满足情况的模型可行性。将所提策略的性能与其他两种确保满足约束条件的低复杂度调度方案进行了比较，结果显示了所提策略的性能优越性。

{"title":"Age of information minimization in hybrid cognitive radio networks under a timely throughput constraint","authors":"Mohamed F. El-Sherif , Sherif I. Rabia , Ahmed H. Abd El-Malek , W.K. Zahra","doi":"10.1016/j.peva.2024.102407","DOIUrl":"10.1016/j.peva.2024.102407","url":null,"abstract":"<div><p>Exchanging time-critical information is prevalent in various industrial applications where low latency and timely delivery are paramount. Through this work, we consider a cognitive radio network comprised of multiple secondary users with time-sensitive traffic, and they can access the licensed channel under the hybrid interweave/underlay scheme to enhance spectrum utilization. Traffic in the secondary system is divided into two distinct categories: deadline-constrained data and status updates. Quality of service of data with expiration time, such as multimedia streams, is assessed through the timely throughput metric. However, the age of information metric is used to characterize the freshness of the status update packets, which is vital in several emerging applications. Within an interference constraint imposed by the primary user, a dynamic scheduling policy is proposed to optimize the weighted sum of the average age of information of the status update users under a strict timely throughput requirement for each user with deadline-constrained traffic. We formulate the optimization problem as a constrained Markov decision process. Then, through the drift-plus-penalty method, the problem is reduced into a series of unconstrained Markov decision problems. Finally, each subproblem is tackled using the backward dynamic programming technique. Simulation results illustrate the effect of the main system parameters, such as the PU transmitted power and transmission rate level, on the performance of the secondary system. Moreover, the model feasibility regarding the fulfillment of the constraints against PU activity is experimentally investigated under the proposed hybrid mode and classical interweave mode. The performance of the proposed policy is compared to two other low-complexity scheduling schemes, which ensure the satisfaction of the constraints; results show the performance superiority of our proposed policy.</p></div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"164 ","pages":"Article 102407"},"PeriodicalIF":2.2,"publicationDate":"2024-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139987786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Average delay analysis of soft deadline messages scheduled in the dynamic segment of FlexRay protocol 在 FlexRay 协议动态段中调度的软截止时间报文的平均延迟分析

IF 2.2 4区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Performance Evaluation

Pub Date : 2024-01-30 DOI: 10.1016/j.peva.2024.102404

Sunil Kumar P.R. , Vinod V. , Manjunath A.S.

The FlexRay communication protocol provides high bandwidth for supporting both hard deadline and soft deadline traffic in in-vehicle communication networks. In this paper, we carry out delay analysis of soft deadline traffic which is handled by the dynamic segment of FlexRay. We model the arrival of these messages as Poisson processes, and use queuing theory to evaluate the average delay that they experience. Initially, we consider three nodes competing for service, assuming that two out of three can transmit messages in any FlexRay cycle and obtain expressions for the evolution of the corresponding queues. We also determine the range of message arrival rates for which the queues are stable. These results are then extended to the general case of N queues. The analytical results are compared with those obtained by simulation for a typical system.

FlexRay 通信协议为车载通信网络中的硬截止日期和软截止日期流量提供了高带宽支持。在本文中，我们对 FlexRay 动态段处理的软截止日期流量进行了延迟分析。我们将这些信息的到达建模为泊松过程，并使用排队理论来评估它们所经历的平均延迟。最初，我们考虑了三个节点竞争服务的情况，假设三个节点中有两个可以在任何 FlexRay 周期中传输信息，并获得了相应队列的演化表达式。我们还确定了队列稳定的报文到达率范围。然后将这些结果扩展到 N 个队列的一般情况。分析结果与典型系统的模拟结果进行了比较。

引用次数: 0

On the sensitivity of stationary solutions of Markov regenerative processes 论马尔可夫再生过程静止解的敏感性

IF 2.2 4区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Performance Evaluation

Pub Date : 2024-01-29 DOI: 10.1016/j.peva.2024.102397

Junjun Zheng , Hiroyuki Okamura , Tadashi Dohi

Markov regenerative process (MRGP) is favored for modeling and evaluating system dependability due to its high power and flexibility. However, its analysis presents challenges because of its inherent renewal nature. The embedded Markov chain (EMC) method offers a stationary solution to the MRGP, while the phase expansion approach delivers both stationary and transient solutions. From these solutions, one can derive performance or dependability measures as outputs from the MRGP model. It is crucial to conduct a sensitivity analysis on MRGP to understand the influence of input factor changes on model outputs, aiding efficient system optimization. Yet, a clear analytical method for sensitivity analysis of MRGP models is currently lacking. Filling this gap, this paper introduces an analytical approach to assess parametric sensitivity for steady-state MRGP, utilizing the EMC method for obtaining the stationary solution. Specifically, since system availability closely correlates with the average system available duration, this paper also shifts its focus from mere model parameters to representative values, like the average available time of a system.

马尔可夫再生过程（MRGP）因其强大的功能和灵活性，在系统可靠性建模和评估中备受青睐。然而，由于其固有的更新特性，对其进行分析面临着挑战。嵌入式马尔可夫链（EMC）方法提供了 MRGP 的静态解决方案，而相位扩展方法则提供了静态和瞬态解决方案。从这些解决方案中，我们可以得出 MRGP 模型输出的性能或可靠性指标。对 MRGP 进行灵敏度分析，以了解输入因素变化对模型输出的影响，从而帮助进行有效的系统优化，这一点至关重要。然而，目前还缺乏对 MRGP 模型进行敏感性分析的明确分析方法。为了填补这一空白，本文介绍了一种分析方法，利用 EMC 方法获取静态解，评估稳态 MRGP 的参数敏感性。具体而言，由于系统可用性与系统平均可用时间密切相关，本文还将重点从单纯的模型参数转移到代表性值，如系统的平均可用时间。

{"title":"On the sensitivity of stationary solutions of Markov regenerative processes","authors":"Junjun Zheng , Hiroyuki Okamura , Tadashi Dohi","doi":"10.1016/j.peva.2024.102397","DOIUrl":"10.1016/j.peva.2024.102397","url":null,"abstract":"<div><p><span>Markov regenerative process (MRGP) is favored for modeling and evaluating system dependability due to its </span>high power<span><span><span> and flexibility. However, its analysis presents challenges because of its inherent renewal nature. The embedded Markov chain (EMC) method offers a stationary solution to the MRGP, while the phase expansion approach delivers both stationary and transient solutions. From these solutions, one can derive performance or dependability measures as outputs from the MRGP model. It is crucial to conduct a sensitivity analysis on MRGP to understand the influence of input factor changes on model outputs, aiding efficient system optimization. Yet, a clear analytical method for sensitivity analysis of MRGP models is currently lacking. Filling this gap, this paper introduces an analytical approach to assess </span>parametric sensitivity for steady-state MRGP, utilizing the EMC method for obtaining the stationary solution. Specifically, since </span>system availability closely correlates with the average system available duration, this paper also shifts its focus from mere model parameters to representative values, like the average available time of a system.</span></p></div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"164 ","pages":"Article 102397"},"PeriodicalIF":2.2,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139659358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Towards the optimal dynamic size-aware dispatching 实现最佳动态规模感知调度

IF 2.2 4区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Performance Evaluation

Pub Date : 2024-01-26 DOI: 10.1016/j.peva.2024.102396

Esa Hyytiä , Rhonda Righter

In this paper, we study dispatching systems that appear in manufacturing, service, healthcare systems, as well as, in various information, communication and computer systems. Such systems comprise a dispatcher and a pool of parallel servers, to which jobs are assigned upon arrival. A common objective is to minimize the mean waiting or response time. In large systems, due to the state-space explosion and scalability reasons, it is impossible to utilize full state information of the system. We therefore consider systems with a small number of servers, and assume that the job sizes become known upon arrival. In such settings, it is plausible to carefully evaluate each server for every new job. First we study a system with a Poisson arrival process, and derive Bellman equations. Then we generalize to the case with general i.i.d. inter-arrival times. The Bellman equations are essentially functional equations that can be solved numerically via value iteration. From their solutions, the optimal dispatching policy and corresponding mean performance can be determined. Our solution framework is illustrated with examples, which show that significant performance gains compared to popular heuristic policies are available in our setting.

在本文中，我们将研究出现在制造、服务、医疗保健系统以及各种信息、通信和计算机系统中的调度系统。这类系统由一个调度员和一组并行服务器组成，作业到达后将被分配给这些服务器。一个共同的目标是尽量缩短平均等待时间或响应时间。在大型系统中，由于状态空间爆炸和可扩展性的原因，不可能利用系统的全部状态信息。我们假设作业大小在到达时就已知晓，并考虑服务器数量较少的系统。在这种情况下，为每个新任务仔细评估每个服务器是可行的。首先，我们研究泊松到达过程的系统，并推导出贝尔曼方程。然后，我们将其推广到一般 i.i.d. 到达时间的情况。贝尔曼方程本质上是函数方程，可以通过数值迭代进行数值求解。根据它们的解，可以确定最优调度策略和相应的平均性能。我们用实例说明了我们的求解框架，这些实例表明，与流行的启发式策略相比，我们的设置能显著提高性能。

{"title":"Towards the optimal dynamic size-aware dispatching","authors":"Esa Hyytiä , Rhonda Righter","doi":"10.1016/j.peva.2024.102396","DOIUrl":"10.1016/j.peva.2024.102396","url":null,"abstract":"<div><p>In this paper, we study dispatching systems that appear in manufacturing, service, healthcare systems, as well as, in various information, communication and computer systems. Such systems comprise a dispatcher and a pool of parallel servers, to which jobs are assigned upon arrival. A common objective is to minimize the mean waiting or response time. In large systems, due to the state-space explosion and scalability reasons, it is impossible to utilize full state information of the system. We therefore consider systems with a small number of servers, and assume that the job sizes become known upon arrival. In such settings, it is plausible to carefully evaluate each server for every new job. First we study a system with a Poisson arrival process, and derive Bellman equations. Then we generalize to the case with general i.i.d. inter-arrival times. The Bellman equations are essentially functional equations that can be solved numerically via value iteration. From their solutions, the optimal dispatching policy and corresponding mean performance can be determined. Our solution framework is illustrated with examples, which show that significant performance gains compared to popular heuristic policies are available in our setting.</p></div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"164 ","pages":"Article 102396"},"PeriodicalIF":2.2,"publicationDate":"2024-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139585222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Decentralized fault tolerant source localization without sensor parameters in wireless sensor networks 无线传感器网络中无需传感器参数的分散容错源定位

IF 2.2 4区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Performance Evaluation

Pub Date : 2023-12-08 DOI: 10.1016/j.peva.2023.102395

Akram Hussain, Yuan Luo

In this paper, we study the source (event) localization problem in decentralized wireless sensor networks (WSNs) under faulty sensor nodes without knowledge of the sensor parameters. Source localization has many applications, such as localizing WiFi hotspots and mobile users. Some works in the literature localize the source by utilizing the knowledge or estimates of the fault probability of each sensor node or the region of influence of the source. However, this paper proposes two approaches: the hitting set and feature selection for estimating the source location without any knowledge of the sensor parameters under faulty sensor nodes in WSN. The proposed approaches provide better or comparable source localization performances. For the hitting set approach, we also derive a lower bound on the required number of samples. In addition, we extend the proposed methods for localizing multiple sources. Finally, we provide extensive simulations to illustrate the performances of the proposed methods against the centroid, maximum likelihood (ML), fault-tolerant ML (FTML), and subtract on negative add on positive (SNAP) estimators. The proposed approaches significantly outperform the centroid and maximum likelihood estimators for faulty sensor nodes while providing comparable or better performance to FTML or SNAP algorithm. In addition, we use real-world WiFi data set to localize the source in comparison to the support vector machine based estimator in the literature, where the proposed methods outperformed the estimator.

本文研究了分散式无线传感器网络（WSN）中，在不知道传感器参数的情况下，传感器节点出现故障时的源（事件）定位问题。源定位有很多应用，如定位 WiFi 热点和移动用户。文献中的一些作品通过利用每个传感器节点的故障概率或源影响区域的知识或估计值来定位源。然而，本文提出了两种方法：命中集和特征选择，用于在不了解 WSN 中故障传感器节点的传感器参数的情况下估计信号源位置。这两种方法都能提供更好或相当的源定位性能。对于命中集方法，我们还推导出了所需样本数量的下限。此外，我们还扩展了建议的多源定位方法。最后，我们提供了大量仿真，说明了所提方法与中心点、最大似然 (ML)、容错 ML (FTML) 和正负相减 (SNAP) 预估器的性能对比。对于故障传感器节点，所提出的方法明显优于中心点和最大似然估计法，同时与 FTML 或 SNAP 算法的性能相当或更好。此外，我们使用真实世界的 WiFi 数据集来定位信号源，与文献中基于支持向量机的估计器进行比较，发现所提出的方法优于该估计器。

{"title":"Decentralized fault tolerant source localization without sensor parameters in wireless sensor networks","authors":"Akram Hussain, Yuan Luo","doi":"10.1016/j.peva.2023.102395","DOIUrl":"10.1016/j.peva.2023.102395","url":null,"abstract":"<div><p>In this paper, we study the source (event) localization problem in decentralized wireless sensor networks (WSNs) under faulty sensor nodes without knowledge of the sensor parameters. Source localization has many applications, such as localizing WiFi hotspots and mobile users. Some works in the literature localize the source by utilizing the knowledge or estimates of the fault probability of each sensor node or the region of influence of the source. However, this paper proposes two approaches: the hitting set and feature selection for estimating the source location without any knowledge of the sensor parameters under faulty sensor nodes in WSN. The proposed approaches provide better or comparable source localization performances. For the hitting set approach, we also derive a lower bound on the required number of samples. In addition, we extend the proposed methods for localizing multiple sources. Finally, we provide extensive simulations to illustrate the performances of the proposed methods against the centroid, maximum likelihood (ML), fault-tolerant ML (FTML), and subtract on negative add on positive (SNAP) estimators. The proposed approaches significantly outperform the centroid and maximum likelihood estimators for faulty sensor nodes while providing comparable or better performance to FTML or SNAP algorithm. In addition, we use real-world WiFi data set to localize the source in comparison to the support vector machine based estimator in the literature, where the proposed methods outperformed the estimator.</p></div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"163 ","pages":"Article 102395"},"PeriodicalIF":2.2,"publicationDate":"2023-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0166531623000652/pdfft?md5=917546a42fc91c4d2235d2f09f3e4318&pid=1-s2.0-S0166531623000652-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138553529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Two families of indexable partially observable restless bandits and Whittle index computation 两类可转位的部分可观测的不动匪和Whittle指数计算

IF 2.2 4区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Performance Evaluation

Pub Date : 2023-12-02 DOI: 10.1016/j.peva.2023.102394

Nima Akbarzadeh, Aditya Mahajan

We consider the restless bandits with general finite state space under partial observability with two observational models: first, the state of each bandit is not observable at all, and second, the state of each bandit is observable when it is selected. Under the assumption that the models satisfy a restart property, we prove that both models are indexable. For the first model, we derive a closed-form expression for the Whittle index. For the second model, we propose an efficient algorithm to compute the Whittle index by exploiting the qualitative properties of the optimal policy. We present detailed numerical experiments for multiple instances of machine maintenance problem. The result indicates that the Whittle index policy outperforms myopic policy and can be close to optimal in different setups.

在部分可观测条件下，考虑具有一般有限状态空间的不动土匪，采用两种观测模型:一是每个土匪的状态完全不可观测，二是每个土匪的状态在被选择时是可观测的。在模型满足重启属性的假设下，证明了两个模型都是可索引的。对于第一个模型，我们推导了惠特尔指数的封闭表达式。对于第二个模型，我们提出了一种利用最优策略的定性性质来计算惠特尔指数的有效算法。针对机器维修问题的多个实例，给出了详细的数值实验。结果表明，Whittle索引策略优于近视策略，在不同的设置下可以接近最优。

引用次数: 0

The effect of network delays on Distributed Ledgers based on Directed Acyclic Graphs: A mathematical model 基于有向无环图的分布式账本网络时延影响:一个数学模型

IF 2.2 4区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Performance Evaluation

Pub Date : 2023-11-25 DOI: 10.1016/j.peva.2023.102392

Navdeep Kumar , Alexandre Reiffers-Masson , Isabel Amigo , Santiago Ruano Rincón

We present a new stochastic model for the evolution of Directed Acyclic Graphs (DAG)-based distributed ledgers (DL), under the presence of heterogeneous delay. This model is used to analyse the performance metrics of the DL, showing in particular that the number of unapproved messages, in expectation, does not diverge to infinity, even under the presence of delay. We propose an analysis based on conveniently defined sets, as well as an alternative drift-based analysis. The former allows to get a bound on the average number of unapproved messages, while the latter, through a simpler analysis, allows to prove the existence of such bound. For particular scenarios, we are able to derive the expected value of the drift of unapproved messages, through a Markov process-based approach. State-of-the-art mathematical models trying to capture the impact of delays on the performance of such DLs rely on some particular simplifications. In contrast, through our model, we are able to analytically derive similar performance guarantees, in a more realistic setup. In particular, we focus on IOTA foundation’s tangle, while our results can be extended to other DAG-based distributed ledgers. We compare our results to results obtained in a real testbed, showing good accordance between them.

我们提出了一种新的基于有向无环图(DAG)的分布式账本(DL)在异构延迟下演化的随机模型。该模型用于分析DL的性能指标，特别是表明，即使在存在延迟的情况下，预期中未经批准的消息的数量也不会发散到无穷大。我们提出了一种基于方便定义集的分析，以及一种基于漂移的替代分析。前者允许得到未经批准消息的平均数量的一个界限，而后者通过更简单的分析，允许证明这个界限的存在。对于特定的场景，我们能够通过基于马尔可夫过程的方法推导出未经批准的消息漂移的期望值。试图捕捉延迟对此类人工智能性能影响的最先进的数学模型依赖于一些特定的简化。相比之下，通过我们的模型，我们能够在更现实的设置中解析地获得类似的性能保证。特别是，我们专注于IOTA基金会的缠结，而我们的结果可以扩展到其他基于dag的分布式账本。将所得结果与实际试验台的结果进行了比较，结果表明两者吻合较好。

{"title":"The effect of network delays on Distributed Ledgers based on Directed Acyclic Graphs: A mathematical model","authors":"Navdeep Kumar , Alexandre Reiffers-Masson , Isabel Amigo , Santiago Ruano Rincón","doi":"10.1016/j.peva.2023.102392","DOIUrl":"https://doi.org/10.1016/j.peva.2023.102392","url":null,"abstract":"<div><p>We present a new stochastic model for the evolution of Directed Acyclic Graphs (DAG)-based distributed ledgers (DL), under the presence of heterogeneous delay. This model is used to analyse the performance metrics of the DL, showing in particular that the number of unapproved messages, in expectation, does not diverge to infinity, even under the presence of delay. We propose an analysis based on conveniently defined sets, as well as an alternative drift-based analysis. The former allows to get a bound on the average number of unapproved messages, while the latter, through a simpler analysis, allows to prove the existence of such bound. For particular scenarios, we are able to derive the expected value of the drift of unapproved messages, through a Markov process-based approach. State-of-the-art mathematical models trying to capture the impact of delays on the performance of such DLs rely on some particular simplifications. In contrast, through our model, we are able to analytically derive similar performance guarantees, in a more realistic setup. In particular, we focus on IOTA foundation’s tangle, while our results can be extended to other DAG-based distributed ledgers. We compare our results to results obtained in a real testbed, showing good accordance between them.</p></div>","PeriodicalId":19964,"journal":{"name":"Performance Evaluation","volume":"163 ","pages":"Article 102392"},"PeriodicalIF":2.2,"publicationDate":"2023-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0166531623000627/pdfft?md5=418849af0d02f48109fd2defe85bc15f&pid=1-s2.0-S0166531623000627-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138474715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Editorial: Special issue on IFIP performance 2023 社论:关于2023年IFIP绩效的特刊

IF 2.2 4区计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Performance Evaluation

Pub Date : 2023-11-19 DOI: 10.1016/j.peva.2023.102393

引用次数: 0