基于连接和基于负载的网络系统故障的弹性

2022 Resilience Week (RWS) Pub Date : 2022-09-26 DOI:10.1109/RWS55399.2022.9984022

W. Al-Aqqad, Hassan S. Hayajneh, Xuewei Zhang

{"title":"基于连接和基于负载的网络系统故障的弹性","authors":"W. Al-Aqqad, Hassan S. Hayajneh, Xuewei Zhang","doi":"10.1109/RWS55399.2022.9984022","DOIUrl":null,"url":null,"abstract":"A general modeling and simulation framework is developed in this work to quantitatively evaluate the resilience of networked systems under two types of failures: connectivity- and load-based. Two newly designed dynamic healing mechanisms are demonstrated. The model considers concurrent cascading failure and healing processes on networks. The discrete-time simulations generate system trajectories, i.e., number of failed nodes at each time step. The 95% recovery time is used as the resilience metric to evaluate and compare the healing performance. Based on two real-world networks, the dependence of system trajectories and resilience metric on various model parameters is explored. If the triggering level (fraction of inactive nodes when healing starts) is too high, the system would either undergo a very slow recovery or never recover to a satisfactory level at all. However, this work provides a counter example to the intuition that the smaller the triggering level, the shorter the recovery time. While low budgets (number of nodes allowed to recover at each time step) lead to prolonged or unsuccessful recovery, it appears that the resilience metric converges to a limit when budget is raised to high enough. This may have practical implications, as node recovery requires resources and a budget too high or too low would be wasteful. This works lays the foundation for subsequent studies on more complex mechanisms and processes on the networks, optimization of model parameters for maximum resilience, as well as applications to more real-world scenarios.","PeriodicalId":170769,"journal":{"name":"2022 Resilience Week (RWS)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Resilience of Networked Systems under Connectivity-Based and Load-Based Failures\",\"authors\":\"W. Al-Aqqad, Hassan S. Hayajneh, Xuewei Zhang\",\"doi\":\"10.1109/RWS55399.2022.9984022\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A general modeling and simulation framework is developed in this work to quantitatively evaluate the resilience of networked systems under two types of failures: connectivity- and load-based. Two newly designed dynamic healing mechanisms are demonstrated. The model considers concurrent cascading failure and healing processes on networks. The discrete-time simulations generate system trajectories, i.e., number of failed nodes at each time step. The 95% recovery time is used as the resilience metric to evaluate and compare the healing performance. Based on two real-world networks, the dependence of system trajectories and resilience metric on various model parameters is explored. If the triggering level (fraction of inactive nodes when healing starts) is too high, the system would either undergo a very slow recovery or never recover to a satisfactory level at all. However, this work provides a counter example to the intuition that the smaller the triggering level, the shorter the recovery time. While low budgets (number of nodes allowed to recover at each time step) lead to prolonged or unsuccessful recovery, it appears that the resilience metric converges to a limit when budget is raised to high enough. This may have practical implications, as node recovery requires resources and a budget too high or too low would be wasteful. This works lays the foundation for subsequent studies on more complex mechanisms and processes on the networks, optimization of model parameters for maximum resilience, as well as applications to more real-world scenarios.\",\"PeriodicalId\":170769,\"journal\":{\"name\":\"2022 Resilience Week (RWS)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 Resilience Week (RWS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RWS55399.2022.9984022\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Resilience Week (RWS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RWS55399.2022.9984022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在这项工作中，开发了一个通用的建模和仿真框架，以定量评估网络系统在两种类型故障下的弹性:基于连接和基于负载的故障。展示了两种新设计的动态愈合机制。该模型考虑了网络上并发级联故障和修复过程。离散时间模拟生成系统轨迹，即每个时间步长失效节点的数量。95%的恢复时间作为弹性指标来评估和比较愈合性能。基于两个真实网络，探讨了系统轨迹和弹性度量对不同模型参数的依赖关系。如果触发水平(恢复开始时不活动节点的比例)过高，系统要么会经历非常缓慢的恢复，要么根本无法恢复到令人满意的水平。然而，这项工作为直觉提供了一个反例，即触发电平越小，恢复时间越短。虽然低预算(每个时间步骤允许恢复的节点数量)会导致恢复时间延长或失败，但当预算提高到足够高时，弹性度量似乎会收敛到一个极限。这可能具有实际意义，因为节点恢复需要资源，预算过高或过低都会造成浪费。这项工作为后续研究更复杂的网络机制和过程、优化模型参数以获得最大弹性以及在更多现实场景中的应用奠定了基础。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Resilience of Networked Systems under Connectivity-Based and Load-Based Failures

A general modeling and simulation framework is developed in this work to quantitatively evaluate the resilience of networked systems under two types of failures: connectivity- and load-based. Two newly designed dynamic healing mechanisms are demonstrated. The model considers concurrent cascading failure and healing processes on networks. The discrete-time simulations generate system trajectories, i.e., number of failed nodes at each time step. The 95% recovery time is used as the resilience metric to evaluate and compare the healing performance. Based on two real-world networks, the dependence of system trajectories and resilience metric on various model parameters is explored. If the triggering level (fraction of inactive nodes when healing starts) is too high, the system would either undergo a very slow recovery or never recover to a satisfactory level at all. However, this work provides a counter example to the intuition that the smaller the triggering level, the shorter the recovery time. While low budgets (number of nodes allowed to recover at each time step) lead to prolonged or unsuccessful recovery, it appears that the resilience metric converges to a limit when budget is raised to high enough. This may have practical implications, as node recovery requires resources and a budget too high or too low would be wasteful. This works lays the foundation for subsequent studies on more complex mechanisms and processes on the networks, optimization of model parameters for maximum resilience, as well as applications to more real-world scenarios.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 Resilience Week (RWS)

自引率

0.00%

发文量