{"title":"The SoS conductor: Orchestrating resources with iterative agent‐based reinforcement learning","authors":"Qiliang Chen, Babak Heydari","doi":"10.1002/sys.21747","DOIUrl":null,"url":null,"abstract":"We introduce a novel resource management approach for Systems of Systems (SoS), utilizing hierarchical deep reinforcement learning, iterating with agent‐based simulation. A key innovation of this method is its ability to balance top‐down SoS management with the autonomy of individual systems. This is achieved by dynamically allocating resources to each system, thereby modifying the range of options they can autonomously choose from. This dynamic option adjustment is a powerful approach to managing the trade‐off between centralized efficiency and decentralized autonomous actions of the systems, enabling the SoS to maintain the systems' autonomy while ensuring efficient SoS governance. The method, validated through a case study, not only demonstrates the potential and efficacy of the learning framework but also reveals how, using this method, minor performance sacrifices can lead to substantial improvements in resource efficiency.","PeriodicalId":509213,"journal":{"name":"Systems Engineering","volume":"5 12","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Systems Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/sys.21747","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We introduce a novel resource management approach for Systems of Systems (SoS), utilizing hierarchical deep reinforcement learning, iterating with agent‐based simulation. A key innovation of this method is its ability to balance top‐down SoS management with the autonomy of individual systems. This is achieved by dynamically allocating resources to each system, thereby modifying the range of options they can autonomously choose from. This dynamic option adjustment is a powerful approach to managing the trade‐off between centralized efficiency and decentralized autonomous actions of the systems, enabling the SoS to maintain the systems' autonomy while ensuring efficient SoS governance. The method, validated through a case study, not only demonstrates the potential and efficacy of the learning framework but also reveals how, using this method, minor performance sacrifices can lead to substantial improvements in resource efficiency.
我们利用分层深度强化学习和基于代理的模拟迭代,为系统的系统(SoS)引入了一种新的资源管理方法。这种方法的一个关键创新点是,它能够在自上而下的系统管理与单个系统的自主性之间取得平衡。这是通过为每个系统动态分配资源来实现的,从而修改了它们可以自主选择的选项范围。这种动态选项调整是管理集中式效率和分散式系统自主行动之间权衡的有力方法,使 SoS 既能保持系统的自主性,又能确保高效的 SoS 治理。该方法通过案例研究得到了验证,不仅展示了学习框架的潜力和功效,还揭示了使用这种方法,微小的性能牺牲如何能够带来资源效率的大幅提高。