首页 > 最新文献

ACM Transactions on Autonomous and Adaptive Systems最新文献

英文 中文
Dynamically Balancing Load with Overload Control for Microservices 利用微服务的过载控制动态平衡负载
IF 2.2 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-05 DOI: 10.1145/3676167
Ratnadeep Bhattacharya, Yuan Gao, Timothy Wood
The microservices architecture simplifies application development by breaking monolithic applications into manageable microservices. However, this distributed microservice “service mesh” leads to new challenges due to the more complex application topology. Particularly, each service component scales up and down independently creating load imbalance problems on shared backend services accessed by multiple components. Traditional load balancing algorithms do not port over well to a distributed microservice architecture where load balancers are deployed client-side. In this paper, we propose a self-managing load balancing system, BLOC, which provides consistent response times to users without using a centralized metadata store or explicit messaging between nodes. BLOC uses overload control approaches to provide feedback to the load balancers. We show that this performs significantly better in solving the incast problem in microservice architectures. A critical component of BLOC is the dynamic capacity estimation algorithm. We show that a well-tuned capacity estimate can outperform even Join-the-Shortest-Queue, a nearly optimal algorithm, while a reasonable dynamic estimate still outperforms Least Connection, a distributed implementation of Join-the-Shortest-Queue. Evaluating this framework, we found that BLOC improves the response time distribution range, between the 10th and 90th percentiles, by 2 to 4 times and the tail, 99th percentile, latency by 2 times.
微服务架构将单体应用程序分解为可管理的微服务,从而简化了应用程序的开发。然而,由于应用程序拓扑结构更为复杂,这种分布式微服务 "服务网 "带来了新的挑战。特别是,每个服务组件都会独立地向上和向下扩展,从而在多个组件访问的共享后端服务上造成负载不平衡问题。传统的负载平衡算法不能很好地移植到分布式微服务架构中,在这种架构中,负载平衡器部署在客户端。在本文中,我们提出了一种自管理负载平衡系统 BLOC,它无需使用集中式元数据存储或节点间的显式消息传递,就能为用户提供一致的响应时间。BLOC 使用过载控制方法向负载平衡器提供反馈。我们的研究表明,这种方法在解决微服务架构中的incast问题方面有明显的优势。BLOC 的一个关键组成部分是动态容量估算算法。我们的研究表明,经过良好调整的容量估计甚至可以优于Join-the-Shortest-Queue(一种近乎最优的算法),而合理的动态估计仍然优于Least Connection(一种Join-the-Shortest-Queue的分布式实现)。在对这一框架进行评估时,我们发现 BLOC 可将响应时间分布范围(第 10 个百分位数和第 90 个百分位数之间)提高 2 到 4 倍,将尾部(第 99 个百分位数)的延迟时间提高 2 倍。
{"title":"Dynamically Balancing Load with Overload Control for Microservices","authors":"Ratnadeep Bhattacharya, Yuan Gao, Timothy Wood","doi":"10.1145/3676167","DOIUrl":"https://doi.org/10.1145/3676167","url":null,"abstract":"The microservices architecture simplifies application development by breaking monolithic applications into manageable microservices. However, this distributed microservice “service mesh” leads to new challenges due to the more complex application topology. Particularly, each service component scales up and down independently creating load imbalance problems on shared backend services accessed by multiple components. Traditional load balancing algorithms do not port over well to a distributed microservice architecture where load balancers are deployed client-side. In this paper, we propose a self-managing load balancing system, BLOC, which provides consistent response times to users without using a centralized metadata store or explicit messaging between nodes. BLOC uses overload control approaches to provide feedback to the load balancers. We show that this performs significantly better in solving the incast problem in microservice architectures. A critical component of BLOC is the dynamic capacity estimation algorithm. We show that a well-tuned capacity estimate can outperform even Join-the-Shortest-Queue, a nearly optimal algorithm, while a reasonable dynamic estimate still outperforms Least Connection, a distributed implementation of Join-the-Shortest-Queue. Evaluating this framework, we found that BLOC improves the response time distribution range, between the 10th and 90th percentiles, by 2 to 4 times and the tail, 99th percentile, latency by 2 times.","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141674272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Introduction to ACSOS 2022 Special Issue ACSOS 2022 特刊简介
IF 2.2 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-02 DOI: 10.1145/3676168
E. Di Nitto, Ilias Gerostathopoulos, Kirstie Bellman
This special issue collects extended versions of four of the papers that have received the best scores during the review process of the 3rd IEEE International Conference on Autonomic Computing and Self- Organizing Systems (ACSOS 2022). In this introduction we are going to frame the papers in the general ACSOS context.
本特刊收集了在第三届电气和电子工程师学会自主计算与自组织系统国际会议(ACSOS 2022)评审过程中获得最佳评分的四篇论文的扩展版本。在导言中,我们将把这些论文放在 ACSOS 的大背景下进行介绍。
{"title":"Introduction to ACSOS 2022 Special Issue","authors":"E. Di Nitto, Ilias Gerostathopoulos, Kirstie Bellman","doi":"10.1145/3676168","DOIUrl":"https://doi.org/10.1145/3676168","url":null,"abstract":"This special issue collects extended versions of four of the papers that have received the best scores during the review process of the 3rd IEEE International Conference on Autonomic Computing and Self- Organizing Systems (ACSOS 2022). In this introduction we are going to frame the papers in the general ACSOS context.","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":null,"pages":null},"PeriodicalIF":2.2,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141688118","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IBAQ: Frequency-Domain Backdoor Attack Threatening Autonomous Driving via Quadratic Phase IBAQ:通过二次相位威胁自动驾驶的频域后门攻击
IF 2.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-06-19 DOI: 10.1145/3673904
Jinghan Qiu, Honglong Chen, Junjian Li, Yudong Gao, Junwei Li, Xingang Wang

The rapid evolution of backdoor attacks has emerged as a significant threat to the security of autonomous driving models. An attacker injects a backdoor into the model by adding triggers to the samples, which can be activated to manipulate the model’s inference. Backdoor attacks can lead to severe consequences, such as misidentifying traffic signs during autonomous driving, posing a risk of causing traffic accidents. Recently, there has been a gradual evolution of frequency-domain backdoor attacks. However, since the change of both amplitude and its corresponding phase will significantly affect image appearance, most of the existing frequency-domain backdoor attacks change only the amplitude, which results in a suboptimal efficacy of the attack. In this work, we propose an attack called IBAQ, to solve this problem by blurring semantic information of the trigger image through the quadratic phase. Initially, we convert the trigger and benign sample to YCrCb space. Then, we perform the fast Fourier transform on the Y channel, blending the trigger image’s amplitude and quadratic phase linearly with the benign sample’s amplitude and phase. IBAQ achieves covert injection of trigger information within amplitude and phase, enhancing the attack effect. We validate the effectiveness and stealthiness of IBAQ through comprehensive experiments.

后门攻击的快速发展已成为自动驾驶模型安全性的重大威胁。攻击者通过在样本中添加触发器向模型中注入后门,从而激活后门来操纵模型的推理。后门攻击可能会导致严重后果,例如在自动驾驶过程中错误识别交通标志,造成交通事故风险。最近,频域后门攻击逐渐发展起来。然而,由于振幅和相应相位的变化都会对图像外观产生重大影响,现有的频域后门攻击大多只改变振幅,导致攻击效果不理想。在这项工作中,我们提出了一种名为 IBAQ 的攻击,通过二次相位模糊触发图像的语义信息来解决这个问题。首先,我们将触发和良性样本转换到 YCrCb 空间。然后,我们对 Y 通道进行快速傅里叶变换,将触发图像的振幅和二次相位与良性样本的振幅和相位线性混合。IBAQ 实现了在振幅和相位内隐蔽注入触发信息,增强了攻击效果。我们通过综合实验验证了 IBAQ 的有效性和隐蔽性。
{"title":"IBAQ: Frequency-Domain Backdoor Attack Threatening Autonomous Driving via Quadratic Phase","authors":"Jinghan Qiu, Honglong Chen, Junjian Li, Yudong Gao, Junwei Li, Xingang Wang","doi":"10.1145/3673904","DOIUrl":"https://doi.org/10.1145/3673904","url":null,"abstract":"<p>The rapid evolution of backdoor attacks has emerged as a significant threat to the security of autonomous driving models. An attacker injects a backdoor into the model by adding triggers to the samples, which can be activated to manipulate the model’s inference. Backdoor attacks can lead to severe consequences, such as misidentifying traffic signs during autonomous driving, posing a risk of causing traffic accidents. Recently, there has been a gradual evolution of frequency-domain backdoor attacks. However, since the change of both amplitude and its corresponding phase will significantly affect image appearance, most of the existing frequency-domain backdoor attacks change only the amplitude, which results in a suboptimal efficacy of the attack. In this work, we propose an attack called IBAQ, to solve this problem by blurring semantic information of the trigger image through the quadratic phase. Initially, we convert the trigger and benign sample to YCrCb space. Then, we perform the fast Fourier transform on the Y channel, blending the trigger image’s amplitude and quadratic phase linearly with the benign sample’s amplitude and phase. IBAQ achieves covert injection of trigger information within amplitude and phase, enhancing the attack effect. We validate the effectiveness and stealthiness of IBAQ through comprehensive experiments.</p>","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":null,"pages":null},"PeriodicalIF":2.7,"publicationDate":"2024-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141508517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Scheduling of High-Availability Drone Swarms for Congestion Alleviation in Connected Automated Vehicles 自适应调度高可用性无人机群,缓解联网自动驾驶汽车的拥堵状况
IF 2.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-06-19 DOI: 10.1145/3673905
Shengye Pang, Yi Li, Zhen Qin, Xinkui Zhao, Jintao Chen, Fan Wang, Jianwei Yin

The Intelligent Transportation System (ITS) serves as a pivotal element within urban networks, offering decision support to users and connected automated vehicles (CAVs) through comprehensive information gathering, sensing, device control, and data processing. Presently, ITS predominantly relies on sensors embedded in fixed infrastructure, notably Roadside Units (RSUs). However, RSUs are confined by coverage limitations and may encounter challenges in prompt emergency responses. On-demand resources, such as drones, present a viable option to supplement these deficiencies effectively. This paper introduces an approach where Software-Defined Networking (SDN) and Mobile Edge Computing (MEC) technologies are integrated to formulate a high-availability drone swarm control and communication infrastructure framework, comprising the cloud layer, edge layer, and device layer. Drones confront limitations in flight duration attributed to battery limitations, posing a challenge in sustaining continuous monitoring of road conditions over extended periods. Effective drone scheduling stands as a promising solution to overcome these constraints. To tackle this issue, we initially utilized Graph WaveNet, a specialized graph neural network structure tailored for spatial-temporal graph modeling, for training a congestion prediction model using real-world dataset inputs. Building upon this, we further propose an algorithm for drone scheduling based on congestion prediction. Our simulation experiments using real-world data demonstrate that, compared to the baseline method, the proposed scheduling algorithm not only yielded superior scheduling gains but also mitigated drone idle rates.

智能交通系统(ITS)是城市网络中的关键要素,通过全面的信息收集、传感、设备控制和数据处理,为用户和联网自动驾驶车辆(CAV)提供决策支持。目前,智能交通系统主要依靠嵌入在固定基础设施中的传感器,特别是路边装置(RSU)。然而,RSU 受到覆盖范围的限制,在迅速作出应急响应方面可能会遇到挑战。无人机等按需资源是有效补充这些不足的可行选择。本文介绍了一种将软件定义网络(SDN)和移动边缘计算(MEC)技术相结合的方法,以制定一个由云层、边缘层和设备层组成的高可用性无人机群控制和通信基础设施框架。由于电池的限制,无人机的飞行时间有限,这给长时间持续监控路况带来了挑战。有效的无人机调度是克服这些限制的可行解决方案。为了解决这个问题,我们首先利用了专门为时空图建模而定制的图神经网络结构 Graph WaveNet,使用真实世界数据集输入来训练拥堵预测模型。在此基础上,我们进一步提出了一种基于拥堵预测的无人机调度算法。我们使用真实世界数据进行的模拟实验表明,与基线方法相比,所提出的调度算法不仅获得了卓越的调度收益,还降低了无人机空闲率。
{"title":"Adaptive Scheduling of High-Availability Drone Swarms for Congestion Alleviation in Connected Automated Vehicles","authors":"Shengye Pang, Yi Li, Zhen Qin, Xinkui Zhao, Jintao Chen, Fan Wang, Jianwei Yin","doi":"10.1145/3673905","DOIUrl":"https://doi.org/10.1145/3673905","url":null,"abstract":"<p>The Intelligent Transportation System (ITS) serves as a pivotal element within urban networks, offering decision support to users and connected automated vehicles (CAVs) through comprehensive information gathering, sensing, device control, and data processing. Presently, ITS predominantly relies on sensors embedded in fixed infrastructure, notably Roadside Units (RSUs). However, RSUs are confined by coverage limitations and may encounter challenges in prompt emergency responses. On-demand resources, such as drones, present a viable option to supplement these deficiencies effectively. This paper introduces an approach where Software-Defined Networking (SDN) and Mobile Edge Computing (MEC) technologies are integrated to formulate a high-availability drone swarm control and communication infrastructure framework, comprising the cloud layer, edge layer, and device layer. Drones confront limitations in flight duration attributed to battery limitations, posing a challenge in sustaining continuous monitoring of road conditions over extended periods. Effective drone scheduling stands as a promising solution to overcome these constraints. To tackle this issue, we initially utilized Graph WaveNet, a specialized graph neural network structure tailored for spatial-temporal graph modeling, for training a congestion prediction model using real-world dataset inputs. Building upon this, we further propose an algorithm for drone scheduling based on congestion prediction. Our simulation experiments using real-world data demonstrate that, compared to the baseline method, the proposed scheduling algorithm not only yielded superior scheduling gains but also mitigated drone idle rates.</p>","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":null,"pages":null},"PeriodicalIF":2.7,"publicationDate":"2024-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141508518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Self-Supervised Machine Learning Framework for Online Container Security Attack Detection 用于在线容器安全攻击检测的自监督机器学习框架
IF 2.7 4区 计算机科学 Q2 Computer Science Pub Date : 2024-05-28 DOI: 10.1145/3665795
Olufogorehan Tunde-Onadele, Yuhang Lin, Xiaohui Gu, Jingzhu He, Hugo Latapie

Container security has received much research attention recently. Previous work has proposed to apply various machine learning techniques to detect security attacks in containerized applications. On one hand, supervised machine learning schemes require sufficient labeled training data to achieve good attack detection accuracy. On the other hand, unsupervised machine learning methods are more practical by avoiding training data labeling requirements, but they often suffer from high false alarm rates. In this paper, we present a generic self-supervised hybrid learning (SHIL) framework for achieving efficient online security attack detection in containerized systems. SHIL can effectively combine both unsupervised and supervised learning algorithms but does not require any manual data labeling. We have implemented a prototype of SHIL and conducted experiments over 46 real world security attacks in 29 commonly used server applications. Our experimental results show that SHIL can reduce false alarms by 33-93% compared to existing supervised, unsupervised, or semi-supervised machine learning schemes while achieving a higher or similar detection rate.

最近,容器安全受到了很多研究的关注。以往的工作提出应用各种机器学习技术来检测容器化应用中的安全攻击。一方面,有监督机器学习方案需要足够多的标注训练数据,才能达到良好的攻击检测精度。另一方面,无监督机器学习方法避免了训练数据标记的要求,因而更加实用,但它们往往存在误报率高的问题。本文提出了一种通用的自监督混合学习(SHIL)框架,用于在容器化系统中实现高效的在线安全攻击检测。SHIL 可以有效结合无监督和有监督学习算法,但不需要任何人工数据标记。我们已经实现了SHIL的原型,并在29个常用服务器应用程序中对46个真实世界的安全攻击进行了实验。实验结果表明,与现有的有监督、无监督或半监督机器学习方案相比,SHIL 可将误报率降低 33-93%,同时实现更高或类似的检测率。
{"title":"Self-Supervised Machine Learning Framework for Online Container Security Attack Detection","authors":"Olufogorehan Tunde-Onadele, Yuhang Lin, Xiaohui Gu, Jingzhu He, Hugo Latapie","doi":"10.1145/3665795","DOIUrl":"https://doi.org/10.1145/3665795","url":null,"abstract":"<p>Container security has received much research attention recently. Previous work has proposed to apply various machine learning techniques to detect security attacks in containerized applications. On one hand, supervised machine learning schemes require sufficient labeled training data to achieve good attack detection accuracy. On the other hand, unsupervised machine learning methods are more practical by avoiding training data labeling requirements, but they often suffer from high false alarm rates. In this paper, we present a generic self-supervised hybrid learning (SHIL) framework for achieving efficient online security attack detection in containerized systems. SHIL can effectively combine both unsupervised and supervised learning algorithms but does not require any manual data labeling. We have implemented a prototype of SHIL and conducted experiments over 46 real world security attacks in 29 commonly used server applications. Our experimental results show that SHIL can reduce false alarms by 33-93% compared to existing supervised, unsupervised, or semi-supervised machine learning schemes while achieving a higher or similar detection rate.</p>","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":null,"pages":null},"PeriodicalIF":2.7,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141166090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Framework for Simultaneous Task Allocation and Planning under Uncertainty 不确定性条件下的任务分配与规划同步框架
IF 2.7 4区 计算机科学 Q2 Computer Science Pub Date : 2024-05-28 DOI: 10.1145/3665499
Fatma Faruq, Bruno Lacerda, Nick Hawes, David Parker

We present novel techniques for simultaneous task allocation and planning in multi-robot systems operating under uncertainty. By performing task allocation and planning simultaneously, allocations are informed by individual robot behaviour, creating more efficient team behaviour. We go beyond existing work by planning for task reallocation across the team given a model of partial task satisfaction under potential robot failures and uncertain action outcomes. We model the problem using Markov decision processes, with tasks encoded in co-safe linear temporal logic, and optimise for the expected number of tasks completed by the team. To avoid the inherent complexity of joint models, we propose an alternative model that simultaneously considers task allocation and planning, but in a sequential fashion. We then build a joint policy from the sequential policy obtained from our model, thus allowing for concurrent policy execution. Furthermore, to enable adaptation in the case of robot failures, we consider replanning from failure states and propose an approach to preemptively replan in an anytime fashion, replanning for more probable failure states first. Our method also allows us to quantify the performance of the team by providing an analysis of properties such as the expected number of completed tasks under concurrent policy execution. We implement and extensively evaluate our approach on a range of scenarios. We compare its performance to a state-of-the-art baseline in decoupled task allocation and planning: sequential single-item auctions. Our approach outperforms the baseline in terms of computation time and the number of times replanning is required on robot failure.

我们介绍了在不确定条件下运行的多机器人系统中同时进行任务分配和规划的新技术。通过同时执行任务分配和规划,机器人个体的行为为任务分配提供了信息,从而提高了团队行为的效率。我们在现有工作的基础上,根据潜在机器人故障和不确定行动结果下的部分任务满意度模型,对整个团队的任务重新分配进行规划。我们使用马尔可夫决策过程对问题进行建模,任务以共安全线性时间逻辑进行编码,并对团队完成任务的预期数量进行优化。为了避免联合模型固有的复杂性,我们提出了一个替代模型,该模型同时考虑任务分配和规划,但以顺序的方式进行。然后,我们根据从模型中获得的顺序策略建立联合策略,从而实现策略的并行执行。此外,为了能够在机器人发生故障时进行调整,我们考虑了从故障状态重新规划的问题,并提出了一种以随时方式进行抢先重新规划的方法,首先对更有可能发生的故障状态进行重新规划。我们的方法还允许我们通过分析并发策略执行下完成任务的预期数量等属性来量化团队的性能。我们在一系列场景中实施并广泛评估了我们的方法。我们将其性能与任务分配和规划解耦的最先进基线(顺序单项拍卖)进行了比较。在计算时间和机器人故障时需要重新规划的次数方面,我们的方法优于基准方法。
{"title":"A Framework for Simultaneous Task Allocation and Planning under Uncertainty","authors":"Fatma Faruq, Bruno Lacerda, Nick Hawes, David Parker","doi":"10.1145/3665499","DOIUrl":"https://doi.org/10.1145/3665499","url":null,"abstract":"<p>We present novel techniques for simultaneous task allocation and planning in multi-robot systems operating under uncertainty. By performing task allocation and planning simultaneously, allocations are informed by individual robot behaviour, creating more efficient team behaviour. We go beyond existing work by planning for task reallocation across the team given a model of partial task satisfaction under potential robot failures and uncertain action outcomes. We model the problem using Markov decision processes, with tasks encoded in co-safe linear temporal logic, and optimise for the expected number of tasks completed by the team. To avoid the inherent complexity of joint models, we propose an alternative model that simultaneously considers task allocation and planning, but in a sequential fashion. We then build a joint policy from the sequential policy obtained from our model, thus allowing for concurrent policy execution. Furthermore, to enable adaptation in the case of robot failures, we consider replanning from failure states and propose an approach to preemptively replan in an anytime fashion, replanning for more probable failure states first. Our method also allows us to quantify the performance of the team by providing an analysis of properties such as the expected number of completed tasks under concurrent policy execution. We implement and extensively evaluate our approach on a range of scenarios. We compare its performance to a state-of-the-art baseline in decoupled task allocation and planning: sequential single-item auctions. Our approach outperforms the baseline in terms of computation time and the number of times replanning is required on robot failure.</p>","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":null,"pages":null},"PeriodicalIF":2.7,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141166539","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptation in Edge Computing: A review on design principles and research challenges 边缘计算中的适应性:设计原则与研究挑战综述
IF 2.7 4区 计算机科学 Q2 Computer Science Pub Date : 2024-05-09 DOI: 10.1145/3664200
Fatemeh Golpayegani, Nanxi Chen, Nima Afraz, Eric Gyamfi, Abdollah Malekjafarian, Dominik Schäfer, Christian Krupitzer

Edge Computing places the computational services and resources closer to the user proximity, to reduce latency, and ensure the quality of service and experience. Low latency, context awareness, and mobility support are the major contributors to edge-enabled smart systems. Such systems require handling new situations and change on the fly and ensuring the quality of service while only having access to constrained computation and communication resources and operating in mobile, dynamic, and ever-changing environments. Hence, adaptation and self-organisation are crucial for such systems to maintain their performance, and operability while accommodating new changes in their environment.

This paper reviews the current literature in the field of adaptive Edge Computing systems. We use a widely accepted taxonomy, which describes the important aspects of adaptive behaviour implementation in computing systems. This taxonomy discusses aspects such as adaptation reasons, the various levels an adaptation strategy can be implemented, the time of reaction to a change, categories of adaptation technique, and control of the adaptive behaviour. In this paper, we discuss how these aspects are addressed in the literature, and identify the open research challenges and future direction in adaptive Edge Computing systems.

The results of our analysis show that most of the identified approaches target adaptation at the application level, and only a few focus on middleware, communication infrastructure, and context. Adaptations that are required to address the changes in the context, changes caused by users or in the system itself are also less explored. Furthermore, most of the literature has opted for reactive adaptation, although proactive adaptation is essential to maintain the edge computing systems’ performance and interoperability by anticipating the required adaptations on the fly. Additionally, most approaches apply a centralised adaptation control, which does not perfectly fit the mostly decentralised/distributed Edge Computing settings.

边缘计算将计算服务和资源置于更接近用户的位置,以减少延迟,确保服务质量和体验。低延迟、情境感知和移动支持是边缘智能系统的主要优势。这类系统要求在移动、动态和不断变化的环境中运行时,仅能访问有限的计算和通信资源,并能即时处理新情况和新变化,确保服务质量。因此,自适应和自组织对于此类系统在适应环境新变化的同时保持性能和可操作性至关重要。我们采用了一种广为接受的分类法,该分类法描述了在计算系统中实施自适应行为的重要方面。该分类法讨论了适应原因、可实施适应策略的不同层次、对变化做出反应的时间、适应技术的类别以及适应行为的控制等方面。在本文中,我们将讨论文献中是如何论述这些方面的,并确定自适应边缘计算系统的公开研究挑战和未来方向。我们的分析结果表明,大多数已确定的方法都针对应用层面的自适应,只有少数方法关注中间件、通信基础设施和上下文。针对环境变化、用户变化或系统本身变化所需的适应性研究也较少。此外,大多数文献都选择了被动适应,尽管主动适应对于通过预测所需的即时适应来保持边缘计算系统的性能和互操作性至关重要。此外,大多数方法都采用集中式适应控制,这并不完全适合大部分分散式/分布式边缘计算设置。
{"title":"Adaptation in Edge Computing: A review on design principles and research challenges","authors":"Fatemeh Golpayegani, Nanxi Chen, Nima Afraz, Eric Gyamfi, Abdollah Malekjafarian, Dominik Schäfer, Christian Krupitzer","doi":"10.1145/3664200","DOIUrl":"https://doi.org/10.1145/3664200","url":null,"abstract":"<p>Edge Computing places the computational services and resources closer to the user proximity, to reduce latency, and ensure the quality of service and experience. Low latency, context awareness, and mobility support are the major contributors to edge-enabled smart systems. Such systems require handling new situations and change on the fly and ensuring the quality of service while only having access to constrained computation and communication resources and operating in mobile, dynamic, and ever-changing environments. Hence, adaptation and self-organisation are crucial for such systems to maintain their performance, and operability while accommodating new changes in their environment.</p><p>This paper reviews the current literature in the field of adaptive Edge Computing systems. We use a widely accepted taxonomy, which describes the important aspects of adaptive behaviour implementation in computing systems. This taxonomy discusses aspects such as adaptation reasons, the various levels an adaptation strategy can be implemented, the time of reaction to a change, categories of adaptation technique, and control of the adaptive behaviour. In this paper, we discuss how these aspects are addressed in the literature, and identify the open research challenges and future direction in adaptive Edge Computing systems.</p><p>The results of our analysis show that most of the identified approaches target adaptation at the application level, and only a few focus on middleware, communication infrastructure, and context. Adaptations that are required to address the changes in the context, changes caused by users or in the system itself are also less explored. Furthermore, most of the literature has opted for reactive adaptation, although proactive adaptation is essential to maintain the edge computing systems’ performance and interoperability by anticipating the required adaptations on the fly. Additionally, most approaches apply a centralised adaptation control, which does not perfectly fit the mostly decentralised/distributed Edge Computing settings.</p>","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":null,"pages":null},"PeriodicalIF":2.7,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140933501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
OptimML: Joint Control of Inference Latency and Server Power Consumption for ML Performance Optimization OptimML:联合控制推理延迟和服务器功耗,优化 ML 性能
IF 2.7 4区 计算机科学 Q2 Computer Science Pub Date : 2024-05-07 DOI: 10.1145/3661825
Guoyu Chen, Xiaorui Wang

Power capping is an important technique for high-density servers to safely oversubscribe the power infrastructure in a data center. However, power capping is commonly accomplished by dynamically lowering the server processors’ frequency levels, which can result in degraded application performance. For servers that run important machine learning (ML) applications with Service-Level Objective (SLO) requirements, inference performance such as recognition accuracy must be optimized within a certain latency constraint, which demands high server performance. In order to achieve the best inference accuracy under the desired latency and server power constraints, this paper proposes OptimML, a multi-input-multi-output (MIMO) control framework that jointly controls both inference latency and server power consumption, by flexibly adjusting the machine learning model size (and so its required computing resources) when server frequency needs to be lowered for power capping. Our results on a hardware testbed with widely adopted ML framework (including PyTorch, TensorFlow, and MXNet) show that OptimML achieves higher inference accuracy compared with several well-designed baselines, while respecting both latency and power constraints. Furthermore, an adaptive control scheme with online model switching and estimation is designed to achieve analytic assurance of control accuracy and system stability, even in the face of significant workload/hardware variations.

功率封顶是高密度服务器的一项重要技术,可安全地超额分配数据中心的电力基础设施。然而,功率封顶通常是通过动态降低服务器处理器的频率水平来实现的,这会导致应用性能下降。对于运行有服务级目标(SLO)要求的重要机器学习(ML)应用的服务器来说,必须在一定的延迟约束内优化推理性能(如识别准确率),这就要求服务器具有很高的性能。为了在所需的延迟和服务器功耗限制条件下实现最佳推理精度,本文提出了多输入多输出(MIMO)控制框架 OptimML,当服务器频率需要降低以达到功耗上限时,通过灵活调整机器学习模型的大小(因此也调整了所需的计算资源)来共同控制推理延迟和服务器功耗。我们在采用广泛应用的 ML 框架(包括 PyTorch、TensorFlow 和 MXNet)的硬件测试平台上取得的结果表明,与几种精心设计的基线相比,OptimML 实现了更高的推理精度,同时遵守了延迟和功耗限制。此外,我们还设计了一种具有在线模型切换和估计功能的自适应控制方案,以实现对控制精度和系统稳定性的分析保证,即使面对显著的工作负载/硬件变化也是如此。
{"title":"OptimML: Joint Control of Inference Latency and Server Power Consumption for ML Performance Optimization","authors":"Guoyu Chen, Xiaorui Wang","doi":"10.1145/3661825","DOIUrl":"https://doi.org/10.1145/3661825","url":null,"abstract":"<p>Power capping is an important technique for high-density servers to safely oversubscribe the power infrastructure in a data center. However, power capping is commonly accomplished by dynamically lowering the server processors’ frequency levels, which can result in degraded application performance. For servers that run important machine learning (ML) applications with Service-Level Objective (SLO) requirements, inference performance such as recognition accuracy must be optimized within a certain latency constraint, which demands high server performance. In order to achieve the best inference accuracy under the desired latency and server power constraints, this paper proposes OptimML, a multi-input-multi-output (MIMO) control framework that jointly controls both inference latency and server power consumption, by flexibly adjusting the machine learning model size (and so its required computing resources) when server frequency needs to be lowered for power capping. Our results on a hardware testbed with widely adopted ML framework (including PyTorch, TensorFlow, and MXNet) show that OptimML achieves higher inference accuracy compared with several well-designed baselines, while respecting both latency and power constraints. Furthermore, an adaptive control scheme with online model switching and estimation is designed to achieve analytic assurance of control accuracy and system stability, even in the face of significant workload/hardware variations.</p>","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":null,"pages":null},"PeriodicalIF":2.7,"publicationDate":"2024-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140888046","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ACM Transactions on Autonomous and Adaptive Systems (ACM TAAS): Editorial Welcome and Update on State of the Journal, Vision, and Ongoing Developments ACM Autonomous and Adaptive Systems (ACM TAAS):编辑部欢迎辞及期刊现状、愿景和当前发展的最新情况
IF 2.7 4区 计算机科学 Q2 Computer Science Pub Date : 2024-04-24 DOI: 10.1145/3661314
Rami Bahsoon
{"title":"ACM Transactions on Autonomous and Adaptive Systems (ACM TAAS): Editorial Welcome and Update on State of the Journal, Vision, and Ongoing Developments","authors":"Rami Bahsoon","doi":"10.1145/3661314","DOIUrl":"https://doi.org/10.1145/3661314","url":null,"abstract":"","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":null,"pages":null},"PeriodicalIF":2.7,"publicationDate":"2024-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140660723","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Applying Trust for Operational States of ICT-Enabled Power Grid Services 将信任应用于信息和通信技术驱动的电网服务的运行状态
IF 2.7 4区 计算机科学 Q2 Computer Science Pub Date : 2024-04-03 DOI: 10.1145/3654672
Michael Brand, Anand Narayan, Sebastian Lehnhoff

Digitalization enables the automation required to operate modern cyber-physical energy systems (CPESs), leading to a shift from hierarchical to organic systems. However, digitalization increases the number of factors affecting the state of a CPES (e.g., software bugs and cyber threats). In addition to established factors like functional correctness, others like security become relevant but are yet to be integrated into an operational viewpoint, i.e. a holistic perspective on the system state. Trust in organic computing is an approach to gain a holistic view of the state of systems. It consists of several facets (e.g., functional correctness, security, and reliability), which can be used to assess the state of CPES. Therefore, a trust assessment on all levels can contribute to a coherent state assessment. This paper focuses on the trust in ICT-enabled grid services in a CPES. These are essential for operating the CPES, and their performance relies on various data aspects like availability, timeliness, and correctness. This paper proposes to assess the trust in involved components and data to estimate data correctness, which is crucial for grid services. The assessment is presented considering two exemplary grid services, namely state estimation and coordinated voltage control. Furthermore, the interpretation of different trust facets is also discussed.

数字化实现了现代网络物理能源系统(CPES)运行所需的自动化,导致系统从分层向有机转变。然而,数字化增加了影响 CPES 状态的因素数量(如软件错误和网络威胁)。除了功能正确性等既有因素外,安全性等其他因素也变得十分重要,但这些因素尚未纳入运行视角,即系统状态的整体视角。有机计算中的信任是一种全面了解系统状态的方法。它包括几个方面(如功能正确性、安全性和可靠性),可用于评估 CPES 的状态。因此,对所有层面的信任评估都有助于进行一致的状态评估。本文重点关注 CPES 中由 ICT 支持的电网服务的信任度。这些服务对 CPES 的运行至关重要,其性能依赖于各种数据,如可用性、及时性和正确性。本文建议对相关组件和数据的信任度进行评估,以估计数据的正确性,这对电网服务至关重要。本文考虑了两种典型的电网服务,即状态估计和协调电压控制,提出了评估方法。此外,还讨论了不同信任面的解释。
{"title":"Applying Trust for Operational States of ICT-Enabled Power Grid Services","authors":"Michael Brand, Anand Narayan, Sebastian Lehnhoff","doi":"10.1145/3654672","DOIUrl":"https://doi.org/10.1145/3654672","url":null,"abstract":"<p>Digitalization enables the automation required to operate modern cyber-physical energy systems (CPESs), leading to a shift from hierarchical to organic systems. However, digitalization increases the number of factors affecting the state of a CPES (e.g., software bugs and cyber threats). In addition to established factors like functional correctness, others like security become relevant but are yet to be integrated into an operational viewpoint, i.e. a holistic perspective on the system state. Trust in organic computing is an approach to gain a holistic view of the state of systems. It consists of several facets (e.g., functional correctness, security, and reliability), which can be used to assess the state of CPES. Therefore, a trust assessment on all levels can contribute to a coherent state assessment. This paper focuses on the trust in ICT-enabled grid services in a CPES. These are essential for operating the CPES, and their performance relies on various data aspects like availability, timeliness, and correctness. This paper proposes to assess the trust in involved components and data to estimate data correctness, which is crucial for grid services. The assessment is presented considering two exemplary grid services, namely state estimation and coordinated voltage control. Furthermore, the interpretation of different trust facets is also discussed.</p>","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":null,"pages":null},"PeriodicalIF":2.7,"publicationDate":"2024-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140560874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ACM Transactions on Autonomous and Adaptive Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1