首页 > 最新文献

ACM Transactions on Autonomous and Adaptive Systems最新文献

英文 中文
Engineering Pervasive Service Ecosystems: The SAPERE Approach 工程普及服务生态系统:SAPERE方法
IF 2.7 4区 计算机科学 Q2 Computer Science Pub Date : 2015-03-25 DOI: 10.1145/2700321
G. Castelli, M. Mamei, A. Rosi, F. Zambonelli
Emerging pervasive computing services will typically involve a large number of devices and service components cooperating together in an open and dynamic environment. This calls for suitable models and infrastructures promoting spontaneous, situated, and self-adaptive interactions between components. SAPERE (Self-Aware Pervasive Service Ecosystems) is a general coordination framework aimed at facilitating the decentralized and situated execution of self-organizing and self-adaptive pervasive computing services. SAPERE adopts a nature-inspired approach, in which pervasive services are modeled and deployed as autonomous individuals in an ecosystem of other services and devices, all of which interact in accord to a limited set of coordination laws, or eco-laws. In this article, we present the overall rationale underlying SAPERE and its reference architecture. We introduce the eco-laws--based coordination model and show how it can be used to express and easily enforce general-purpose self-organizing coordination patterns. The middleware infrastructure supporting the SAPERE model is presented and evaluated, and the overall advantages of SAPERE are discussed in the context of exemplary use cases.
新兴的普及计算服务通常涉及大量设备和服务组件在开放和动态的环境中协同工作。这需要合适的模型和基础设施来促进组件之间自发的、定位的和自适应的交互。SAPERE(自我意识的普适服务生态系统)是一个通用的协调框架,旨在促进自组织和自适应普适计算服务的分散和定位执行。SAPERE采用了一种受自然启发的方法,在这种方法中,普适性服务被建模并部署为其他服务和设备组成的生态系统中的自主个体,所有这些服务和设备都按照一组有限的协调规律或生态规律进行交互。在本文中,我们将介绍SAPERE及其参考体系结构的基本原理。我们介绍了基于生态规律的协调模型,并展示了如何使用它来表达和容易地执行通用的自组织协调模式。介绍并评估了支持SAPERE模型的中间件基础设施,并在示例用例的上下文中讨论了SAPERE的总体优势。
{"title":"Engineering Pervasive Service Ecosystems: The SAPERE Approach","authors":"G. Castelli, M. Mamei, A. Rosi, F. Zambonelli","doi":"10.1145/2700321","DOIUrl":"https://doi.org/10.1145/2700321","url":null,"abstract":"Emerging pervasive computing services will typically involve a large number of devices and service components cooperating together in an open and dynamic environment. This calls for suitable models and infrastructures promoting spontaneous, situated, and self-adaptive interactions between components. SAPERE (Self-Aware Pervasive Service Ecosystems) is a general coordination framework aimed at facilitating the decentralized and situated execution of self-organizing and self-adaptive pervasive computing services. SAPERE adopts a nature-inspired approach, in which pervasive services are modeled and deployed as autonomous individuals in an ecosystem of other services and devices, all of which interact in accord to a limited set of coordination laws, or eco-laws. In this article, we present the overall rationale underlying SAPERE and its reference architecture. We introduce the eco-laws--based coordination model and show how it can be used to express and easily enforce general-purpose self-organizing coordination patterns. The middleware infrastructure supporting the SAPERE model is presented and evaluated, and the overall advantages of SAPERE are discussed in the context of exemplary use cases.","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":null,"pages":null},"PeriodicalIF":2.7,"publicationDate":"2015-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85388145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 50
Reliable Task Allocation with Load Balancing in Multiplex Networks 多路复用网络中负载均衡的可靠任务分配
IF 2.7 4区 计算机科学 Q2 Computer Science Pub Date : 2015-03-25 DOI: 10.1145/2700327
Yichuan Jiang, Yifeng Zhou, Yunpeng Li
In multiplex networks, agents are connected by multiple types of links; a multiplex network can be split into more than one network layer that is composed of the same type of links and involved agents. Each network link type has a bias for communicating different types of resources; thus, the task’s access to the required resources in multiplex networks is strongly related to the network link types. However, traditional task allocation and load balancing methods only considered the situations of agents themselves and did not address the effects of network link types in multiplex networks. To solve this problem, this article considers both link types and agents, and substantially extends the existing work by highlighting the effect of network layers on task allocation and load balancing. Two multiplex network-adapted models of task allocation with load balancing are presented: network layer-oriented allocation and agent-oriented allocation. This article also addresses the unreliability in multiplex networks, which includes the unreliable links and agents, and implements a reliable task allocation based on a negotiation reputation and reward mechanism. Our findings show that both of our presented models can effectively and robustly satisfy the task allocation objectives in unreliable multiplex networks; the experiments prove that they can significantly reduce the time costs and improve the success rate of tasks for multiplex networks over the traditional simplex network-adapted task allocation model. Lastly, we find that our presented network layer-oriented allocation performs much better in terms of reliability and allocation time compared to our presented agent-oriented allocation, which further explains the importance of network layers in multiplex networks.
在多路网络中,代理通过多种类型的链路连接;多路复用网络可以分为多个网络层,这些网络层由相同类型的链路和所涉及的代理组成。每一种网络链路类型对不同类型的资源具有通信偏差;因此,任务对多路网络中所需资源的访问与网络链路类型密切相关。然而,传统的任务分配和负载均衡方法只考虑了智能体自身的情况,没有解决多路网络中网络链路类型的影响。为了解决这个问题,本文同时考虑了链路类型和代理,并通过强调网络层对任务分配和负载平衡的影响,大大扩展了现有的工作。提出了两种多路网络负载均衡任务分配模型:面向网络层的任务分配模型和面向agent的任务分配模型。本文还解决了多路复用网络中的不可靠性问题,包括不可靠链路和不可靠代理,并实现了基于协商信誉和奖励机制的可靠任务分配。研究结果表明,两种模型都能有效地鲁棒地满足不可靠复用网络中的任务分配目标;实验证明,与传统的单形网络任务分配模型相比,该模型可以显著降低多路网络任务分配的时间成本,提高任务分配的成功率。最后,我们发现,与我们提出的面向代理的分配相比,我们提出的面向网络层的分配在可靠性和分配时间方面表现得更好,这进一步解释了网络层在多路网络中的重要性。
{"title":"Reliable Task Allocation with Load Balancing in Multiplex Networks","authors":"Yichuan Jiang, Yifeng Zhou, Yunpeng Li","doi":"10.1145/2700327","DOIUrl":"https://doi.org/10.1145/2700327","url":null,"abstract":"In multiplex networks, agents are connected by multiple types of links; a multiplex network can be split into more than one network layer that is composed of the same type of links and involved agents. Each network link type has a bias for communicating different types of resources; thus, the task’s access to the required resources in multiplex networks is strongly related to the network link types. However, traditional task allocation and load balancing methods only considered the situations of agents themselves and did not address the effects of network link types in multiplex networks. To solve this problem, this article considers both link types and agents, and substantially extends the existing work by highlighting the effect of network layers on task allocation and load balancing. Two multiplex network-adapted models of task allocation with load balancing are presented: network layer-oriented allocation and agent-oriented allocation. This article also addresses the unreliability in multiplex networks, which includes the unreliable links and agents, and implements a reliable task allocation based on a negotiation reputation and reward mechanism. Our findings show that both of our presented models can effectively and robustly satisfy the task allocation objectives in unreliable multiplex networks; the experiments prove that they can significantly reduce the time costs and improve the success rate of tasks for multiplex networks over the traditional simplex network-adapted task allocation model. Lastly, we find that our presented network layer-oriented allocation performs much better in terms of reliability and allocation time compared to our presented agent-oriented allocation, which further explains the importance of network layers in multiplex networks.","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":null,"pages":null},"PeriodicalIF":2.7,"publicationDate":"2015-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81463500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
A Reinforcement Learning Approach for Interdomain Routing with Link Prices 基于链路价格的域间路由强化学习方法
IF 2.7 4区 计算机科学 Q2 Computer Science Pub Date : 2015-03-25 DOI: 10.1145/2719648
Peter Vrancx, Pasquale Gurzi, Abdel Rodríguez, K. Steenhaut, A. Nowé
In today’s Internet, the commercial aspects of routing are gaining importance. Current technology allows Internet Service Providers (ISPs) to renegotiate contracts online to maximize profits. Changing link prices will influence interdomain routing policies that are now driven by monetary aspects as well as global resource and performance optimization. In this article, we consider an interdomain routing game in which the ISP’s action is to set the price for its transit links. Assuming a cheapest path routing scheme, the optimal action is the price setting that yields the highest utility (i.e., profit) and depends both on the network load and the actions of other ISPs. We adapt a continuous and a discrete action learning automaton (LA) to operate in this framework as a tool that can be used by ISP operators to learn optimal price setting. In our model, agents representing different ISPs learn only on the basis of local information and do not need any central coordination or sensitive information exchange. Simulation results show that a single ISP employing LAs is able to learn the optimal price in a stationary environment. By introducing a selective exploration rule, LAs are also able to operate in nonstationary environments. When two ISPs employ LAs, we show that they converge to stable and fair equilibrium strategies.
在今天的互联网中,路由的商业方面变得越来越重要。目前的技术允许互联网服务提供商(isp)在线重新谈判合同以实现利润最大化。不断变化的链接价格将影响域间路由政策,这些政策现在是由货币方面以及全球资源和性能优化驱动的。在本文中,我们考虑一个域间路由博弈,其中ISP的行为是设置其传输链路的价格。假设一个最便宜的路径路由方案,最优行为是产生最高效用(即利润)的价格设置,并取决于网络负载和其他isp的行为。我们采用连续和离散动作学习自动机(LA)在此框架中运行,作为ISP运营商可以使用的工具来学习最优价格设置。在我们的模型中,代表不同isp的代理仅基于本地信息学习,不需要任何中心协调或敏感信息交换。仿真结果表明,在固定环境下,单个ISP能够学习到最优价格。通过引入选择性勘探规则,人工智能也能够在非平稳环境中工作。当两个isp采用LAs时,我们证明它们收敛于稳定和公平的均衡策略。
{"title":"A Reinforcement Learning Approach for Interdomain Routing with Link Prices","authors":"Peter Vrancx, Pasquale Gurzi, Abdel Rodríguez, K. Steenhaut, A. Nowé","doi":"10.1145/2719648","DOIUrl":"https://doi.org/10.1145/2719648","url":null,"abstract":"In today’s Internet, the commercial aspects of routing are gaining importance. Current technology allows Internet Service Providers (ISPs) to renegotiate contracts online to maximize profits. Changing link prices will influence interdomain routing policies that are now driven by monetary aspects as well as global resource and performance optimization. In this article, we consider an interdomain routing game in which the ISP’s action is to set the price for its transit links. Assuming a cheapest path routing scheme, the optimal action is the price setting that yields the highest utility (i.e., profit) and depends both on the network load and the actions of other ISPs. We adapt a continuous and a discrete action learning automaton (LA) to operate in this framework as a tool that can be used by ISP operators to learn optimal price setting. In our model, agents representing different ISPs learn only on the basis of local information and do not need any central coordination or sensitive information exchange. Simulation results show that a single ISP employing LAs is able to learn the optimal price in a stationary environment. By introducing a selective exploration rule, LAs are also able to operate in nonstationary environments. When two ISPs employ LAs, we show that they converge to stable and fair equilibrium strategies.","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":null,"pages":null},"PeriodicalIF":2.7,"publicationDate":"2015-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87242898","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Self-Tuning Batching with DVFS for Performance Improvement and Energy Efficiency in Internet Servers 自调优批处理与DVFS在互联网服务器的性能改进和能源效率
IF 2.7 4区 计算机科学 Q2 Computer Science Pub Date : 2015-03-25 DOI: 10.1145/2720023
Dazhao Cheng, Yanfei Guo, Changjun Jiang, Xiaobo Zhou
Performance improvement and energy efficiency are two important goals in provisioning Internet services in datacenter servers. In this article, we propose and develop a self-tuning request batching mechanism to simultaneously achieve the two correlated goals. The batching mechanism increases the cache hit rate at the front-tier Web server, which provides the opportunity to improve an application’s performance and the energy efficiency of the server system. The core of the batching mechanism is a novel and practical two-layer control system that adaptively adjusts the batching interval and frequency states of CPUs according to the service level agreement and the workload characteristics. The batching control adopts a self-tuning fuzzy model predictive control approach for application performance improvement. The power control dynamically adjusts the frequency of Central Processing Units (CPUs) with Dynamic Voltage and Frequency Scaling (DVFS) in response to workload fluctuations for energy efficiency. A coordinator between the two control loops achieves the desired performance and energy efficiency. We further extend the self-tuning batching with DVFS approach from a single-server system to a multiserver system. It relies on a MIMO expert fuzzy control to adjust the CPU frequencies of multiple servers and coordinate the frequency states of CPUs at different tiers. We implement the mechanism in a test bed. Experimental results demonstrate that the new approach significantly improves the application performance in terms of the system throughput and average response time. At the same time, the results also illustrate the mechanism can reduce the energy consumption of a single-server system by 13% and a multiserver system by 11%, respectively.
性能改进和能源效率是在数据中心服务器中提供Internet服务的两个重要目标。在本文中,我们提出并开发了一种自调优请求批处理机制,以同时实现两个相关的目标。批处理机制提高了前端Web服务器的缓存命中率,从而提供了改进应用程序性能和服务器系统能效的机会。该批处理机制的核心是一种新颖实用的双层控制系统,可根据服务水平协议和工作负载特性自适应调整cpu的批处理间隔和频率状态。批处理控制采用自整定模糊模型预测控制方法,提高应用性能。电源控制通过动态电压和频率缩放(DVFS)来动态调整中央处理器(cpu)的频率,以响应工作负载的波动,从而提高能源效率。两个控制回路之间的协调器实现了期望的性能和能源效率。我们进一步使用DVFS方法将自调优批处理从单服务器系统扩展到多服务器系统。它依靠MIMO专家模糊控制来调整多台服务器的CPU频率,并协调各层CPU的频率状态。我们在测试台上实现了该机制。实验结果表明,该方法在系统吞吐量和平均响应时间方面显著提高了应用性能。同时,结果还表明,该机制可以使单服务器系统的能耗降低13%,多服务器系统的能耗降低11%。
{"title":"Self-Tuning Batching with DVFS for Performance Improvement and Energy Efficiency in Internet Servers","authors":"Dazhao Cheng, Yanfei Guo, Changjun Jiang, Xiaobo Zhou","doi":"10.1145/2720023","DOIUrl":"https://doi.org/10.1145/2720023","url":null,"abstract":"Performance improvement and energy efficiency are two important goals in provisioning Internet services in datacenter servers. In this article, we propose and develop a self-tuning request batching mechanism to simultaneously achieve the two correlated goals. The batching mechanism increases the cache hit rate at the front-tier Web server, which provides the opportunity to improve an application’s performance and the energy efficiency of the server system. The core of the batching mechanism is a novel and practical two-layer control system that adaptively adjusts the batching interval and frequency states of CPUs according to the service level agreement and the workload characteristics. The batching control adopts a self-tuning fuzzy model predictive control approach for application performance improvement. The power control dynamically adjusts the frequency of Central Processing Units (CPUs) with Dynamic Voltage and Frequency Scaling (DVFS) in response to workload fluctuations for energy efficiency. A coordinator between the two control loops achieves the desired performance and energy efficiency. We further extend the self-tuning batching with DVFS approach from a single-server system to a multiserver system. It relies on a MIMO expert fuzzy control to adjust the CPU frequencies of multiple servers and coordinate the frequency states of CPUs at different tiers. We implement the mechanism in a test bed. Experimental results demonstrate that the new approach significantly improves the application performance in terms of the system throughput and average response time. At the same time, the results also illustrate the mechanism can reduce the energy consumption of a single-server system by 13% and a multiserver system by 11%, respectively.","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":null,"pages":null},"PeriodicalIF":2.7,"publicationDate":"2015-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84892693","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
SHõWA: A Self-Healing Framework for Web-Based Applications SHõWA:一个基于web的应用程序的自修复框架
IF 2.7 4区 计算机科学 Q2 Computer Science Pub Date : 2015-03-25 DOI: 10.1145/2700325
J. Magalhães, L. Silva
The complexity of systems is considered an obstacle to the progress of the IT industry. Autonomic computing is presented as the alternative to cope with the growing complexity. It is a holistic approach, in which the systems are able to configure, heal, optimize, and protect by themselves. Web-based applications are an example of systems where the complexity is high. The number of components, their interoperability, and workload variations are factors that may lead to performance failures or unavailability scenarios. The occurrence of these scenarios affects the revenue and reputation of businesses that rely on these types of applications. In this article, we present a self-healing framework for Web-based applications (SHõWA). SHõWA is composed by several modules, which monitor the application, analyze the data to detect and pinpoint anomalies, and execute recovery actions autonomously. The monitoring is done by a small aspect-oriented programming agent. This agent does not require changes to the application source code and includes adaptive and selective algorithms to regulate the level of monitoring. The anomalies are detected and pinpointed by means of statistical correlation. The data analysis detects changes in the server response time and analyzes if those changes are correlated with the workload or are due to a performance anomaly. In the presence of performance anomalies, the data analysis pinpoints the anomaly. Upon the pinpointing of anomalies, SHõWA executes a recovery procedure. We also present a study about the detection and localization of anomalies, the accuracy of the data analysis, and the performance impact induced by SHõWA. Two benchmarking applications, exercised through dynamic workloads, and different types of anomaly were considered in the study. The results reveal that (1) the capacity of SHõWA to detect and pinpoint anomalies while the number of end users affected is low; (2) SHõWA was able to detect anomalies without raising any false alarm; and (3) SHõWA does not induce a significant performance overhead (throughput was affected in less than 1%, and the response time delay was no more than 2 milliseconds).
系统的复杂性被认为是IT产业发展的障碍。自主计算是应对日益增长的复杂性的替代方案。它是一种整体方法,其中系统能够自行配置、修复、优化和保护。基于web的应用程序是复杂性很高的系统的一个例子。组件的数量、它们的互操作性和工作负载变化是可能导致性能故障或不可用场景的因素。这些场景的出现会影响依赖这些类型应用程序的企业的收入和声誉。在本文中,我们将介绍一个基于web的应用程序的自修复框架(SHõWA)。SHõWA由几个模块组成,这些模块监视应用程序,分析数据以检测和查明异常,并自主执行恢复操作。监视是由一个小型的面向方面编程代理完成的。该代理不需要更改应用程序源代码,并包括自适应和选择性算法来调节监视级别。用统计相关的方法检测和确定异常。数据分析检测服务器响应时间的变化,并分析这些变化是与工作负载相关还是由于性能异常引起的。在出现性能异常时,数据分析可以精确定位异常。在精确定位异常后,SHõWA执行恢复过程。我们还对异常的检测和定位、数据分析的准确性以及SHõWA对性能的影响进行了研究。研究中考虑了通过动态工作负载运行的两个基准测试应用程序和不同类型的异常。结果表明:(1)SHõWA在受影响的终端用户数量较少的情况下,检测和定位异常的能力较弱;(2) SHõWA能够检测异常而不产生任何虚警;(3) SHõWA不会引起显著的性能开销(吞吐量受到的影响小于1%,响应时间延迟不超过2毫秒)。
{"title":"SHõWA: A Self-Healing Framework for Web-Based Applications","authors":"J. Magalhães, L. Silva","doi":"10.1145/2700325","DOIUrl":"https://doi.org/10.1145/2700325","url":null,"abstract":"The complexity of systems is considered an obstacle to the progress of the IT industry. Autonomic computing is presented as the alternative to cope with the growing complexity. It is a holistic approach, in which the systems are able to configure, heal, optimize, and protect by themselves. Web-based applications are an example of systems where the complexity is high. The number of components, their interoperability, and workload variations are factors that may lead to performance failures or unavailability scenarios. The occurrence of these scenarios affects the revenue and reputation of businesses that rely on these types of applications.\u0000 In this article, we present a self-healing framework for Web-based applications (SHõWA). SHõWA is composed by several modules, which monitor the application, analyze the data to detect and pinpoint anomalies, and execute recovery actions autonomously. The monitoring is done by a small aspect-oriented programming agent. This agent does not require changes to the application source code and includes adaptive and selective algorithms to regulate the level of monitoring. The anomalies are detected and pinpointed by means of statistical correlation. The data analysis detects changes in the server response time and analyzes if those changes are correlated with the workload or are due to a performance anomaly. In the presence of performance anomalies, the data analysis pinpoints the anomaly. Upon the pinpointing of anomalies, SHõWA executes a recovery procedure. We also present a study about the detection and localization of anomalies, the accuracy of the data analysis, and the performance impact induced by SHõWA. Two benchmarking applications, exercised through dynamic workloads, and different types of anomaly were considered in the study. The results reveal that (1) the capacity of SHõWA to detect and pinpoint anomalies while the number of end users affected is low; (2) SHõWA was able to detect anomalies without raising any false alarm; and (3) SHõWA does not induce a significant performance overhead (throughput was affected in less than 1%, and the response time delay was no more than 2 milliseconds).","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":null,"pages":null},"PeriodicalIF":2.7,"publicationDate":"2015-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88086653","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Reinforcement Learning of Informed Initial Policies for Decentralized Planning 分散规划初始策略的强化学习
IF 2.7 4区 计算机科学 Q2 Computer Science Pub Date : 2015-01-14 DOI: 10.1145/2668130
Landon Kraemer, Bikramjit Banerjee
Decentralized partially observable Markov decision processes (Dec-POMDPs) offer a formal model for planning in cooperative multiagent systems where agents operate with noisy sensors and actuators, as well as local information. Prevalent solution techniques are centralized and model based—limitations that we address by distributed reinforcement learning (RL). We particularly favor alternate learning, where agents alternately learn best responses to each other, which appears to outperform concurrent RL. However, alternate learning requires an initial policy. We propose two principled approaches to generating informed initial policies: a naive approach that lays the foundation for a more sophisticated approach. We empirically demonstrate that the refined approach produces near-optimal solutions in many challenging benchmark settings, staking a claim to being an efficient (and realistic) approximate solver in its own right. Furthermore, alternate best response learning seeded with such policies quickly learns high-quality policies as well.
分散式部分可观察马尔可夫决策过程(deco - pomdp)为协作多智能体系统中的规划提供了一种形式化模型,其中智能体使用噪声传感器和执行器以及本地信息进行操作。流行的解决方案技术是集中式的和基于模型的限制,我们通过分布式强化学习(RL)来解决这些限制。我们特别喜欢交替学习,其中代理交替学习彼此的最佳反应,这似乎优于并发强化学习。然而,交替学习需要一个初始策略。我们提出了两种原则性的方法来生成知情的初始政策:一种幼稚的方法为更复杂的方法奠定基础。我们的经验证明,改进的方法在许多具有挑战性的基准设置中产生接近最优的解决方案,声称自己是一个有效的(和现实的)近似求解器。此外,以这些策略为种子的备选最佳响应学习也可以快速学习高质量的策略。
{"title":"Reinforcement Learning of Informed Initial Policies for Decentralized Planning","authors":"Landon Kraemer, Bikramjit Banerjee","doi":"10.1145/2668130","DOIUrl":"https://doi.org/10.1145/2668130","url":null,"abstract":"Decentralized partially observable Markov decision processes (Dec-POMDPs) offer a formal model for planning in cooperative multiagent systems where agents operate with noisy sensors and actuators, as well as local information. Prevalent solution techniques are centralized and model based—limitations that we address by distributed reinforcement learning (RL). We particularly favor alternate learning, where agents alternately learn best responses to each other, which appears to outperform concurrent RL. However, alternate learning requires an initial policy. We propose two principled approaches to generating informed initial policies: a naive approach that lays the foundation for a more sophisticated approach. We empirically demonstrate that the refined approach produces near-optimal solutions in many challenging benchmark settings, staking a claim to being an efficient (and realistic) approximate solver in its own right. Furthermore, alternate best response learning seeded with such policies quickly learns high-quality policies as well.","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":null,"pages":null},"PeriodicalIF":2.7,"publicationDate":"2015-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90207190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Distributed Data-Centric Adaptive Sampling for Cyber-Physical Systems 分布式数据中心自适应采样网络物理系统
IF 2.7 4区 计算机科学 Q2 Computer Science Pub Date : 2015-01-14 DOI: 10.1145/2644820
Eun Kyung Lee, H. Viswanathan, D. Pompili
A data-centric joint adaptive sampling and sleep scheduling solution, SILENCE, for autonomic sensor-based systems that monitor and reconstruct physical or environmental phenomena is proposed. Adaptive sampling and sleep scheduling can help realize the much needed resource efficiency by minimizing the communication and processing overhead in densely deployed autonomic sensor-based systems. The proposed solution exploits the spatiotemporal correlation in sensed data and eliminates redundancy in transmitted data through selective representation without compromising on accuracy of reconstruction of the monitored phenomenon at a remote monitor node. Differently from existing adaptive sampling solutions, SILENCE employs temporal causality analysis to not only track the variation in the underlying phenomenon but also its cause and direction of propagation in the field. The causality analysis and the same correlations are then leveraged for adaptive sleep scheduling aimed at saving energy in wireless sensor networks (WSNs). SILENCE outperforms traditional adaptive sampling solutions as well as the recently proposed compressive sampling techniques. Real experiments were performed on a WSN testbed monitoring temperature and humidity distribution in a rack of servers, and the simulations were performed on TOSSIM, the TinyOS simulator.
提出了一种以数据为中心的联合自适应采样和睡眠调度解决方案SILENCE,用于监测和重建物理或环境现象的基于自主传感器的系统。在密集部署的自主传感器系统中,自适应采样和睡眠调度可以通过最小化通信和处理开销来帮助实现急需的资源效率。该方案利用遥感数据的时空相关性,通过选择性表示消除传输数据中的冗余,同时不影响远程监测节点对监测现象的重建精度。与现有的自适应采样解决方案不同,SILENCE采用时间因果分析,不仅可以跟踪潜在现象的变化,还可以跟踪其在现场传播的原因和方向。然后利用因果分析和相同的相关性进行自适应睡眠调度,目的是在无线传感器网络(WSNs)中节省能量。SILENCE优于传统的自适应采样解决方案以及最近提出的压缩采样技术。在监测服务器机架温度和湿度分布的WSN试验台上进行了实际实验,并在TinyOS模拟器TOSSIM上进行了仿真。
{"title":"Distributed Data-Centric Adaptive Sampling for Cyber-Physical Systems","authors":"Eun Kyung Lee, H. Viswanathan, D. Pompili","doi":"10.1145/2644820","DOIUrl":"https://doi.org/10.1145/2644820","url":null,"abstract":"A data-centric joint adaptive sampling and sleep scheduling solution, SILENCE, for autonomic sensor-based systems that monitor and reconstruct physical or environmental phenomena is proposed. Adaptive sampling and sleep scheduling can help realize the much needed resource efficiency by minimizing the communication and processing overhead in densely deployed autonomic sensor-based systems. The proposed solution exploits the spatiotemporal correlation in sensed data and eliminates redundancy in transmitted data through selective representation without compromising on accuracy of reconstruction of the monitored phenomenon at a remote monitor node. Differently from existing adaptive sampling solutions, SILENCE employs temporal causality analysis to not only track the variation in the underlying phenomenon but also its cause and direction of propagation in the field. The causality analysis and the same correlations are then leveraged for adaptive sleep scheduling aimed at saving energy in wireless sensor networks (WSNs). SILENCE outperforms traditional adaptive sampling solutions as well as the recently proposed compressive sampling techniques. Real experiments were performed on a WSN testbed monitoring temperature and humidity distribution in a rack of servers, and the simulations were performed on TOSSIM, the TinyOS simulator.","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":null,"pages":null},"PeriodicalIF":2.7,"publicationDate":"2015-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82799988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Property-Driven Design for Robot Swarms: A Design Method Based on Prescriptive Modeling and Model Checking 机器人群体属性驱动设计:一种基于规定性建模和模型检验的设计方法
IF 2.7 4区 计算机科学 Q2 Computer Science Pub Date : 2015-01-14 DOI: 10.1145/2700318
Manuele Brambilla, A. Brutschy, M. Dorigo, M. Birattari
In this article, we present property-driven design, a novel top-down design method for robot swarms based on prescriptive modeling and model checking. Traditionally, robot swarms have been developed using a code-and-fix approach: in a bottom-up iterative process, the developer tests and improves the individual behaviors of the robots until the desired collective behavior is obtained. The code-and-fix approach is unstructured, and the quality of the obtained swarm depends completely on the expertise and ingenuity of the developer who has little scientific or technical support in his activity. Property-driven design aims at providing such scientific and technical support, with many advantages compared to the traditional unstructured approach. Property-driven design is composed of four phases: first, the developer formally specifies the requirements of the robot swarm by stating its desired properties; second, the developer creates a prescriptive model of the swarm and uses model checking to verify that this prescriptive model satisfies the desired properties; third, using the prescriptive model as a blueprint, the developer implements a simulated version of the desired robot swarm and validates the prescriptive model developed in the previous step; fourth, the developer implements the desired robot swarm and validates the previous steps. We demonstrate property-driven design using two case studies: aggregation and foraging.
本文提出了一种基于规定性建模和模型检验的自顶向下机器人群体设计方法——属性驱动设计。传统上,机器人群是使用代码和修复方法开发的:在自下而上的迭代过程中,开发人员测试和改进机器人的个体行为,直到获得所需的集体行为。编码和修复方法是非结构化的,所获得的群集的质量完全取决于开发人员的专业知识和独创性,而开发人员在其活动中几乎没有科学或技术支持。属性驱动设计旨在提供这种科学和技术支持,与传统的非结构化方法相比具有许多优势。属性驱动设计由四个阶段组成:首先,开发者通过陈述其期望的属性来正式指定机器人群的需求;其次,开发人员创建群体的规定性模型,并使用模型检查来验证该规定性模型是否满足期望的属性;第三,以规定性模型为蓝图,开发人员实现了期望机器人群的仿真版本,并验证了上一步开发的规定性模型;第四,开发人员实现所需的机器人群并验证前面的步骤。我们使用两个案例研究来演示属性驱动设计:聚合和觅食。
{"title":"Property-Driven Design for Robot Swarms: A Design Method Based on Prescriptive Modeling and Model Checking","authors":"Manuele Brambilla, A. Brutschy, M. Dorigo, M. Birattari","doi":"10.1145/2700318","DOIUrl":"https://doi.org/10.1145/2700318","url":null,"abstract":"In this article, we present property-driven design, a novel top-down design method for robot swarms based on prescriptive modeling and model checking. Traditionally, robot swarms have been developed using a code-and-fix approach: in a bottom-up iterative process, the developer tests and improves the individual behaviors of the robots until the desired collective behavior is obtained. The code-and-fix approach is unstructured, and the quality of the obtained swarm depends completely on the expertise and ingenuity of the developer who has little scientific or technical support in his activity. Property-driven design aims at providing such scientific and technical support, with many advantages compared to the traditional unstructured approach. Property-driven design is composed of four phases: first, the developer formally specifies the requirements of the robot swarm by stating its desired properties; second, the developer creates a prescriptive model of the swarm and uses model checking to verify that this prescriptive model satisfies the desired properties; third, using the prescriptive model as a blueprint, the developer implements a simulated version of the desired robot swarm and validates the prescriptive model developed in the previous step; fourth, the developer implements the desired robot swarm and validates the previous steps. We demonstrate property-driven design using two case studies: aggregation and foraging.","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":null,"pages":null},"PeriodicalIF":2.7,"publicationDate":"2015-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88253096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 64
Multiagent Reinforcement Social Learning toward Coordination in Cooperative Multiagent Systems 协同多智能体系统中面向协调的多智能体强化社会学习
IF 2.7 4区 计算机科学 Q2 Computer Science Pub Date : 2015-01-14 DOI: 10.1145/2644819
Jianye Hao, Ho-fung Leung, Zhong Ming
Most previous works on coordination in cooperative multiagent systems study the problem of how two (or more) players can coordinate on Pareto-optimal Nash equilibrium(s) through fixed and repeated interactions in the context of cooperative games. However, in practical complex environments, the interactions between agents can be sparse, and each agent's interacting partners may change frequently and randomly. To this end, we investigate the multiagent coordination problems in cooperative environments under a social learning framework. We consider a large population of agents where each agent interacts with another agent randomly chosen from the population in each round. Each agent learns its policy through repeated interactions with the rest of the agents via social learning. It is not clear a priori if all agents can learn a consistent optimal coordination policy in such a situation. We distinguish two different types of learners depending on the amount of information each agent can perceive: individual action learner and joint action learner. The learning performance of both types of learners is evaluated under a number of challenging deterministic and stochastic cooperative games, and the influence of the information sharing degree on the learning performance also is investigated—a key difference from the learning framework involving repeated interactions among fixed agents.
大多数关于合作多智能体系统协调的研究都研究了在合作博弈的背景下,两个(或更多)参与者如何通过固定和重复的交互在帕累托最优纳什均衡上进行协调。然而,在实际的复杂环境中,智能体之间的交互可能是稀疏的,并且每个智能体的交互伙伴可能频繁且随机地变化。为此,我们研究了社会学习框架下合作环境下的多智能体协调问题。我们考虑一个大的智能体群体,其中每个智能体与每轮从群体中随机选择的另一个智能体相互作用。通过社会学习,每个智能体通过与其他智能体的重复交互来学习自己的策略。在这种情况下,是否所有智能体都能学习到一致的最优协调策略尚不清楚。我们根据每个智能体可以感知的信息量来区分两种不同类型的学习器:个体行动学习器和联合行动学习器。在一系列具有挑战性的确定性和随机合作博弈下,评估了这两种类型的学习者的学习绩效,并研究了信息共享程度对学习绩效的影响——这是与固定代理之间重复交互的学习框架的一个关键区别。
{"title":"Multiagent Reinforcement Social Learning toward Coordination in Cooperative Multiagent Systems","authors":"Jianye Hao, Ho-fung Leung, Zhong Ming","doi":"10.1145/2644819","DOIUrl":"https://doi.org/10.1145/2644819","url":null,"abstract":"Most previous works on coordination in cooperative multiagent systems study the problem of how two (or more) players can coordinate on Pareto-optimal Nash equilibrium(s) through fixed and repeated interactions in the context of cooperative games. However, in practical complex environments, the interactions between agents can be sparse, and each agent's interacting partners may change frequently and randomly. To this end, we investigate the multiagent coordination problems in cooperative environments under a social learning framework. We consider a large population of agents where each agent interacts with another agent randomly chosen from the population in each round. Each agent learns its policy through repeated interactions with the rest of the agents via social learning. It is not clear a priori if all agents can learn a consistent optimal coordination policy in such a situation. We distinguish two different types of learners depending on the amount of information each agent can perceive: individual action learner and joint action learner. The learning performance of both types of learners is evaluated under a number of challenging deterministic and stochastic cooperative games, and the influence of the information sharing degree on the learning performance also is investigated—a key difference from the learning framework involving repeated interactions among fixed agents.","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":null,"pages":null},"PeriodicalIF":2.7,"publicationDate":"2015-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86153088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Multi-Cloud Provisioning and Load Distribution for Three-Tier Applications 面向三层应用的多云配置和负载分配
IF 2.7 4区 计算机科学 Q2 Computer Science Pub Date : 2014-10-07 DOI: 10.1145/2662112
N. Grozev, R. Buyya
Cloud data centers are becoming the preferred deployment environment for a wide range of business applications because they provide many benefits compared to private in-house infrastructure. However, the traditional approach of using a single cloud has several limitations in terms of availability, avoiding vendor lock-in, and providing legislation-compliant services with suitable Quality of Experience (QoE) to users worldwide. One way for cloud clients to mitigate these issues is to use multiple clouds (i.e., a Multi-Cloud). In this article, we introduce an approach for deploying three-tier applications across multiple clouds in order to satisfy their key nonfunctional requirements. We propose adaptive, dynamic, and reactive resource provisioning and load distribution algorithms that heuristically optimize overall cost and response delays without violating essential legislative and regulatory requirements. Our simulation with realistic workload, network, and cloud characteristics shows that our method improves the state of the art in terms of availability, regulatory compliance, and QoE with acceptable sacrifice in cost and latency.
云数据中心正在成为各种业务应用程序的首选部署环境,因为与私有内部基础设施相比,云数据中心提供了许多好处。然而,使用单一云的传统方法在可用性、避免供应商锁定以及向全球用户提供具有适当体验质量(QoE)的符合法规的服务方面存在一些限制。云客户端缓解这些问题的一种方法是使用多个云(即多云)。在本文中,我们将介绍一种跨多个云部署三层应用程序的方法,以满足其关键的非功能需求。我们提出自适应、动态和被动的资源配置和负载分配算法,在不违反基本立法和监管要求的情况下,启发式地优化总体成本和响应延迟。我们对实际工作负载、网络和云特征进行的模拟表明,我们的方法在可用性、法规遵从性和QoE方面提高了技术水平,并在成本和延迟方面做出了可接受的牺牲。
{"title":"Multi-Cloud Provisioning and Load Distribution for Three-Tier Applications","authors":"N. Grozev, R. Buyya","doi":"10.1145/2662112","DOIUrl":"https://doi.org/10.1145/2662112","url":null,"abstract":"Cloud data centers are becoming the preferred deployment environment for a wide range of business applications because they provide many benefits compared to private in-house infrastructure. However, the traditional approach of using a single cloud has several limitations in terms of availability, avoiding vendor lock-in, and providing legislation-compliant services with suitable Quality of Experience (QoE) to users worldwide. One way for cloud clients to mitigate these issues is to use multiple clouds (i.e., a Multi-Cloud). In this article, we introduce an approach for deploying three-tier applications across multiple clouds in order to satisfy their key nonfunctional requirements. We propose adaptive, dynamic, and reactive resource provisioning and load distribution algorithms that heuristically optimize overall cost and response delays without violating essential legislative and regulatory requirements. Our simulation with realistic workload, network, and cloud characteristics shows that our method improves the state of the art in terms of availability, regulatory compliance, and QoE with acceptable sacrifice in cost and latency.","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":null,"pages":null},"PeriodicalIF":2.7,"publicationDate":"2014-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82342934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 80
期刊
ACM Transactions on Autonomous and Adaptive Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1