ACM Transactions on Autonomous and Adaptive Systems最新文献_第8页

Reinforcement Learning of Informed Initial Policies for Decentralized Planning 分散规划初始策略的强化学习

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Autonomous and Adaptive Systems

Pub Date : 2015-01-14 DOI: 10.1145/2668130

Landon Kraemer, Bikramjit Banerjee

Decentralized partially observable Markov decision processes (Dec-POMDPs) offer a formal model for planning in cooperative multiagent systems where agents operate with noisy sensors and actuators, as well as local information. Prevalent solution techniques are centralized and model based—limitations that we address by distributed reinforcement learning (RL). We particularly favor alternate learning, where agents alternately learn best responses to each other, which appears to outperform concurrent RL. However, alternate learning requires an initial policy. We propose two principled approaches to generating informed initial policies: a naive approach that lays the foundation for a more sophisticated approach. We empirically demonstrate that the refined approach produces near-optimal solutions in many challenging benchmark settings, staking a claim to being an efficient (and realistic) approximate solver in its own right. Furthermore, alternate best response learning seeded with such policies quickly learns high-quality policies as well.

分散式部分可观察马尔可夫决策过程(deco - pomdp)为协作多智能体系统中的规划提供了一种形式化模型，其中智能体使用噪声传感器和执行器以及本地信息进行操作。流行的解决方案技术是集中式的和基于模型的限制，我们通过分布式强化学习(RL)来解决这些限制。我们特别喜欢交替学习，其中代理交替学习彼此的最佳反应，这似乎优于并发强化学习。然而，交替学习需要一个初始策略。我们提出了两种原则性的方法来生成知情的初始政策:一种幼稚的方法为更复杂的方法奠定基础。我们的经验证明，改进的方法在许多具有挑战性的基准设置中产生接近最优的解决方案，声称自己是一个有效的(和现实的)近似求解器。此外，以这些策略为种子的备选最佳响应学习也可以快速学习高质量的策略。

引用次数: 7

Distributed Data-Centric Adaptive Sampling for Cyber-Physical Systems 分布式数据中心自适应采样网络物理系统

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Autonomous and Adaptive Systems

Pub Date : 2015-01-14 DOI: 10.1145/2644820

Eun Kyung Lee, H. Viswanathan, D. Pompili

A data-centric joint adaptive sampling and sleep scheduling solution, SILENCE, for autonomic sensor-based systems that monitor and reconstruct physical or environmental phenomena is proposed. Adaptive sampling and sleep scheduling can help realize the much needed resource efficiency by minimizing the communication and processing overhead in densely deployed autonomic sensor-based systems. The proposed solution exploits the spatiotemporal correlation in sensed data and eliminates redundancy in transmitted data through selective representation without compromising on accuracy of reconstruction of the monitored phenomenon at a remote monitor node. Differently from existing adaptive sampling solutions, SILENCE employs temporal causality analysis to not only track the variation in the underlying phenomenon but also its cause and direction of propagation in the field. The causality analysis and the same correlations are then leveraged for adaptive sleep scheduling aimed at saving energy in wireless sensor networks (WSNs). SILENCE outperforms traditional adaptive sampling solutions as well as the recently proposed compressive sampling techniques. Real experiments were performed on a WSN testbed monitoring temperature and humidity distribution in a rack of servers, and the simulations were performed on TOSSIM, the TinyOS simulator.

提出了一种以数据为中心的联合自适应采样和睡眠调度解决方案SILENCE，用于监测和重建物理或环境现象的基于自主传感器的系统。在密集部署的自主传感器系统中，自适应采样和睡眠调度可以通过最小化通信和处理开销来帮助实现急需的资源效率。该方案利用遥感数据的时空相关性，通过选择性表示消除传输数据中的冗余，同时不影响远程监测节点对监测现象的重建精度。与现有的自适应采样解决方案不同，SILENCE采用时间因果分析，不仅可以跟踪潜在现象的变化，还可以跟踪其在现场传播的原因和方向。然后利用因果分析和相同的相关性进行自适应睡眠调度，目的是在无线传感器网络(WSNs)中节省能量。SILENCE优于传统的自适应采样解决方案以及最近提出的压缩采样技术。在监测服务器机架温度和湿度分布的WSN试验台上进行了实际实验，并在TinyOS模拟器TOSSIM上进行了仿真。

{"title":"Distributed Data-Centric Adaptive Sampling for Cyber-Physical Systems","authors":"Eun Kyung Lee, H. Viswanathan, D. Pompili","doi":"10.1145/2644820","DOIUrl":"https://doi.org/10.1145/2644820","url":null,"abstract":"A data-centric joint adaptive sampling and sleep scheduling solution, SILENCE, for autonomic sensor-based systems that monitor and reconstruct physical or environmental phenomena is proposed. Adaptive sampling and sleep scheduling can help realize the much needed resource efficiency by minimizing the communication and processing overhead in densely deployed autonomic sensor-based systems. The proposed solution exploits the spatiotemporal correlation in sensed data and eliminates redundancy in transmitted data through selective representation without compromising on accuracy of reconstruction of the monitored phenomenon at a remote monitor node. Differently from existing adaptive sampling solutions, SILENCE employs temporal causality analysis to not only track the variation in the underlying phenomenon but also its cause and direction of propagation in the field. The causality analysis and the same correlations are then leveraged for adaptive sleep scheduling aimed at saving energy in wireless sensor networks (WSNs). SILENCE outperforms traditional adaptive sampling solutions as well as the recently proposed compressive sampling techniques. Real experiments were performed on a WSN testbed monitoring temperature and humidity distribution in a rack of servers, and the simulations were performed on TOSSIM, the TinyOS simulator.","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":"30 1","pages":"21:1-21:27"},"PeriodicalIF":2.7,"publicationDate":"2015-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82799988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Property-Driven Design for Robot Swarms: A Design Method Based on Prescriptive Modeling and Model Checking 机器人群体属性驱动设计:一种基于规定性建模和模型检验的设计方法

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Autonomous and Adaptive Systems

Pub Date : 2015-01-14 DOI: 10.1145/2700318

Manuele Brambilla, A. Brutschy, M. Dorigo, M. Birattari

In this article, we present property-driven design, a novel top-down design method for robot swarms based on prescriptive modeling and model checking. Traditionally, robot swarms have been developed using a code-and-fix approach: in a bottom-up iterative process, the developer tests and improves the individual behaviors of the robots until the desired collective behavior is obtained. The code-and-fix approach is unstructured, and the quality of the obtained swarm depends completely on the expertise and ingenuity of the developer who has little scientific or technical support in his activity. Property-driven design aims at providing such scientific and technical support, with many advantages compared to the traditional unstructured approach. Property-driven design is composed of four phases: first, the developer formally specifies the requirements of the robot swarm by stating its desired properties; second, the developer creates a prescriptive model of the swarm and uses model checking to verify that this prescriptive model satisfies the desired properties; third, using the prescriptive model as a blueprint, the developer implements a simulated version of the desired robot swarm and validates the prescriptive model developed in the previous step; fourth, the developer implements the desired robot swarm and validates the previous steps. We demonstrate property-driven design using two case studies: aggregation and foraging.

本文提出了一种基于规定性建模和模型检验的自顶向下机器人群体设计方法——属性驱动设计。传统上，机器人群是使用代码和修复方法开发的:在自下而上的迭代过程中，开发人员测试和改进机器人的个体行为，直到获得所需的集体行为。编码和修复方法是非结构化的，所获得的群集的质量完全取决于开发人员的专业知识和独创性，而开发人员在其活动中几乎没有科学或技术支持。属性驱动设计旨在提供这种科学和技术支持，与传统的非结构化方法相比具有许多优势。属性驱动设计由四个阶段组成:首先，开发者通过陈述其期望的属性来正式指定机器人群的需求;其次，开发人员创建群体的规定性模型，并使用模型检查来验证该规定性模型是否满足期望的属性;第三，以规定性模型为蓝图，开发人员实现了期望机器人群的仿真版本，并验证了上一步开发的规定性模型;第四，开发人员实现所需的机器人群并验证前面的步骤。我们使用两个案例研究来演示属性驱动设计:聚合和觅食。

{"title":"Property-Driven Design for Robot Swarms: A Design Method Based on Prescriptive Modeling and Model Checking","authors":"Manuele Brambilla, A. Brutschy, M. Dorigo, M. Birattari","doi":"10.1145/2700318","DOIUrl":"https://doi.org/10.1145/2700318","url":null,"abstract":"In this article, we present property-driven design, a novel top-down design method for robot swarms based on prescriptive modeling and model checking. Traditionally, robot swarms have been developed using a code-and-fix approach: in a bottom-up iterative process, the developer tests and improves the individual behaviors of the robots until the desired collective behavior is obtained. The code-and-fix approach is unstructured, and the quality of the obtained swarm depends completely on the expertise and ingenuity of the developer who has little scientific or technical support in his activity. Property-driven design aims at providing such scientific and technical support, with many advantages compared to the traditional unstructured approach. Property-driven design is composed of four phases: first, the developer formally specifies the requirements of the robot swarm by stating its desired properties; second, the developer creates a prescriptive model of the swarm and uses model checking to verify that this prescriptive model satisfies the desired properties; third, using the prescriptive model as a blueprint, the developer implements a simulated version of the desired robot swarm and validates the prescriptive model developed in the previous step; fourth, the developer implements the desired robot swarm and validates the previous steps. We demonstrate property-driven design using two case studies: aggregation and foraging.","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":"114 1","pages":"17:1-17:28"},"PeriodicalIF":2.7,"publicationDate":"2015-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88253096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 64

Multiagent Reinforcement Social Learning toward Coordination in Cooperative Multiagent Systems 协同多智能体系统中面向协调的多智能体强化社会学习

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Autonomous and Adaptive Systems

Pub Date : 2015-01-14 DOI: 10.1145/2644819

Jianye Hao, Ho-fung Leung, Zhong Ming

Most previous works on coordination in cooperative multiagent systems study the problem of how two (or more) players can coordinate on Pareto-optimal Nash equilibrium(s) through fixed and repeated interactions in the context of cooperative games. However, in practical complex environments, the interactions between agents can be sparse, and each agent's interacting partners may change frequently and randomly. To this end, we investigate the multiagent coordination problems in cooperative environments under a social learning framework. We consider a large population of agents where each agent interacts with another agent randomly chosen from the population in each round. Each agent learns its policy through repeated interactions with the rest of the agents via social learning. It is not clear a priori if all agents can learn a consistent optimal coordination policy in such a situation. We distinguish two different types of learners depending on the amount of information each agent can perceive: individual action learner and joint action learner. The learning performance of both types of learners is evaluated under a number of challenging deterministic and stochastic cooperative games, and the influence of the information sharing degree on the learning performance also is investigated—a key difference from the learning framework involving repeated interactions among fixed agents.

大多数关于合作多智能体系统协调的研究都研究了在合作博弈的背景下，两个(或更多)参与者如何通过固定和重复的交互在帕累托最优纳什均衡上进行协调。然而，在实际的复杂环境中，智能体之间的交互可能是稀疏的，并且每个智能体的交互伙伴可能频繁且随机地变化。为此，我们研究了社会学习框架下合作环境下的多智能体协调问题。我们考虑一个大的智能体群体，其中每个智能体与每轮从群体中随机选择的另一个智能体相互作用。通过社会学习，每个智能体通过与其他智能体的重复交互来学习自己的策略。在这种情况下，是否所有智能体都能学习到一致的最优协调策略尚不清楚。我们根据每个智能体可以感知的信息量来区分两种不同类型的学习器:个体行动学习器和联合行动学习器。在一系列具有挑战性的确定性和随机合作博弈下，评估了这两种类型的学习者的学习绩效，并研究了信息共享程度对学习绩效的影响——这是与固定代理之间重复交互的学习框架的一个关键区别。

{"title":"Multiagent Reinforcement Social Learning toward Coordination in Cooperative Multiagent Systems","authors":"Jianye Hao, Ho-fung Leung, Zhong Ming","doi":"10.1145/2644819","DOIUrl":"https://doi.org/10.1145/2644819","url":null,"abstract":"Most previous works on coordination in cooperative multiagent systems study the problem of how two (or more) players can coordinate on Pareto-optimal Nash equilibrium(s) through fixed and repeated interactions in the context of cooperative games. However, in practical complex environments, the interactions between agents can be sparse, and each agent's interacting partners may change frequently and randomly. To this end, we investigate the multiagent coordination problems in cooperative environments under a social learning framework. We consider a large population of agents where each agent interacts with another agent randomly chosen from the population in each round. Each agent learns its policy through repeated interactions with the rest of the agents via social learning. It is not clear a priori if all agents can learn a consistent optimal coordination policy in such a situation. We distinguish two different types of learners depending on the amount of information each agent can perceive: individual action learner and joint action learner. The learning performance of both types of learners is evaluated under a number of challenging deterministic and stochastic cooperative games, and the influence of the information sharing degree on the learning performance also is investigated—a key difference from the learning framework involving repeated interactions among fixed agents.","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":"37 1","pages":"20:1-20:20"},"PeriodicalIF":2.7,"publicationDate":"2015-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86153088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 21

Multi-Cloud Provisioning and Load Distribution for Three-Tier Applications 面向三层应用的多云配置和负载分配

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Autonomous and Adaptive Systems

Pub Date : 2014-10-07 DOI: 10.1145/2662112

N. Grozev, R. Buyya

Cloud data centers are becoming the preferred deployment environment for a wide range of business applications because they provide many benefits compared to private in-house infrastructure. However, the traditional approach of using a single cloud has several limitations in terms of availability, avoiding vendor lock-in, and providing legislation-compliant services with suitable Quality of Experience (QoE) to users worldwide. One way for cloud clients to mitigate these issues is to use multiple clouds (i.e., a Multi-Cloud). In this article, we introduce an approach for deploying three-tier applications across multiple clouds in order to satisfy their key nonfunctional requirements. We propose adaptive, dynamic, and reactive resource provisioning and load distribution algorithms that heuristically optimize overall cost and response delays without violating essential legislative and regulatory requirements. Our simulation with realistic workload, network, and cloud characteristics shows that our method improves the state of the art in terms of availability, regulatory compliance, and QoE with acceptable sacrifice in cost and latency.

云数据中心正在成为各种业务应用程序的首选部署环境，因为与私有内部基础设施相比，云数据中心提供了许多好处。然而，使用单一云的传统方法在可用性、避免供应商锁定以及向全球用户提供具有适当体验质量(QoE)的符合法规的服务方面存在一些限制。云客户端缓解这些问题的一种方法是使用多个云(即多云)。在本文中，我们将介绍一种跨多个云部署三层应用程序的方法，以满足其关键的非功能需求。我们提出自适应、动态和被动的资源配置和负载分配算法，在不违反基本立法和监管要求的情况下，启发式地优化总体成本和响应延迟。我们对实际工作负载、网络和云特征进行的模拟表明，我们的方法在可用性、法规遵从性和QoE方面提高了技术水平，并在成本和延迟方面做出了可接受的牺牲。

引用次数: 80

Distributive Justice for Self-Organised Common-Pool Resource Management 自组织公共池资源管理的分配公正

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Autonomous and Adaptive Systems

Pub Date : 2014-10-07 DOI: 10.1145/2629567

J. Pitt, D. Busquets, S. Macbeth

In this article, we complement Elinor Ostrom’s institutional design principles for enduring common-pool resource management with Nicholas Rescher’s theory of distributive justice based on the canon of legitimate claims. Two of Ostrom’s principles are that the resource allocation method should be congruent with the local environment, and that those affected by the allocation method (the appropriators) should participate in its selection. However, these principles do not say anything explicitly about the fairness of the allocation method or the outcomes it produces: for this, we need a mechanism for distributive justice. Rescher identified a number of different mechanisms, each of which had both its merits and demerits, and instead maintained that distributive justice consisted in identifying the legitimate claims in context, accommodating multiple claims in case of plurality, and reconciling them in case of conflict. Accordingly, we specify a logical axiomatisation of the principles with the canon of legitimate claims, whereby a set of claims is each represented as a voting function, which collectively determine the rank order in which resources are allocated. The appropriators vote on the weight attached to the scoring functions, and so self-organise the allocation method, taking into account both the plurality of and conflict between the claims. Therefore, the appropriators exercise collective choice over the method, and the method itself is congruent with the local environment, taking into account both the resources available and the relative claims of the appropriators. Experiments with a variant of the linear public good game show that this pluralistic self-organising approach produces a better balance of utility and fairness (for agents that comply with the rules of the game) compared to monistic or fixed approaches, provide “fairness over time” (a series of ostensibly unfair individual allocations is revealed to be cumulatively fair), and offer an intuition of how to resolve the free-rider phenomenon in provision and appropriation of common-pool resources.

在这篇文章中，我们用Nicholas Rescher基于合法主张的分配正义理论来补充Elinor Ostrom关于持久公共资源管理的制度设计原则。奥斯特罗姆的两个原则是资源分配方法应与当地环境相一致，以及受分配方法影响的人(占有者)应参与分配方法的选择。然而，这些原则并没有明确说明分配方法的公平性或其产生的结果:为此，我们需要一种分配正义的机制。Rescher确定了许多不同的机制，每一种机制都有其优点和缺点，相反，他坚持认为分配正义包括在背景中识别合法的要求，在多元化的情况下容纳多种要求，在冲突的情况下调和它们。因此，我们指定了具有合法要求标准的原则的逻辑公理化，其中一组要求被表示为一个投票函数，它们共同决定了资源分配的等级顺序。分配者对附加在评分函数上的权重进行投票，从而自组织分配方法，同时考虑权利要求的多元性和权利要求之间的冲突。因此，占用者对方法进行集体选择，并且方法本身符合当地环境，同时考虑到可用资源和占用者的相对要求。对线性公共物品博弈的一种变体进行的实验表明，与一元论或固定方法相比，这种多元自组织方法可以更好地平衡效用和公平性(对于遵守游戏规则的代理)，提供“随着时间的推移的公平性”(一系列表面上不公平的个人分配被揭示为累积公平)，并提供如何解决公共资源提供和分配中的搭便车现象的直觉。

{"title":"Distributive Justice for Self-Organised Common-Pool Resource Management","authors":"J. Pitt, D. Busquets, S. Macbeth","doi":"10.1145/2629567","DOIUrl":"https://doi.org/10.1145/2629567","url":null,"abstract":"In this article, we complement Elinor Ostrom’s institutional design principles for enduring common-pool resource management with Nicholas Rescher’s theory of distributive justice based on the canon of legitimate claims. Two of Ostrom’s principles are that the resource allocation method should be congruent with the local environment, and that those affected by the allocation method (the appropriators) should participate in its selection. However, these principles do not say anything explicitly about the fairness of the allocation method or the outcomes it produces: for this, we need a mechanism for distributive justice. Rescher identified a number of different mechanisms, each of which had both its merits and demerits, and instead maintained that distributive justice consisted in identifying the legitimate claims in context, accommodating multiple claims in case of plurality, and reconciling them in case of conflict. Accordingly, we specify a logical axiomatisation of the principles with the canon of legitimate claims, whereby a set of claims is each represented as a voting function, which collectively determine the rank order in which resources are allocated. The appropriators vote on the weight attached to the scoring functions, and so self-organise the allocation method, taking into account both the plurality of and conflict between the claims. Therefore, the appropriators exercise collective choice over the method, and the method itself is congruent with the local environment, taking into account both the resources available and the relative claims of the appropriators. Experiments with a variant of the linear public good game show that this pluralistic self-organising approach produces a better balance of utility and fairness (for agents that comply with the rules of the game) compared to monistic or fixed approaches, provide “fairness over time” (a series of ostensibly unfair individual allocations is revealed to be cumulatively fair), and offer an intuition of how to resolve the free-rider phenomenon in provision and appropriation of common-pool resources.","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":"64 1","pages":"14:1-14:39"},"PeriodicalIF":2.7,"publicationDate":"2014-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85644636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 45

An Intelligent Agent for Bilateral Negotiation with Unknown Opponents in Continuous-Time Domains 连续时间域未知对手双边协商的智能代理

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Autonomous and Adaptive Systems

Pub Date : 2014-10-07 DOI: 10.1145/2629577

Siqi Chen, Gerhard Weiss

Automated negotiation among self-interested autonomous agents has gained tremendous attention due to the diversity of its broad range of potential real-world applications. This article deals with a prominent type of such negotiations, namely, multiissue negotiation that runs under continuous-time constraints and in which the negotiating agents have no prior knowledge about their opponents’ preferences and strategies. A negotiation strategy called Dragon is described that employs sparse pseudoinput Gaussian processes. Specifically, Dragon enables an agent (1) to precisely model the behavior of its opponents with comparably low computational load and (2) to make decisions effectively and adaptively in very complex negotiation settings. Extensive experimental results, based on a number of negotiation scenarios and state-of-the-art negotiating agents from Automated Negotiating Agents Competitions, are provided. Moreover, the robustness of our strategy is evaluated through both empirical game-theoretic and spatial evolutionary game-theoretic analysis.

自利益自治代理之间的自动协商由于其广泛的潜在现实应用范围的多样性而获得了极大的关注。本文研究了此类谈判的一种突出类型，即在连续时间约束下进行的多议题谈判，其中谈判代理人对对手的偏好和策略没有事先的了解。描述了一种采用稀疏伪输入高斯过程的协商策略Dragon。具体来说，Dragon使代理(1)能够以相对较低的计算负荷精确地模拟其对手的行为，(2)能够在非常复杂的谈判设置中有效地自适应地做出决策。本文提供了基于一系列谈判场景和自动化谈判代理竞赛中最先进的谈判代理的广泛实验结果。此外，通过实证博弈论和空间进化博弈论分析来评估我们的策略的稳健性。

引用次数: 23

Utility-Based Mechanism for Structural Self-Organization in Service-Oriented MAS 面向服务MAS中基于效用的结构自组织机制

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Autonomous and Adaptive Systems

Pub Date : 2014-10-07 DOI: 10.1145/2651423

E. Noguera, M. Rebollo, Matteo Vasirani, Alberto Fernández

Structural relations established among agents influence the performance of decentralized service discovery process in multiagent systems. Moreover, distributed systems should be able to adapt their structural relations to changes in environmental conditions. In this article, we present a service-oriented multiagent systems, where agents initially self-organize their structural relations based on the similarity of their services. During the service discovery process, agents integrate a mechanism that facilitates the self-organization of their structural relations to adapt the structure of the system to the service demand. This mechanism facilitates the task of decentralized service discovery and improves its performance. Each agent has local knowledge about its direct neighbors and the queries received during discovery processes. With this information, an agent is able to analyze its structural relations and decide when it is more appropriate to modify its direct neighbors and select the most suitable acquaintances to replace them. The experimental evaluation shows how this self-organization mechanism improves the overall performance of the service discovery process in the system when the service demand changes.

在多智能体系统中，智能体之间的结构关系影响着分散服务发现过程的性能。此外，分布式系统应该能够调整其结构关系以适应环境条件的变化。在本文中，我们提出了一个面向服务的多代理系统，其中代理最初根据其服务的相似性自组织其结构关系。在服务发现过程中，智能体集成了一种机制，促进其结构关系的自组织，使系统的结构适应服务需求。这种机制促进了分散的服务发现任务，并提高了其性能。每个代理都有关于其直接邻居和在发现过程中收到的查询的本地知识。有了这些信息，智能体就能够分析其结构关系，决定何时修改其直接邻居更合适，并选择最合适的熟人来取代它们。实验评估表明，当服务需求发生变化时，这种自组织机制提高了系统中服务发现过程的整体性能。

{"title":"Utility-Based Mechanism for Structural Self-Organization in Service-Oriented MAS","authors":"E. Noguera, M. Rebollo, Matteo Vasirani, Alberto Fernández","doi":"10.1145/2651423","DOIUrl":"https://doi.org/10.1145/2651423","url":null,"abstract":"Structural relations established among agents influence the performance of decentralized service discovery process in multiagent systems. Moreover, distributed systems should be able to adapt their structural relations to changes in environmental conditions. In this article, we present a service-oriented multiagent systems, where agents initially self-organize their structural relations based on the similarity of their services. During the service discovery process, agents integrate a mechanism that facilitates the self-organization of their structural relations to adapt the structure of the system to the service demand. This mechanism facilitates the task of decentralized service discovery and improves its performance. Each agent has local knowledge about its direct neighbors and the queries received during discovery processes. With this information, an agent is able to analyze its structural relations and decide when it is more appropriate to modify its direct neighbors and select the most suitable acquaintances to replace them. The experimental evaluation shows how this self-organization mechanism improves the overall performance of the service discovery process in the system when the service demand changes.","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":"8 1","pages":"12:1-12:24"},"PeriodicalIF":2.7,"publicationDate":"2014-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77769068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

The Complexity of Adding Multitolerance 添加多公差的复杂性

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Autonomous and Adaptive Systems

Pub Date : 2014-10-07 DOI: 10.1145/2629664

Jingshu Chen, Ali Ebnenasir, S. Kulkarni

We focus on the problem of adding multitolerance to an existing fault-intolerant program. A multitolerant program tolerates multiple classes of faults and provides a potentially different level of fault tolerance to each of them. We consider three levels of fault tolerance, namely failsafe (i.e., satisfy safety in the presence of faults), nonmasking (i.e., recover to legitimate states after the occurrence of faults), and masking (both). For the case where the program is subject to two classes of faults, we consider six categories of multitolerant programs—FF, FN, FM, MM, MN, and NN, where F, N, and M represent failsafe, nonmasking, and masking levels of tolerance provided to each class of fault. We show that the problem of adding FF, NN, and MN multitolerance can be solved in polynomial time (in the state space of the program). However, the problem is NP-complete for adding FN, MM, and FM multitolerance. We note that the hardness of adding MM and FM multitolerance is especially atypical given that MM and FM multitolerance can be added efficiently under more restricted scenarios where multiple faults occur simultaneously in the same computation. We also present heuristics for managing the complexity of MM multitolerance. Finally, we present real-world multitolerant programs and discuss the trade-off involved in design decisions while developing such programs.

我们主要研究在现有的容错程序中增加多容错的问题。多容错程序可以容错多种类型的错误，并为每种错误提供不同级别的容错。我们考虑了三个级别的容错，即故障安全(即，在故障存在时满足安全性)、非屏蔽(即，在故障发生后恢复到合法状态)和屏蔽(两者都有)。对于程序受两类故障影响的情况，我们考虑六类多容错程序——ff、FN、FM、MM、MN和NN，其中F、N和M表示为每一类故障提供的容错级别的故障安全、非屏蔽和屏蔽。我们证明了FF、NN和MN多容差的添加问题可以在多项式时间内(在程序的状态空间中)得到解决。然而，对于添加FN、MM和FM多公差，问题是np完全的。我们注意到，添加MM和FM多容差的难度尤其不典型，因为MM和FM多容差可以在更有限的情况下有效地添加，即在同一计算中同时发生多个故障。我们还提出了管理MM多容差复杂性的启发式方法。最后，我们提出了现实世界的多容忍程序，并讨论了在开发此类程序时设计决策所涉及的权衡。

{"title":"The Complexity of Adding Multitolerance","authors":"Jingshu Chen, Ali Ebnenasir, S. Kulkarni","doi":"10.1145/2629664","DOIUrl":"https://doi.org/10.1145/2629664","url":null,"abstract":"We focus on the problem of adding multitolerance to an existing fault-intolerant program. A multitolerant program tolerates multiple classes of faults and provides a potentially different level of fault tolerance to each of them. We consider three levels of fault tolerance, namely failsafe (i.e., satisfy safety in the presence of faults), nonmasking (i.e., recover to legitimate states after the occurrence of faults), and masking (both). For the case where the program is subject to two classes of faults, we consider six categories of multitolerant programs—FF, FN, FM, MM, MN, and NN, where F, N, and M represent failsafe, nonmasking, and masking levels of tolerance provided to each class of fault. We show that the problem of adding FF, NN, and MN multitolerance can be solved in polynomial time (in the state space of the program). However, the problem is NP-complete for adding FN, MM, and FM multitolerance. We note that the hardness of adding MM and FM multitolerance is especially atypical given that MM and FM multitolerance can be added efficiently under more restricted scenarios where multiple faults occur simultaneously in the same computation. We also present heuristics for managing the complexity of MM multitolerance. Finally, we present real-world multitolerant programs and discuss the trade-off involved in design decisions while developing such programs.","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":"26 1","pages":"15:1-15:33"},"PeriodicalIF":2.7,"publicationDate":"2014-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82635031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Improving the Effectiveness of Testing Pervasive Software via Context Diversity 通过上下文多样性提高普适软件测试的有效性

IF 2.7 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ACM Transactions on Autonomous and Adaptive Systems

Pub Date : 2014-07-01 DOI: 10.1145/2620000

Huai Wang, W. Chan, T. Tse

Context-aware pervasive software is responsive to various contexts and their changes. A faulty implementation of the context-aware features may lead to unpredictable behavior with adverse effects. In software testing, one of the most important research issues is to determine the sufficiency of a test suite to verify the software under test. Existing adequacy criteria for testing traditional software, however, have not explored the dimension of serial test inputs and have not considered context changes when constructing test suites. In this article, we define the concept of context diversity to capture the extent of context changes in serial inputs and propose three strategies to study how context diversity may improve the effectiveness of the data-flow testing criteria. Our case study shows that the strategy that uses test cases with higher context diversity can significantly improve the effectiveness of existing data-flow testing criteria for context-aware pervasive software. In addition, test suites with higher context diversity are found to execute significantly longer paths, which may provide a clue that reveals why context diversity can contribute to the improvement of effectiveness of test suites.

上下文感知的普及软件能够响应各种上下文及其变化。上下文感知功能的错误实现可能导致具有不利影响的不可预测行为。在软件测试中，最重要的研究问题之一是确定测试套件的充分性，以验证被测软件。然而，现有的测试传统软件的充分性标准并没有探索串行测试输入的维度，也没有在构建测试套件时考虑上下文的变化。在本文中，我们定义了上下文多样性的概念，以捕捉串行输入中上下文变化的程度，并提出了三种策略来研究上下文多样性如何提高数据流测试标准的有效性。我们的案例研究表明，使用具有更高上下文多样性的测试用例的策略可以显著提高上下文感知的普适软件的现有数据流测试标准的有效性。此外，具有较高上下文多样性的测试套件被发现执行的路径明显更长，这可能为揭示上下文多样性为什么有助于提高测试套件的有效性提供线索。

{"title":"Improving the Effectiveness of Testing Pervasive Software via Context Diversity","authors":"Huai Wang, W. Chan, T. Tse","doi":"10.1145/2620000","DOIUrl":"https://doi.org/10.1145/2620000","url":null,"abstract":"Context-aware pervasive software is responsive to various contexts and their changes. A faulty implementation of the context-aware features may lead to unpredictable behavior with adverse effects. In software testing, one of the most important research issues is to determine the sufficiency of a test suite to verify the software under test. Existing adequacy criteria for testing traditional software, however, have not explored the dimension of serial test inputs and have not considered context changes when constructing test suites. In this article, we define the concept of context diversity to capture the extent of context changes in serial inputs and propose three strategies to study how context diversity may improve the effectiveness of the data-flow testing criteria. Our case study shows that the strategy that uses test cases with higher context diversity can significantly improve the effectiveness of existing data-flow testing criteria for context-aware pervasive software. In addition, test suites with higher context diversity are found to execute significantly longer paths, which may provide a clue that reveals why context diversity can contribute to the improvement of effectiveness of test suites.","PeriodicalId":50919,"journal":{"name":"ACM Transactions on Autonomous and Adaptive Systems","volume":"13 1","pages":"9:1-9:28"},"PeriodicalIF":2.7,"publicationDate":"2014-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81713282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 22