首页 > 最新文献

arXiv - CS - Multiagent Systems最新文献

英文 中文
Multi-Agent Based Simulation for Decentralized Electric Vehicle Charging Strategies and their Impacts 基于多代理的分散式电动汽车充电策略及其影响模拟
Pub Date : 2024-08-20 DOI: arxiv-2408.10790
Kristoffer Christensen, Bo Nørregaard Jørgensen, Zheng Grace Ma
The growing shift towards a Smart Grid involves integrating numerous newdigital energy solutions into the energy ecosystems to address problems arisingfrom the transition to carbon neutrality, particularly in linking theelectricity and transportation sectors. Yet, this shift brings challenges dueto mass electric vehicle adoption and the lack of methods to adequately assessvarious EV charging algorithms and their ecosystem impacts. This paperintroduces a multi-agent based simulation model, validated through a case studyof a Danish radial distribution network serving 126 households. The studyreveals that traditional charging leads to grid overload by 2031 at 67% EVpenetration, while decentralized strategies like Real-Time Pricing could causeoverloads as early as 2028. The developed multi-agent based simulationdemonstrates its ability to offer detailed, hourly analysis of future loadprofiles in distribution grids, and therefore, can be applied to otherprospective scenarios in similar energy systems.
向智能电网的不断转变涉及将众多新型数字能源解决方案整合到能源生态系统中,以解决向碳中和过渡过程中出现的问题,特别是在连接电力和交通部门方面。然而,由于电动汽车的大规模应用以及缺乏充分评估各种电动汽车充电算法及其对生态系统影响的方法,这种转变带来了挑战。本文介绍了一种基于多代理的仿真模型,并通过对丹麦一个服务于 126 户家庭的放射状配电网络的案例研究进行了验证。研究结果表明,在电动汽车普及率达到 67% 的情况下,传统充电方式会导致电网在 2031 年出现过载,而像实时定价这样的分散式策略则可能在 2028 年就会造成过载。所开发的基于多代理的仿真证明了其对配电网未来负荷状况进行详细的每小时分析的能力,因此可应用于类似能源系统中的其他前瞻性情景。
{"title":"Multi-Agent Based Simulation for Decentralized Electric Vehicle Charging Strategies and their Impacts","authors":"Kristoffer Christensen, Bo Nørregaard Jørgensen, Zheng Grace Ma","doi":"arxiv-2408.10790","DOIUrl":"https://doi.org/arxiv-2408.10790","url":null,"abstract":"The growing shift towards a Smart Grid involves integrating numerous new\u0000digital energy solutions into the energy ecosystems to address problems arising\u0000from the transition to carbon neutrality, particularly in linking the\u0000electricity and transportation sectors. Yet, this shift brings challenges due\u0000to mass electric vehicle adoption and the lack of methods to adequately assess\u0000various EV charging algorithms and their ecosystem impacts. This paper\u0000introduces a multi-agent based simulation model, validated through a case study\u0000of a Danish radial distribution network serving 126 households. The study\u0000reveals that traditional charging leads to grid overload by 2031 at 67% EV\u0000penetration, while decentralized strategies like Real-Time Pricing could cause\u0000overloads as early as 2028. The developed multi-agent based simulation\u0000demonstrates its ability to offer detailed, hourly analysis of future load\u0000profiles in distribution grids, and therefore, can be applied to other\u0000prospective scenarios in similar energy systems.","PeriodicalId":501315,"journal":{"name":"arXiv - CS - Multiagent Systems","volume":"34 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142190538","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Athena: Safe Autonomous Agents with Verbal Contrastive Learning 雅典娜:具有语言对比学习能力的安全自主机器人
Pub Date : 2024-08-20 DOI: arxiv-2408.11021
Tanmana Sadhu, Ali Pesaranghader, Yanan Chen, Dong Hoon Yi
Due to emergent capabilities, large language models (LLMs) have been utilizedas language-based agents to perform a variety of tasks and make decisions withan increasing degree of autonomy. These autonomous agents can understandhigh-level instructions, interact with their environments, and execute complextasks using a selection of tools available to them. As the capabilities of theagents expand, ensuring their safety and trustworthiness becomes moreimperative. In this study, we introduce the Athena framework which leveragesthe concept of verbal contrastive learning where past safe and unsafetrajectories are used as in-context (contrastive) examples to guide the agenttowards safety while fulfilling a given task. The framework also incorporates acritiquing mechanism to guide the agent to prevent risky actions at every step.Furthermore, due to the lack of existing benchmarks on the safety reasoningability of LLM-based agents, we curate a set of 80 toolkits across 8 categorieswith 180 scenarios to provide a safety evaluation benchmark. Our experimentalevaluation, with both closed- and open-source LLMs, indicates verbalcontrastive learning and interaction-level critiquing improve the safety ratesignificantly.
由于大型语言模型(LLMs)具有新出现的能力,因此已被用作基于语言的代理来执行各种任务,并以越来越高的自主程度做出决策。这些自主代理可以理解高级指令,与环境交互,并使用可供选择的工具执行完整的任务。随着代理能力的扩展,确保其安全性和可信度变得更加重要。在本研究中,我们引入了雅典娜框架,该框架利用了言语对比学习的概念,将过去安全和不安全的轨迹作为情境(对比)示例,引导代理在完成给定任务时注意安全。此外,由于缺乏关于基于 LLM 的代理安全推理能力的现有基准,我们收集了一套横跨 8 个类别、包含 180 个场景的 80 个工具包,以提供安全评估基准。我们使用封闭式和开源 LLM 进行了实验评估,结果表明,口头对比学习和交互级批评显著提高了安全率。
{"title":"Athena: Safe Autonomous Agents with Verbal Contrastive Learning","authors":"Tanmana Sadhu, Ali Pesaranghader, Yanan Chen, Dong Hoon Yi","doi":"arxiv-2408.11021","DOIUrl":"https://doi.org/arxiv-2408.11021","url":null,"abstract":"Due to emergent capabilities, large language models (LLMs) have been utilized\u0000as language-based agents to perform a variety of tasks and make decisions with\u0000an increasing degree of autonomy. These autonomous agents can understand\u0000high-level instructions, interact with their environments, and execute complex\u0000tasks using a selection of tools available to them. As the capabilities of the\u0000agents expand, ensuring their safety and trustworthiness becomes more\u0000imperative. In this study, we introduce the Athena framework which leverages\u0000the concept of verbal contrastive learning where past safe and unsafe\u0000trajectories are used as in-context (contrastive) examples to guide the agent\u0000towards safety while fulfilling a given task. The framework also incorporates a\u0000critiquing mechanism to guide the agent to prevent risky actions at every step.\u0000Furthermore, due to the lack of existing benchmarks on the safety reasoning\u0000ability of LLM-based agents, we curate a set of 80 toolkits across 8 categories\u0000with 180 scenarios to provide a safety evaluation benchmark. Our experimental\u0000evaluation, with both closed- and open-source LLMs, indicates verbal\u0000contrastive learning and interaction-level critiquing improve the safety rate\u0000significantly.","PeriodicalId":501315,"journal":{"name":"arXiv - CS - Multiagent Systems","volume":"10 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142190542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-Agent Based Simulation for Investigating Centralized Charging Strategies and their Impact on Electric Vehicle Home Charging Ecosystem 基于多代理的模拟研究集中充电策略及其对电动汽车家庭充电生态系统的影响
Pub Date : 2024-08-20 DOI: arxiv-2408.10773
Kristoffer Christensen, Bo Nørregaard Jørgensen, Zheng Grace Ma
This paper addresses the critical integration of electric vehicles (EVs) intothe electricity grid, which is essential for achieving carbon neutrality by2050. The rapid increase in EV adoption poses significant challenges to theexisting grid infrastructure, particularly in managing the increasingelectricity demand and mitigating the risk of grid overloads. Centralized EVcharging strategies are investigated due to their potential to optimize gridstability and efficiency, compared to decentralized approaches that mayexacerbate grid stress. Utilizing a multi-agent based simulation model, thestudy provides a realistic representation of the electric vehicle home chargingecosystem in a case study of Strib, Denmark. The findings show that theEarliest-deadline-first and Round Robin perform best with 100% EV adoption interms of EV user satisfaction. The simulation considers a realistic adoptioncurve, EV charging strategies, EV models, and driving patterns to capture thefull ecosystem dynamics over a long-term period with high resolution (hourly).Additionally, the study offers detailed load profiles for future distributiongrids, demonstrating how centralized charging strategies can efficiently managegrid loads and prevent overloads.
本文探讨了将电动汽车(EV)并入电网的关键问题,这对于到 2050 年实现碳中和至关重要。电动汽车应用的快速增长对现有电网基础设施提出了巨大挑战,尤其是在管理日益增长的电力需求和降低电网过载风险方面。与可能加剧电网压力的分散式方法相比,集中式电动汽车充电策略具有优化电网稳定性和效率的潜力,因此对其进行了研究。该研究利用基于多代理的仿真模型,在丹麦斯特里布的案例研究中真实再现了电动汽车家庭充电生态系统。研究结果表明,在电动汽车用户满意度方面,"最早截止时间优先 "和 "循环罗宾 "在电动汽车100%采用的情况下表现最佳。该模拟考虑了现实的采用曲线、电动汽车充电策略、电动汽车模型和驾驶模式,以高分辨率(每小时)捕捉长期的完整生态系统动态。
{"title":"Multi-Agent Based Simulation for Investigating Centralized Charging Strategies and their Impact on Electric Vehicle Home Charging Ecosystem","authors":"Kristoffer Christensen, Bo Nørregaard Jørgensen, Zheng Grace Ma","doi":"arxiv-2408.10773","DOIUrl":"https://doi.org/arxiv-2408.10773","url":null,"abstract":"This paper addresses the critical integration of electric vehicles (EVs) into\u0000the electricity grid, which is essential for achieving carbon neutrality by\u00002050. The rapid increase in EV adoption poses significant challenges to the\u0000existing grid infrastructure, particularly in managing the increasing\u0000electricity demand and mitigating the risk of grid overloads. Centralized EV\u0000charging strategies are investigated due to their potential to optimize grid\u0000stability and efficiency, compared to decentralized approaches that may\u0000exacerbate grid stress. Utilizing a multi-agent based simulation model, the\u0000study provides a realistic representation of the electric vehicle home charging\u0000ecosystem in a case study of Strib, Denmark. The findings show that the\u0000Earliest-deadline-first and Round Robin perform best with 100% EV adoption in\u0000terms of EV user satisfaction. The simulation considers a realistic adoption\u0000curve, EV charging strategies, EV models, and driving patterns to capture the\u0000full ecosystem dynamics over a long-term period with high resolution (hourly).\u0000Additionally, the study offers detailed load profiles for future distribution\u0000grids, demonstrating how centralized charging strategies can efficiently manage\u0000grid loads and prevent overloads.","PeriodicalId":501315,"journal":{"name":"arXiv - CS - Multiagent Systems","volume":"113 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142190540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DBHP: Trajectory Imputation in Multi-Agent Sports Using Derivative-Based Hybrid Prediction DBHP:利用基于派生的混合预测在多代理运动中进行轨迹推断
Pub Date : 2024-08-20 DOI: arxiv-2408.10878
Hanjun Choi, Hyunsung Kim, Minho Lee, Chang-Jo Kim, Jinsung Yoon, Sang-Ki Ko
Many spatiotemporal domains handle multi-agent trajectory data, but inreal-world scenarios, collected trajectory data are often partially missing dueto various reasons. While existing approaches demonstrate good performance intrajectory imputation, they face challenges in capturing the complex dynamicsand interactions between agents due to a lack of physical constraints thatgovern realistic trajectories, leading to suboptimal results. To address thisissue, the paper proposes a Derivative-Based Hybrid Prediction (DBHP) frameworkthat can effectively impute multiple agents' missing trajectories. First, aneural network equipped with Set Transformers produces a naive prediction ofmissing trajectories while satisfying the permutation-equivariance in terms ofthe order of input agents. Then, the framework makes alternative predictionsleveraging velocity and acceleration information and combines all thepredictions with properly determined weights to provide final imputedtrajectories. In this way, our proposed framework not only accurately predictsposition, velocity, and acceleration values but also enforces the physicalrelationship between them, eventually improving both the accuracy andnaturalness of the predicted trajectories. Accordingly, the experiment resultsabout imputing player trajectories in team sports show that our frameworksignificantly outperforms existing imputation baselines.
许多时空领域都要处理多代理轨迹数据,但在现实世界的场景中,由于各种原因,收集到的轨迹数据往往部分缺失。虽然现有方法在轨迹推算方面表现出良好的性能,但由于缺乏管理现实轨迹的物理约束,这些方法在捕捉复杂动态和代理之间的交互方面面临挑战,从而导致次优结果。为了解决这个问题,本文提出了一种基于衍生的混合预测(DBHP)框架,可以有效地估算多个代理的缺失轨迹。首先,配备了集合变换器的神经网络会对缺失轨迹进行天真预测,同时满足输入代理顺序方面的置换-方差。然后,该框架利用速度和加速度信息进行替代预测,并将所有预测与适当确定的权重相结合,以提供最终的估算轨迹。这样,我们提出的框架不仅能准确预测位置、速度和加速度值,还能加强它们之间的物理关系,最终提高预测轨迹的准确性和自然度。因此,有关团队运动中球员轨迹归因的实验结果表明,我们的框架明显优于现有的归因基线。
{"title":"DBHP: Trajectory Imputation in Multi-Agent Sports Using Derivative-Based Hybrid Prediction","authors":"Hanjun Choi, Hyunsung Kim, Minho Lee, Chang-Jo Kim, Jinsung Yoon, Sang-Ki Ko","doi":"arxiv-2408.10878","DOIUrl":"https://doi.org/arxiv-2408.10878","url":null,"abstract":"Many spatiotemporal domains handle multi-agent trajectory data, but in\u0000real-world scenarios, collected trajectory data are often partially missing due\u0000to various reasons. While existing approaches demonstrate good performance in\u0000trajectory imputation, they face challenges in capturing the complex dynamics\u0000and interactions between agents due to a lack of physical constraints that\u0000govern realistic trajectories, leading to suboptimal results. To address this\u0000issue, the paper proposes a Derivative-Based Hybrid Prediction (DBHP) framework\u0000that can effectively impute multiple agents' missing trajectories. First, a\u0000neural network equipped with Set Transformers produces a naive prediction of\u0000missing trajectories while satisfying the permutation-equivariance in terms of\u0000the order of input agents. Then, the framework makes alternative predictions\u0000leveraging velocity and acceleration information and combines all the\u0000predictions with properly determined weights to provide final imputed\u0000trajectories. In this way, our proposed framework not only accurately predicts\u0000position, velocity, and acceleration values but also enforces the physical\u0000relationship between them, eventually improving both the accuracy and\u0000naturalness of the predicted trajectories. Accordingly, the experiment results\u0000about imputing player trajectories in team sports show that our framework\u0000significantly outperforms existing imputation baselines.","PeriodicalId":501315,"journal":{"name":"arXiv - CS - Multiagent Systems","volume":"8 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142190417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-agent based modeling for investigating excess heat utilization from electrolyzer production to district heating network 基于多代理的建模,用于调查从电解槽生产到区域供热网络的多余热量利用情况
Pub Date : 2024-08-20 DOI: arxiv-2408.10783
Kristoffer Christensen, Bo Nørregaard Jørgensen, Zheng Grace Ma
Power-to-Hydrogen is crucial for the renewable energy transition, yetexisting literature lacks business models for the significant excess heat itgenerates. This study addresses this by evaluating three models for sellingelectrolyzer-generated heat to district heating grids: constant, flexible, andrenewable-source hydrogen production, with and without heat sales. Usingagent-based modeling and multi-criteria decision-making methods (VIKOR, TOPSIS,PROMETHEE), it finds that selling excess heat can cut hydrogen production costsby 5.6%. The optimal model operates flexibly with electricity spot prices,includes heat sales, and maintains a hydrogen price of 3.3 EUR/kg.Environmentally, hydrogen production from grid electricity could emit up to13,783.8 tons of CO2 over four years from 2023. The best economic andenvironmental model uses renewable sources and sells heat at 3.5 EUR/kg
电制氢对于可再生能源转型至关重要,但现有文献缺乏针对其产生的大量过剩热量的商业模式。为了解决这一问题,本研究评估了向区域供热电网出售电解槽产生的热量的三种模式:恒定、灵活和可再生能源制氢,有无热量销售。通过使用基于代理的建模和多标准决策方法(VIKOR、TOPSIS、PROMETHEE),研究发现出售多余热量可将制氢成本降低 5.6%。在环境方面,从 2023 年起的四年内,利用电网电力生产氢气可排放 13783.8 吨二氧化碳。最佳经济和环境模式使用可再生能源,并以 3.5 欧元/千克的价格出售热量。
{"title":"Multi-agent based modeling for investigating excess heat utilization from electrolyzer production to district heating network","authors":"Kristoffer Christensen, Bo Nørregaard Jørgensen, Zheng Grace Ma","doi":"arxiv-2408.10783","DOIUrl":"https://doi.org/arxiv-2408.10783","url":null,"abstract":"Power-to-Hydrogen is crucial for the renewable energy transition, yet\u0000existing literature lacks business models for the significant excess heat it\u0000generates. This study addresses this by evaluating three models for selling\u0000electrolyzer-generated heat to district heating grids: constant, flexible, and\u0000renewable-source hydrogen production, with and without heat sales. Using\u0000agent-based modeling and multi-criteria decision-making methods (VIKOR, TOPSIS,\u0000PROMETHEE), it finds that selling excess heat can cut hydrogen production costs\u0000by 5.6%. The optimal model operates flexibly with electricity spot prices,\u0000includes heat sales, and maintains a hydrogen price of 3.3 EUR/kg.\u0000Environmentally, hydrogen production from grid electricity could emit up to\u000013,783.8 tons of CO2 over four years from 2023. The best economic and\u0000environmental model uses renewable sources and sells heat at 3.5 EUR/kg","PeriodicalId":501315,"journal":{"name":"arXiv - CS - Multiagent Systems","volume":"37 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142190539","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analyzing the Impact of Electric Vehicles on Local Energy Systems using Digital Twins 利用数字孪生分析电动汽车对当地能源系统的影响
Pub Date : 2024-08-20 DOI: arxiv-2408.10763
Daniel René Bayer, Marco Pruckner
The electrification of the transportation and heating sector, the so-calledsector coupling, is one of the core elements to achieve independence fromfossil fuels. As it highly affects the electricity demand, especially on thelocal level, the integrated modeling and simulation of all sectors is apromising approach for analyzing design decisions or complex controlstrategies. This paper analyzes the increase in electricity demand resultingfrom sector coupling, mainly due to integrating electric vehicles into urbanenergy systems. Therefore, we utilize a digital twin of an existing localenergy system and extend it with a mobility simulation model to evaluate theimpact of electric vehicles on the distribution grid level. Our findingsindicate a significant rise in annual electricity consumption attributed toelectric vehicles, with home charging alone resulting in a 78% increase.However, we demonstrate that integrating photovoltaic and battery energystorage systems can effectively mitigate this rise.
交通和供热部门的电气化,即所谓的部门耦合,是实现独立于化石燃料的核心要素之一。由于它高度影响电力需求,尤其是地方层面的电力需求,因此对所有部门进行综合建模和仿真是分析设计决策或复杂控制策略的有效方法。本文分析了部门耦合导致的电力需求增长,这主要是由于将电动汽车纳入城市能源系统。因此,我们利用现有本地能源系统的数字孪生系统,并通过移动性仿真模型对其进行扩展,以评估电动汽车对配电网的影响。我们的研究结果表明,电动汽车导致年耗电量大幅增加,仅家庭充电就导致耗电量增加 78%。
{"title":"Analyzing the Impact of Electric Vehicles on Local Energy Systems using Digital Twins","authors":"Daniel René Bayer, Marco Pruckner","doi":"arxiv-2408.10763","DOIUrl":"https://doi.org/arxiv-2408.10763","url":null,"abstract":"The electrification of the transportation and heating sector, the so-called\u0000sector coupling, is one of the core elements to achieve independence from\u0000fossil fuels. As it highly affects the electricity demand, especially on the\u0000local level, the integrated modeling and simulation of all sectors is a\u0000promising approach for analyzing design decisions or complex control\u0000strategies. This paper analyzes the increase in electricity demand resulting\u0000from sector coupling, mainly due to integrating electric vehicles into urban\u0000energy systems. Therefore, we utilize a digital twin of an existing local\u0000energy system and extend it with a mobility simulation model to evaluate the\u0000impact of electric vehicles on the distribution grid level. Our findings\u0000indicate a significant rise in annual electricity consumption attributed to\u0000electric vehicles, with home charging alone resulting in a 78% increase.\u0000However, we demonstrate that integrating photovoltaic and battery energy\u0000storage systems can effectively mitigate this rise.","PeriodicalId":501315,"journal":{"name":"arXiv - CS - Multiagent Systems","volume":"19 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142190541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Synthesis of Reward Machines for Multi-Agent Equilibrium Design (Full Version) 多代理平衡设计奖励机制的合成(完整版)
Pub Date : 2024-08-19 DOI: arxiv-2408.10074
Muhammad Najib, Giuseppe Perelli
Mechanism design is a well-established game-theoretic paradigm for designinggames to achieve desired outcomes. This paper addresses a closely related butdistinct concept, equilibrium design. Unlike mechanism design, the designer'sauthority in equilibrium design is more constrained; she can only modify theincentive structures in a given game to achieve certain outcomes without theability to create the game from scratch. We study the problem of equilibriumdesign using dynamic incentive structures, known as reward machines. We useweighted concurrent game structures for the game model, with goals (for theplayers and the designer) defined as mean-payoff objectives. We show how rewardmachines can be used to represent dynamic incentives that allocate rewards in amanner that optimises the designer's goal. We also introduce the main decisionproblem within our framework, the payoff improvement problem. This problemessentially asks whether there exists a dynamic incentive (represented by somereward machine) that can improve the designer's payoff by more than a giventhreshold value. We present two variants of the problem: strong and weak. Wedemonstrate that both can be solved in polynomial time using a Turing machineequipped with an NP oracle. Furthermore, we also establish that these variantsare either NP-hard or coNP-hard. Finally, we show how to synthesise thecorresponding reward machine if it exists.
机制设计是一种成熟的博弈论范式,用于设计游戏以实现预期结果。本文讨论的是一个密切相关但又不同的概念--均衡设计。与机制设计不同,平衡设计中设计者的权力受到更多限制;她只能修改给定博弈中的激励结构来实现特定结果,而无法从头开始创建博弈。我们使用动态激励结构(即奖励机器)来研究均衡设计问题。我们使用加权并发博弈结构作为博弈模型,目标(对于玩家和设计者)定义为平均报酬目标。我们展示了如何使用奖励机来表示动态激励机制,以优化设计者目标的方式分配奖励。我们还介绍了我们框架中的主要决策问题--报酬改进问题。这个问题本质上是问,是否存在一种动态激励机制(由某个奖励机表示)能使设计者的报酬提高超过给定的阈值。我们提出了这个问题的两个变体:强激励和弱激励。我们证明,使用配备 NP 甲骨文的图灵机,可以在多项式时间内解决这两个问题。此外,我们还确定这些变体要么是 NP 难,要么是 coNP 难。最后,我们展示了如何合成相应的奖励机器(如果存在的话)。
{"title":"Synthesis of Reward Machines for Multi-Agent Equilibrium Design (Full Version)","authors":"Muhammad Najib, Giuseppe Perelli","doi":"arxiv-2408.10074","DOIUrl":"https://doi.org/arxiv-2408.10074","url":null,"abstract":"Mechanism design is a well-established game-theoretic paradigm for designing\u0000games to achieve desired outcomes. This paper addresses a closely related but\u0000distinct concept, equilibrium design. Unlike mechanism design, the designer's\u0000authority in equilibrium design is more constrained; she can only modify the\u0000incentive structures in a given game to achieve certain outcomes without the\u0000ability to create the game from scratch. We study the problem of equilibrium\u0000design using dynamic incentive structures, known as reward machines. We use\u0000weighted concurrent game structures for the game model, with goals (for the\u0000players and the designer) defined as mean-payoff objectives. We show how reward\u0000machines can be used to represent dynamic incentives that allocate rewards in a\u0000manner that optimises the designer's goal. We also introduce the main decision\u0000problem within our framework, the payoff improvement problem. This problem\u0000essentially asks whether there exists a dynamic incentive (represented by some\u0000reward machine) that can improve the designer's payoff by more than a given\u0000threshold value. We present two variants of the problem: strong and weak. We\u0000demonstrate that both can be solved in polynomial time using a Turing machine\u0000equipped with an NP oracle. Furthermore, we also establish that these variants\u0000are either NP-hard or coNP-hard. Finally, we show how to synthesise the\u0000corresponding reward machine if it exists.","PeriodicalId":501315,"journal":{"name":"arXiv - CS - Multiagent Systems","volume":"2011 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142190423","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MegaAgent: A Practical Framework for Autonomous Cooperation in Large-Scale LLM Agent Systems MegaAgent:大型 LLM 代理系统中自主合作的实用框架
Pub Date : 2024-08-19 DOI: arxiv-2408.09955
Qian Wang, Tianyu Wang, Qinbin Li, Jingsheng Liang, Bingsheng He
With the emergence of large language models (LLMs), LLM-powered multi-agentsystems (LLM-MA systems) have been proposed to tackle real-world tasks.However, their agents mostly follow predefined Standard Operating Procedures(SOPs) that remain unchanged across the whole interaction, lacking autonomy andscalability. Additionally, current solutions often overlook the necessity foreffective agent cooperation. To address the above limitations, we proposeMegaAgent, a practical framework designed for autonomous cooperation inlarge-scale LLM Agent systems. MegaAgent leverages the autonomy of agents todynamically generate agents based on task requirements, incorporating featuressuch as automatically dividing tasks, systematic planning and monitoring ofagent activities, and managing concurrent operations. In addition, MegaAgent isdesigned with a hierarchical structure and employs system-level parallelism toenhance performance and boost communication. We demonstrate the effectivenessof MegaAgent through Gobang game development, showing that it outperformspopular LLM-MA systems; and national policy simulation, demonstrating its highautonomy and potential to rapidly scale up to 590 agents while ensuringeffective cooperation among them. Our results indicate that MegaAgent is thefirst autonomous large-scale LLM-MA system with no pre-defined SOPs, higheffectiveness and scalability, paving the way for further research in thisfield. Our code is at https://anonymous.4open.science/r/MegaAgent-81F3.
随着大型语言模型(LLM)的出现,人们提出了由 LLM 驱动的多代理系统(LLM-MA 系统)来解决现实世界中的任务。然而,这些系统中的代理大多遵循预定义的标准操作程序(SOP),在整个交互过程中保持不变,缺乏自主性和可扩展性。此外,当前的解决方案往往忽视了有效代理合作的必要性。为了解决上述局限性,我们提出了MegaAgent,这是一个专为大规模LLM Agent系统中的自主合作而设计的实用框架。MegaAgent 利用代理的自主性,根据任务需求动态生成代理,具有自动划分任务、系统规划和监控代理活动以及管理并发操作等功能。此外,MegaAgent 采用分层结构设计,并利用系统级并行性来提高性能和加强通信。我们通过Gobang游戏开发展示了MegaAgent的有效性,结果表明它优于流行的LLM-MA系统;通过国家政策模拟展示了它的高自主性和快速扩展到590个代理的潜力,同时确保了代理之间的有效合作。我们的研究结果表明,MegaAgent 是第一个没有预定义 SOP、具有高效性和可扩展性的自主大型 LLM-MA 系统,为该领域的进一步研究铺平了道路。我们的代码见 https://anonymous.4open.science/r/MegaAgent-81F3。
{"title":"MegaAgent: A Practical Framework for Autonomous Cooperation in Large-Scale LLM Agent Systems","authors":"Qian Wang, Tianyu Wang, Qinbin Li, Jingsheng Liang, Bingsheng He","doi":"arxiv-2408.09955","DOIUrl":"https://doi.org/arxiv-2408.09955","url":null,"abstract":"With the emergence of large language models (LLMs), LLM-powered multi-agent\u0000systems (LLM-MA systems) have been proposed to tackle real-world tasks.\u0000However, their agents mostly follow predefined Standard Operating Procedures\u0000(SOPs) that remain unchanged across the whole interaction, lacking autonomy and\u0000scalability. Additionally, current solutions often overlook the necessity for\u0000effective agent cooperation. To address the above limitations, we propose\u0000MegaAgent, a practical framework designed for autonomous cooperation in\u0000large-scale LLM Agent systems. MegaAgent leverages the autonomy of agents to\u0000dynamically generate agents based on task requirements, incorporating features\u0000such as automatically dividing tasks, systematic planning and monitoring of\u0000agent activities, and managing concurrent operations. In addition, MegaAgent is\u0000designed with a hierarchical structure and employs system-level parallelism to\u0000enhance performance and boost communication. We demonstrate the effectiveness\u0000of MegaAgent through Gobang game development, showing that it outperforms\u0000popular LLM-MA systems; and national policy simulation, demonstrating its high\u0000autonomy and potential to rapidly scale up to 590 agents while ensuring\u0000effective cooperation among them. Our results indicate that MegaAgent is the\u0000first autonomous large-scale LLM-MA system with no pre-defined SOPs, high\u0000effectiveness and scalability, paving the way for further research in this\u0000field. Our code is at https://anonymous.4open.science/r/MegaAgent-81F3.","PeriodicalId":501315,"journal":{"name":"arXiv - CS - Multiagent Systems","volume":"43 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142190419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Algorithmic Contract Design with Reinforcement Learning Agents 使用强化学习代理的算法合同设计
Pub Date : 2024-08-19 DOI: arxiv-2408.09686
David Molina Concha, Kyeonghyeon Park, Hyun-Rok Lee, Taesik Lee, Chi-Guhn Lee
We introduce a novel problem setting for algorithmic contract design, namedthe principal-MARL contract design problem. This setting extends traditionalcontract design to account for dynamic and stochastic environments using MarkovGames and Multi-Agent Reinforcement Learning. To tackle this problem, wepropose a Multi-Objective Bayesian Optimization (MOBO) framework namedConstrained Pareto Maximum Entropy Search (cPMES). Our approach integrates MOBOand MARL to explore the highly constrained contract design space, identifyingpromising incentive and recruitment decisions. cPMES transforms theprincipal-MARL contract design problem into an unconstrained multi-objectiveproblem, leveraging the probability of feasibility as part of the objectivesand ensuring promising designs predicted on the feasibility border are includedin the Pareto front. By focusing the entropy prediction on designs within thePareto set, cPMES mitigates the risk of the search strategy being overwhelmedby entropy from constraints. We demonstrate the effectiveness of cPMES throughextensive benchmark studies in synthetic and simulated environments, showingits ability to find feasible contract designs that maximize the principal'sobjectives. Additionally, we provide theoretical support with a sub-linearregret bound concerning the number of iterations.
我们为算法合约设计引入了一种新的问题设置,并将其命名为 principal-MARL 合约设计问题。该问题利用马尔可夫游戏和多代理强化学习(Multi-Agent Reinforcement Learning)扩展了传统的合同设计,以考虑动态和随机环境。为了解决这个问题,我们提出了一个多目标贝叶斯优化(MOBO)框架,名为有约束帕累托最大熵搜索(cPMES)。我们的方法整合了 MOBO 和 MARL,以探索高度受限的合同设计空间,识别有前途的激励和招聘决策。cPMES 将主要-MARL 合同设计问题转化为无约束多目标问题,利用可行性概率作为目标的一部分,确保在可行性边界上预测的有前途的设计被纳入帕累托前沿。通过将熵预测重点放在帕累托集内的设计上,cPMES 降低了搜索策略被约束熵淹没的风险。我们通过在合成和模拟环境中进行的大量基准研究证明了 cPMES 的有效性,表明它有能力找到可行的合同设计,使委托人的目标最大化。此外,我们还提供了有关迭代次数的亚线性遗憾约束的理论支持。
{"title":"Algorithmic Contract Design with Reinforcement Learning Agents","authors":"David Molina Concha, Kyeonghyeon Park, Hyun-Rok Lee, Taesik Lee, Chi-Guhn Lee","doi":"arxiv-2408.09686","DOIUrl":"https://doi.org/arxiv-2408.09686","url":null,"abstract":"We introduce a novel problem setting for algorithmic contract design, named\u0000the principal-MARL contract design problem. This setting extends traditional\u0000contract design to account for dynamic and stochastic environments using Markov\u0000Games and Multi-Agent Reinforcement Learning. To tackle this problem, we\u0000propose a Multi-Objective Bayesian Optimization (MOBO) framework named\u0000Constrained Pareto Maximum Entropy Search (cPMES). Our approach integrates MOBO\u0000and MARL to explore the highly constrained contract design space, identifying\u0000promising incentive and recruitment decisions. cPMES transforms the\u0000principal-MARL contract design problem into an unconstrained multi-objective\u0000problem, leveraging the probability of feasibility as part of the objectives\u0000and ensuring promising designs predicted on the feasibility border are included\u0000in the Pareto front. By focusing the entropy prediction on designs within the\u0000Pareto set, cPMES mitigates the risk of the search strategy being overwhelmed\u0000by entropy from constraints. We demonstrate the effectiveness of cPMES through\u0000extensive benchmark studies in synthetic and simulated environments, showing\u0000its ability to find feasible contract designs that maximize the principal's\u0000objectives. Additionally, we provide theoretical support with a sub-linear\u0000regret bound concerning the number of iterations.","PeriodicalId":501315,"journal":{"name":"arXiv - CS - Multiagent Systems","volume":"63-65 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142190418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Auctioning Escape Permits for Multiple Correlated Pollutants Using CMRA 利用 CMRA 拍卖多种相关污染物的逃逸许可证
Pub Date : 2024-08-19 DOI: arxiv-2408.10148
Keshav Goyal, Sooraj Sathish, Shrisha Rao
In the context of increasingly complex environmental challenges, effectivepollution control mechanisms are crucial. By extending the state of the artauction mechanisms, we aim to develop an efficient approach for allocatingpollution abatement resources in a multi-pollutant setting with pollutantsaffecting each other's reduction costs. We modify the Combinatorial Multi-RoundAscending Auction for the auction of escape permits of pollutants withco-dependent reduction processes, specifically, greenhouse gas emissions andnutrient runoff in Finnish agriculture. We show the significant advantages ofthis mechanism in pollution control through experiments on the bid prices andamount of escape permits sold in multiple auction simulations.
在环境挑战日益复杂的背景下,有效的污染控制机制至关重要。通过扩展现有的拍卖机制,我们旨在开发一种高效的方法,用于在污染物减排成本相互影响的多污染物环境中分配污染减排资源。我们修改了 "组合式多轮竞价拍卖",用于拍卖具有共同依赖减排过程的污染物的逃逸许可,特别是芬兰农业中的温室气体排放和营养物质径流。我们通过对多次拍卖模拟中的出价和逃逸许可成交量进行实验,证明了这一机制在污染控制方面的显著优势。
{"title":"Auctioning Escape Permits for Multiple Correlated Pollutants Using CMRA","authors":"Keshav Goyal, Sooraj Sathish, Shrisha Rao","doi":"arxiv-2408.10148","DOIUrl":"https://doi.org/arxiv-2408.10148","url":null,"abstract":"In the context of increasingly complex environmental challenges, effective\u0000pollution control mechanisms are crucial. By extending the state of the art\u0000auction mechanisms, we aim to develop an efficient approach for allocating\u0000pollution abatement resources in a multi-pollutant setting with pollutants\u0000affecting each other's reduction costs. We modify the Combinatorial Multi-Round\u0000Ascending Auction for the auction of escape permits of pollutants with\u0000co-dependent reduction processes, specifically, greenhouse gas emissions and\u0000nutrient runoff in Finnish agriculture. We show the significant advantages of\u0000this mechanism in pollution control through experiments on the bid prices and\u0000amount of escape permits sold in multiple auction simulations.","PeriodicalId":501315,"journal":{"name":"arXiv - CS - Multiagent Systems","volume":"19 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142190422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - CS - Multiagent Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1