Semantic Communication for Partial Observation Multi-agent Reinforcement Learning

Hoang Khoi Do, Thi Quynh Khanh Dinh, Minh Duong Nguyen, Tien Hoa Nguyen
{"title":"Semantic Communication for Partial Observation Multi-agent Reinforcement Learning","authors":"Hoang Khoi Do, Thi Quynh Khanh Dinh, Minh Duong Nguyen, Tien Hoa Nguyen","doi":"10.1109/SSP53291.2023.10207979","DOIUrl":null,"url":null,"abstract":"Effective cooperation and coordination among agents is essential for success in many real-world scenarios, particularly in reinforcement learning challenges. However, partial observation, where agents are not aware of all the observations made by other agents, creates a significant obstacle to coordination. To overcome this challenge, we propose the Shared Online Multi-agent Knowledge Exchange (SOME) framework, which allows agents to learn to anticipate each other’s observations and improve their local learning. In SOME, agents learn to anticipate the observations of other agents to improve their local learning, allowing for better coordination and cooperation. Additionally, using knowledge generators instead of full observations reduces communication costs. Our experimental evaluation demonstrates that agents trained with SOME can not only predict the next observations and actions of opponents and collaborators but also take appropriate actions, making it a promising approach for overcoming the partial observation challenge in multi-agent reinforcement learning.","PeriodicalId":296346,"journal":{"name":"2023 IEEE Statistical Signal Processing Workshop (SSP)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE Statistical Signal Processing Workshop (SSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSP53291.2023.10207979","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Effective cooperation and coordination among agents is essential for success in many real-world scenarios, particularly in reinforcement learning challenges. However, partial observation, where agents are not aware of all the observations made by other agents, creates a significant obstacle to coordination. To overcome this challenge, we propose the Shared Online Multi-agent Knowledge Exchange (SOME) framework, which allows agents to learn to anticipate each other’s observations and improve their local learning. In SOME, agents learn to anticipate the observations of other agents to improve their local learning, allowing for better coordination and cooperation. Additionally, using knowledge generators instead of full observations reduces communication costs. Our experimental evaluation demonstrates that agents trained with SOME can not only predict the next observations and actions of opponents and collaborators but also take appropriate actions, making it a promising approach for overcoming the partial observation challenge in multi-agent reinforcement learning.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
部分观察多智能体强化学习的语义通信
在许多现实场景中,智能体之间的有效合作和协调对于成功至关重要,特别是在强化学习挑战中。然而,部分观察,即代理不知道其他代理所做的所有观察,对协调造成了重大障碍。为了克服这一挑战,我们提出了共享在线多智能体知识交换(SOME)框架,该框架允许智能体学习预测彼此的观察结果并改进其局部学习。在SOME中,智能体学会预测其他智能体的观察结果,以改善它们的局部学习,从而实现更好的协调与合作。此外,使用知识生成器而不是完整的观察可以降低沟通成本。我们的实验评估表明,使用SOME训练的智能体不仅可以预测对手和合作者的下一个观察和行动,而且还可以采取适当的行动,这使得它成为克服多智能体强化学习中部分观察挑战的一种有希望的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Ultra Low Delay Audio Source Separation Using Zeroth-Order Optimization Joint Channel Estimation and Symbol Detection in Overloaded MIMO Using ADMM Performance Analysis and Deep Learning Evaluation of URLLC Full-Duplex Energy Harvesting IoT Networks over Nakagami-m Fading Channels Accelerated Magnetic Resonance Parameter Mapping With Low-Rank Modeling and Deep Generative Priors Physical Characteristics Estimation for Irregularly Shaped Fruit Using Two Cameras
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1