Handling Realistic Noise in Multi-Agent Systems with Self-Supervised Learning and Curiosity

IF 3.3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Journal of Artificial Intelligence and Soft Computing Research Pub Date : 2021-04-01 DOI:10.2478/jaiscr-2022-0009

Marton Szemenyei, Patrik Reizinger

引用次数: 0

Abstract

Abstract 1Most reinforcement learning benchmarks – especially in multi-agent tasks – do not go beyond observations with simple noise; nonetheless, real scenarios induce more elaborate vision pipeline failures: false sightings, misclassifications or occlusion. In this work, we propose a lightweight, 2D environment for robot soccer and autonomous driving that can emulate the above discrepancies. Besides establishing a benchmark for accessible multi-agent reinforcement learning research, our work addresses the challenges the simulator imposes. For handling realistic noise, we use self-supervised learning to enhance scene reconstruction and extend curiosity-driven learning to model longer horizons. Our extensive experiments show that the proposed methods achieve state-of-the-art performance, compared against actor-critic methods, ICM, and PPO.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用自监督学习和好奇心处理多智能体系统中的真实噪声

摘要1大多数强化学习基准——尤其是在多智能体任务中——不会超出简单噪声的观察范围；尽管如此，真实的场景会引发更复杂的视觉管道故障：虚假视觉、错误分类或遮挡。在这项工作中，我们为机器人足球和自动驾驶提出了一个轻量级的2D环境，可以模拟上述差异。除了为可访问的多智能体强化学习研究建立基准外，我们的工作还解决了模拟器带来的挑战。为了处理逼真的噪声，我们使用自监督学习来增强场景重建，并将好奇心驱动的学习扩展到建模更长的视野。我们的大量实验表明，与演员-评论家方法、ICM和PPO相比，所提出的方法实现了最先进的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Artificial Intelligence and Soft Computing Research COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

7.00

自引率

25.00%

发文量

审稿时长

24 weeks

期刊介绍： Journal of Artificial Intelligence and Soft Computing Research (available also at Sciendo (De Gruyter)) is a dynamically developing international journal focused on the latest scientific results and methods constituting traditional artificial intelligence methods and soft computing techniques. Our goal is to bring together scientists representing both approaches and various research communities.