Deep Reinforcement Learning with DQN vs. PPO in VizDoom

2021 IEEE 21st International Symposium on Computational Intelligence and Informatics (CINTI) Pub Date : 2021-11-18 DOI:10.1109/CINTI53070.2021.9668479

Anton Zakharenkov, Ilya Makarov

引用次数: 4

Abstract

VizDoom is a flexible and easy-to-use 3D reinforcement learning research platform based on the well-known Doom first-person shooter. The challenge is to create bots that compete in the DeathMatch track, making decisions based solely on visual in-formation from the screen. The paper offers a com-parison of different approaches with reinforcement learning: Q-learning and policy-gradient algorithms. We explore the distributed learning paradigm in re-inforcement learning, and also discuss the differences in speed and quality of convergence when adding an object detection module.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

在VizDoom中使用DQN与PPO进行深度强化学习

VizDoom是一个灵活易用的3D强化学习研究平台，基于著名的第一人称射击游戏《毁灭战士》。我们面临的挑战是创造出能够在DeathMatch赛道上竞争的机器人，并基于屏幕上的视觉信息做出决定。本文对不同的强化学习方法进行了比较:q学习和策略梯度算法。我们探索了强化学习中的分布式学习范式，并讨论了添加目标检测模块时收敛速度和质量的差异。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2021 IEEE 21st International Symposium on Computational Intelligence and Informatics (CINTI)

自引率

0.00%

发文量