RLSS: A Deep Reinforcement Learning Algorithm for Sequential Scene Generation

2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Pub Date : 2022-01-01 DOI:10.1109/WACV51458.2022.00278

Azimkhon Ostonov, Peter Wonka, D. Michels

引用次数: 3

Abstract

We present RLSS: a reinforcement learning algorithm for sequential scene generation. This is based on employing the proximal policy optimization (PPO) algorithm for generative problems. In particular, we consider how to effectively reduce the action space by including a greedy search algorithm in the learning process. Our experiments demonstrate that our method converges for a relatively large number of actions and learns to generate scenes with predefined design objectives. This approach is placing objects iteratively in the virtual scene. In each step, the network chooses which objects to place and selects positions which result in maximal reward. A high reward is assigned if the last action resulted in desired properties whereas the violation of constraints is penalized. We demonstrate the capability of our method to generate plausible and diverse scenes efficiently by solving indoor planning problems and generating Angry Birds levels.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

时序场景生成的深度强化学习算法

我们提出了RLSS:一种用于顺序场景生成的强化学习算法。这是基于采用近端策略优化(PPO)算法的生成问题。特别地，我们考虑了如何通过在学习过程中加入贪婪搜索算法来有效地减少动作空间。我们的实验表明，我们的方法收敛于相对大量的动作，并学习生成具有预定义设计目标的场景。这种方法是在虚拟场景中迭代地放置对象。在每一步中，网络选择放置哪些物体，并选择产生最大奖励的位置。如果最后一个行动产生了期望的属性，那么就会获得高奖励，而违反约束则会受到惩罚。通过解决室内规划问题和生成愤怒的小鸟关卡，我们证明了我们的方法能够有效地生成可信和多样化的场景。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

自引率

0.00%

发文量