A novel reinforcement learning architecture for continuous state and action spaces

Advances in artificial intelligence. Canadian Society for Computational Studies of Intelligence. Conference Pub Date : 2013-01-01 DOI:10.1155/2013/492852

Victor Uc-Cetina

引用次数: 2

Abstract

We introduce a reinforcement learning architecture designed for problems with an infinite number of states, where each state can be seen as a vector of real numbers and with a finite number of actions, where each action requires a vector of real numbers as parameters. The main objective of this architecture is to distribute in two actors the work required to learn the final policy. One actor decideswhat actionmust be performed;meanwhile, a second actor determines the right parameters for the selected action. We tested our architecture and one algorithmbased on it solving the robot dribbling problem, a challenging robot control problem taken from the RoboCup competitions. Our experimental work with three different function approximators provides enough evidence to prove that the proposed architecture can be used to implement fast, robust, and reliable reinforcement learning algorithms.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一种新的连续状态和动作空间强化学习架构

我们引入了一个为具有无限数量状态的问题设计的强化学习架构，其中每个状态可以被视为实数向量，并且具有有限数量的动作，其中每个动作需要实数向量作为参数。该体系结构的主要目标是将学习最终策略所需的工作分配给两个参与者。一个参与者决定必须执行什么操作;同时，第二个参与者确定所选操作的正确参数。我们测试了我们的架构和一个基于它的算法来解决机器人运球问题，这是一个来自机器人世界杯比赛的具有挑战性的机器人控制问题。我们用三种不同的函数逼近器进行的实验工作提供了足够的证据来证明所提出的架构可以用于实现快速、鲁棒和可靠的强化学习算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Advances in artificial intelligence. Canadian Society for Computational Studies of Intelligence. Conference

自引率

0.00%

发文量