Analysis of Q-learning on ANNs for robot control using live video feed

2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA) Pub Date : 2017-09-01 DOI:10.1109/ICSIPA.2017.8120668

Nihal Murali, Kunal Gupta, S. Bhanot

引用次数: 1

Abstract

Training of artificial neural networks (ANNs) using reinforcement learning (RL) techniques is being widely discussed in the robot learning literature. The high model complexity of ANNs along with the model-free nature of RL algorithms provides a desirable combination for many robotics applications. There is a huge need for algorithms that generalize using raw sensory inputs, such as vision, without any hand-engineered features or domain heuristics. In this paper, the standard control problem of line following robot was used as a test-bed, and an ANN controller for the robot was trained on images from a live video feed using Q-learning. A virtual agent was first trained in simulation environment and then deployed onto a robot's hardware. The robot successfully learns to traverse a wide range of curves and displays excellent generalization ability. Qualitative analysis of the evolution of policies, performance and weights of the network provide insights into the nature and convergence of the learning algorithm.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于实时视频馈送的机器人控制人工神经网络q学习分析

使用强化学习技术训练人工神经网络在机器人学习文献中得到了广泛的讨论。人工神经网络的高模型复杂性以及强化学习算法的无模型特性为许多机器人应用提供了理想的组合。对使用原始感官输入(如视觉)进行泛化的算法有巨大的需求，而不需要任何手工设计的特征或领域启发式。本文以直线跟随机器人的标准控制问题为实验平台，采用Q-learning方法对机器人的人工神经网络控制器进行训练。首先在模拟环境中对虚拟代理进行训练，然后将其部署到机器人的硬件上。该机器人成功地学习了大范围的曲线，并表现出了出色的泛化能力。对网络的策略、性能和权重的演变进行定性分析，可以深入了解学习算法的性质和收敛性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)

自引率

0.00%

发文量