Studies on the less-used actions exploration problem of a rationing algorithm based on reinforcement learning

M. Stoica, G. Calangiu, F. Sisak
{"title":"Studies on the less-used actions exploration problem of a rationing algorithm based on reinforcement learning","authors":"M. Stoica, G. Calangiu, F. Sisak","doi":"10.1109/INES.2011.5954770","DOIUrl":null,"url":null,"abstract":"Programming by demonstration is an interesting subject in the field of robotics and it is developing more and more in the direction of robots for services and humanoid robots. Programming by demonstration is much less researched when it comes to industrial robots. One of the reasons is that an industrial robot has to act in a precise and certain manner. However, extending research regarding programming by demonstration to the field of industrial robots could lead to the creation of intelligent systems where the industrial robot could be programmed in an easier way. The goal of our research is to develop an intelligent system useful for industrial robot programming by demonstration. The reasoning algorithms are the mechanisms which offer flexibility to the proposed system. We have focused our research on the creation of a reasoning algorithm based on artificial neural networks [1, 2]. Because the results of this algorithm were not satisfying we have switched our focus to the development of a reasoning algorithm based on reinforcement learning [3]. The algorithm is based on the idea that marks can be assigned to each possible action whenever the robot is in an unknown state. The exploration of less-used actions plays also an important role in the case the robot must to take a decision. Based on the marks and on the exploration feature of the algorithm the robot updates its behaviour. This paper presents a description and some studies on less-used actions exploration problem of the algorithm. Some chapters of the paper will deal with the problems implementing the algorithm, the conducted experiments in terms of exploration feature of the algorithm and the results obtained. The analysis of the results and the characteristics of the algorithm in terms of less-used actions exploration are also discussed in this paper.","PeriodicalId":414812,"journal":{"name":"2011 15th IEEE International Conference on Intelligent Engineering Systems","volume":"319 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 15th IEEE International Conference on Intelligent Engineering Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INES.2011.5954770","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Programming by demonstration is an interesting subject in the field of robotics and it is developing more and more in the direction of robots for services and humanoid robots. Programming by demonstration is much less researched when it comes to industrial robots. One of the reasons is that an industrial robot has to act in a precise and certain manner. However, extending research regarding programming by demonstration to the field of industrial robots could lead to the creation of intelligent systems where the industrial robot could be programmed in an easier way. The goal of our research is to develop an intelligent system useful for industrial robot programming by demonstration. The reasoning algorithms are the mechanisms which offer flexibility to the proposed system. We have focused our research on the creation of a reasoning algorithm based on artificial neural networks [1, 2]. Because the results of this algorithm were not satisfying we have switched our focus to the development of a reasoning algorithm based on reinforcement learning [3]. The algorithm is based on the idea that marks can be assigned to each possible action whenever the robot is in an unknown state. The exploration of less-used actions plays also an important role in the case the robot must to take a decision. Based on the marks and on the exploration feature of the algorithm the robot updates its behaviour. This paper presents a description and some studies on less-used actions exploration problem of the algorithm. Some chapters of the paper will deal with the problems implementing the algorithm, the conducted experiments in terms of exploration feature of the algorithm and the results obtained. The analysis of the results and the characteristics of the algorithm in terms of less-used actions exploration are also discussed in this paper.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于强化学习的配给算法的少用动作探索问题研究
演示编程是机器人领域一个有趣的研究课题,它正朝着服务型机器人和类人机器人的方向发展。当涉及到工业机器人时,通过演示编程的研究要少得多。其中一个原因是工业机器人必须以精确和确定的方式行动。然而,通过示范将关于编程的研究扩展到工业机器人领域可能会导致智能系统的创建,其中工业机器人可以以更容易的方式编程。我们的研究目标是通过演示开发一个可用于工业机器人编程的智能系统。推理算法是为所提出的系统提供灵活性的机制。我们的研究重点是基于人工神经网络的推理算法的创建[1,2]。由于该算法的结果并不令人满意,因此我们将重点转向了基于强化学习的推理算法的开发[3]。该算法基于这样一种思想,即当机器人处于未知状态时,可以为每个可能的动作分配标记。在机器人必须做出决定的情况下,探索较少使用的动作也起着重要的作用。基于标记和算法的探索特性,机器人更新其行为。本文对该算法的少用动作探索问题进行了描述和研究。论文的一些章节将讨论算法的实现问题,并就算法的探索特点和得到的结果进行了实验。本文还从少用动作探索的角度对结果进行了分析,并对算法的特点进行了讨论。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Algorithms for pitch distance determination Ontology-coupled active contours for dynamic video scene understanding Linear octapolar radiofrequency tool for liver ablation Integrated approach to course and engineering model for automation related topics 3DOF drawing robot using LEGO-NXT
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1