Sorting operation method of manipulator based on deep reinforcement learning

Int. J. Model. Simul. Sci. Comput. Pub Date : 2022-02-25 DOI:10.1142/s1793962323410076

Qing An, Yanhua Chen, Hui Zeng, J. Wang

{"title":"Sorting operation method of manipulator based on deep reinforcement learning","authors":"Qing An, Yanhua Chen, Hui Zeng, J. Wang","doi":"10.1142/s1793962323410076","DOIUrl":null,"url":null,"abstract":"Radioactive waste sorting often faces an unstructured and locally radioactive working environment. At present, remote operation sorting has problems such as low sorting efficiency, greater difficulty in operation, longer training periods for personnel, and poor autonomous control capabilities. Based on the premise of improving the adaptability and autonomous operation ability of robots in an unstructured environment, this paper uses the dual deep Q learning algorithm to optimize the classic deep Q learning algorithm to improve training speed and improve sorting efficiency and stability. Secondly, the sorting algorithm model of deep reinforcement learning is used to determine the optimal behavior in this state. Set up multiple sets of simulations and physical experiments to verify the sorting method. The results show that the robotic arm can autonomously complete sorting tasks under complex conditions and can significantly improve work efficiency when pushing and grasping collaborative operations and will preferentially grasp objects with high radioactivity in the radioactive area. The algorithm has migration ability and good generalization.","PeriodicalId":13657,"journal":{"name":"Int. J. Model. Simul. Sci. Comput.","volume":"263 1","pages":"2341007:1-2341007:22"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Model. Simul. Sci. Comput.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s1793962323410076","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Radioactive waste sorting often faces an unstructured and locally radioactive working environment. At present, remote operation sorting has problems such as low sorting efficiency, greater difficulty in operation, longer training periods for personnel, and poor autonomous control capabilities. Based on the premise of improving the adaptability and autonomous operation ability of robots in an unstructured environment, this paper uses the dual deep Q learning algorithm to optimize the classic deep Q learning algorithm to improve training speed and improve sorting efficiency and stability. Secondly, the sorting algorithm model of deep reinforcement learning is used to determine the optimal behavior in this state. Set up multiple sets of simulations and physical experiments to verify the sorting method. The results show that the robotic arm can autonomously complete sorting tasks under complex conditions and can significantly improve work efficiency when pushing and grasping collaborative operations and will preferentially grasp objects with high radioactivity in the radioactive area. The algorithm has migration ability and good generalization.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于深度强化学习的机械手分拣操作方法

放射性废物分类往往面临非结构化和局部放射性的工作环境。目前，远程操作分拣存在分拣效率低、操作难度大、人员培训时间长、自主控制能力差等问题。本文以提高机器人在非结构化环境中的适应性和自主操作能力为前提，采用双深度Q学习算法对经典深度Q学习算法进行优化，提高训练速度，提高分拣效率和稳定性。其次，利用深度强化学习的排序算法模型确定该状态下的最优行为;建立多组模拟和物理实验来验证分选方法。结果表明，该机械臂能够在复杂条件下自主完成分拣任务，在推抓协同作业时能显著提高工作效率，并优先抓取放射性区域内的高放射性物体。该算法具有迁移能力和良好的泛化能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Int. J. Model. Simul. Sci. Comput.

自引率

0.00%

发文量