Sorting operation method of manipulator based on deep reinforcement learning

Qing An, Yanhua Chen, Hui Zeng, J. Wang
{"title":"Sorting operation method of manipulator based on deep reinforcement learning","authors":"Qing An, Yanhua Chen, Hui Zeng, J. Wang","doi":"10.1142/s1793962323410076","DOIUrl":null,"url":null,"abstract":"Radioactive waste sorting often faces an unstructured and locally radioactive working environment. At present, remote operation sorting has problems such as low sorting efficiency, greater difficulty in operation, longer training periods for personnel, and poor autonomous control capabilities. Based on the premise of improving the adaptability and autonomous operation ability of robots in an unstructured environment, this paper uses the dual deep Q learning algorithm to optimize the classic deep Q learning algorithm to improve training speed and improve sorting efficiency and stability. Secondly, the sorting algorithm model of deep reinforcement learning is used to determine the optimal behavior in this state. Set up multiple sets of simulations and physical experiments to verify the sorting method. The results show that the robotic arm can autonomously complete sorting tasks under complex conditions and can significantly improve work efficiency when pushing and grasping collaborative operations and will preferentially grasp objects with high radioactivity in the radioactive area. The algorithm has migration ability and good generalization.","PeriodicalId":13657,"journal":{"name":"Int. J. Model. Simul. Sci. Comput.","volume":"263 1","pages":"2341007:1-2341007:22"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Model. Simul. Sci. Comput.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s1793962323410076","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Radioactive waste sorting often faces an unstructured and locally radioactive working environment. At present, remote operation sorting has problems such as low sorting efficiency, greater difficulty in operation, longer training periods for personnel, and poor autonomous control capabilities. Based on the premise of improving the adaptability and autonomous operation ability of robots in an unstructured environment, this paper uses the dual deep Q learning algorithm to optimize the classic deep Q learning algorithm to improve training speed and improve sorting efficiency and stability. Secondly, the sorting algorithm model of deep reinforcement learning is used to determine the optimal behavior in this state. Set up multiple sets of simulations and physical experiments to verify the sorting method. The results show that the robotic arm can autonomously complete sorting tasks under complex conditions and can significantly improve work efficiency when pushing and grasping collaborative operations and will preferentially grasp objects with high radioactivity in the radioactive area. The algorithm has migration ability and good generalization.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于深度强化学习的机械手分拣操作方法
放射性废物分类往往面临非结构化和局部放射性的工作环境。目前,远程操作分拣存在分拣效率低、操作难度大、人员培训时间长、自主控制能力差等问题。本文以提高机器人在非结构化环境中的适应性和自主操作能力为前提,采用双深度Q学习算法对经典深度Q学习算法进行优化,提高训练速度,提高分拣效率和稳定性。其次,利用深度强化学习的排序算法模型确定该状态下的最优行为;建立多组模拟和物理实验来验证分选方法。结果表明,该机械臂能够在复杂条件下自主完成分拣任务,在推抓协同作业时能显著提高工作效率,并优先抓取放射性区域内的高放射性物体。该算法具有迁移能力和良好的泛化能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Simulation of the drift and diffusion of marine oil spill under the effect of wind, current and tides Detection and recognition of dim and small targets in sea clutter background based on polarization decomposition Intelligent generation of combat simulation scenarios based on UML diagram recognition Visualization of relevant parameter dependencies in a delay SEIQ epidemic model - A live script program for didactic and interactive demonstrations A study on shipowner's behavior intention to cross dam by integrating SEM-SD model
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1