{"title":"Sorting operation method of manipulator based on deep reinforcement learning","authors":"Qing An, Yanhua Chen, Hui Zeng, J. Wang","doi":"10.1142/s1793962323410076","DOIUrl":null,"url":null,"abstract":"Radioactive waste sorting often faces an unstructured and locally radioactive working environment. At present, remote operation sorting has problems such as low sorting efficiency, greater difficulty in operation, longer training periods for personnel, and poor autonomous control capabilities. Based on the premise of improving the adaptability and autonomous operation ability of robots in an unstructured environment, this paper uses the dual deep Q learning algorithm to optimize the classic deep Q learning algorithm to improve training speed and improve sorting efficiency and stability. Secondly, the sorting algorithm model of deep reinforcement learning is used to determine the optimal behavior in this state. Set up multiple sets of simulations and physical experiments to verify the sorting method. The results show that the robotic arm can autonomously complete sorting tasks under complex conditions and can significantly improve work efficiency when pushing and grasping collaborative operations and will preferentially grasp objects with high radioactivity in the radioactive area. The algorithm has migration ability and good generalization.","PeriodicalId":13657,"journal":{"name":"Int. J. Model. Simul. Sci. Comput.","volume":"263 1","pages":"2341007:1-2341007:22"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Model. Simul. Sci. Comput.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s1793962323410076","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Radioactive waste sorting often faces an unstructured and locally radioactive working environment. At present, remote operation sorting has problems such as low sorting efficiency, greater difficulty in operation, longer training periods for personnel, and poor autonomous control capabilities. Based on the premise of improving the adaptability and autonomous operation ability of robots in an unstructured environment, this paper uses the dual deep Q learning algorithm to optimize the classic deep Q learning algorithm to improve training speed and improve sorting efficiency and stability. Secondly, the sorting algorithm model of deep reinforcement learning is used to determine the optimal behavior in this state. Set up multiple sets of simulations and physical experiments to verify the sorting method. The results show that the robotic arm can autonomously complete sorting tasks under complex conditions and can significantly improve work efficiency when pushing and grasping collaborative operations and will preferentially grasp objects with high radioactivity in the radioactive area. The algorithm has migration ability and good generalization.