通过基于演示的方法使用深度 Q-learning 和人工神经网络优化机械臂控制：动态和静态条件案例研究

IF 4.3 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Robotics and Autonomous Systems Pub Date : 2024-08-03 DOI:10.1016/j.robot.2024.104771

Tianci Gao

{"title":"通过基于演示的方法使用深度 Q-learning 和人工神经网络优化机械臂控制：动态和静态条件案例研究","authors":"Tianci Gao","doi":"10.1016/j.robot.2024.104771","DOIUrl":null,"url":null,"abstract":"<div><p>This paper uses robot programming techniques, such as Deep Q Network, Artificial Neural Network, and Artificial Deep Q Network, to address challenges related to controlling robotic arms through demonstration learning. Static and dynamic states of the subjects were the subjects of experiments. Each method's classification accuracy process success values and experimental condition combination were evaluated. The DQN method demonstrated favourable classification accuracy outcomes, achieving an Accuracy value of 0.64 for the fixed dice and 0.52 for the moving dice. The Response value was 0.51 for the fixed dice and 0.41 for the moving dice, indicating a moderate level. The ANN method demonstrated lower accuracy, with Accuracy values of 0.59 and 0.56 and Response values of 0.61 and 0.58, respectively. The ADQN method demonstrated superior outcomes, with Accuracy values of 0.66 and 0.59 and Response values of 0.67 and 0.61. During the initial learning iterations, ADQN demonstrated the highest success rate at 33.67 %, whereas DQN and ANN achieved 28.39 % and 20.13 % success rates, respectively. As the number of iterations increased, all methods demonstrated improvement in their results. ADQN maintained a high success rate of 97.59 %, while DQN and ANN attained 82.16 % and 88.66 %, respectively. As the number of iterations increases, the results of all methods improve, but the success rate of the Artificial Deep Q Network remains high. As the number of iterations increases, both Deep Q Network and Artificial Neural Network demonstrate the potential to achieve good results. Overall, the findings support the efficacy of robot programming techniques that incorporate demonstration learning. The Artificial Deep Q Network is the most successful and fast-converging method suitable for various robot control tasks. These findings provide a foundation for future research and large-scale, comprehensive learning applications for complex rot control.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":"181 ","pages":"Article 104771"},"PeriodicalIF":4.3000,"publicationDate":"2024-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimizing robotic arm control using deep Q-learning and artificial neural networks through demonstration-based methodologies: A case study of dynamic and static conditions\",\"authors\":\"Tianci Gao\",\"doi\":\"10.1016/j.robot.2024.104771\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>This paper uses robot programming techniques, such as Deep Q Network, Artificial Neural Network, and Artificial Deep Q Network, to address challenges related to controlling robotic arms through demonstration learning. Static and dynamic states of the subjects were the subjects of experiments. Each method's classification accuracy process success values and experimental condition combination were evaluated. The DQN method demonstrated favourable classification accuracy outcomes, achieving an Accuracy value of 0.64 for the fixed dice and 0.52 for the moving dice. The Response value was 0.51 for the fixed dice and 0.41 for the moving dice, indicating a moderate level. The ANN method demonstrated lower accuracy, with Accuracy values of 0.59 and 0.56 and Response values of 0.61 and 0.58, respectively. The ADQN method demonstrated superior outcomes, with Accuracy values of 0.66 and 0.59 and Response values of 0.67 and 0.61. During the initial learning iterations, ADQN demonstrated the highest success rate at 33.67 %, whereas DQN and ANN achieved 28.39 % and 20.13 % success rates, respectively. As the number of iterations increased, all methods demonstrated improvement in their results. ADQN maintained a high success rate of 97.59 %, while DQN and ANN attained 82.16 % and 88.66 %, respectively. As the number of iterations increases, the results of all methods improve, but the success rate of the Artificial Deep Q Network remains high. As the number of iterations increases, both Deep Q Network and Artificial Neural Network demonstrate the potential to achieve good results. Overall, the findings support the efficacy of robot programming techniques that incorporate demonstration learning. The Artificial Deep Q Network is the most successful and fast-converging method suitable for various robot control tasks. These findings provide a foundation for future research and large-scale, comprehensive learning applications for complex rot control.</p></div>\",\"PeriodicalId\":49592,\"journal\":{\"name\":\"Robotics and Autonomous Systems\",\"volume\":\"181 \",\"pages\":\"Article 104771\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2024-08-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Robotics and Autonomous Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0921889024001556\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics and Autonomous Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0921889024001556","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

本文利用深度 Q 网络、人工神经网络和人工深度 Q 网络等机器人编程技术，通过演示学习来解决控制机械臂的相关难题。实验对象为静态和动态状态。实验评估了每种方法的分类准确性过程成功值和实验条件组合。DQN 方法取得了良好的分类准确度结果，固定骰子的准确度值为 0.64，移动骰子的准确度值为 0.52。固定骰子的响应值为 0.51，移动骰子的响应值为 0.41，显示出中等水平。ANN 方法的准确度较低，准确值分别为 0.59 和 0.56，响应值分别为 0.61 和 0.58。ADQN 方法的准确度分别为 0.66 和 0.59，响应值分别为 0.67 和 0.61，表现出较高的水平。在最初的学习迭代中，ADQN 的成功率最高，达到 33.67%，而 DQN 和 ANN 的成功率分别为 28.39% 和 20.13%。随着迭代次数的增加，所有方法的结果都有所改善。ADQN 保持了 97.59 % 的高成功率，而 DQN 和 ANN 分别达到了 82.16 % 和 88.66 %。随着迭代次数的增加，所有方法的结果都有所改善，但人工深度 Q 网络的成功率仍然很高。随着迭代次数的增加，深度 Q 网络和人工神经网络都显示出取得良好结果的潜力。总体而言，研究结果支持结合演示学习的机器人编程技术的有效性。人工深度 Q 网络是最成功的快速收敛方法，适用于各种机器人控制任务。这些发现为未来的研究和复杂旋转控制的大规模综合学习应用奠定了基础。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Optimizing robotic arm control using deep Q-learning and artificial neural networks through demonstration-based methodologies: A case study of dynamic and static conditions

This paper uses robot programming techniques, such as Deep Q Network, Artificial Neural Network, and Artificial Deep Q Network, to address challenges related to controlling robotic arms through demonstration learning. Static and dynamic states of the subjects were the subjects of experiments. Each method's classification accuracy process success values and experimental condition combination were evaluated. The DQN method demonstrated favourable classification accuracy outcomes, achieving an Accuracy value of 0.64 for the fixed dice and 0.52 for the moving dice. The Response value was 0.51 for the fixed dice and 0.41 for the moving dice, indicating a moderate level. The ANN method demonstrated lower accuracy, with Accuracy values of 0.59 and 0.56 and Response values of 0.61 and 0.58, respectively. The ADQN method demonstrated superior outcomes, with Accuracy values of 0.66 and 0.59 and Response values of 0.67 and 0.61. During the initial learning iterations, ADQN demonstrated the highest success rate at 33.67 %, whereas DQN and ANN achieved 28.39 % and 20.13 % success rates, respectively. As the number of iterations increased, all methods demonstrated improvement in their results. ADQN maintained a high success rate of 97.59 %, while DQN and ANN attained 82.16 % and 88.66 %, respectively. As the number of iterations increases, the results of all methods improve, but the success rate of the Artificial Deep Q Network remains high. As the number of iterations increases, both Deep Q Network and Artificial Neural Network demonstrate the potential to achieve good results. Overall, the findings support the efficacy of robot programming techniques that incorporate demonstration learning. The Artificial Deep Q Network is the most successful and fast-converging method suitable for various robot control tasks. These findings provide a foundation for future research and large-scale, comprehensive learning applications for complex rot control.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Robotics and Autonomous Systems 工程技术-机器人学

CiteScore

9.00

自引率

7.00%

发文量

164

审稿时长

4.5 months

期刊介绍： Robotics and Autonomous Systems will carry articles describing fundamental developments in the field of robotics, with special emphasis on autonomous systems. An important goal of this journal is to extend the state of the art in both symbolic and sensory based robot control and learning in the context of autonomous systems. Robotics and Autonomous Systems will carry articles on the theoretical, computational and experimental aspects of autonomous systems, or modules of such systems.