{"title":"通过基于演示的方法使用深度 Q-learning 和人工神经网络优化机械臂控制:动态和静态条件案例研究","authors":"Tianci Gao","doi":"10.1016/j.robot.2024.104771","DOIUrl":null,"url":null,"abstract":"<div><p>This paper uses robot programming techniques, such as Deep Q Network, Artificial Neural Network, and Artificial Deep Q Network, to address challenges related to controlling robotic arms through demonstration learning. Static and dynamic states of the subjects were the subjects of experiments. Each method's classification accuracy process success values and experimental condition combination were evaluated. The DQN method demonstrated favourable classification accuracy outcomes, achieving an Accuracy value of 0.64 for the fixed dice and 0.52 for the moving dice. The Response value was 0.51 for the fixed dice and 0.41 for the moving dice, indicating a moderate level. The ANN method demonstrated lower accuracy, with Accuracy values of 0.59 and 0.56 and Response values of 0.61 and 0.58, respectively. The ADQN method demonstrated superior outcomes, with Accuracy values of 0.66 and 0.59 and Response values of 0.67 and 0.61. During the initial learning iterations, ADQN demonstrated the highest success rate at 33.67 %, whereas DQN and ANN achieved 28.39 % and 20.13 % success rates, respectively. As the number of iterations increased, all methods demonstrated improvement in their results. ADQN maintained a high success rate of 97.59 %, while DQN and ANN attained 82.16 % and 88.66 %, respectively. As the number of iterations increases, the results of all methods improve, but the success rate of the Artificial Deep Q Network remains high. As the number of iterations increases, both Deep Q Network and Artificial Neural Network demonstrate the potential to achieve good results. Overall, the findings support the efficacy of robot programming techniques that incorporate demonstration learning. The Artificial Deep Q Network is the most successful and fast-converging method suitable for various robot control tasks. These findings provide a foundation for future research and large-scale, comprehensive learning applications for complex rot control.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":"181 ","pages":"Article 104771"},"PeriodicalIF":4.3000,"publicationDate":"2024-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimizing robotic arm control using deep Q-learning and artificial neural networks through demonstration-based methodologies: A case study of dynamic and static conditions\",\"authors\":\"Tianci Gao\",\"doi\":\"10.1016/j.robot.2024.104771\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>This paper uses robot programming techniques, such as Deep Q Network, Artificial Neural Network, and Artificial Deep Q Network, to address challenges related to controlling robotic arms through demonstration learning. Static and dynamic states of the subjects were the subjects of experiments. Each method's classification accuracy process success values and experimental condition combination were evaluated. The DQN method demonstrated favourable classification accuracy outcomes, achieving an Accuracy value of 0.64 for the fixed dice and 0.52 for the moving dice. The Response value was 0.51 for the fixed dice and 0.41 for the moving dice, indicating a moderate level. The ANN method demonstrated lower accuracy, with Accuracy values of 0.59 and 0.56 and Response values of 0.61 and 0.58, respectively. The ADQN method demonstrated superior outcomes, with Accuracy values of 0.66 and 0.59 and Response values of 0.67 and 0.61. During the initial learning iterations, ADQN demonstrated the highest success rate at 33.67 %, whereas DQN and ANN achieved 28.39 % and 20.13 % success rates, respectively. As the number of iterations increased, all methods demonstrated improvement in their results. ADQN maintained a high success rate of 97.59 %, while DQN and ANN attained 82.16 % and 88.66 %, respectively. As the number of iterations increases, the results of all methods improve, but the success rate of the Artificial Deep Q Network remains high. As the number of iterations increases, both Deep Q Network and Artificial Neural Network demonstrate the potential to achieve good results. Overall, the findings support the efficacy of robot programming techniques that incorporate demonstration learning. The Artificial Deep Q Network is the most successful and fast-converging method suitable for various robot control tasks. These findings provide a foundation for future research and large-scale, comprehensive learning applications for complex rot control.</p></div>\",\"PeriodicalId\":49592,\"journal\":{\"name\":\"Robotics and Autonomous Systems\",\"volume\":\"181 \",\"pages\":\"Article 104771\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2024-08-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Robotics and Autonomous Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0921889024001556\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics and Autonomous Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0921889024001556","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Optimizing robotic arm control using deep Q-learning and artificial neural networks through demonstration-based methodologies: A case study of dynamic and static conditions
This paper uses robot programming techniques, such as Deep Q Network, Artificial Neural Network, and Artificial Deep Q Network, to address challenges related to controlling robotic arms through demonstration learning. Static and dynamic states of the subjects were the subjects of experiments. Each method's classification accuracy process success values and experimental condition combination were evaluated. The DQN method demonstrated favourable classification accuracy outcomes, achieving an Accuracy value of 0.64 for the fixed dice and 0.52 for the moving dice. The Response value was 0.51 for the fixed dice and 0.41 for the moving dice, indicating a moderate level. The ANN method demonstrated lower accuracy, with Accuracy values of 0.59 and 0.56 and Response values of 0.61 and 0.58, respectively. The ADQN method demonstrated superior outcomes, with Accuracy values of 0.66 and 0.59 and Response values of 0.67 and 0.61. During the initial learning iterations, ADQN demonstrated the highest success rate at 33.67 %, whereas DQN and ANN achieved 28.39 % and 20.13 % success rates, respectively. As the number of iterations increased, all methods demonstrated improvement in their results. ADQN maintained a high success rate of 97.59 %, while DQN and ANN attained 82.16 % and 88.66 %, respectively. As the number of iterations increases, the results of all methods improve, but the success rate of the Artificial Deep Q Network remains high. As the number of iterations increases, both Deep Q Network and Artificial Neural Network demonstrate the potential to achieve good results. Overall, the findings support the efficacy of robot programming techniques that incorporate demonstration learning. The Artificial Deep Q Network is the most successful and fast-converging method suitable for various robot control tasks. These findings provide a foundation for future research and large-scale, comprehensive learning applications for complex rot control.
期刊介绍:
Robotics and Autonomous Systems will carry articles describing fundamental developments in the field of robotics, with special emphasis on autonomous systems. An important goal of this journal is to extend the state of the art in both symbolic and sensory based robot control and learning in the context of autonomous systems.
Robotics and Autonomous Systems will carry articles on the theoretical, computational and experimental aspects of autonomous systems, or modules of such systems.