{"title":"基于深度强化学习和递归神经网络的多 AGV 路由规划方法","authors":"Yishuai Lin;Gang Hue;Liang Wang;Qingshan Li;Jiawei Zhu","doi":"10.1109/JAS.2023.123300","DOIUrl":null,"url":null,"abstract":"Dear Editor, This letter presents a multi-automated guided vehicles (AGV) routing planning method based on deep reinforcement learning (DRL) and recurrent neural network (RNN), specifically utilizing proximal policy optimization (PPO) and long short-term memory (LSTM). Compared to traditional AGV pathing planning methods using genetic algorithm, ant colony optimization algorithm, etc., our proposed method has a higher degree of adaptability to deal with temporary changes of tasks or sudden failures of AGVs. Furthermore, our novel routing method, which uses LSTM to take into account temporal step information, provides a more optimized performance in terms of rewards and convergence speed as compared to existing PPO-based routing methods for AGVs.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"11 7","pages":"1720-1722"},"PeriodicalIF":15.3000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10198708","citationCount":"0","resultStr":"{\"title\":\"A Multi-AGV Routing Planning Method Based on Deep Reinforcement Learning and Recurrent Neural Network\",\"authors\":\"Yishuai Lin;Gang Hue;Liang Wang;Qingshan Li;Jiawei Zhu\",\"doi\":\"10.1109/JAS.2023.123300\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Dear Editor, This letter presents a multi-automated guided vehicles (AGV) routing planning method based on deep reinforcement learning (DRL) and recurrent neural network (RNN), specifically utilizing proximal policy optimization (PPO) and long short-term memory (LSTM). Compared to traditional AGV pathing planning methods using genetic algorithm, ant colony optimization algorithm, etc., our proposed method has a higher degree of adaptability to deal with temporary changes of tasks or sudden failures of AGVs. Furthermore, our novel routing method, which uses LSTM to take into account temporal step information, provides a more optimized performance in terms of rewards and convergence speed as compared to existing PPO-based routing methods for AGVs.\",\"PeriodicalId\":54230,\"journal\":{\"name\":\"Ieee-Caa Journal of Automatica Sinica\",\"volume\":\"11 7\",\"pages\":\"1720-1722\"},\"PeriodicalIF\":15.3000,\"publicationDate\":\"2023-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10198708\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ieee-Caa Journal of Automatica Sinica\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10198708/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ieee-Caa Journal of Automatica Sinica","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10198708/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
A Multi-AGV Routing Planning Method Based on Deep Reinforcement Learning and Recurrent Neural Network
Dear Editor, This letter presents a multi-automated guided vehicles (AGV) routing planning method based on deep reinforcement learning (DRL) and recurrent neural network (RNN), specifically utilizing proximal policy optimization (PPO) and long short-term memory (LSTM). Compared to traditional AGV pathing planning methods using genetic algorithm, ant colony optimization algorithm, etc., our proposed method has a higher degree of adaptability to deal with temporary changes of tasks or sudden failures of AGVs. Furthermore, our novel routing method, which uses LSTM to take into account temporal step information, provides a more optimized performance in terms of rewards and convergence speed as compared to existing PPO-based routing methods for AGVs.
期刊介绍:
The IEEE/CAA Journal of Automatica Sinica is a reputable journal that publishes high-quality papers in English on original theoretical/experimental research and development in the field of automation. The journal covers a wide range of topics including automatic control, artificial intelligence and intelligent control, systems theory and engineering, pattern recognition and intelligent systems, automation engineering and applications, information processing and information systems, network-based automation, robotics, sensing and measurement, and navigation, guidance, and control.
Additionally, the journal is abstracted/indexed in several prominent databases including SCIE (Science Citation Index Expanded), EI (Engineering Index), Inspec, Scopus, SCImago, DBLP, CNKI (China National Knowledge Infrastructure), CSCD (Chinese Science Citation Database), and IEEE Xplore.