{"title":"基于多智能体强化学习和高效动作解码的agv柔性作业车间实时调度","authors":"Yuxin Li;Qingzheng Wang;Xinyu Li;Liang Gao;Ling Fu;Yanbin Yu;Wei Zhou","doi":"10.1109/TSMC.2024.3520381","DOIUrl":null,"url":null,"abstract":"The application of automated guided vehicle (AGV) greatly improves the production efficiency of workshop. However, machine flexibility and limited logistics equipment increase the complexity of collaborative scheduling, and frequent dynamic events bring uncertainty. Therefore, this article proposes a real-time scheduling method for dynamic flexible job shop scheduling problem with AGVs using multiagent reinforcement learning (MARL). Specifically, a real-time scheduling framework is proposed in which a multiagent scheduling architecture is designed for achieving task selection, machine allocation and AGV allocation. Then, an action space and an efficient action decoding algorithm are proposed, which enable agents to explore in the high-quality solution space and improve the learning efficiency. In addition, a state space with generalization, a reward function considering machine idle time and a strategy for handling four disturbance events are designed to minimize the total tardiness cost. Comparison experiments show that the proposed method outperforms the priority dispatching rules, genetic programming and four popular reinforcement learning (RL)-based methods, with performance improvements mostly exceeding 10%. Furthermore, experiments considering four disturbance events demonstrate that the proposed method has strong robustness, and it can provide appropriate scheme for uncertain manufacturing system.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 3","pages":"2120-2132"},"PeriodicalIF":8.7000,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Real-Time Scheduling for Flexible Job Shop With AGVs Using Multiagent Reinforcement Learning and Efficient Action Decoding\",\"authors\":\"Yuxin Li;Qingzheng Wang;Xinyu Li;Liang Gao;Ling Fu;Yanbin Yu;Wei Zhou\",\"doi\":\"10.1109/TSMC.2024.3520381\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The application of automated guided vehicle (AGV) greatly improves the production efficiency of workshop. However, machine flexibility and limited logistics equipment increase the complexity of collaborative scheduling, and frequent dynamic events bring uncertainty. Therefore, this article proposes a real-time scheduling method for dynamic flexible job shop scheduling problem with AGVs using multiagent reinforcement learning (MARL). Specifically, a real-time scheduling framework is proposed in which a multiagent scheduling architecture is designed for achieving task selection, machine allocation and AGV allocation. Then, an action space and an efficient action decoding algorithm are proposed, which enable agents to explore in the high-quality solution space and improve the learning efficiency. In addition, a state space with generalization, a reward function considering machine idle time and a strategy for handling four disturbance events are designed to minimize the total tardiness cost. Comparison experiments show that the proposed method outperforms the priority dispatching rules, genetic programming and four popular reinforcement learning (RL)-based methods, with performance improvements mostly exceeding 10%. Furthermore, experiments considering four disturbance events demonstrate that the proposed method has strong robustness, and it can provide appropriate scheme for uncertain manufacturing system.\",\"PeriodicalId\":48915,\"journal\":{\"name\":\"IEEE Transactions on Systems Man Cybernetics-Systems\",\"volume\":\"55 3\",\"pages\":\"2120-2132\"},\"PeriodicalIF\":8.7000,\"publicationDate\":\"2025-01-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Systems Man Cybernetics-Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10829984/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Systems Man Cybernetics-Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10829984/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Real-Time Scheduling for Flexible Job Shop With AGVs Using Multiagent Reinforcement Learning and Efficient Action Decoding
The application of automated guided vehicle (AGV) greatly improves the production efficiency of workshop. However, machine flexibility and limited logistics equipment increase the complexity of collaborative scheduling, and frequent dynamic events bring uncertainty. Therefore, this article proposes a real-time scheduling method for dynamic flexible job shop scheduling problem with AGVs using multiagent reinforcement learning (MARL). Specifically, a real-time scheduling framework is proposed in which a multiagent scheduling architecture is designed for achieving task selection, machine allocation and AGV allocation. Then, an action space and an efficient action decoding algorithm are proposed, which enable agents to explore in the high-quality solution space and improve the learning efficiency. In addition, a state space with generalization, a reward function considering machine idle time and a strategy for handling four disturbance events are designed to minimize the total tardiness cost. Comparison experiments show that the proposed method outperforms the priority dispatching rules, genetic programming and four popular reinforcement learning (RL)-based methods, with performance improvements mostly exceeding 10%. Furthermore, experiments considering four disturbance events demonstrate that the proposed method has strong robustness, and it can provide appropriate scheme for uncertain manufacturing system.
期刊介绍:
The IEEE Transactions on Systems, Man, and Cybernetics: Systems encompasses the fields of systems engineering, covering issue formulation, analysis, and modeling throughout the systems engineering lifecycle phases. It addresses decision-making, issue interpretation, systems management, processes, and various methods such as optimization, modeling, and simulation in the development and deployment of large systems.