Multi-Agent Reinforcement Learning for Extended Flexible Job Shop Scheduling

IF 2.1 3区工程技术 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Machines Pub Date : 2023-12-22 DOI:10.3390/machines12010008

Shaoming Peng Gang Xiong Jing Yang Zhen Shen Tariku Sinshaw Tamir Zhikun Tao Fei-Yue Wang Yunjun Han

{"title":"Multi-Agent Reinforcement Learning for Extended Flexible Job Shop Scheduling","authors":"Shaoming Peng, Gang Xiong, Jing Yang, Zhen Shen, Tariku Sinshaw Tamir, Zhikun Tao, Yunjun Han, Fei-Yue Wang","doi":"10.3390/machines12010008","DOIUrl":null,"url":null,"abstract":"An extended flexible job scheduling problem is presented with characteristics of technology and path flexibility (dual flexibility), varied transportation time, and an uncertain environment. The scheduling can greatly increase efficiency and security in complex scenarios, e.g., distributed vehicle manufacturing, and multiple aircraft maintenance. However, optimizing the scheduling puts forward higher requirements on accuracy, real time, and generalization, while subject to the curse of dimension and usually incomplete information. Various coupling relations among operations, stations, and resources aggravate the problem. To deal with the above challenges, we propose a multi-agent reinforcement learning algorithm where the scheduling environment is modeled as a decentralized partially observable Markov decision process. Each job is regarded as an agent that decides the next triplet, i.e., operation, station, and employed resource. This paper is novel in addressing the flexible job shop scheduling problem with dual flexibility and varied transportation time in consideration and proposing a double Q-value mixing (DQMIX) optimization algorithm under a multi-agent reinforcement learning framework. The experiments of our case study show that the DQMIX algorithm outperforms existing multi-agent reinforcement learning algorithms in terms of solution accuracy, stability, and generalization. In addition, it achieves better solution quality for larger-scale cases than traditional intelligent optimization algorithms.","PeriodicalId":48519,"journal":{"name":"Machines","volume":"8 4","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machines","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.3390/machines12010008","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

An extended flexible job scheduling problem is presented with characteristics of technology and path flexibility (dual flexibility), varied transportation time, and an uncertain environment. The scheduling can greatly increase efficiency and security in complex scenarios, e.g., distributed vehicle manufacturing, and multiple aircraft maintenance. However, optimizing the scheduling puts forward higher requirements on accuracy, real time, and generalization, while subject to the curse of dimension and usually incomplete information. Various coupling relations among operations, stations, and resources aggravate the problem. To deal with the above challenges, we propose a multi-agent reinforcement learning algorithm where the scheduling environment is modeled as a decentralized partially observable Markov decision process. Each job is regarded as an agent that decides the next triplet, i.e., operation, station, and employed resource. This paper is novel in addressing the flexible job shop scheduling problem with dual flexibility and varied transportation time in consideration and proposing a double Q-value mixing (DQMIX) optimization algorithm under a multi-agent reinforcement learning framework. The experiments of our case study show that the DQMIX algorithm outperforms existing multi-agent reinforcement learning algorithms in terms of solution accuracy, stability, and generalization. In addition, it achieves better solution quality for larger-scale cases than traditional intelligent optimization algorithms.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于扩展灵活作业车间调度的多代理强化学习

本文提出了一个扩展的柔性作业调度问题，该问题具有技术和路径柔性（双重柔性）、不同的运输时间和不确定的环境等特点。在复杂场景下，如分布式车辆制造和多架飞机维护，调度可以大大提高效率和安全性。然而，优化调度对准确性、实时性和概括性提出了更高的要求，同时还受到维度诅咒和通常不完整信息的影响。操作、站点和资源之间的各种耦合关系也加剧了问题的严重性。为应对上述挑战，我们提出了一种多代理强化学习算法，将调度环境建模为分散的部分可观测马尔可夫决策过程。每个作业都被视为一个代理，它决定下一个三元组，即操作、站点和使用的资源。本文在多代理强化学习框架下，针对考虑了双重灵活性和不同运输时间的柔性作业车间调度问题，提出了一种双 Q 值混合（DQMIX）优化算法，具有新颖性。我们的案例研究实验表明，DQMIX 算法在求解精度、稳定性和泛化方面优于现有的多代理强化学习算法。此外，与传统的智能优化算法相比，DQMIX 算法在更大规模的案例中能获得更好的解质量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

IF 0 Procedia CIRPPub Date : 2022-01-01 DOI: 10.1016/j.procir.2022.09.024

Peter Burggräf , Johannes Wagner , Till Saßmannshausen , Dennis Ohrndorf , Karthik Subramani

Using Multi-Agent Deep Reinforcement Learning For Flexible Job Shop Scheduling Problems

IF 0 Procedia CIRPPub Date : 2022-01-01 DOI: 10.1016/j.procir.2022.09.039

Jens Popper , Martin Ruskowski

来源期刊

Machines Multiple-

CiteScore

3.00

自引率

26.90%

发文量

1012

审稿时长

11 weeks

期刊介绍： Machines (ISSN 2075-1702) is an international, peer-reviewed journal on machinery and engineering. It publishes research articles, reviews, short communications and letters. Our aim is to encourage scientists to publish their experimental and theoretical results in as much detail as possible. There is no restriction on the length of the papers. Full experimental and/or methodical details must be provided. There are, in addition, unique features of this journal: *manuscripts regarding research proposals and research ideas will be particularly welcomed *electronic files or software regarding the full details of the calculation and experimental procedure - if unable to be published in a normal way - can be deposited as supplementary material Subject Areas: applications of automation, systems and control engineering, electronic engineering, mechanical engineering, computer engineering, mechatronics, robotics, industrial design, human-machine-interfaces, mechanical systems, machines and related components, machine vision, history of technology and industrial revolution, turbo machinery, machine diagnostics and prognostics (condition monitoring), machine design.