Multi-Agent Reinforcement Learning for Extended Flexible Job Shop Scheduling

IF 2.1 3区 工程技术 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Machines Pub Date : 2023-12-22 DOI:10.3390/machines12010008
Shaoming Peng, Gang Xiong, Jing Yang, Zhen Shen, Tariku Sinshaw Tamir, Zhikun Tao, Yunjun Han, Fei-Yue Wang
{"title":"Multi-Agent Reinforcement Learning for Extended Flexible Job Shop Scheduling","authors":"Shaoming Peng, Gang Xiong, Jing Yang, Zhen Shen, Tariku Sinshaw Tamir, Zhikun Tao, Yunjun Han, Fei-Yue Wang","doi":"10.3390/machines12010008","DOIUrl":null,"url":null,"abstract":"An extended flexible job scheduling problem is presented with characteristics of technology and path flexibility (dual flexibility), varied transportation time, and an uncertain environment. The scheduling can greatly increase efficiency and security in complex scenarios, e.g., distributed vehicle manufacturing, and multiple aircraft maintenance. However, optimizing the scheduling puts forward higher requirements on accuracy, real time, and generalization, while subject to the curse of dimension and usually incomplete information. Various coupling relations among operations, stations, and resources aggravate the problem. To deal with the above challenges, we propose a multi-agent reinforcement learning algorithm where the scheduling environment is modeled as a decentralized partially observable Markov decision process. Each job is regarded as an agent that decides the next triplet, i.e., operation, station, and employed resource. This paper is novel in addressing the flexible job shop scheduling problem with dual flexibility and varied transportation time in consideration and proposing a double Q-value mixing (DQMIX) optimization algorithm under a multi-agent reinforcement learning framework. The experiments of our case study show that the DQMIX algorithm outperforms existing multi-agent reinforcement learning algorithms in terms of solution accuracy, stability, and generalization. In addition, it achieves better solution quality for larger-scale cases than traditional intelligent optimization algorithms.","PeriodicalId":48519,"journal":{"name":"Machines","volume":"8 4","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machines","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.3390/machines12010008","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

An extended flexible job scheduling problem is presented with characteristics of technology and path flexibility (dual flexibility), varied transportation time, and an uncertain environment. The scheduling can greatly increase efficiency and security in complex scenarios, e.g., distributed vehicle manufacturing, and multiple aircraft maintenance. However, optimizing the scheduling puts forward higher requirements on accuracy, real time, and generalization, while subject to the curse of dimension and usually incomplete information. Various coupling relations among operations, stations, and resources aggravate the problem. To deal with the above challenges, we propose a multi-agent reinforcement learning algorithm where the scheduling environment is modeled as a decentralized partially observable Markov decision process. Each job is regarded as an agent that decides the next triplet, i.e., operation, station, and employed resource. This paper is novel in addressing the flexible job shop scheduling problem with dual flexibility and varied transportation time in consideration and proposing a double Q-value mixing (DQMIX) optimization algorithm under a multi-agent reinforcement learning framework. The experiments of our case study show that the DQMIX algorithm outperforms existing multi-agent reinforcement learning algorithms in terms of solution accuracy, stability, and generalization. In addition, it achieves better solution quality for larger-scale cases than traditional intelligent optimization algorithms.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于扩展灵活作业车间调度的多代理强化学习
本文提出了一个扩展的柔性作业调度问题,该问题具有技术和路径柔性(双重柔性)、不同的运输时间和不确定的环境等特点。在复杂场景下,如分布式车辆制造和多架飞机维护,调度可以大大提高效率和安全性。然而,优化调度对准确性、实时性和概括性提出了更高的要求,同时还受到维度诅咒和通常不完整信息的影响。操作、站点和资源之间的各种耦合关系也加剧了问题的严重性。为应对上述挑战,我们提出了一种多代理强化学习算法,将调度环境建模为分散的部分可观测马尔可夫决策过程。每个作业都被视为一个代理,它决定下一个三元组,即操作、站点和使用的资源。本文在多代理强化学习框架下,针对考虑了双重灵活性和不同运输时间的柔性作业车间调度问题,提出了一种双 Q 值混合(DQMIX)优化算法,具有新颖性。我们的案例研究实验表明,DQMIX 算法在求解精度、稳定性和泛化方面优于现有的多代理强化学习算法。此外,与传统的智能优化算法相比,DQMIX 算法在更大规模的案例中能获得更好的解质量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Machines
Machines Multiple-
CiteScore
3.00
自引率
26.90%
发文量
1012
审稿时长
11 weeks
期刊介绍: Machines (ISSN 2075-1702) is an international, peer-reviewed journal on machinery and engineering. It publishes research articles, reviews, short communications and letters. Our aim is to encourage scientists to publish their experimental and theoretical results in as much detail as possible. There is no restriction on the length of the papers. Full experimental and/or methodical details must be provided. There are, in addition, unique features of this journal: *manuscripts regarding research proposals and research ideas will be particularly welcomed *electronic files or software regarding the full details of the calculation and experimental procedure - if unable to be published in a normal way - can be deposited as supplementary material Subject Areas: applications of automation, systems and control engineering, electronic engineering, mechanical engineering, computer engineering, mechatronics, robotics, industrial design, human-machine-interfaces, mechanical systems, machines and related components, machine vision, history of technology and industrial revolution, turbo machinery, machine diagnostics and prognostics (condition monitoring), machine design.
期刊最新文献
Investigative Study of the Effect of Damping and Stiffness Nonlinearities on an Electromagnetic Energy Harvester at Low-Frequency Excitations Vibration Research on Centrifugal Loop Dryer Machines Used in Plastic Recycling Processes Novel Design of Variable Stiffness Pneumatic Flexible Shaft Coupling: Determining the Mathematical-Physical Model and Potential Benefits Considerations on the Dynamics of Biofidelic Sensors in the Assessment of Human–Robot Impacts Structural Design with Self-Weight and Inertial Loading Using Simulated Annealing for Non-Gradient Topology Optimization
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1