Enhancing Monte-Carlo Tree Search with Multi-Agent Deep Q-Network in Open Shop Scheduling

Oliver Lohse, Aaron Haag, Tizian Dagner
{"title":"Enhancing Monte-Carlo Tree Search with Multi-Agent Deep Q-Network in Open Shop Scheduling","authors":"Oliver Lohse, Aaron Haag, Tizian Dagner","doi":"10.1109/WCMEIM56910.2022.10021570","DOIUrl":null,"url":null,"abstract":"Production disruptions, e.g., machine breakdowns, cannot be predicted in any case. Such disruptions lead to a deviation of the planned and optimized production schedule, and the actual production process. Instead of manually re-routing products, an online scheduler can re-route products automatically and maintain the best possible production throughput. To establish such an online scheduler, a framework for combining Monte-Carlo Tree Search (MCTS) and a multi-agent Deep Q-Network (MADQN) to solve the Open Shop Scheduling Problem (OSSP) is developed. Similar to approaches of using some sort of single-agent to guide the MCTS during the exploration phase, this approach deploys a multi-agent. Although the combination of single agents and MCTS have shown promising results in relatively small environments, applications relying on this approach have a very limited number of use cases in a real production scenario due to the considerable number of machines and products [10]. However, for that particular use case, the multi-agents promise a scalable solution even for large environments [6]. To do so, the problem has to be formulated such that a multi-agent can solve it. In addition to that, a learning framework is presented, and the developed approach is compared to an MCTS and single-agent combination.","PeriodicalId":202270,"journal":{"name":"2022 5th World Conference on Mechanical Engineering and Intelligent Manufacturing (WCMEIM)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th World Conference on Mechanical Engineering and Intelligent Manufacturing (WCMEIM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WCMEIM56910.2022.10021570","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Production disruptions, e.g., machine breakdowns, cannot be predicted in any case. Such disruptions lead to a deviation of the planned and optimized production schedule, and the actual production process. Instead of manually re-routing products, an online scheduler can re-route products automatically and maintain the best possible production throughput. To establish such an online scheduler, a framework for combining Monte-Carlo Tree Search (MCTS) and a multi-agent Deep Q-Network (MADQN) to solve the Open Shop Scheduling Problem (OSSP) is developed. Similar to approaches of using some sort of single-agent to guide the MCTS during the exploration phase, this approach deploys a multi-agent. Although the combination of single agents and MCTS have shown promising results in relatively small environments, applications relying on this approach have a very limited number of use cases in a real production scenario due to the considerable number of machines and products [10]. However, for that particular use case, the multi-agents promise a scalable solution even for large environments [6]. To do so, the problem has to be formulated such that a multi-agent can solve it. In addition to that, a learning framework is presented, and the developed approach is compared to an MCTS and single-agent combination.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
开放车间调度中基于多智能体深度q网络的蒙特卡罗树搜索增强
生产中断,如机器故障,在任何情况下都无法预测。这种中断导致计划和优化的生产计划和实际生产过程的偏差。在线调度程序可以自动重新路由产品,而不是手动重新路由产品,并保持最佳的生产吞吐量。为了建立这样一个在线调度程序,提出了一个将蒙特卡罗树搜索(MCTS)和多智能体深度q网络(MADQN)相结合的框架来解决开放式车间调度问题。与在探索阶段使用某种单一代理来指导MCTS的方法类似,这种方法部署了一个多代理。尽管单智能体和MCTS的结合在相对较小的环境中显示出了有希望的结果,但由于机器和产品的数量相当大,依赖于这种方法的应用在实际生产场景中的用例数量非常有限[10]。然而,对于那个特定的用例,多代理承诺了一个可扩展的解决方案,即使是在大型环境中[6]。要做到这一点,必须将问题公式化,使多代理能够解决它。除此之外,还提出了一个学习框架,并将所开发的方法与MCTS和单智能体组合进行了比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Design and Analysis of a Novel Soft Actuator with High Contraction Ratio Based on Nested Structure Design and Verification of Thermal Balance System for Electric Drive Transmission in Urban Public Transit Design and Experiment of a Novel Manipulator for Autonomous Harvesting Tomato Clusters Research on Young's Modulus Prediction Model of Particle Reinforced Composites The Liquid Rocket Engine Experiment Data Quality Improvement Based on 3σ-LMBP
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1