A multi-level action coupling reinforcement learning approach for online two-stage flexible assembly flow shop scheduling

IF 12.2 1区 工程技术 Q1 ENGINEERING, INDUSTRIAL Journal of Manufacturing Systems Pub Date : 2024-08-16 DOI:10.1016/j.jmsy.2024.08.006
Junhao Qiu, Jianjun Liu, Zhantao Li, Xinjun Lai
{"title":"A multi-level action coupling reinforcement learning approach for online two-stage flexible assembly flow shop scheduling","authors":"Junhao Qiu,&nbsp;Jianjun Liu,&nbsp;Zhantao Li,&nbsp;Xinjun Lai","doi":"10.1016/j.jmsy.2024.08.006","DOIUrl":null,"url":null,"abstract":"<div><p>Multi-product centralized delivery and kitting assembly present significant challenges to hierarchical co-processing in multi-stage manufacturing systems. The combinations of priority dispatching rules at each level are transiently adaptive, and the performance in online scheduling deteriorates rapidly with changing environment. This paper investigates the selection of rule combinations for sustained high-performance responsive scheduling in two-stage flexible assembly flow shop scheduling problem with asynchronous execution and complex decision correlation. A Multi-Level Action Coupling Deep Q-Network (MALC-DQN) approach is proposed for adaptive integrated scheduling in hybrid processing and assembly shops. Firstly, the problem is skillfully established as an event-triggered integrated decision markov decision process. The prioritized batch experience replay mechanism is employed to retain the complete correlation information of key decision sequences. Then, coupling and sequence feature extraction modules are developed to enhance the agent’s ability to perceive execution process and the environment. Furthermore, the multi-level wait-limit mechanism and efficient action filtering mechanism are designed to mitigate ineffective waiting waste and action space explosion during learning. Finally, a series of sophisticated experiments are conducted to validate the effectiveness of the proposed methodology. In 20 actual instances of different sizes, MLAC-DQN outperformed its closest competitor, with a 26.6% improvement in average tardiness. Moreover, extraordinary robustness is demonstrated in 16 sets of experiments involving different configurations of resources, orders, and arrival concentration levels.</p></div>","PeriodicalId":16227,"journal":{"name":"Journal of Manufacturing Systems","volume":"76 ","pages":"Pages 351-370"},"PeriodicalIF":12.2000,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Manufacturing Systems","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0278612524001699","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 0

Abstract

Multi-product centralized delivery and kitting assembly present significant challenges to hierarchical co-processing in multi-stage manufacturing systems. The combinations of priority dispatching rules at each level are transiently adaptive, and the performance in online scheduling deteriorates rapidly with changing environment. This paper investigates the selection of rule combinations for sustained high-performance responsive scheduling in two-stage flexible assembly flow shop scheduling problem with asynchronous execution and complex decision correlation. A Multi-Level Action Coupling Deep Q-Network (MALC-DQN) approach is proposed for adaptive integrated scheduling in hybrid processing and assembly shops. Firstly, the problem is skillfully established as an event-triggered integrated decision markov decision process. The prioritized batch experience replay mechanism is employed to retain the complete correlation information of key decision sequences. Then, coupling and sequence feature extraction modules are developed to enhance the agent’s ability to perceive execution process and the environment. Furthermore, the multi-level wait-limit mechanism and efficient action filtering mechanism are designed to mitigate ineffective waiting waste and action space explosion during learning. Finally, a series of sophisticated experiments are conducted to validate the effectiveness of the proposed methodology. In 20 actual instances of different sizes, MLAC-DQN outperformed its closest competitor, with a 26.6% improvement in average tardiness. Moreover, extraordinary robustness is demonstrated in 16 sets of experiments involving different configurations of resources, orders, and arrival concentration levels.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
在线两阶段柔性装配流程车间调度的多级行动耦合强化学习方法
多产品集中交付和成套装配给多级制造系统中的分级协同处理带来了巨大挑战。各层次的优先调度规则组合是瞬时自适应的,在线调度的性能会随着环境的变化而迅速下降。本文研究了在具有异步执行和复杂决策相关性的两阶段柔性装配流动车间调度问题中,如何选择规则组合以实现持续的高性能响应调度。针对混合加工和装配车间的自适应综合调度,提出了一种多级动作耦合深度 Q 网络(MALC-DQN)方法。首先,将问题巧妙地建立为一个事件触发的综合决策马尔可夫决策过程。采用优先批次经验重放机制保留关键决策序列的完整相关信息。然后,开发了耦合和序列特征提取模块,以增强代理对执行过程和环境的感知能力。此外,还设计了多级等待限制机制和高效行动过滤机制,以减少学习过程中的无效等待浪费和行动空间爆炸。最后,我们进行了一系列复杂的实验来验证所提方法的有效性。在 20 个不同规模的实际实例中,MLAC-DQN 的表现优于其最接近的竞争对手,平均延迟时间提高了 26.6%。此外,在涉及不同资源配置、订单和到达集中程度的 16 组实验中,MLAC-DQN 也表现出了非凡的鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Manufacturing Systems
Journal of Manufacturing Systems 工程技术-工程:工业
CiteScore
23.30
自引率
13.20%
发文量
216
审稿时长
25 days
期刊介绍: The Journal of Manufacturing Systems is dedicated to showcasing cutting-edge fundamental and applied research in manufacturing at the systems level. Encompassing products, equipment, people, information, control, and support functions, manufacturing systems play a pivotal role in the economical and competitive development, production, delivery, and total lifecycle of products, meeting market and societal needs. With a commitment to publishing archival scholarly literature, the journal strives to advance the state of the art in manufacturing systems and foster innovation in crafting efficient, robust, and sustainable manufacturing systems. The focus extends from equipment-level considerations to the broader scope of the extended enterprise. The Journal welcomes research addressing challenges across various scales, including nano, micro, and macro-scale manufacturing, and spanning diverse sectors such as aerospace, automotive, energy, and medical device manufacturing.
期刊最新文献
Material removal rate optimization with bayesian optimized differential evolution based on deep learning in robotic polishing Leveraging AI for energy-efficient manufacturing systems: Review and future prospectives Investigation of assistance systems in assembly in the context of digitalization: A systematic literature review Machining parameter optimization for a batch milling system using multi-task deep reinforcement learning A dynamic artificial bee colony for fuzzy distributed energy-efficient hybrid flow shop scheduling with batch processing machines
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1