Adaptive manufacturing control with Deep Reinforcement Learning for dynamic WIP management in industry 4.0

IF 6.5 1区 工程技术 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Computers & Industrial Engineering Pub Date : 2025-04-01 Epub Date: 2025-02-17 DOI:10.1016/j.cie.2025.110966
Silvestro Vespoli, Giulio Mattera, Maria Grazia Marchesano, Luigi Nele, Guido Guizzi
{"title":"Adaptive manufacturing control with Deep Reinforcement Learning for dynamic WIP management in industry 4.0","authors":"Silvestro Vespoli,&nbsp;Giulio Mattera,&nbsp;Maria Grazia Marchesano,&nbsp;Luigi Nele,&nbsp;Guido Guizzi","doi":"10.1016/j.cie.2025.110966","DOIUrl":null,"url":null,"abstract":"<div><div>In the context of Industry 4.0, manufacturing systems face increased complexity and uncertainty due to elevated product customisation and demand variability. This paper presents a novel framework for adaptive Work-In-Progress (WIP) control in semi-heterarchical architectures, addressing the limitations of traditional analytical methods that rely on exponential processing time distributions. Integrating Deep Reinforcement Learning (DRL) with Discrete Event Simulation (DES) enables model-free control of flow-shop production systems under non-exponential, stochastic processing times. A Deep Q-Network (DQN) agent dynamically manages WIP levels in a CONstant Work In Progress (CONWIP) environment, learning optimal control policies directly from system interactions. The framework’s effectiveness is demonstrated through extensive experiments with varying machine numbers, processing times, and system variability. The results show robust performance in tracking the target throughput and adapting the processing time variability, achieving Mean Absolute Percentual Errors (MAPE) in the throughput – calculated as the percentage difference between the actual and the target throughput – ranging from 0.3% to 2.3% with standard deviations of 5. 5% to 8. 4%. Key contributions include the development of a data-driven WIP control approach to overcome analytical methods’ limitations in stochastic environments, validating DQN agent adaptability across varying production scenarios, and demonstrating framework scalability in realistic manufacturing settings. This research bridges the gap between conventional WIP control methods and Industry 4.0 requirements, offering manufacturers an adaptive solution for enhanced production efficiency.</div></div>","PeriodicalId":55220,"journal":{"name":"Computers & Industrial Engineering","volume":"202 ","pages":"Article 110966"},"PeriodicalIF":6.5000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Industrial Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0360835225001123","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/17 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

In the context of Industry 4.0, manufacturing systems face increased complexity and uncertainty due to elevated product customisation and demand variability. This paper presents a novel framework for adaptive Work-In-Progress (WIP) control in semi-heterarchical architectures, addressing the limitations of traditional analytical methods that rely on exponential processing time distributions. Integrating Deep Reinforcement Learning (DRL) with Discrete Event Simulation (DES) enables model-free control of flow-shop production systems under non-exponential, stochastic processing times. A Deep Q-Network (DQN) agent dynamically manages WIP levels in a CONstant Work In Progress (CONWIP) environment, learning optimal control policies directly from system interactions. The framework’s effectiveness is demonstrated through extensive experiments with varying machine numbers, processing times, and system variability. The results show robust performance in tracking the target throughput and adapting the processing time variability, achieving Mean Absolute Percentual Errors (MAPE) in the throughput – calculated as the percentage difference between the actual and the target throughput – ranging from 0.3% to 2.3% with standard deviations of 5. 5% to 8. 4%. Key contributions include the development of a data-driven WIP control approach to overcome analytical methods’ limitations in stochastic environments, validating DQN agent adaptability across varying production scenarios, and demonstrating framework scalability in realistic manufacturing settings. This research bridges the gap between conventional WIP control methods and Industry 4.0 requirements, offering manufacturers an adaptive solution for enhanced production efficiency.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于深度强化学习的工业4.0动态在制品管理自适应制造控制
在工业4.0的背景下,由于产品定制和需求变化的增加,制造系统面临着越来越多的复杂性和不确定性。本文提出了一个半层次结构中自适应在制品(WIP)控制的新框架,解决了依赖于指数处理时间分布的传统分析方法的局限性。将深度强化学习(DRL)与离散事件仿真(DES)相结合,可以在非指数、随机处理时间下实现流车间生产系统的无模型控制。深度Q-Network (DQN)智能体在CONWIP环境中动态管理WIP水平,直接从系统交互中学习最优控制策略。该框架的有效性通过不同机器数量、处理时间和系统可变性的大量实验得到了证明。结果显示,在跟踪目标吞吐量和适应处理时间可变性方面具有强大的性能,实现吞吐量中的平均绝对百分比误差(MAPE)——以实际吞吐量与目标吞吐量之间的百分比差计算——范围从0.3%到2.3%,标准差为5。5%到8%。4%。主要贡献包括开发了一种数据驱动的在制品控制方法,以克服分析方法在随机环境中的局限性,验证了DQN代理在不同生产场景中的适应性,并展示了框架在实际制造环境中的可扩展性。这项研究弥合了传统在制品控制方法与工业4.0要求之间的差距,为制造商提供了提高生产效率的自适应解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Computers & Industrial Engineering
Computers & Industrial Engineering 工程技术-工程:工业
CiteScore
12.70
自引率
12.70%
发文量
794
审稿时长
10.6 months
期刊介绍: Computers & Industrial Engineering (CAIE) is dedicated to researchers, educators, and practitioners in industrial engineering and related fields. Pioneering the integration of computers in research, education, and practice, industrial engineering has evolved to make computers and electronic communication integral to its domain. CAIE publishes original contributions focusing on the development of novel computerized methodologies to address industrial engineering problems. It also highlights the applications of these methodologies to issues within the broader industrial engineering and associated communities. The journal actively encourages submissions that push the boundaries of fundamental theories and concepts in industrial engineering techniques.
期刊最新文献
Dynamic scheduling of automated manufacturing systems under logical and temporal constrains using staged Q-learning with curriculum guidance A domain knowledge-enhanced MBSE framework for developing knowledge-based inspection systems: A case study on overhead crane inspection Coordinated planning of autonomous rail rapid transit trains with flexible coupling operations and demand-responsive high-speed rail shuttle buses with same-platform transfers Remaining useful life prediction of systems under time-varying conditions based on dynamic weighted information fusion and an adaptive UKF Research on emergency material warehouse location and inventory prepositioning planning for pre-disaster response
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1