Flow-Shop Scheduling Problem With Batch Processing Machines via Deep Reinforcement Learning for Industrial Internet of Things

IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE IEEE Transactions on Emerging Topics in Computational Intelligence Pub Date : 2024-03-28 DOI:10.1109/TETCI.2024.3402685
Zihui Luo;Chengling Jiang;Liang Liu;Xiaolong Zheng;Huadong Ma
{"title":"Flow-Shop Scheduling Problem With Batch Processing Machines via Deep Reinforcement Learning for Industrial Internet of Things","authors":"Zihui Luo;Chengling Jiang;Liang Liu;Xiaolong Zheng;Huadong Ma","doi":"10.1109/TETCI.2024.3402685","DOIUrl":null,"url":null,"abstract":"The rapidly evolving Industrial Internet of Things (IIoT) is driving the transition from conventional manufacturing to intelligent manufacturing. Intelligent shop scheduling, as one of the essential components of intelligent manufacturing in IIoT, is desired to allocate jobs on different machines to achieve specific production targets. The flow-shop scheduling problem with batch processing machines (FSSP-BPM), which widely exists in real-world manufacturing, requires two distinct but interdependent decisions: batch formation and job scheduling. Existing approaches rely on fixed search paradigms that utilize expert knowledge to find satisfactory solutions. However, these methods struggle to ensure solution quality under real-time constraints due to the varying data distribution and the complexity of large-scale practical problems. To address this challenge, we propose a deep reinforcement learning (DRL) based method. First, we formulate the FSSP-BPM decision process as a Markov Decision Process (MDP) and design the corresponding state, action, and reward. Second, we propose a basic scheduling framework based on an encoder-decoder model with the attention mechanism. Finally, we design a batch formation module and a scheduling module trained on unlabeled multi-dimensional data. Extensive experiments on public benchmark datasets and actual production data demonstrate that the proposed method outperforms baseline algorithms and improves makespan performance by an average of 8.33%.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"8 5","pages":"3518-3533"},"PeriodicalIF":5.3000,"publicationDate":"2024-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Emerging Topics in Computational Intelligence","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10540330/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

The rapidly evolving Industrial Internet of Things (IIoT) is driving the transition from conventional manufacturing to intelligent manufacturing. Intelligent shop scheduling, as one of the essential components of intelligent manufacturing in IIoT, is desired to allocate jobs on different machines to achieve specific production targets. The flow-shop scheduling problem with batch processing machines (FSSP-BPM), which widely exists in real-world manufacturing, requires two distinct but interdependent decisions: batch formation and job scheduling. Existing approaches rely on fixed search paradigms that utilize expert knowledge to find satisfactory solutions. However, these methods struggle to ensure solution quality under real-time constraints due to the varying data distribution and the complexity of large-scale practical problems. To address this challenge, we propose a deep reinforcement learning (DRL) based method. First, we formulate the FSSP-BPM decision process as a Markov Decision Process (MDP) and design the corresponding state, action, and reward. Second, we propose a basic scheduling framework based on an encoder-decoder model with the attention mechanism. Finally, we design a batch formation module and a scheduling module trained on unlabeled multi-dimensional data. Extensive experiments on public benchmark datasets and actual production data demonstrate that the proposed method outperforms baseline algorithms and improves makespan performance by an average of 8.33%.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
通过深度强化学习解决工业物联网批量处理机的流水线调度问题
快速发展的工业物联网(IIoT)正在推动传统制造向智能制造转型。智能车间调度作为 IIoT 智能制造的重要组成部分之一,旨在将作业分配到不同的机器上,以实现特定的生产目标。批量加工机器的流水车间调度问题(FSSP-BPM)广泛存在于现实世界的制造业中,它需要两个不同但相互依存的决策:批量形成和作业调度。现有方法依赖于固定搜索范式,利用专家知识找到令人满意的解决方案。然而,由于数据分布的变化和大规模实际问题的复杂性,这些方法很难在实时约束条件下确保解决方案的质量。为了应对这一挑战,我们提出了一种基于深度强化学习(DRL)的方法。首先,我们将 FSSP-BPM 决策过程表述为马尔可夫决策过程(MDP),并设计相应的状态、行动和奖励。其次,我们提出了一个基于编码器-解码器模型和注意力机制的基本调度框架。最后,我们设计了一个批次形成模块和一个在无标记多维数据上训练的调度模块。在公共基准数据集和实际生产数据上进行的大量实验表明,所提出的方法优于基准算法,平均提高了 8.33% 的时间跨度性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
10.30
自引率
7.50%
发文量
147
期刊介绍: The IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI) publishes original articles on emerging aspects of computational intelligence, including theory, applications, and surveys. TETCI is an electronics only publication. TETCI publishes six issues per year. Authors are encouraged to submit manuscripts in any emerging topic in computational intelligence, especially nature-inspired computing topics not covered by other IEEE Computational Intelligence Society journals. A few such illustrative examples are glial cell networks, computational neuroscience, Brain Computer Interface, ambient intelligence, non-fuzzy computing with words, artificial life, cultural learning, artificial endocrine networks, social reasoning, artificial hormone networks, computational intelligence for the IoT and Smart-X technologies.
期刊最新文献
Table of Contents IEEE Transactions on Emerging Topics in Computational Intelligence Publication Information IEEE Transactions on Emerging Topics in Computational Intelligence Information for Authors IEEE Computational Intelligence Society Information Decentralized Triggering and Event-Based Integral Reinforcement Learning for Multiplayer Differential Game Systems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1