Zimiao He, Kunlan Wang, Hanxiao Li, Hong Song, Zhongjie Lin, Kaizhou Gao, Ali Sadollah
{"title":"求解排列流车间调度问题的改进Q学习算法","authors":"Zimiao He, Kunlan Wang, Hanxiao Li, Hong Song, Zhongjie Lin, Kaizhou Gao, Ali Sadollah","doi":"10.1049/cim2.12042","DOIUrl":null,"url":null,"abstract":"<p>Generally, scheduling problems refer to allocation of available shared resources and the sorting of production tasks, in order to satisfy the specified performance target within a certain time. The fundamental scheduling problem is that all jobs need to be processed on the same route, which is called flow shop scheduling problems (FSSP). The goal of FSSP, proven as an NP-hard problem, is to find a job sequence that minimizes the makespan. In this paper, an improved <i>Q</i>-learning algorithm is proposed for solving the FSSP. Firstly, a problem model based on the basic <i>Q</i>-learning algorithm is constructed. The makespan is used as the feedback signal, and the process of environmental state change is defined as the process of job selection. <i>Q</i>-learning gives the expected utility of taking a given action in a given state. Afterwards, combined with the NEH heuristic, the algorithm efficiency is enhanced by changing the job inserting mode. In order to validate the proposed method, several simulation experiments are carried out on a set of test problems having different scales. The obtained optimization results of the proposed algorithm are compared to the standard <i>Q</i>-learning algorithm and a hybrid algorithm. The discussion and analysis show that the proposed algorithm performs better than the others in solving the permutation FSSP. As a future direction, in order to shorten the running time, further improvements will be studied to increase the performance of the proposed algorithm and make it applicable and efficient for solving multi-objective optimization problems.</p>","PeriodicalId":33286,"journal":{"name":"IET Collaborative Intelligent Manufacturing","volume":"4 1","pages":"35-44"},"PeriodicalIF":2.5000,"publicationDate":"2021-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/cim2.12042","citationCount":"7","resultStr":"{\"title\":\"Improved Q-learning algorithm for solving permutation flow shop scheduling problems\",\"authors\":\"Zimiao He, Kunlan Wang, Hanxiao Li, Hong Song, Zhongjie Lin, Kaizhou Gao, Ali Sadollah\",\"doi\":\"10.1049/cim2.12042\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Generally, scheduling problems refer to allocation of available shared resources and the sorting of production tasks, in order to satisfy the specified performance target within a certain time. The fundamental scheduling problem is that all jobs need to be processed on the same route, which is called flow shop scheduling problems (FSSP). The goal of FSSP, proven as an NP-hard problem, is to find a job sequence that minimizes the makespan. In this paper, an improved <i>Q</i>-learning algorithm is proposed for solving the FSSP. Firstly, a problem model based on the basic <i>Q</i>-learning algorithm is constructed. The makespan is used as the feedback signal, and the process of environmental state change is defined as the process of job selection. <i>Q</i>-learning gives the expected utility of taking a given action in a given state. Afterwards, combined with the NEH heuristic, the algorithm efficiency is enhanced by changing the job inserting mode. In order to validate the proposed method, several simulation experiments are carried out on a set of test problems having different scales. The obtained optimization results of the proposed algorithm are compared to the standard <i>Q</i>-learning algorithm and a hybrid algorithm. The discussion and analysis show that the proposed algorithm performs better than the others in solving the permutation FSSP. As a future direction, in order to shorten the running time, further improvements will be studied to increase the performance of the proposed algorithm and make it applicable and efficient for solving multi-objective optimization problems.</p>\",\"PeriodicalId\":33286,\"journal\":{\"name\":\"IET Collaborative Intelligent Manufacturing\",\"volume\":\"4 1\",\"pages\":\"35-44\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2021-09-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/cim2.12042\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IET Collaborative Intelligent Manufacturing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1049/cim2.12042\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, INDUSTRIAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Collaborative Intelligent Manufacturing","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/cim2.12042","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
Improved Q-learning algorithm for solving permutation flow shop scheduling problems
Generally, scheduling problems refer to allocation of available shared resources and the sorting of production tasks, in order to satisfy the specified performance target within a certain time. The fundamental scheduling problem is that all jobs need to be processed on the same route, which is called flow shop scheduling problems (FSSP). The goal of FSSP, proven as an NP-hard problem, is to find a job sequence that minimizes the makespan. In this paper, an improved Q-learning algorithm is proposed for solving the FSSP. Firstly, a problem model based on the basic Q-learning algorithm is constructed. The makespan is used as the feedback signal, and the process of environmental state change is defined as the process of job selection. Q-learning gives the expected utility of taking a given action in a given state. Afterwards, combined with the NEH heuristic, the algorithm efficiency is enhanced by changing the job inserting mode. In order to validate the proposed method, several simulation experiments are carried out on a set of test problems having different scales. The obtained optimization results of the proposed algorithm are compared to the standard Q-learning algorithm and a hybrid algorithm. The discussion and analysis show that the proposed algorithm performs better than the others in solving the permutation FSSP. As a future direction, in order to shorten the running time, further improvements will be studied to increase the performance of the proposed algorithm and make it applicable and efficient for solving multi-objective optimization problems.
期刊介绍:
IET Collaborative Intelligent Manufacturing is a Gold Open Access journal that focuses on the development of efficient and adaptive production and distribution systems. It aims to meet the ever-changing market demands by publishing original research on methodologies and techniques for the application of intelligence, data science, and emerging information and communication technologies in various aspects of manufacturing, such as design, modeling, simulation, planning, and optimization of products, processes, production, and assembly.
The journal is indexed in COMPENDEX (Elsevier), Directory of Open Access Journals (DOAJ), Emerging Sources Citation Index (Clarivate Analytics), INSPEC (IET), SCOPUS (Elsevier) and Web of Science (Clarivate Analytics).