{"title":"基于学习代理的随机故障动态并行机器调度","authors":"Biao Yuan, Zhibin Jiang, Lei Wang","doi":"10.1504/IJSOI.2016.10001001","DOIUrl":null,"url":null,"abstract":"Agent technology has been widely applied in the manufacturing process due to its flexibility, autonomy, and scalability. In this paper, the learning agent is proposed to solve a dynamic parallel machine scheduling problem which considers random breakdowns. The duty of the agent, which is based on the Q-learning algorithm, is to dynamically assign arriving jobs to idle machines according to the current state of its environment. A state-action table involving machine breakdowns is constructed to define the state of the agent's environment. Three rules, including SPT (Shortest Processing Time), EDD (Earliest Due Date) and FCFS (First Come First Served), are used as actions of the agent, and the e-greedy policy is adopted by the agent to select an action. In the simulation experiment, two different objectives, including minimising the maximum lateness and minimising the percentage of tardy jobs, are utilised to validate the ability of the learning agent. The results demonstrate that the proposed agent is suitable for the complex parallel machine environment.","PeriodicalId":35046,"journal":{"name":"International Journal of Services Operations and Informatics","volume":"8 1","pages":"94"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Dynamic parallel machine scheduling with random breakdowns using the learning agent\",\"authors\":\"Biao Yuan, Zhibin Jiang, Lei Wang\",\"doi\":\"10.1504/IJSOI.2016.10001001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Agent technology has been widely applied in the manufacturing process due to its flexibility, autonomy, and scalability. In this paper, the learning agent is proposed to solve a dynamic parallel machine scheduling problem which considers random breakdowns. The duty of the agent, which is based on the Q-learning algorithm, is to dynamically assign arriving jobs to idle machines according to the current state of its environment. A state-action table involving machine breakdowns is constructed to define the state of the agent's environment. Three rules, including SPT (Shortest Processing Time), EDD (Earliest Due Date) and FCFS (First Come First Served), are used as actions of the agent, and the e-greedy policy is adopted by the agent to select an action. In the simulation experiment, two different objectives, including minimising the maximum lateness and minimising the percentage of tardy jobs, are utilised to validate the ability of the learning agent. The results demonstrate that the proposed agent is suitable for the complex parallel machine environment.\",\"PeriodicalId\":35046,\"journal\":{\"name\":\"International Journal of Services Operations and Informatics\",\"volume\":\"8 1\",\"pages\":\"94\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-11-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Services Operations and Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1504/IJSOI.2016.10001001\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Business, Management and Accounting\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Services Operations and Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJSOI.2016.10001001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Business, Management and Accounting","Score":null,"Total":0}
Dynamic parallel machine scheduling with random breakdowns using the learning agent
Agent technology has been widely applied in the manufacturing process due to its flexibility, autonomy, and scalability. In this paper, the learning agent is proposed to solve a dynamic parallel machine scheduling problem which considers random breakdowns. The duty of the agent, which is based on the Q-learning algorithm, is to dynamically assign arriving jobs to idle machines according to the current state of its environment. A state-action table involving machine breakdowns is constructed to define the state of the agent's environment. Three rules, including SPT (Shortest Processing Time), EDD (Earliest Due Date) and FCFS (First Come First Served), are used as actions of the agent, and the e-greedy policy is adopted by the agent to select an action. In the simulation experiment, two different objectives, including minimising the maximum lateness and minimising the percentage of tardy jobs, are utilised to validate the ability of the learning agent. The results demonstrate that the proposed agent is suitable for the complex parallel machine environment.
期刊介绍:
The advances in distributed computing and networks make it possible to link people, heterogeneous service providers and physically isolated services efficiently and cost-effectively. As the economic dynamics and the complexity of service operations continue to increase, it becomes a critical challenge to leverage information technology in achieving world-class quality and productivity in the production and delivery of physical goods and services. The IJSOI, a fully refereed journal, provides the primary forum for both academic and industry researchers and practitioners to propose and foster discussion on state-of-the-art research and development in the areas of service operations and the role of informatics towards improving their efficiency and competitiveness.