Deep reinforcement learning based medical supplies dispatching model for major infectious diseases: Case study of COVID-19

IF 3.7 4区管理学 Q2 OPERATIONS RESEARCH & MANAGEMENT SCIENCE Operations Research Perspectives Pub Date : 2023-12-01 Epub Date: 2023-11-23 DOI:10.1016/j.orp.2023.100293

Jia-Ying Zeng , Ping Lu , Ying Wei , Xin Chen , Kai-Biao Lin

{"title":"Deep reinforcement learning based medical supplies dispatching model for major infectious diseases: Case study of COVID-19","authors":"Jia-Ying Zeng , Ping Lu , Ying Wei , Xin Chen , Kai-Biao Lin","doi":"10.1016/j.orp.2023.100293","DOIUrl":null,"url":null,"abstract":"<div><p>Stockpiling and scheduling plans for medical supplies represent essential preventive and control measures in major public health events. In the face of major infectious diseases, such as the novel coronavirus disease (COVID-19), the outbreak trend and variability of disease strains are often unpredictable. Hence, it is necessary to optimally adjust the prevention and control dispatching strategy according to the circumstances and outbreak locations to maintain economic development while ensuring the human health survival, however, many models in this scenario seldom consider the dynamic material prediction and the measurement of multiple costs at the same time. Taking the COVID-19 scenario as a case study, we establish a deep reinforcement learning (DRL)-based medical supplies dispatching (MSD) model for major infectious diseases, considering the volatility of the COVID-19 situation and the discrepancy between medical material demand and supply due to the high infectiousness of the Omicron series strains. The present model has three main components: 1) First, for the dynamic medical material prediction problem in complex infectious disease scenarios, taking the lifted COVID-19 lockdown scenario as an example, the modified susceptible-exposed-infected-recovered (SEIR) model was utilized to analyze the spread of the COVID-19, understand its characteristics, and map out the related medical supplies demand; 2) Second, to break away from the previous premise of only considering supply-demand, this study adds scheduling rules and cost function that weighs health and economic costs. An epidemic dispatching optimization model (Epi_DispatchOptim) was established using the OpenAI Gym toolkit to form an environment structure with virus transmission space, and emergency MSD while considering both human health and economic costs. This architecture interprets the balance between the supply-demand of medical supplies and reflects the importance of MSD in the balanced development of health and economy under the spread of infectious diseases; 3) Finally, the MSD strategy under the balance of health and economic cost is explored in Epi_DispatchOptim using reinforcement learning (RL) and the evolutionary algorithm (EA). Experiments conducted on two datasets indicate that the RL and EA reduce economic as well as health costs compared to the original environmental strategies. The above study illustrates how to use epidemiological models to predict the demand for healthcare supplies as the premise of scheduling models, and use Epi_DispatchOptim to explore the dynamic MSD decisions under mortality and economic equilibrium. In Shanghai, China, the economic cost of the exploration strategy is reduced by 27.36–27.07B compared to static scheduling, and deaths are reduced by 126–150 in 150 day compared to the no-intervention scenario. By integrating knowledge of epidemiology, optimal decision making, and economics, Epi_DispatchOptim further constructs epidemiological models, cost functions, state-action spaces, and other modules to assist public health decision makers in adopting appropriate MSD strategies for major public health event.</p></div>","PeriodicalId":38055,"journal":{"name":"Operations Research Perspectives","volume":"11 ","pages":"Article 100293"},"PeriodicalIF":3.7000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2214716023000283/pdfft?md5=ea4f042b5fe351d77ed253105f2650f7&pid=1-s2.0-S2214716023000283-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Operations Research Perspectives","FirstCategoryId":"91","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214716023000283","RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/11/23 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"OPERATIONS RESEARCH & MANAGEMENT SCIENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Stockpiling and scheduling plans for medical supplies represent essential preventive and control measures in major public health events. In the face of major infectious diseases, such as the novel coronavirus disease (COVID-19), the outbreak trend and variability of disease strains are often unpredictable. Hence, it is necessary to optimally adjust the prevention and control dispatching strategy according to the circumstances and outbreak locations to maintain economic development while ensuring the human health survival, however, many models in this scenario seldom consider the dynamic material prediction and the measurement of multiple costs at the same time. Taking the COVID-19 scenario as a case study, we establish a deep reinforcement learning (DRL)-based medical supplies dispatching (MSD) model for major infectious diseases, considering the volatility of the COVID-19 situation and the discrepancy between medical material demand and supply due to the high infectiousness of the Omicron series strains. The present model has three main components: 1) First, for the dynamic medical material prediction problem in complex infectious disease scenarios, taking the lifted COVID-19 lockdown scenario as an example, the modified susceptible-exposed-infected-recovered (SEIR) model was utilized to analyze the spread of the COVID-19, understand its characteristics, and map out the related medical supplies demand; 2) Second, to break away from the previous premise of only considering supply-demand, this study adds scheduling rules and cost function that weighs health and economic costs. An epidemic dispatching optimization model (Epi_DispatchOptim) was established using the OpenAI Gym toolkit to form an environment structure with virus transmission space, and emergency MSD while considering both human health and economic costs. This architecture interprets the balance between the supply-demand of medical supplies and reflects the importance of MSD in the balanced development of health and economy under the spread of infectious diseases; 3) Finally, the MSD strategy under the balance of health and economic cost is explored in Epi_DispatchOptim using reinforcement learning (RL) and the evolutionary algorithm (EA). Experiments conducted on two datasets indicate that the RL and EA reduce economic as well as health costs compared to the original environmental strategies. The above study illustrates how to use epidemiological models to predict the demand for healthcare supplies as the premise of scheduling models, and use Epi_DispatchOptim to explore the dynamic MSD decisions under mortality and economic equilibrium. In Shanghai, China, the economic cost of the exploration strategy is reduced by 27.36–27.07B compared to static scheduling, and deaths are reduced by 126–150 in 150 day compared to the no-intervention scenario. By integrating knowledge of epidemiology, optimal decision making, and economics, Epi_DispatchOptim further constructs epidemiological models, cost functions, state-action spaces, and other modules to assist public health decision makers in adopting appropriate MSD strategies for major public health event.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于深度强化学习的重大传染病医疗物资调度模型——以2019冠状病毒病为例

医疗用品的储存和调度计划是重大公共卫生事件中必不可少的预防和控制措施。面对新型冠状病毒病(COVID-19)等重大传染病，疾病毒株的爆发趋势和变异往往是不可预测的。因此，在保证人类健康生存的同时，需要根据具体情况和疫情发生地对防控调度策略进行优化调整，但这种情况下的许多模型很少同时考虑动态物质预测和多重成本的测量。以新冠肺炎疫情为例，考虑新冠肺炎疫情的波动性和欧米克隆系列菌株高传染性导致的医疗物资供需差异，建立了基于深度强化学习(DRL)的重大传染病医疗物资调度模型。该模型主要由三个部分组成:1)首先，针对复杂传染病场景下的动态物资预测问题，以新冠肺炎解除封锁场景为例，利用改进的易感暴露感染恢复(SEIR)模型分析新冠肺炎的传播情况，了解疫情特征，规划相关医疗物资需求;2)其次，打破了以往只考虑供需的前提，增加了调度规则和权衡健康成本和经济成本的成本函数。利用OpenAI Gym工具包建立疫情调度优化模型Epi_DispatchOptim，在考虑人类健康和经济成本的情况下，形成病毒传播空间和应急MSD的环境结构。这一体系结构诠释了医疗用品供需平衡，反映了传染病传播下MSD在卫生与经济平衡发展中的重要性;3)最后，利用强化学习(RL)和进化算法(EA)探讨了Epi_DispatchOptim在健康和经济成本平衡下的MSD策略。在两个数据集上进行的实验表明，与原始环境策略相比，RL和EA降低了经济和健康成本。本文以流行病学模型预测医疗物资需求为调度模型的前提，利用Epi_DispatchOptim研究死亡率和经济均衡下的动态MSD决策。在中国上海，与静态调度相比，该勘探策略的经济成本降低了27.36-27.07B，与不干预方案相比，150天内死亡人数减少了126-150人。Epi_DispatchOptim通过整合流行病学、最优决策和经济学知识，进一步构建流行病学模型、成本函数、状态-行动空间等模块，帮助公共卫生决策者在重大公共卫生事件中采取适当的MSD策略。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊