A novel ensemble ARIMA-LSTM approach for evaluating COVID-19 cases and future outbreak preparedness

IF 3.3 Health Care Science Pub Date : 2024-12-15 DOI:10.1002/hcs2.123

Somit Jain, Shobhit Agrawal, Eshaan Mohapatra, Kathiravan Srinivasan

{"title":"A novel ensemble ARIMA-LSTM approach for evaluating COVID-19 cases and future outbreak preparedness","authors":"Somit Jain, Shobhit Agrawal, Eshaan Mohapatra, Kathiravan Srinivasan","doi":"10.1002/hcs2.123","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>The global impact of the highly contagious COVID-19 virus has created unprecedented challenges, significantly impacting public health and economies worldwide. This research article conducts a time series analysis of COVID-19 data across various countries, including India, Brazil, Russia, and the United States, with a particular emphasis on total confirmed cases.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>The proposed approach combines auto-regressive integrated moving average (ARIMA)'s ability to capture linear trends and seasonality with long short-term memory (LSTM) networks, which are designed to learn complex nonlinear dependencies in the data. This hybrid approach surpasses both individual models and existing ARIMA-artificial neural network (ANN) hybrids, which often struggle with highly nonlinear time series like COVID-19 data. By integrating ARIMA and LSTM, the model aims to achieve superior forecasting accuracy compared to baseline models, including ARIMA, Gated Recurrent Unit (GRU), LSTM, and Prophet.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>The hybrid ARIMA-LSTM model outperformed the benchmark models, achieving a mean absolute percentage error (MAPE) score of 2.4%. Among the benchmark models, GRU performed the best with a MAPE score of 2.9%, followed by LSTM with a score of 3.6%.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>The proposed ARIMA-LSTM hybrid model outperforms ARIMA, GRU, LSTM, Prophet, and the ARIMA-ANN hybrid model when evaluating using metrics like MAPE, symmetric mean absolute percentage error, and median absolute percentage error across all countries analyzed. These findings have the potential to significantly improve preparedness and response efforts by public health authorities, allowing for more efficient resource allocation and targeted interventions.</p>\n </section>\n </div>","PeriodicalId":100601,"journal":{"name":"Health Care Science","volume":"3 6","pages":"409-425"},"PeriodicalIF":3.3000,"publicationDate":"2024-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11671211/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Care Science","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/hcs2.123","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Background

The global impact of the highly contagious COVID-19 virus has created unprecedented challenges, significantly impacting public health and economies worldwide. This research article conducts a time series analysis of COVID-19 data across various countries, including India, Brazil, Russia, and the United States, with a particular emphasis on total confirmed cases.

Methods

The proposed approach combines auto-regressive integrated moving average (ARIMA)'s ability to capture linear trends and seasonality with long short-term memory (LSTM) networks, which are designed to learn complex nonlinear dependencies in the data. This hybrid approach surpasses both individual models and existing ARIMA-artificial neural network (ANN) hybrids, which often struggle with highly nonlinear time series like COVID-19 data. By integrating ARIMA and LSTM, the model aims to achieve superior forecasting accuracy compared to baseline models, including ARIMA, Gated Recurrent Unit (GRU), LSTM, and Prophet.

Results

The hybrid ARIMA-LSTM model outperformed the benchmark models, achieving a mean absolute percentage error (MAPE) score of 2.4%. Among the benchmark models, GRU performed the best with a MAPE score of 2.9%, followed by LSTM with a score of 3.6%.

Conclusions

The proposed ARIMA-LSTM hybrid model outperforms ARIMA, GRU, LSTM, Prophet, and the ARIMA-ANN hybrid model when evaluating using metrics like MAPE, symmetric mean absolute percentage error, and median absolute percentage error across all countries analyzed. These findings have the potential to significantly improve preparedness and response efforts by public health authorities, allowing for more efficient resource allocation and targeted interventions.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于评估COVID-19病例和未来疫情准备的新型集成ARIMA-LSTM方法

背景：高传染性COVID-19病毒的全球影响带来了前所未有的挑战，严重影响了世界各地的公共卫生和经济。本研究文章对包括印度、巴西、俄罗斯和美国在内的各国的COVID-19数据进行了时间序列分析，特别强调了确诊病例总数。方法：该方法将自回归综合移动平均（ARIMA）捕捉线性趋势和季节性的能力与长短期记忆（LSTM）网络相结合，后者旨在学习数据中复杂的非线性依赖关系。这种混合方法超越了单个模型和现有的arima -人工神经网络（ANN）混合模型，后者通常难以处理COVID-19数据等高度非线性的时间序列。通过整合ARIMA和LSTM，该模型的目标是实现比基线模型（包括ARIMA、门控循环单元（GRU）、LSTM和Prophet）更高的预测精度。结果：ARIMA-LSTM混合模型优于基准模型，平均绝对百分比误差（MAPE）得分为2.4%。在基准模型中，GRU表现最好，MAPE得分为2.9%，其次是LSTM，得分为3.6%。结论：在所分析的所有国家使用MAPE、对称平均绝对百分比误差和中位数绝对百分比误差等指标进行评估时，所提出的ARIMA-LSTM混合模型优于ARIMA、GRU、LSTM、Prophet和ARIMA- ann混合模型。这些发现有可能大大改善公共卫生当局的防范和应对工作，从而实现更有效的资源分配和有针对性的干预措施。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Health Care Science

CiteScore

0.90

自引率

0.00%

发文量