Forecasting seasonal influenza activity in Canada—Comparing seasonal Auto-Regressive integrated moving average and artificial neural network approaches for public health preparedness
Armin Orang, Olaf Berke, Zvonimir Poljak, Amy L. Greer, Erin E. Rees, Victoria Ng
{"title":"Forecasting seasonal influenza activity in Canada—Comparing seasonal Auto-Regressive integrated moving average and artificial neural network approaches for public health preparedness","authors":"Armin Orang, Olaf Berke, Zvonimir Poljak, Amy L. Greer, Erin E. Rees, Victoria Ng","doi":"10.1111/zph.13114","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Introduction</h3>\n \n <p>Public health preparedness is based on timely and accurate information. Time series forecasting using disease surveillance data is an important aspect of preparedness. This study compared two approaches of time series forecasting: seasonal auto-regressive integrated moving average (SARIMA) modelling and the artificial neural network (ANN) algorithm. The goal was to model weekly seasonal influenza activity in Canada using SARIMA and compares its predictive accuracy, based on root mean square prediction error (RMSE) and mean absolute prediction error (MAE), to that of an ANN.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>An initial SARIMA model was fit using automated model selection by minimizing the Akaike information criterion (AIC). Further inspection of the autocorrelation function and partial autocorrelation function led to ‘manual’ model improvements. ANNs were trained iteratively, using an automated process to minimize the RMSE and MAE.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>A total of 378, 462 cases of influenza was reported in Canada from the 2010–2011 influenza season to the end of the 2019–2020 influenza season, with an average yearly incidence risk of 20.02 per 100,000 population. Automated SARIMA modelling was the better method in terms of forecasting accuracy (per RMSE and MAE). However, the ANN correctly predicted the peak week of disease incidence while the other models did not.</p>\n </section>\n \n <section>\n \n <h3> Conclusion</h3>\n \n <p>Both the ANN and SARIMA models have shown to be capable tools in forecasting seasonal influenza activity in Canada. It was shown that applying both in tandem is beneficial, SARIMA better forecasted overall incidence while ANN correctly predicted the peak week.</p>\n </section>\n </div>","PeriodicalId":24025,"journal":{"name":"Zoonoses and Public Health","volume":"71 3","pages":"304-313"},"PeriodicalIF":2.4000,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/zph.13114","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Zoonoses and Public Health","FirstCategoryId":"97","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/zph.13114","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"INFECTIOUS DISEASES","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction
Public health preparedness is based on timely and accurate information. Time series forecasting using disease surveillance data is an important aspect of preparedness. This study compared two approaches of time series forecasting: seasonal auto-regressive integrated moving average (SARIMA) modelling and the artificial neural network (ANN) algorithm. The goal was to model weekly seasonal influenza activity in Canada using SARIMA and compares its predictive accuracy, based on root mean square prediction error (RMSE) and mean absolute prediction error (MAE), to that of an ANN.
Methods
An initial SARIMA model was fit using automated model selection by minimizing the Akaike information criterion (AIC). Further inspection of the autocorrelation function and partial autocorrelation function led to ‘manual’ model improvements. ANNs were trained iteratively, using an automated process to minimize the RMSE and MAE.
Results
A total of 378, 462 cases of influenza was reported in Canada from the 2010–2011 influenza season to the end of the 2019–2020 influenza season, with an average yearly incidence risk of 20.02 per 100,000 population. Automated SARIMA modelling was the better method in terms of forecasting accuracy (per RMSE and MAE). However, the ANN correctly predicted the peak week of disease incidence while the other models did not.
Conclusion
Both the ANN and SARIMA models have shown to be capable tools in forecasting seasonal influenza activity in Canada. It was shown that applying both in tandem is beneficial, SARIMA better forecasted overall incidence while ANN correctly predicted the peak week.
期刊介绍:
Zoonoses and Public Health brings together veterinary and human health researchers and policy-makers by providing a venue for publishing integrated and global approaches to zoonoses and public health. The Editors will consider papers that focus on timely collaborative and multi-disciplinary research in zoonoses and public health. This journal provides rapid publication of original papers, reviews, and potential discussion papers embracing this collaborative spirit. Papers should advance the scientific knowledge of the sources, transmission, prevention and control of zoonoses and be authored by scientists with expertise in areas such as microbiology, virology, parasitology and epidemiology. Articles that incorporate recent data into new methods, applications, or approaches (e.g. statistical modeling) which enhance public health are strongly encouraged.