{"title":"Forecasting infectious disease prevalence with associated uncertainty using neural networks","authors":"Michael Morris","doi":"arxiv-2409.01154","DOIUrl":null,"url":null,"abstract":"Infectious diseases pose significant human and economic burdens. Accurately\nforecasting disease incidence can enable public health agencies to respond\neffectively to existing or emerging diseases. Despite progress in the field,\ndeveloping accurate forecasting models remains a significant challenge. This\nthesis proposes two methodological frameworks using neural networks (NNs) with\nassociated uncertainty estimates - a critical component limiting the\napplication of NNs to epidemic forecasting thus far. We develop our frameworks\nby forecasting influenza-like illness (ILI) in the United States. Our first\nproposed method uses Web search activity data in conjunction with historical\nILI rates as observations for training NN architectures. Our models incorporate\nBayesian layers to produce uncertainty intervals, positioning themselves as\nlegitimate alternatives to more conventional approaches. The best performing\narchitecture: iterative recurrent neural network (IRNN), reduces mean absolute\nerror by 10.3% and improves Skill by 17.1% on average in forecasting tasks\nacross four flu seasons compared to the state-of-the-art. We build on this\nmethod by introducing IRNNs, an architecture which changes the sampling\nprocedure in the IRNN to improve the uncertainty estimation. Our second\nframework uses neural ordinary differential equations to bridge the gap between\nmechanistic compartmental models and NNs; benefiting from the physical\nconstraints that compartmental models provide. We evaluate eight neural ODE\nmodels utilising a mixture of ILI rates and Web search activity data to provide\nforecasts. These are compared with the IRNN and IRNN0 - the IRNN using only ILI\nrates. Models trained without Web search activity data outperform the IRNN0 by\n16% in terms of Skill. Future work should focus on more effectively using\nneural ODEs with Web search data to compete with the best performing IRNN.","PeriodicalId":501044,"journal":{"name":"arXiv - QuanBio - Populations and Evolution","volume":"16 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Populations and Evolution","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.01154","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Infectious diseases pose significant human and economic burdens. Accurately
forecasting disease incidence can enable public health agencies to respond
effectively to existing or emerging diseases. Despite progress in the field,
developing accurate forecasting models remains a significant challenge. This
thesis proposes two methodological frameworks using neural networks (NNs) with
associated uncertainty estimates - a critical component limiting the
application of NNs to epidemic forecasting thus far. We develop our frameworks
by forecasting influenza-like illness (ILI) in the United States. Our first
proposed method uses Web search activity data in conjunction with historical
ILI rates as observations for training NN architectures. Our models incorporate
Bayesian layers to produce uncertainty intervals, positioning themselves as
legitimate alternatives to more conventional approaches. The best performing
architecture: iterative recurrent neural network (IRNN), reduces mean absolute
error by 10.3% and improves Skill by 17.1% on average in forecasting tasks
across four flu seasons compared to the state-of-the-art. We build on this
method by introducing IRNNs, an architecture which changes the sampling
procedure in the IRNN to improve the uncertainty estimation. Our second
framework uses neural ordinary differential equations to bridge the gap between
mechanistic compartmental models and NNs; benefiting from the physical
constraints that compartmental models provide. We evaluate eight neural ODE
models utilising a mixture of ILI rates and Web search activity data to provide
forecasts. These are compared with the IRNN and IRNN0 - the IRNN using only ILI
rates. Models trained without Web search activity data outperform the IRNN0 by
16% in terms of Skill. Future work should focus on more effectively using
neural ODEs with Web search data to compete with the best performing IRNN.