On the reliability of predictions on Covid-19 dynamics: A systematic and critical review of modelling techniques

IF 2.5 Q2 INFECTIOUS DISEASES 传染病建模(英文) Pub Date : 2020-09-11 DOI:10.1101/2020.09.10.20192328

J. Gnanvi, K. V. Salako, Gaëtan Brezesky Kotanmi, R. G. Glèlè Kakaï

{"title":"On the reliability of predictions on Covid-19 dynamics: A systematic and critical review of modelling techniques","authors":"J. Gnanvi, K. V. Salako, Gaëtan Brezesky Kotanmi, R. G. Glèlè Kakaï","doi":"10.1101/2020.09.10.20192328","DOIUrl":null,"url":null,"abstract":"\n Since the emergence of the novel 2019 coronavirus pandemic in December 2019 (COVID-19), numerous modellers have used diverse techniques to assess the dynamics of transmission of the disease, predict its future course and determine the impact of different control measures. In this study, we conducted a global systematic literature review to summarize trends in the modelling techniques used for Covid-19 from January 1st, 2020 to October 30th, 2020. We further examined the reliability and correctness of predictions by comparing predicted and observed values for cumulative cases and deaths. From an initial 4311 peer-reviewed articles and preprints found with our defined keywords, 242 were fully analysed. Most studies were done on Asian (46.52%) and European (27.39%) countries. Most of them used compartmental models (namely SIR and SEIR) (46.1%) and statistical models (growth models and time series) (31.8%) while few used artificial intelligence (6.7%), Bayesian approach (4.7%), Network models (2.3%) and Agent-based models (1.3%). For the number of cumulative cases, the ratio of the predicted over the observed values and the ratio of the amplitude of confidence interval (CI) or credibility interval (CrI) of predictions and the central value were on average larger than 1 indicating cases of inaccurate and imprecise predictions, and large variation across predictions. There was no clear difference among models used for these two ratios. In 75% of predictions that provided CI or CrI, observed values fall within the 95% CI or CrI of the cumulative cases predicted. Only 3.7% of the studies predicted the cumulative number of deaths. For 70% of the predictions, the ratio of predicted over observed cumulative deaths was less or close to 1. Also, the Bayesian model made predictions closer to reality than classical statistical models, although these differences are only suggestive due to the small number of predictions within our dataset (9 in total). In addition, we found a significant negative correlation (rho = - 0.56, p = 0.021) between this ratio and the length (in days) of the period covered by the modelling, suggesting that the longer the period covered by the model the likely more accurate the estimates tend to be. Our findings suggest that while predictions made by the different models are useful to understand the pandemic course and guide policy-making, some were relatively accurate and precise while other not.\n","PeriodicalId":64814,"journal":{"name":"传染病建模(英文)","volume":"6 1","pages":"258 - 272"},"PeriodicalIF":2.5000,"publicationDate":"2020-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"69","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"传染病建模(英文)","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1101/2020.09.10.20192328","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INFECTIOUS DISEASES","Score":null,"Total":0}

引用次数: 69

Abstract

Since the emergence of the novel 2019 coronavirus pandemic in December 2019 (COVID-19), numerous modellers have used diverse techniques to assess the dynamics of transmission of the disease, predict its future course and determine the impact of different control measures. In this study, we conducted a global systematic literature review to summarize trends in the modelling techniques used for Covid-19 from January 1st, 2020 to October 30th, 2020. We further examined the reliability and correctness of predictions by comparing predicted and observed values for cumulative cases and deaths. From an initial 4311 peer-reviewed articles and preprints found with our defined keywords, 242 were fully analysed. Most studies were done on Asian (46.52%) and European (27.39%) countries. Most of them used compartmental models (namely SIR and SEIR) (46.1%) and statistical models (growth models and time series) (31.8%) while few used artificial intelligence (6.7%), Bayesian approach (4.7%), Network models (2.3%) and Agent-based models (1.3%). For the number of cumulative cases, the ratio of the predicted over the observed values and the ratio of the amplitude of confidence interval (CI) or credibility interval (CrI) of predictions and the central value were on average larger than 1 indicating cases of inaccurate and imprecise predictions, and large variation across predictions. There was no clear difference among models used for these two ratios. In 75% of predictions that provided CI or CrI, observed values fall within the 95% CI or CrI of the cumulative cases predicted. Only 3.7% of the studies predicted the cumulative number of deaths. For 70% of the predictions, the ratio of predicted over observed cumulative deaths was less or close to 1. Also, the Bayesian model made predictions closer to reality than classical statistical models, although these differences are only suggestive due to the small number of predictions within our dataset (9 in total). In addition, we found a significant negative correlation (rho = - 0.56, p = 0.021) between this ratio and the length (in days) of the period covered by the modelling, suggesting that the longer the period covered by the model the likely more accurate the estimates tend to be. Our findings suggest that while predictions made by the different models are useful to understand the pandemic course and guide policy-making, some were relatively accurate and precise while other not.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

关于新冠肺炎动态预测的可靠性：对建模技术的系统和批判性回顾

自2019年12月新型冠状病毒大流行（新冠肺炎）出现以来，许多建模者使用了不同的技术来评估疾病的传播动态，预测其未来的过程，并确定不同控制措施的影响。在这项研究中，我们进行了一项全球系统文献综述，总结了2020年1月1日至2020年10月30日新冠肺炎建模技术的趋势。我们通过比较累计病例和死亡的预测值和观察值，进一步检验了预测的可靠性和正确性。从最初发现的4311篇同行评审文章和预印本中，我们定义了关键词，其中242篇进行了全面分析。大多数研究是在亚洲（46.52%）和欧洲（27.39%）国家进行的。他们中的大多数人使用分区模型（即SIR和SEIR）（46.1%）和统计模型（增长模型和时间序列）（31.8%），而很少使用人工智能（6.7%）、贝叶斯方法（4.7%）、网络模型（2.3%）和基于Agent的模型（1.3%），预测值与观测值的比率以及预测的置信区间（CI）或可信度区间（CrI）的幅度与中心值的比率平均大于1，表明预测不准确和不精确以及预测之间的大变化的情况。用于这两个比率的模型之间没有明显的差异。在75%提供CI或CrI的预测中，观察值落在预测的累积病例的95%CI或CrI。只有3.7%的研究预测了累计死亡人数。对于70%的预测，预测的累计死亡人数与观察到的累计死亡数之比小于或接近1。此外，贝叶斯模型使预测比经典统计模型更接近现实，尽管这些差异只是由于我们数据集中的预测数量很少（总共9个）而引起的。此外，我们发现了显著的负相关（rho = - 0.56，p = 0.021），这表明模型覆盖的时间越长，估计往往越准确。我们的研究结果表明，虽然不同模型的预测有助于了解疫情进程和指导决策，有些是相对准确和精确的，而另一些则不然。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

传染病建模(英文)

CiteScore

18.30

自引率

0.00%

发文量