Statistical modeling of COVID-19 deaths with excess zero counts

Q3 Mathematics Epidemiologic Methods Pub Date : 2021-02-01 DOI:10.1515/em-2021-0007
S. Khedhiri
{"title":"Statistical modeling of COVID-19 deaths with excess zero counts","authors":"S. Khedhiri","doi":"10.1515/em-2021-0007","DOIUrl":null,"url":null,"abstract":"Abstract Objectives Modeling and forecasting possible trajectories of COVID-19 infections and deaths using statistical methods is one of the most important topics in present time. However, statistical models use different assumptions and methods and thus yield different results. One issue in monitoring disease progression over time is how to handle excess zeros counts. In this research, we assess the statistical empirical performance of these models in terms of their fit and forecast accuracy of COVID-19 deaths. Methods Two types of models are suggested in the literature to study count time series data. The first type of models is based on Poisson and negative binomial conditional probability distributions to account for data over dispersion and using auto regression to account for dependence of the responses. The second type of models is based on zero-inflated mixed auto regression and also uses exponential family conditional distributions. We study the goodness of fit and forecast accuracy of these count time series models based on autoregressive conditional count distributions with and without zero inflation. Results We illustrate these methods using a recently published online COVID-19 data for Tunisia, which reports daily death counts from March 2020 to February 2021. We perform an empirical analysis and we compare the fit and the forecast performance of these models for death counts in presence of an intervention policy. Our statistical findings show that models that account for zero inflation produce better fit and have more accurate forecast of the pandemic deaths. Conclusions This paper shows that infectious disease data with excess zero counts are better modelled with zero-inflated models. These models yield more accurate predictions of deaths related to the pandemic than the generalized count data models. In addition, our statistical results find that the lift of travel restrictions has a significant impact on the surge of COVID-19 deaths. One plausible explanation of the outperformance of zero-inflated models is that the zero values are related to an intervention policy and therefore they are structural.","PeriodicalId":37999,"journal":{"name":"Epidemiologic Methods","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Epidemiologic Methods","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1515/em-2021-0007","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 9

Abstract

Abstract Objectives Modeling and forecasting possible trajectories of COVID-19 infections and deaths using statistical methods is one of the most important topics in present time. However, statistical models use different assumptions and methods and thus yield different results. One issue in monitoring disease progression over time is how to handle excess zeros counts. In this research, we assess the statistical empirical performance of these models in terms of their fit and forecast accuracy of COVID-19 deaths. Methods Two types of models are suggested in the literature to study count time series data. The first type of models is based on Poisson and negative binomial conditional probability distributions to account for data over dispersion and using auto regression to account for dependence of the responses. The second type of models is based on zero-inflated mixed auto regression and also uses exponential family conditional distributions. We study the goodness of fit and forecast accuracy of these count time series models based on autoregressive conditional count distributions with and without zero inflation. Results We illustrate these methods using a recently published online COVID-19 data for Tunisia, which reports daily death counts from March 2020 to February 2021. We perform an empirical analysis and we compare the fit and the forecast performance of these models for death counts in presence of an intervention policy. Our statistical findings show that models that account for zero inflation produce better fit and have more accurate forecast of the pandemic deaths. Conclusions This paper shows that infectious disease data with excess zero counts are better modelled with zero-inflated models. These models yield more accurate predictions of deaths related to the pandemic than the generalized count data models. In addition, our statistical results find that the lift of travel restrictions has a significant impact on the surge of COVID-19 deaths. One plausible explanation of the outperformance of zero-inflated models is that the zero values are related to an intervention policy and therefore they are structural.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
超零计数COVID-19死亡的统计建模
摘要目的利用统计方法对COVID-19感染和死亡的可能轨迹进行建模和预测是当前最重要的课题之一。然而,统计模型使用不同的假设和方法,从而产生不同的结果。监测疾病进展的一个问题是如何处理多余的零计数。在本研究中,我们从拟合和预测COVID-19死亡的准确性方面评估了这些模型的统计经验性能。方法文献中提出了两种模型来研究计数时间序列数据。第一类模型基于泊松和负二项条件概率分布来解释数据的分散,并使用自动回归来解释响应的依赖性。第二类模型基于零膨胀混合自回归,也使用指数族条件分布。我们研究了这些基于自回归条件计数分布的计数时间序列模型的拟合优度和预测精度。我们使用突尼斯最近发布的在线COVID-19数据来说明这些方法,该数据报告了2020年3月至2021年2月的每日死亡人数。我们进行了实证分析,并比较了这些模型在存在干预政策的情况下对死亡人数的拟合和预测性能。我们的统计结果表明,考虑零通货膨胀的模型具有更好的拟合性,并且对大流行死亡的预测更准确。结论用零膨胀模型可以较好地模拟具有超零计数的传染病数据。这些模型对与大流行有关的死亡人数的预测比广义计数数据模型更准确。此外,我们的统计结果发现,取消旅行限制对COVID-19死亡人数激增产生了重大影响。对于零膨胀模型的优异表现,一个合理的解释是,零值与干预政策有关,因此它们是结构性的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Epidemiologic Methods
Epidemiologic Methods Mathematics-Applied Mathematics
CiteScore
2.10
自引率
0.00%
发文量
7
期刊介绍: Epidemiologic Methods (EM) seeks contributions comparable to those of the leading epidemiologic journals, but also invites papers that may be more technical or of greater length than what has traditionally been allowed by journals in epidemiology. Applications and examples with real data to illustrate methodology are strongly encouraged but not required. Topics. genetic epidemiology, infectious disease, pharmaco-epidemiology, ecologic studies, environmental exposures, screening, surveillance, social networks, comparative effectiveness, statistical modeling, causal inference, measurement error, study design, meta-analysis
期刊最新文献
Linked shrinkage to improve estimation of interaction effects in regression models. Bounds for selection bias using outcome probabilities Population dynamic study of two prey one predator system with disease in first prey using fuzzy impulsive control Development and application of an evidence-based directed acyclic graph to evaluate the associations between metal mixtures and cardiometabolic outcomes. Performance evaluation of ResNet model for classification of tomato plant disease
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1