利用异方差高斯过程对疾病发病率进行现象学预测:一个登革热病例研究

L. Johnson, R. Gramacy, Jeremy M. Cohen, E. Mordecai, C. Murdock, Jason Rohr, S. Ryan, Anna M. Stewart-Ibarra, Daniel P. Weikel
{"title":"利用异方差高斯过程对疾病发病率进行现象学预测:一个登革热病例研究","authors":"L. Johnson, R. Gramacy, Jeremy M. Cohen, E. Mordecai, C. Murdock, Jason Rohr, S. Ryan, Anna M. Stewart-Ibarra, Daniel P. Weikel","doi":"10.1214/17-AOAS1090","DOIUrl":null,"url":null,"abstract":"In 2015 the US federal government sponsored a dengue forecasting competition using historical case data from Iquitos, Peru and San Juan, Puerto Rico. Competitors were evaluated on several aspects of out-of-sample forecasts including the targets of peak week, peak incidence during that week and total season incidence across each of several seasons. Our team was one of the top performers of that competition, outperforming all other teams in multiple targets/locals. In this paper we report on our methodology, a large component of which, surprisingly, ignores the known biology of epidemics at large---in particular relationships between dengue transmission and environmental factors---and instead relies on flexible nonparametric nonlinear Gaussian process (GP) regression fits that \"memorize\" the trajectories of past seasons, and then \"match\" the dynamics of the unfolding season to past ones in real-time. Our phenomenological approach has advantages in situations where disease dynamics are less well understood, e.g., at sites with shorter histories of disease (such as Iquitos), or where measurements and forecasts of ancillary covariates like precipitation are unavailable and/or where the strength of association with cases are as yet unknown. In particular, we show that the GP approach generally outperforms a more classical generalized linear (autoregressive) model (GLM) that we developed to utilize abundant covariate information. We illustrate variations of our method(s) on the two benchmark locales alongside a full summary of results submitted by other contest competitors.","PeriodicalId":409996,"journal":{"name":"arXiv: Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"34","resultStr":"{\"title\":\"Phenomenological forecasting of disease incidence using heteroskedastic Gaussian processes: a dengue case study\",\"authors\":\"L. Johnson, R. Gramacy, Jeremy M. Cohen, E. Mordecai, C. Murdock, Jason Rohr, S. Ryan, Anna M. Stewart-Ibarra, Daniel P. Weikel\",\"doi\":\"10.1214/17-AOAS1090\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In 2015 the US federal government sponsored a dengue forecasting competition using historical case data from Iquitos, Peru and San Juan, Puerto Rico. Competitors were evaluated on several aspects of out-of-sample forecasts including the targets of peak week, peak incidence during that week and total season incidence across each of several seasons. Our team was one of the top performers of that competition, outperforming all other teams in multiple targets/locals. In this paper we report on our methodology, a large component of which, surprisingly, ignores the known biology of epidemics at large---in particular relationships between dengue transmission and environmental factors---and instead relies on flexible nonparametric nonlinear Gaussian process (GP) regression fits that \\\"memorize\\\" the trajectories of past seasons, and then \\\"match\\\" the dynamics of the unfolding season to past ones in real-time. Our phenomenological approach has advantages in situations where disease dynamics are less well understood, e.g., at sites with shorter histories of disease (such as Iquitos), or where measurements and forecasts of ancillary covariates like precipitation are unavailable and/or where the strength of association with cases are as yet unknown. In particular, we show that the GP approach generally outperforms a more classical generalized linear (autoregressive) model (GLM) that we developed to utilize abundant covariate information. We illustrate variations of our method(s) on the two benchmark locales alongside a full summary of results submitted by other contest competitors.\",\"PeriodicalId\":409996,\"journal\":{\"name\":\"arXiv: Applications\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"34\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv: Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1214/17-AOAS1090\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv: Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1214/17-AOAS1090","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 34

摘要

2015年,美国联邦政府利用秘鲁伊基托斯和波多黎各圣胡安的历史病例数据赞助了一项登革热预测竞赛。竞争对手在样本外预测的几个方面进行了评估,包括高峰周的目标、该周的高峰发病率和几个季节中每个季节的总发病率。我们的队伍是那次比赛中表现最好的队伍之一,在多个目标/地区的比赛中都胜过其他所有队伍。在本文中,我们报告了我们的方法,令人惊讶的是,其中很大一部分忽略了已知的流行病生物学-特别是登革热传播与环境因素之间的关系-而是依赖于灵活的非参数非线性高斯过程(GP)回归拟合,“记住”过去季节的轨迹,然后实时“匹配”正在展开的季节与过去的动态。我们的现象学方法在疾病动力学不太清楚的情况下具有优势,例如,在疾病历史较短的地点(如伊基托斯),或在降雨量等辅助协变量的测量和预测不可用和/或与病例的关联强度尚不清楚的情况下。特别是,我们表明GP方法通常优于我们为利用丰富的协变量信息而开发的更经典的广义线性(自回归)模型(GLM)。我们在两个基准语言环境中演示了我们的方法的变化,以及其他竞赛对手提交的结果的完整摘要。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Phenomenological forecasting of disease incidence using heteroskedastic Gaussian processes: a dengue case study
In 2015 the US federal government sponsored a dengue forecasting competition using historical case data from Iquitos, Peru and San Juan, Puerto Rico. Competitors were evaluated on several aspects of out-of-sample forecasts including the targets of peak week, peak incidence during that week and total season incidence across each of several seasons. Our team was one of the top performers of that competition, outperforming all other teams in multiple targets/locals. In this paper we report on our methodology, a large component of which, surprisingly, ignores the known biology of epidemics at large---in particular relationships between dengue transmission and environmental factors---and instead relies on flexible nonparametric nonlinear Gaussian process (GP) regression fits that "memorize" the trajectories of past seasons, and then "match" the dynamics of the unfolding season to past ones in real-time. Our phenomenological approach has advantages in situations where disease dynamics are less well understood, e.g., at sites with shorter histories of disease (such as Iquitos), or where measurements and forecasts of ancillary covariates like precipitation are unavailable and/or where the strength of association with cases are as yet unknown. In particular, we show that the GP approach generally outperforms a more classical generalized linear (autoregressive) model (GLM) that we developed to utilize abundant covariate information. We illustrate variations of our method(s) on the two benchmark locales alongside a full summary of results submitted by other contest competitors.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Weekly Bayesian Modelling Strategy to Predict Deaths by COVID-19: a Model and Case Study for the State of Santa Catarina, Brazil Selecting the Most Effective Nudge: Evidence from a Large-Scale Experiment on Immunization Revealing the Transmission Dynamics of COVID-19: A Bayesian Framework for Rt Estimation Improving living biomass C-stock loss estimates by combining optical satellite, airborne laser scanning, and NFI data Bayesian classification for dating archaeological sites via projectile points.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1