用知情随机森林来模拟流行病学先验、政府政策和公众流动性之间的关联。

IF 1.9 Q3 HEALTH CARE SCIENCES & SERVICES MDM Policy and Practice Pub Date : 2023-12-26 eCollection Date: 2023-07-01 DOI:10.1177/23814683231218716
Tsaone Swaabow Thapelo, Dimane Mpoeleng, Gregory Hillhouse
{"title":"用知情随机森林来模拟流行病学先验、政府政策和公众流动性之间的关联。","authors":"Tsaone Swaabow Thapelo, Dimane Mpoeleng, Gregory Hillhouse","doi":"10.1177/23814683231218716","DOIUrl":null,"url":null,"abstract":"<p><p><b>Background.</b> Infectious diseases constitute a significant concern worldwide due to their increasing prevalence, associated health risks, and the socioeconomic costs. Machine learning (ML) models and epidemic models formulated using deterministic differential equations are the most dominant tools for analyzing and modeling the transmission of infectious diseases. However, ML models can be inconsistent in extracting the dynamics of a disease in the presence of data drifts. Likewise, the capability of epidemic models is constrained to parameter dimensions and estimation. We aimed at creating a framework of informed ML that integrates a random forest (RF) with an adapted susceptible infectious recovered (SIR) model to account for accuracy and consistency in stochasticity within the dynamics of coronavirus disease 2019 (COVID-19). <b>Methods.</b> An adapted SIR model was used to inform a default RF on predicting new COVID-19 cases (NCCs) at given intervals. We validated the performance of the informed RF (IRF) using real data. We used Botswana's pharmaceutical interventions (PIs) and non-PIs (NPIs) adopted between February 2020 and August 2022. The discrepancy between predictions and observations is modeled using loss functions, which are minimized, interpreted, and used to assess the IRF. <b>Results.</b> The findings on the real data have revealed the effectiveness of the default RF in modeling and predicting NCCs. The use of the effective reproductive rate to inform the RF yielded an excellent predictive power (84%) compared with 75% by the default RF. <b>Conclusion.</b> This research has potential to inform policy and decision makers in developing systems to evaluate interventions for infectious diseases.</p><p><strong>Highlights: </strong>This framework is initiated by incorporating model outputs from an epidemic model to a machine learning model.An informed random forest (RF) is instantiated to model government and public responses to the COVID-19 pandemic.This framework does not require data transformations, and the epidemic model is shown to boost the RF's performance.This is a baseline knowledge-informed learning framework for assessing public health interventions in Botswana.</p>","PeriodicalId":36567,"journal":{"name":"MDM Policy and Practice","volume":"8 2","pages":"23814683231218716"},"PeriodicalIF":1.9000,"publicationDate":"2023-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10752195/pdf/","citationCount":"0","resultStr":"{\"title\":\"Informed Random Forest to Model Associations of Epidemiological Priors, Government Policies, and Public Mobility.\",\"authors\":\"Tsaone Swaabow Thapelo, Dimane Mpoeleng, Gregory Hillhouse\",\"doi\":\"10.1177/23814683231218716\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p><b>Background.</b> Infectious diseases constitute a significant concern worldwide due to their increasing prevalence, associated health risks, and the socioeconomic costs. Machine learning (ML) models and epidemic models formulated using deterministic differential equations are the most dominant tools for analyzing and modeling the transmission of infectious diseases. However, ML models can be inconsistent in extracting the dynamics of a disease in the presence of data drifts. Likewise, the capability of epidemic models is constrained to parameter dimensions and estimation. We aimed at creating a framework of informed ML that integrates a random forest (RF) with an adapted susceptible infectious recovered (SIR) model to account for accuracy and consistency in stochasticity within the dynamics of coronavirus disease 2019 (COVID-19). <b>Methods.</b> An adapted SIR model was used to inform a default RF on predicting new COVID-19 cases (NCCs) at given intervals. We validated the performance of the informed RF (IRF) using real data. We used Botswana's pharmaceutical interventions (PIs) and non-PIs (NPIs) adopted between February 2020 and August 2022. The discrepancy between predictions and observations is modeled using loss functions, which are minimized, interpreted, and used to assess the IRF. <b>Results.</b> The findings on the real data have revealed the effectiveness of the default RF in modeling and predicting NCCs. The use of the effective reproductive rate to inform the RF yielded an excellent predictive power (84%) compared with 75% by the default RF. <b>Conclusion.</b> This research has potential to inform policy and decision makers in developing systems to evaluate interventions for infectious diseases.</p><p><strong>Highlights: </strong>This framework is initiated by incorporating model outputs from an epidemic model to a machine learning model.An informed random forest (RF) is instantiated to model government and public responses to the COVID-19 pandemic.This framework does not require data transformations, and the epidemic model is shown to boost the RF's performance.This is a baseline knowledge-informed learning framework for assessing public health interventions in Botswana.</p>\",\"PeriodicalId\":36567,\"journal\":{\"name\":\"MDM Policy and Practice\",\"volume\":\"8 2\",\"pages\":\"23814683231218716\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2023-12-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10752195/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"MDM Policy and Practice\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1177/23814683231218716\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2023/7/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q3\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"MDM Policy and Practice","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/23814683231218716","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/7/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0

摘要

背景。传染病因其日益增长的流行率、相关的健康风险和社会经济成本而成为全球关注的焦点。使用确定性微分方程建立的机器学习(ML)模型和流行病模型是分析和模拟传染病传播的最主要工具。然而,在存在数据漂移的情况下,ML 模型在提取疾病的动态变化方面可能不一致。同样,流行病模型的能力也受限于参数维度和估计。我们的目标是创建一个知情 ML 框架,该框架将随机森林(RF)与经调整的易感传染性恢复(SIR)模型整合在一起,以考虑 2019 年冠状病毒疾病(COVID-19)动态中随机性的准确性和一致性。方法。我们使用经过调整的 SIR 模型为默认 RF 提供信息,以预测特定时间间隔内的 COVID-19 新病例 (NCC)。我们使用真实数据验证了知情 RF (IRF) 的性能。我们使用了博茨瓦纳在 2020 年 2 月至 2022 年 8 月期间采用的药物干预 (PI) 和非药物干预 (NPI)。预测与观察之间的差异通过损失函数来建模,损失函数被最小化、解释并用于评估 IRF。结果。对真实数据的研究结果表明,默认 RF 在模拟和预测 NCC 方面非常有效。使用有效生殖率为射频提供信息产生了出色的预测能力(84%),而默认射频的预测能力为 75%。结论。这项研究有望为政策制定者和决策者提供信息,帮助他们建立传染病干预评估系统:该框架通过将流行病模型的输出结果整合到机器学习模型中而启动,并将知情随机森林(RF)实例化,以模拟政府和公众对 COVID-19 大流行病的反应。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Informed Random Forest to Model Associations of Epidemiological Priors, Government Policies, and Public Mobility.

Background. Infectious diseases constitute a significant concern worldwide due to their increasing prevalence, associated health risks, and the socioeconomic costs. Machine learning (ML) models and epidemic models formulated using deterministic differential equations are the most dominant tools for analyzing and modeling the transmission of infectious diseases. However, ML models can be inconsistent in extracting the dynamics of a disease in the presence of data drifts. Likewise, the capability of epidemic models is constrained to parameter dimensions and estimation. We aimed at creating a framework of informed ML that integrates a random forest (RF) with an adapted susceptible infectious recovered (SIR) model to account for accuracy and consistency in stochasticity within the dynamics of coronavirus disease 2019 (COVID-19). Methods. An adapted SIR model was used to inform a default RF on predicting new COVID-19 cases (NCCs) at given intervals. We validated the performance of the informed RF (IRF) using real data. We used Botswana's pharmaceutical interventions (PIs) and non-PIs (NPIs) adopted between February 2020 and August 2022. The discrepancy between predictions and observations is modeled using loss functions, which are minimized, interpreted, and used to assess the IRF. Results. The findings on the real data have revealed the effectiveness of the default RF in modeling and predicting NCCs. The use of the effective reproductive rate to inform the RF yielded an excellent predictive power (84%) compared with 75% by the default RF. Conclusion. This research has potential to inform policy and decision makers in developing systems to evaluate interventions for infectious diseases.

Highlights: This framework is initiated by incorporating model outputs from an epidemic model to a machine learning model.An informed random forest (RF) is instantiated to model government and public responses to the COVID-19 pandemic.This framework does not require data transformations, and the epidemic model is shown to boost the RF's performance.This is a baseline knowledge-informed learning framework for assessing public health interventions in Botswana.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
MDM Policy and Practice
MDM Policy and Practice Medicine-Health Policy
CiteScore
2.50
自引率
0.00%
发文量
28
审稿时长
15 weeks
期刊最新文献
Prioritizing Patients from the Most Deprived Areas on Elective Waiting Lists in the NHS in England: Estimating the Health and Health Inequality Impact. Optimizing Masks and Random Screening Test Usage within K-12 Schools. Associations of Concordant and Shared Lung Cancer Screening Decision Making with Decisional Conflict: A Multi-Institution Cross-Sectional Analysis. Implications of Diminishing Lifespan Marginal Utility for Valuing Equity in Cost-Effectiveness Analysis. Evaluation of the Soda Tax on Obesity and Diabetes in California: A Cost-Effectiveness Analysis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1