首页 > 最新文献

Epidemics最新文献

英文 中文
Projecting the population-level impact of norovirus vaccines 预测诺如病毒疫苗对人群的影响
IF 3 3区 医学 Q2 INFECTIOUS DISEASES Pub Date : 2025-09-01 Epub Date: 2025-07-18 DOI: 10.1016/j.epidem.2025.100842
Katia Koelle , Brooke Lappe , Benjamin A. Lopman , Max S.Y. Lau , Emma Viscidi , Katherine B. Carlson
Norovirus diversity has major implications for vaccine design. The number of circulating genogroups and genotypes, and the way this viral diversity interacts at the population level, will factor into how many and which genotypes should be included in an effective vaccine. Here, we develop an age-stratified, multi-strain model for norovirus to project potential population-level impacts of different vaccine formulations on genotype-specific and overall annual attack rates. Our model assumes that vaccination impacts susceptibility to infection but not infectiousness or the risk of developing disease. We parameterize the baseline model (without vaccination) based on literature estimates and the ability to recover observed epidemiological patterns. We then simulate this model under seven different potential vaccine formulations, initially assuming only pediatric vaccination. While we find that increases in coverage result in declines in annual norovirus attack rates for all formulations considered, we also find that vaccine formulations that include genotype GII.4 would be most effective at lowering overall norovirus attack rates. Inclusion of additional genotypes in a vaccine would further lower attack rates but more incrementally, with the addition of GI.3, GII.2, GII.3, and GII.6 together having a similar impact to that of GII.4 alone on reducing overall norovirus incidence. We further find that transient dynamics are expected for 10-20 years following roll-out with any pediatric vaccine. During this time, there may be unanticipated changes in genotype circulation patterns, although long-term increases in non-vaccine genotype attack rates above baseline levels are not expected. Finally, we anticipate that annual vaccination of older-aged individuals with a GII.4-containing vaccine can, under certain conditions but not others, provide appreciable direct benefits to individuals in this age group beyond what pediatric vaccination affords. Together, our results indicate that there is a clear population-level benefit of primary pediatric vaccination with a GII.4-inclusive norovirus vaccine plus incremental value of other genotypes, with additional direct benefits of annual vaccination to older adults provided that vaccination results in a considerable (multi-month) duration of broadly protective immunity to infection. More empirical studies are needed to validate the structure of the model and refine its parameterization, both of which affect projections of vaccine impact.
诺如病毒多样性对疫苗设计具有重要意义。流行基因组和基因型的数量,以及这种病毒多样性在人群水平上相互作用的方式,将影响有效疫苗中应包括多少基因型和哪些基因型。在这里,我们开发了一个年龄分层的诺如病毒多毒株模型,以预测不同疫苗配方对基因型特异性和总体年发病率的潜在人群水平影响。我们的模型假设疫苗接种会影响对感染的易感性,但不会影响传染性或发展疾病的风险。我们根据文献估计和恢复观察到的流行病学模式的能力对基线模型(不接种疫苗)进行参数化。然后,我们在7种不同的潜在疫苗配方下模拟该模型,最初假设仅为儿科疫苗接种。虽然我们发现覆盖率的增加导致所考虑的所有配方的诺如病毒年攻击率下降,但我们还发现,包含基因型GII.4的疫苗配方在降低总体诺如病毒攻击率方面最有效。在疫苗中加入额外的基因型将进一步降低发病率,但这是渐进的,加入GII.3、GII.2、GII.3和GII.6在降低诺如病毒总体发病率方面的作用与单独加入GII.4类似。我们进一步发现,在任何儿科疫苗推出后,预计10-20年的短暂动态。在此期间,基因型循环模式可能会发生意想不到的变化,尽管预计非疫苗基因型发病率不会长期增加到基线水平以上。最后,我们预计,在某些条件下(但在其他条件下),老年人每年接种含有gii .4的疫苗可以为该年龄组的个人提供明显的直接益处,而不是儿科疫苗所能提供的益处。总之,我们的研究结果表明,儿童接种含gii .4的诺如病毒疫苗,加上其他基因型的增加值,具有明显的人群水平益处,如果接种疫苗能产生相当长(数月)时间的广泛保护性免疫,则每年接种一次对老年人有额外的直接益处。需要更多的实证研究来验证模型的结构并完善其参数化,这两者都会影响疫苗影响的预测。
{"title":"Projecting the population-level impact of norovirus vaccines","authors":"Katia Koelle ,&nbsp;Brooke Lappe ,&nbsp;Benjamin A. Lopman ,&nbsp;Max S.Y. Lau ,&nbsp;Emma Viscidi ,&nbsp;Katherine B. Carlson","doi":"10.1016/j.epidem.2025.100842","DOIUrl":"10.1016/j.epidem.2025.100842","url":null,"abstract":"<div><div>Norovirus diversity has major implications for vaccine design. The number of circulating genogroups and genotypes, and the way this viral diversity interacts at the population level, will factor into how many and which genotypes should be included in an effective vaccine. Here, we develop an age-stratified, multi-strain model for norovirus to project potential population-level impacts of different vaccine formulations on genotype-specific and overall annual attack rates. Our model assumes that vaccination impacts susceptibility to infection but not infectiousness or the risk of developing disease. We parameterize the baseline model (without vaccination) based on literature estimates and the ability to recover observed epidemiological patterns. We then simulate this model under seven different potential vaccine formulations, initially assuming only pediatric vaccination. While we find that increases in coverage result in declines in annual norovirus attack rates for all formulations considered, we also find that vaccine formulations that include genotype GII.4 would be most effective at lowering overall norovirus attack rates. Inclusion of additional genotypes in a vaccine would further lower attack rates but more incrementally, with the addition of GI.3, GII.2, GII.3, and GII.6 together having a similar impact to that of GII.4 alone on reducing overall norovirus incidence. We further find that transient dynamics are expected for 10-20 years following roll-out with any pediatric vaccine. During this time, there may be unanticipated changes in genotype circulation patterns, although long-term increases in non-vaccine genotype attack rates above baseline levels are not expected. Finally, we anticipate that annual vaccination of older-aged individuals with a GII.4-containing vaccine can, under certain conditions but not others, provide appreciable direct benefits to individuals in this age group beyond what pediatric vaccination affords. Together, our results indicate that there is a clear population-level benefit of primary pediatric vaccination with a GII.4-inclusive norovirus vaccine plus incremental value of other genotypes, with additional direct benefits of annual vaccination to older adults provided that vaccination results in a considerable (multi-month) duration of broadly protective immunity to infection. More empirical studies are needed to validate the structure of the model and refine its parameterization, both of which affect projections of vaccine impact.</div></div>","PeriodicalId":49206,"journal":{"name":"Epidemics","volume":"52 ","pages":"Article 100842"},"PeriodicalIF":3.0,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144672569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Work absenteeism across economic activity sectors and its association with COVID-19-like illness prevalence in the Netherlands, 2020–2023 2020-2023年荷兰经济活动部门的缺勤情况及其与covid -19样疾病流行的关系
IF 3 3区 医学 Q2 INFECTIOUS DISEASES Pub Date : 2025-09-01 Epub Date: 2025-07-01 DOI: 10.1016/j.epidem.2025.100841
Hester Korthals Altes , Jan Van De Kassteele , Bram Wisse , Maria Xiridou , Albert Jan Van Hoek , Jacco Wallinga
The monitoring of work absenteeism can inform pandemic decision making, besides the surveillance of disease end-points like mortality and intensive care bed occupancy. For instance, high disease prevalence accompanied by elevated levels of absenteeism in the healthcare sector will increase the strain on the health care system, and may necessitate adaptation of the control measures. This highlights the need to assess the association between COVID-19 disease prevalence and absenteeism in relevant economic sectors. We initiated the comprehensive monitoring and analysis of work absenteeism and developed an autoregressive time series model which combined COVID-19 prevalence as measured through syndromic surveillance, with absenteeism across various economic activity sectors in the Netherlands. The analysis was updated regularly and shared with policy makers. Overall, prevalence of COVID-19-like illnesses was the most important contributor to variation in absenteeism over the period November 2020-May 2023, with absenteeism rates varying markedly between activity sectors. Of the sectors well-covered by the absenteeism database, the Education and Logistics sectors showed the greatest contribution of a seasonal pattern independent of COVID-19 to absenteeism.
除了监测死亡率和重症监护病床占用率等疾病终点外,对缺勤情况的监测还可以为大流行疫情的决策提供信息。例如,在卫生保健部门,高发病率伴随着高缺勤率将增加对卫生保健系统的压力,并可能需要调整控制措施。这突出表明有必要评估相关经济部门COVID-19患病率与缺勤之间的关系。我们启动了对旷工的全面监测和分析,并开发了一个自回归时间序列模型,该模型将通过综合征监测测量的COVID-19流行率与荷兰各经济活动部门的旷工率结合起来。该分析定期更新,并与政策制定者分享。总体而言,2019冠状病毒样疾病的流行是2020年11月至2023年5月期间缺勤率变化的最重要因素,各活动部门之间的缺勤率差异显著。在缺勤数据库全面覆盖的部门中,教育和物流部门显示出独立于COVID-19的季节性模式对缺勤的贡献最大。
{"title":"Work absenteeism across economic activity sectors and its association with COVID-19-like illness prevalence in the Netherlands, 2020–2023","authors":"Hester Korthals Altes ,&nbsp;Jan Van De Kassteele ,&nbsp;Bram Wisse ,&nbsp;Maria Xiridou ,&nbsp;Albert Jan Van Hoek ,&nbsp;Jacco Wallinga","doi":"10.1016/j.epidem.2025.100841","DOIUrl":"10.1016/j.epidem.2025.100841","url":null,"abstract":"<div><div>The monitoring of work absenteeism can inform pandemic decision making, besides the surveillance of disease end-points like mortality and intensive care bed occupancy. For instance, high disease prevalence accompanied by elevated levels of absenteeism in the healthcare sector will increase the strain on the health care system, and may necessitate adaptation of the control measures. This highlights the need to assess the association between COVID-19 disease prevalence and absenteeism in relevant economic sectors. We initiated the comprehensive monitoring and analysis of work absenteeism and developed an autoregressive time series model which combined COVID-19 prevalence as measured through syndromic surveillance, with absenteeism across various economic activity sectors in the Netherlands. The analysis was updated regularly and shared with policy makers. Overall, prevalence of COVID-19-like illnesses was the most important contributor to variation in absenteeism over the period November 2020-May 2023, with absenteeism rates varying markedly between activity sectors. Of the sectors well-covered by the absenteeism database, the Education and Logistics sectors showed the greatest contribution of a seasonal pattern independent of COVID-19 to absenteeism.</div></div>","PeriodicalId":49206,"journal":{"name":"Epidemics","volume":"52 ","pages":"Article 100841"},"PeriodicalIF":3.0,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144572646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Epidemiology and environmental risks of antibiotic resistant Enterobacterales isolates in different aquatic matrices from North-Western Romania 罗马尼亚西北部不同水生基质中抗生素耐药肠杆菌分离株的流行病学和环境风险
IF 2.4 3区 医学 Q2 INFECTIOUS DISEASES Pub Date : 2025-09-01 Epub Date: 2025-08-22 DOI: 10.1016/j.epidem.2025.100852
Anca Farkas , Rahela Carpa , Edina Szekeres , Adela Teban-Man , Cristian Coman , Anca Butiuc-Keul
The most menacing sources of environmental contamination with antibiotic resistant bacteria are effluents derived from anthropic activities. Even when wastewater treatment processes are implemented, conventional methods are not able to completely retain the antibiotic resistance determinants. We propose an antibiotic resistance risk assessment, incorporating the characterisation of ARB, ARGs and MGEs in different environmental compartments.
Antibiotic susceptibility testing of 678 Enterobacterales isolates revealed an increased degree of intrinsic resistance to erythromycin (77.9 %), high level of resistance to ampicillin (39.7 %), low frequency of carbapenem resistance (2.36 %), and a percentage of 34.4 % MDR strains. The most frequent resistance determinants were blaTEM-1 (26.5 %) and tetA (8.26 %), while the intI1 gene was found in 7.37 % of isolates. Resistant Enterobacterales from aquatic matrices with different degrees of contamination were identified as Citrobacter spp. (n = 46), Enterobacter spp. (n = 35), Klebsiella spp. (n = 54) and Escherichia coli (n = 107). A strong statistical correlation was observed between the presence of intI1 and the ARG index (0.768) in resistant Enterobacter spp.
Distinct clustering of strains was not observed across different environmental matrices, especially in those directly impacted by human-derived bacteria. Also, distribution of ARB patterns and diversity of ARGs was stable from the taxonomic perspective. Dendrogram analysis based on ERIC-PCR profiles confirmed the presence of strains with identical DNA fingerprints in non-related aquatic ecosystems. The epidemiology of resistant Citrobacter, Enterobacter, Klebsiella and Escherichia isolates confirmed an extensive migration and environmental dispersion of strains with human health significance, particularly important for water resources.
具有抗生素抗性细菌的最具威胁性的环境污染源是人类活动产生的污水。即使实施了废水处理过程,传统方法也不能完全保留抗生素耐药性决定因素。我们建议进行抗生素耐药风险评估,包括不同环境区室中ARB、ARGs和MGEs的特征。对678株肠杆菌进行药敏试验,发现对红霉素耐药程度增加(77.9% %),对氨苄西林耐药程度高(39.7% %),对碳青霉烯类耐药频率低(2.36 %),耐多药菌株比例为34.4% %。最常见的耐药决定因素是blatem1(26.5% %)和tetA(8.26% %),而intI1基因在7.37% %的分离株中发现。从不同污染程度的水生基质中鉴定出耐药肠杆菌为Citrobacter spp (n = 46)、Enterobacter spp (n = 35)、Klebsiella spp (n = 54)和Escherichia coli (n = 107)。耐药肠杆菌中intI1的存在与ARG指数(0.768)有较强的统计学相关性,但在不同的环境基质中,特别是在直接受人源性细菌影响的环境基质中,未观察到明显的菌群聚集性。从分类上看,arg的分布格局和多样性是稳定的。基于ERIC-PCR图谱的树形图分析证实,在非亲缘关系的水生生态系统中存在具有相同DNA指纹的菌株。耐药柠檬酸杆菌、肠杆菌、克雷伯氏菌和埃希氏菌分离株的流行病学证实了具有人类健康意义的菌株的广泛迁移和环境分散,对水资源尤其重要。
{"title":"Epidemiology and environmental risks of antibiotic resistant Enterobacterales isolates in different aquatic matrices from North-Western Romania","authors":"Anca Farkas ,&nbsp;Rahela Carpa ,&nbsp;Edina Szekeres ,&nbsp;Adela Teban-Man ,&nbsp;Cristian Coman ,&nbsp;Anca Butiuc-Keul","doi":"10.1016/j.epidem.2025.100852","DOIUrl":"10.1016/j.epidem.2025.100852","url":null,"abstract":"<div><div>The most menacing sources of environmental contamination with antibiotic resistant bacteria are effluents derived from anthropic activities. Even when wastewater treatment processes are implemented, conventional methods are not able to completely retain the antibiotic resistance determinants. We propose an antibiotic resistance risk assessment, incorporating the characterisation of ARB, ARGs and MGEs in different environmental compartments.</div><div>Antibiotic susceptibility testing of 678 <em>Enterobacterales</em> isolates revealed an increased degree of intrinsic resistance to erythromycin (77.9 %), high level of resistance to ampicillin (39.7 %), low frequency of carbapenem resistance (2.36 %), and a percentage of 34.4 % MDR strains. The most frequent resistance determinants were <em>bla</em><sub><em>TEM-1</em></sub> (26.5 %) and <em>tetA</em> (8.26 %), while the <em>intI1</em> gene was found in 7.37 % of isolates. Resistant <em>Enterobacterales</em> from aquatic matrices with different degrees of contamination were identified as <em>Citrobacter</em> spp. (n = 46), <em>Enterobacter</em> spp. (n = 35), <em>Klebsiella</em> spp. (n = 54) and <em>Escherichia coli</em> (n = 107). A strong statistical correlation was observed between the presence of <em>intI1</em> and the ARG index (0.768) in resistant <em>Enterobacter</em> spp.</div><div>Distinct clustering of strains was not observed across different environmental matrices, especially in those directly impacted by human-derived bacteria. Also, distribution of ARB patterns and diversity of ARGs was stable from the taxonomic perspective. Dendrogram analysis based on ERIC-PCR profiles confirmed the presence of strains with identical DNA fingerprints in non-related aquatic ecosystems. The epidemiology of resistant <em>Citrobacter, Enterobacter, Klebsiella</em> and <em>Escherichia</em> isolates confirmed an extensive migration and environmental dispersion of strains with human health significance, particularly important for water resources.</div></div>","PeriodicalId":49206,"journal":{"name":"Epidemics","volume":"52 ","pages":"Article 100852"},"PeriodicalIF":2.4,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144911852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning associations of COVID-19 hospitalizations with wastewater viral signals by Markov modulated models 利用马尔可夫调制模型学习废水病毒信号与COVID-19住院的关联
IF 3 3区 医学 Q2 INFECTIOUS DISEASES Pub Date : 2025-09-01 Epub Date: 2025-07-14 DOI: 10.1016/j.epidem.2025.100840
K. Ken Peng , Charmaine B. Dean , X. Joan Hu , Robert Delatolla
Recent research highlights a strong correlation between COVID-19 hospitalizations and wastewater viral signals. Increases in wastewater viral signals may be early warnings of increases in hospital admissions. That indicates a promising opportunity to assess and predict the burden of infectious diseases and has driven the widespread adoption and development of wastewater monitoring tools by public health organizations. Previous studies utilize distributed lag models to explore associations of COVID-19 hospitalizations with lagged SARS-CoV-2 wastewater viral signals. However, the conventional distributed lag models assume the duration time of the lag to be fixed, which is not always plausible. This paper presents Markov-modulated models with distributed lasting time, treating the duration of the lag as a random variable defined by a hidden process. We evaluate exposure effects over the duration time and estimate the distribution of the lasting time using the wastewater data and COVID-19 hospitalization records from Ottawa, Canada during June 2020 to November 2022. The different COVID-19 pandemic waves are accommodated in the statistical learning. Moreover, two strategies for comparing the associations over different time intervals are exemplified using the Ottawa data. Of note, the proposed Markov modulated models, an extension of distributed lag models, are potentially applicable to many different problems where the lag time is not fixed.
最近的研究强调,COVID-19住院与废水病毒信号之间存在很强的相关性。废水中病毒信号的增加可能是入院人数增加的早期预警。这表明有机会评估和预测传染病的负担,并促使公共卫生组织广泛采用和开发废水监测工具。以往的研究利用分布式滞后模型来探索COVID-19住院与滞后的SARS-CoV-2废水病毒信号的关系。然而,传统的分布式滞后模型假设滞后的持续时间是固定的,这并不总是可信的。本文提出了具有分布持续时间的马尔可夫调制模型,将滞后的持续时间视为一个由隐藏过程定义的随机变量。我们利用2020年6月至2022年11月期间加拿大渥太华的废水数据和COVID-19住院记录,评估了持续时间内的暴露效应,并估计了持续时间的分布。统计学习中容纳了不同的COVID-19大流行波。此外,使用渥太华数据举例说明了在不同时间间隔内比较关联的两种策略。值得注意的是,所提出的马尔可夫调制模型是分布式滞后模型的扩展,它潜在地适用于延迟时间不固定的许多不同问题。
{"title":"Learning associations of COVID-19 hospitalizations with wastewater viral signals by Markov modulated models","authors":"K. Ken Peng ,&nbsp;Charmaine B. Dean ,&nbsp;X. Joan Hu ,&nbsp;Robert Delatolla","doi":"10.1016/j.epidem.2025.100840","DOIUrl":"10.1016/j.epidem.2025.100840","url":null,"abstract":"<div><div>Recent research highlights a strong correlation between COVID-19 hospitalizations and wastewater viral signals. Increases in wastewater viral signals may be early warnings of increases in hospital admissions. That indicates a promising opportunity to assess and predict the burden of infectious diseases and has driven the widespread adoption and development of wastewater monitoring tools by public health organizations. Previous studies utilize distributed lag models to explore associations of COVID-19 hospitalizations with lagged SARS-CoV-2 wastewater viral signals. However, the conventional distributed lag models assume the duration time of the lag to be fixed, which is not always plausible. This paper presents Markov-modulated models with distributed lasting time, treating the duration of the lag as a random variable defined by a hidden process. We evaluate exposure effects over the duration time and estimate the distribution of the lasting time using the wastewater data and COVID-19 hospitalization records from Ottawa, Canada during June 2020 to November 2022. The different COVID-19 pandemic waves are accommodated in the statistical learning. Moreover, two strategies for comparing the associations over different time intervals are exemplified using the Ottawa data. Of note, the proposed Markov modulated models, an extension of distributed lag models, are potentially applicable to many different problems where the lag time is not fixed.</div></div>","PeriodicalId":49206,"journal":{"name":"Epidemics","volume":"52 ","pages":"Article 100840"},"PeriodicalIF":3.0,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144666047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explaining the stable coexistence of drug-resistant and -susceptible pathogens: the resistance acquisition purifying selection model 解释耐药和敏感病原体稳定共存:耐药性获得纯化选择模型
IF 2.4 3区 医学 Q2 INFECTIOUS DISEASES Pub Date : 2025-09-01 Epub Date: 2025-08-06 DOI: 10.1016/j.epidem.2025.100848
Pleuni S. Pennings
Drug resistance is a problem in many pathogens. While overall, levels of resistance have risen in recent decades, there are many examples where after an initial rise, levels of resistance have stabilized. The stable coexistence of resistance and susceptibility has proven hard to explain – in most evolutionary models, either resistance or susceptibility ultimately “wins” and takes over the population. Here, we show that a simple model, mathematically akin to mutation-selection balance theory, can explain several key observations about drug resistance: (1) the stable coexistence of resistant and susceptible strains (2) at levels that depend on population-level drug usage and (3) with resistance often due to many different strains (resistance is present on many different genetic backgrounds). The model is applicable to resistance due to both mutations and horizontal gene transfer (HGT). It predicts that new resistant strains should continuously appear (through mutation or HGT and positive selection within treated hosts) and disappear (due to a fitness cost of resistance). The result is that while resistance is stable, which strains carry resistance is constantly changing. We used data from a longitudinal genomic study on E. coli in Norway to test this prediction for resistance to five different drugs and found that, consistent with the model, most resistant strains indeed disappear quickly after they appear in the dataset. Having a model that explains the dynamics of drug resistance will allow us to plan science-backed interventions to reduce the burden of drug resistance.
耐药性是许多病原体存在的问题。虽然总体而言,近几十年来耐药性水平有所上升,但有许多例子表明,在最初的上升之后,耐药性水平已经稳定下来。抗药性和易感染性的稳定共存已被证明是难以解释的——在大多数进化模型中,抗药性或易感性最终“胜出”并接管种群。在这里,我们展示了一个简单的模型,在数学上类似于突变选择平衡理论,可以解释关于耐药性的几个关键观察:(1)耐药菌株和敏感菌株的稳定共存(2)依赖于人群水平的药物使用水平;(3)通常由许多不同的菌株引起的耐药性(耐药性存在于许多不同的遗传背景)。该模型适用于突变和水平基因转移(HGT)引起的抗性。它预测新的耐药菌株将不断出现(通过突变或HGT和处理宿主内的阳性选择)并消失(由于抗性的适应度成本)。结果是,虽然抗性是稳定的,但哪些菌株携带抗性是不断变化的。我们使用来自挪威大肠杆菌纵向基因组研究的数据来测试对五种不同药物耐药性的预测,并发现,与模型一致,大多数耐药菌株在出现在数据集中后确实很快消失了。拥有一个解释耐药性动态的模型将使我们能够计划有科学依据的干预措施,以减轻耐药性的负担。
{"title":"Explaining the stable coexistence of drug-resistant and -susceptible pathogens: the resistance acquisition purifying selection model","authors":"Pleuni S. Pennings","doi":"10.1016/j.epidem.2025.100848","DOIUrl":"10.1016/j.epidem.2025.100848","url":null,"abstract":"<div><div>Drug resistance is a problem in many pathogens. While overall, levels of resistance have risen in recent decades, there are many examples where after an initial rise, levels of resistance have stabilized. The stable coexistence of resistance and susceptibility has proven hard to explain – in most evolutionary models, either resistance or susceptibility ultimately “wins” and takes over the population. Here, we show that a simple model, mathematically akin to mutation-selection balance theory, can explain several key observations about drug resistance: (1) the stable coexistence of resistant and susceptible strains (2) at levels that depend on population-level drug usage and (3) with resistance often due to many different strains (resistance is present on many different genetic backgrounds). The model is applicable to resistance due to both mutations and horizontal gene transfer (HGT). It predicts that new resistant strains should continuously appear (through mutation or HGT and positive selection within treated hosts) and disappear (due to a fitness cost of resistance). The result is that while resistance is stable, <em>which</em> strains carry resistance is constantly changing. We used data from a longitudinal genomic study on <em>E. coli</em> in Norway to test this prediction for resistance to five different drugs and found that, consistent with the model, most resistant strains indeed disappear quickly after they appear in the dataset. Having a model that explains the dynamics of drug resistance will allow us to plan science-backed interventions to reduce the burden of drug resistance.</div></div>","PeriodicalId":49206,"journal":{"name":"Epidemics","volume":"52 ","pages":"Article 100848"},"PeriodicalIF":2.4,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144907266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust phylodynamic inference and model specification for HIV transmission dynamics HIV传播动力学的鲁棒系统动力学推断和模型规范
IF 3 3区 医学 Q2 INFECTIOUS DISEASES Pub Date : 2025-09-01 Epub Date: 2025-07-16 DOI: 10.1016/j.epidem.2025.100846
Fabrícia F. Nascimento , Sanjay R. Mehta , Susan J. Little , Erik M. Volz
The robustness and statistical efficiency of phylodynamic models have been tested by many investigators. However, little attention has been given to model specification and inductive bias that can occur if the model is misspecified or provides an overly simplistic representation of the evolutionary process. Here, we carried out a study involving the simulation of HIV epidemics using a complex model and calibrated to men who have sex with men from San Diego, USA. We then used this epidemic trajectory to simulate genealogies, sequence alignments equivalent to HIV partial pol gene and the complete genome. We proceeded to estimate migration rates using a simplistic representation of the epidemiological model by testing model-based phylodynamics and phylogeographic methods. We observed that even though there were some biases on the estimates using a simplistic representation of the epidemiological model, we were still able to estimate the migration rates depending on the method and sample size used in the analyses.
系统动力学模型的稳健性和统计效率已经被许多研究者检验过。然而,很少有人关注模型规范和归纳偏差,如果模型被错误地指定或提供了一个过于简单的进化过程的表示,则可能发生归纳偏差。在这里,我们进行了一项研究,使用一个复杂的模型来模拟艾滋病毒的流行,并校准了来自美国圣地亚哥的男男性行为者。然后,我们使用这种流行轨迹来模拟家谱,序列比对相当于HIV部分pol基因和完整基因组。我们通过测试基于模型的系统动力学和系统地理学方法,使用流行病学模型的简化表示来估计迁移率。我们观察到,尽管使用流行病学模型的简单表示估计存在一些偏差,但我们仍然能够根据分析中使用的方法和样本量估计迁移率。
{"title":"Robust phylodynamic inference and model specification for HIV transmission dynamics","authors":"Fabrícia F. Nascimento ,&nbsp;Sanjay R. Mehta ,&nbsp;Susan J. Little ,&nbsp;Erik M. Volz","doi":"10.1016/j.epidem.2025.100846","DOIUrl":"10.1016/j.epidem.2025.100846","url":null,"abstract":"<div><div>The robustness and statistical efficiency of phylodynamic models have been tested by many investigators. However, little attention has been given to model specification and inductive bias that can occur if the model is misspecified or provides an overly simplistic representation of the evolutionary process. Here, we carried out a study involving the simulation of HIV epidemics using a complex model and calibrated to men who have sex with men from San Diego, USA. We then used this epidemic trajectory to simulate genealogies, sequence alignments equivalent to HIV partial <em>pol</em> gene and the complete genome. We proceeded to estimate migration rates using a simplistic representation of the epidemiological model by testing model-based phylodynamics and phylogeographic methods. We observed that even though there were some biases on the estimates using a simplistic representation of the epidemiological model, we were still able to estimate the migration rates depending on the method and sample size used in the analyses.</div></div>","PeriodicalId":49206,"journal":{"name":"Epidemics","volume":"52 ","pages":"Article 100846"},"PeriodicalIF":3.0,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144655353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving policy-oriented agent-based modeling with history matching: A case study 使用历史匹配改进面向策略的基于代理的建模:一个案例研究
IF 3 3区 医学 Q2 INFECTIOUS DISEASES Pub Date : 2025-09-01 Epub Date: 2025-07-16 DOI: 10.1016/j.epidem.2025.100845
David O’Gara , Cliff C. Kerr , Daniel J. Klein , Mickaël Binois , Roman Garnett , Ross A. Hammond
Advances in computing power and data availability have led to growing sophistication in mechanistic mathematical models of social dynamics. Increasingly these models are used to inform real-world policy decision-making, often with significant time sensitivity. One such modeling approach is agent-based modeling, which offers particular strengths for capturing spatial and behavioral realism, and for in-silico experiments (varying input parameters and assumptions to explore their downstream impact on key outcomes). To be useful in the real-world, these models must be able to qualitatively or quantitatively capture observed empirical phenomena, forming the starting point for subsequent experimentation. One recent example is the COVID-19 pandemic, where epidemiological agent-based models informed policy and response planning worldwide. Throughout, modeling teams often had to spend valuable time and effort aligning their models to data, also known as calibration. Since many agent-based models are computationally intensive, the calibration process constrains the questions and scenarios policymakers may explore in time-sensitive situations. In this paper, we combine history matching, heteroskedastic Gaussian process modeling, and approximate Bayesian computation to address this bottleneck, substantially increasing efficiency and thus widening the range of utility for policy models. We illustrate our approach with a case study using a previously published and widely used epidemiological model, the Covasim model.
计算能力和数据可用性的进步导致社会动态的机械数学模型越来越复杂。这些模型越来越多地用于为现实世界的政策决策提供信息,通常具有显著的时间敏感性。其中一种建模方法是基于代理的建模,它为捕获空间和行为现实性以及计算机实验(改变输入参数和假设以探索其对关键结果的下游影响)提供了特别的优势。为了在现实世界中发挥作用,这些模型必须能够定性或定量地捕捉到观察到的经验现象,形成后续实验的起点。最近的一个例子是COVID-19大流行,基于流行病学主体的模型为全球的政策和应对规划提供了信息。在整个过程中,建模团队经常不得不花费宝贵的时间和精力将他们的模型与数据对齐,也称为校准。由于许多基于智能体的模型是计算密集型的,校准过程限制了决策者在时间敏感的情况下可能探索的问题和场景。在本文中,我们结合历史匹配、异方差高斯过程建模和近似贝叶斯计算来解决这一瓶颈,大大提高了效率,从而扩大了政策模型的实用范围。我们通过一个案例研究来说明我们的方法,该案例研究使用了先前发表并广泛使用的流行病学模型Covasim模型。
{"title":"Improving policy-oriented agent-based modeling with history matching: A case study","authors":"David O’Gara ,&nbsp;Cliff C. Kerr ,&nbsp;Daniel J. Klein ,&nbsp;Mickaël Binois ,&nbsp;Roman Garnett ,&nbsp;Ross A. Hammond","doi":"10.1016/j.epidem.2025.100845","DOIUrl":"10.1016/j.epidem.2025.100845","url":null,"abstract":"<div><div>Advances in computing power and data availability have led to growing sophistication in mechanistic mathematical models of social dynamics. Increasingly these models are used to inform real-world policy decision-making, often with significant time sensitivity. One such modeling approach is agent-based modeling, which offers particular strengths for capturing spatial and behavioral realism, and for <em>in-silico</em> experiments (varying input parameters and assumptions to explore their downstream impact on key outcomes). To be useful in the real-world, these models must be able to qualitatively or quantitatively capture observed empirical phenomena, forming the starting point for subsequent experimentation. One recent example is the COVID-19 pandemic, where epidemiological agent-based models informed policy and response planning worldwide. Throughout, modeling teams often had to spend valuable time and effort aligning their models to data, also known as calibration. Since many agent-based models are computationally intensive, the calibration process constrains the questions and scenarios policymakers may explore in time-sensitive situations. In this paper, we combine history matching, heteroskedastic Gaussian process modeling, and approximate Bayesian computation to address this bottleneck, substantially increasing efficiency and thus widening the range of utility for policy models. We illustrate our approach with a case study using a previously published and widely used epidemiological model, the Covasim model.</div></div>","PeriodicalId":49206,"journal":{"name":"Epidemics","volume":"52 ","pages":"Article 100845"},"PeriodicalIF":3.0,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144672567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating the effective reproduction number from wastewater (Rt): A methods comparison 估计废水的有效再生数(Rt):一种方法的比较
IF 3 3区 医学 Q2 INFECTIOUS DISEASES Pub Date : 2025-09-01 Epub Date: 2025-06-18 DOI: 10.1016/j.epidem.2025.100839
Dustin T. Hill , Yifan Zhu , Christopher Dunham , E. Joe Moran , Yiquan Zhou , Mary B. Collins , Brittany L. Kmush , David A. Larsen

Background

The effective reproduction number (Rt) is a dynamic indicator of current disease spread risk. Wastewater measurements of viral concentrations are known to correlate with clinical measures of diseases and have been incorporated into methods for estimating the Rt.

Methods

We review wastewater-based methods to estimate the Rt for SARS-CoV-2 based on similarity to the reference case-based Rt, ease of use, and computational requirements. Using wastewater data collected between August 1, 2022, and February 20, 2024, from 205 wastewater treatment plants across New York State, we fit eight wastewater Rt models identified from the literature. Each model is compared to the Rt estimated from case data for New York at the sewershed (wastewater treatment plant catchment area), county, and state levels.

Results

We find a high degree of similarity across all eight methods despite differences in model parameters and approach. Further, two methods based on the common measures of percent change and linear fit reproduced the Rt from case data very well and a GLM accurately predicted case data. Model output varied between spatial scales with some models more closely estimating sewershed Rt values than county Rt values. Similarity to clinical models was also highly correlated with the proportion of the population served by sewer in the surveilled communities (r = 0.77).

Conclusions

While not all methods that estimate Rt from wastewater produce the same results, they all provide a way to incorporate wastewater concentration data into epidemic modeling. Our results show that straightforward measures like the percent change can produce similar results of more complex models. Based on the results, researchers and public health officials can select the method that is best for their situation.
有效繁殖数(Rt)是当前疾病传播风险的动态指标。已知废水中病毒浓度的测量与疾病的临床测量相关,并已被纳入估计Rt的方法中。方法基于与参考病例Rt的相似性、易用性和计算要求,我们综述了基于废水的Rt估计方法。利用2022年8月1日至2024年2月20日期间从纽约州205家污水处理厂收集的废水数据,我们拟合了从文献中确定的8个废水Rt模型。每个模型都与纽约下水道(污水处理厂集水区)、县和州一级的病例数据估计的Rt进行比较。结果尽管在模型参数和方法上存在差异,但我们发现所有八种方法都具有高度的相似性。此外,基于变化百分比和线性拟合的两种常用测量方法可以很好地再现病例数据的Rt,并且GLM可以准确地预测病例数据。模型的输出在不同的空间尺度上存在差异,一些模型对下水道Rt值的估计比县Rt值更接近。与临床模型的相似性也与监测社区下水道服务的人口比例高度相关(r = 0.77)。虽然并非所有从废水中估计Rt的方法都产生相同的结果,但它们都提供了一种将废水浓度数据纳入流行病建模的方法。我们的结果表明,像百分比变化这样简单的度量可以在更复杂的模型中产生类似的结果。根据结果,研究人员和公共卫生官员可以选择最适合他们情况的方法。
{"title":"Estimating the effective reproduction number from wastewater (Rt): A methods comparison","authors":"Dustin T. Hill ,&nbsp;Yifan Zhu ,&nbsp;Christopher Dunham ,&nbsp;E. Joe Moran ,&nbsp;Yiquan Zhou ,&nbsp;Mary B. Collins ,&nbsp;Brittany L. Kmush ,&nbsp;David A. Larsen","doi":"10.1016/j.epidem.2025.100839","DOIUrl":"10.1016/j.epidem.2025.100839","url":null,"abstract":"<div><h3>Background</h3><div>The effective reproduction number (R<sub>t</sub>) is a dynamic indicator of current disease spread risk. Wastewater measurements of viral concentrations are known to correlate with clinical measures of diseases and have been incorporated into methods for estimating the R<sub>t</sub>.</div></div><div><h3>Methods</h3><div>We review wastewater-based methods to estimate the R<sub>t</sub> for SARS-CoV-2 based on similarity to the reference case-based R<sub>t</sub>, ease of use, and computational requirements. Using wastewater data collected between August 1, 2022, and February 20, 2024, from 205 wastewater treatment plants across New York State, we fit eight wastewater R<sub>t</sub> models identified from the literature. Each model is compared to the R<sub>t</sub> estimated from case data for New York at the sewershed (wastewater treatment plant catchment area), county, and state levels.</div></div><div><h3>Results</h3><div>We find a high degree of similarity across all eight methods despite differences in model parameters and approach. Further, two methods based on the common measures of percent change and linear fit reproduced the R<sub>t</sub> from case data very well and a GLM accurately predicted case data. Model output varied between spatial scales with some models more closely estimating sewershed R<sub>t</sub> values than county R<sub>t</sub> values. Similarity to clinical models was also highly correlated with the proportion of the population served by sewer in the surveilled communities (r = 0.77).</div></div><div><h3>Conclusions</h3><div>While not all methods that estimate R<sub>t</sub> from wastewater produce the same results, they all provide a way to incorporate wastewater concentration data into epidemic modeling. Our results show that straightforward measures like the percent change can produce similar results of more complex models. Based on the results, researchers and public health officials can select the method that is best for their situation.</div></div>","PeriodicalId":49206,"journal":{"name":"Epidemics","volume":"52 ","pages":"Article 100839"},"PeriodicalIF":3.0,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144338640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Incident COVID-19 infections before Omicron in the U.S. 在美国欧米克隆之前发生的COVID-19感染事件
IF 3 3区 医学 Q2 INFECTIOUS DISEASES Pub Date : 2025-09-01 Epub Date: 2025-06-09 DOI: 10.1016/j.epidem.2025.100838
Rachel Lobay , Ajitesh Srivastava , Ryan J. Tibshirani , Daniel J. McDonald
The timing and magnitude of COVID-19 infections are of interest to the public and to public health, but these are challenging to ascertain due to the volume of undetected asymptomatic cases and reporting delays. Accurate estimates of COVID-19 infections based on finalized data can improve understanding of the pandemic and provide more meaningful quantification of disease patterns and burden. Therefore, we retrospectively estimate daily incident infections for each U.S. state prior to Omicron. To this end, reported COVID-19 cases are deconvolved to their likely date of infection onset using delay distributions estimated from the CDC line list. Then, a novel serology-driven model is used to scale these deconvolved cases to account for the unreported infections. The resulting infection estimates incorporate variant-specific incubation periods, reinfections, and waning antigenic immunity. They clearly demonstrate that reported cases failed to reflect the full extent of disease burden in all states. Most notably, infections were severely underreported during the Delta wave, with an estimated reporting rate as low as 6.3% in New Jersey, 7.3% in Maryland, and 8.4% in Nevada. Moreover, in 44 states, fewer than 1/3 of infections eventually appeared as case reports, and there were sustained periods where surges in infections were virtually undetectable through reported cases. This pattern was clearly illustrated by North and South Dakota during the spring of 2021, as well as by several Northeastern states during the Delta wave of late summer that year. While reported cases offered a convenient proxy of disease burden, they failed to capture the full extent of infections and severely underestimated the true disease burden. Our retrospective analysis also estimates other important quantities for every state, including variant-specific deconvolved cases, time-varying case ascertainment ratios, as well as infection-hospitalization and infection-fatality ratios.
COVID-19感染的时间和规模是公众和公共卫生关注的问题,但由于大量未发现的无症状病例和报告延误,这些问题很难确定。根据最终数据对COVID-19感染进行准确估计,可以增进对大流行的了解,并提供更有意义的疾病模式和负担量化。因此,我们回顾性地估计了美国各州在欧米克隆之前的每日感染事件。为此,根据CDC清单估计的延迟分布,将报告的COVID-19病例解卷积到其可能的感染发病日期。然后,使用一种新的血清学驱动模型对这些解卷积病例进行缩放,以解释未报告的感染。由此产生的感染估计包括变异特异性潜伏期、再感染和抗原免疫减弱。它们清楚地表明,报告的病例未能反映所有州疾病负担的全部程度。最值得注意的是,在三角洲波期间,感染严重低估,估计报告率在新泽西州低至6.3%,马里兰州为7.3%,内华达州为8.4%。此外,在44个州,不到三分之一的感染最终以病例报告的形式出现,并且在持续的一段时间里,感染的激增几乎无法通过报告的病例检测到。这种模式在2021年春季的北达科他州和南达科他州以及当年夏末的三角洲波期间的几个东北部州都得到了清楚的说明。虽然报告的病例提供了疾病负担的方便代表,但它们未能捕捉到感染的全部程度,并严重低估了真正的疾病负担。我们的回顾性分析还估计了每个州的其他重要数量,包括变异特异性反卷积病例,随时间变化的病例确定比率,以及感染住院率和感染病死率。
{"title":"Incident COVID-19 infections before Omicron in the U.S.","authors":"Rachel Lobay ,&nbsp;Ajitesh Srivastava ,&nbsp;Ryan J. Tibshirani ,&nbsp;Daniel J. McDonald","doi":"10.1016/j.epidem.2025.100838","DOIUrl":"10.1016/j.epidem.2025.100838","url":null,"abstract":"<div><div>The timing and magnitude of COVID-19 infections are of interest to the public and to public health, but these are challenging to ascertain due to the volume of undetected asymptomatic cases and reporting delays. Accurate estimates of COVID-19 infections based on finalized data can improve understanding of the pandemic and provide more meaningful quantification of disease patterns and burden. Therefore, we retrospectively estimate daily incident infections for each U.S. state prior to Omicron. To this end, reported COVID-19 cases are deconvolved to their likely date of infection onset using delay distributions estimated from the CDC line list. Then, a novel serology-driven model is used to scale these deconvolved cases to account for the unreported infections. The resulting infection estimates incorporate variant-specific incubation periods, reinfections, and waning antigenic immunity. They clearly demonstrate that reported cases failed to reflect the full extent of disease burden in all states. Most notably, infections were severely underreported during the Delta wave, with an estimated reporting rate as low as 6.3% in New Jersey, 7.3% in Maryland, and 8.4% in Nevada. Moreover, in 44 states, fewer than 1/3 of infections eventually appeared as case reports, and there were sustained periods where surges in infections were virtually undetectable through reported cases. This pattern was clearly illustrated by North and South Dakota during the spring of 2021, as well as by several Northeastern states during the Delta wave of late summer that year. While reported cases offered a convenient proxy of disease burden, they failed to capture the full extent of infections and severely underestimated the true disease burden. Our retrospective analysis also estimates other important quantities for every state, including variant-specific deconvolved cases, time-varying case ascertainment ratios, as well as infection-hospitalization and infection-fatality ratios.</div></div>","PeriodicalId":49206,"journal":{"name":"Epidemics","volume":"52 ","pages":"Article 100838"},"PeriodicalIF":3.0,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144289026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning approaches for real-time ZIP code and county-level estimation of state-wide infectious disease hospitalizations using local health system data 机器学习方法实时邮政编码和县级估计全州传染病住院使用当地卫生系统数据
IF 3 3区 医学 Q2 INFECTIOUS DISEASES Pub Date : 2025-06-01 Epub Date: 2025-04-03 DOI: 10.1016/j.epidem.2025.100823
Tanvir Ahammed , Md Sakhawat Hossain , Christopher McMahan , Lior Rennert
The lack of conventional methods of estimating real-time infectious disease burden in granular regions inhibits timely and efficient public health response. Comprehensive data sources (e.g., state health department data) typically needed for such estimation are often limited due to 1) substantial delays in data reporting and 2) lack of geographic granularity in data provided to researchers. Leveraging real-time local health system data presents an opportunity to overcome these challenges. This study evaluates the effectiveness of machine learning and statistical approaches using local health system data to estimate current and previous COVID-19 hospitalizations in South Carolina. Random Forest models demonstrated consistently higher average median percent agreement accuracy compared to generalized linear mixed models for current weekly hospitalizations across 123 ZIP codes (72.29 %, IQR: 63.20–75.62 %) and 28 counties (76.43 %, IQR: 70.33–81.16 %) with sufficient health system coverage. To account for underrepresented populations in health systems, we combined Random Forest models with Classification and Regression Trees (CART) for imputation. The average median percent agreement was 61.02 % (IQR: 51.17–72.29 %) for all ZIP codes and 72.64 % (IQR: 66.13–77.69 %) for all counties. Median percent agreement for cumulative hospitalizations over the previous 6 months was 80.98 % (IQR: 68.99–89.66 %) for all ZIP codes and 81.17 % (IQR: 68.55–91.33 %) for all counties. These findings emphasize the effectiveness of utilizing real-time health system data to estimate infectious disease burden. Moreover, the methodologies developed in this study can be adapted to estimate hospitalizations for other diseases, offering a valuable tool for public health officials to respond swiftly and effectively to various health crises.
由于缺乏估算细粒度地区实时传染病负担的常规方法,无法及时有效地采取公共卫生应对措施。由于 1) 数据报告严重滞后,2) 提供给研究人员的数据缺乏地理粒度,此类估算通常所需的综合数据源(如州卫生部门数据)往往受到限制。利用当地卫生系统的实时数据为克服这些挑战提供了机会。本研究评估了机器学习和统计方法的有效性,这些方法使用当地卫生系统数据来估算南卡罗来纳州当前和以往的 COVID-19 住院情况。在 123 个邮政编码(72.29%,IQR:63.20-75.62%)和 28 个有足够医疗系统覆盖范围的县(76.43%,IQR:70.33-81.16%)中,随机森林模型与广义线性混合模型相比,在当前每周住院情况方面显示出更高的平均中位数百分比一致性准确率。为了考虑到医疗系统中代表性不足的人群,我们将随机森林模型与分类和回归树 (CART) 结合起来进行估算。所有邮政编码和所有县的平均一致率中位数分别为 61.02 %(IQR:51.17-72.29 %)和 72.64 %(IQR:66.13-77.69 %)。在所有邮政编码中,前 6 个月累计住院治疗的中位同意率为 80.98 %(IQR:68.99-89.66 %),在所有县中为 81.17 %(IQR:68.55-91.33 %)。这些发现强调了利用实时卫生系统数据估算传染病负担的有效性。此外,本研究开发的方法还可用于估算其他疾病的住院人数,为公共卫生官员迅速有效地应对各种卫生危机提供了宝贵的工具。
{"title":"Machine learning approaches for real-time ZIP code and county-level estimation of state-wide infectious disease hospitalizations using local health system data","authors":"Tanvir Ahammed ,&nbsp;Md Sakhawat Hossain ,&nbsp;Christopher McMahan ,&nbsp;Lior Rennert","doi":"10.1016/j.epidem.2025.100823","DOIUrl":"10.1016/j.epidem.2025.100823","url":null,"abstract":"<div><div>The lack of conventional methods of estimating real-time infectious disease burden in granular regions inhibits timely and efficient public health response. Comprehensive data sources (e.g., state health department data) typically needed for such estimation are often limited due to 1) substantial delays in data reporting and 2) lack of geographic granularity in data provided to researchers. Leveraging real-time local health system data presents an opportunity to overcome these challenges. This study evaluates the effectiveness of machine learning and statistical approaches using local health system data to estimate current and previous COVID-19 hospitalizations in South Carolina. Random Forest models demonstrated consistently higher average median percent agreement accuracy compared to generalized linear mixed models for current weekly hospitalizations across 123 ZIP codes (72.29 %, IQR: 63.20–75.62 %) and 28 counties (76.43 %, IQR: 70.33–81.16 %) with sufficient health system coverage. To account for underrepresented populations in health systems, we combined Random Forest models with Classification and Regression Trees (CART) for imputation. The average median percent agreement was 61.02 % (IQR: 51.17–72.29 %) for all ZIP codes and 72.64 % (IQR: 66.13–77.69 %) for all counties. Median percent agreement for cumulative hospitalizations over the previous 6 months was 80.98 % (IQR: 68.99–89.66 %) for all ZIP codes and 81.17 % (IQR: 68.55–91.33 %) for all counties. These findings emphasize the effectiveness of utilizing real-time health system data to estimate infectious disease burden. Moreover, the methodologies developed in this study can be adapted to estimate hospitalizations for other diseases, offering a valuable tool for public health officials to respond swiftly and effectively to various health crises.</div></div>","PeriodicalId":49206,"journal":{"name":"Epidemics","volume":"51 ","pages":"Article 100823"},"PeriodicalIF":3.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143815115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Epidemics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1