Machine learning models predict long COVID outcomes based on baseline clinical and immunologic factors.

Naresh Doni Jayavelu, Hady Samaha, Sonia Tandon Wimalasena, Annmarie Hoch, Jeremy P Gygi, Gisela Gabernet, Al Ozonoff, Shanshan Liu, Carly E Milliren, Ofer Levy, Lindsey R Baden, Esther Melamed, Lauren I R Ehrlich, Grace A McComsey, Rafick P Sekaly, Charles B Cairns, Elias K Haddad, Joanna Schaenman, Albert C Shaw, David A Hafler, Ruth R Montgomery, David B Corry, Farrah Kheradmand, Mark A Atkinson, Scott C Brakenridge, Nelson I Agudelo Higuita, Jordan P Metcalf, Catherine L Hough, William B Messer, Bali Pulendran, Kari C Nadeau, Mark M Davis, Linda N Geng, Ana Fernandez Sesma, Viviana Simon, Florian Krammer, Monica Kraft, Chris Bime, Carolyn S Calfee, David J Erle, Charles R Langelier, Leying Guan, Holden T Maecker, Bjoern Peters, Steven H Kleinstein, Elaine F Reed, Joann Diray-Arce, Nadine Rouphael, Matthew C Altman
{"title":"Machine learning models predict long COVID outcomes based on baseline clinical and immunologic factors.","authors":"Naresh Doni Jayavelu, Hady Samaha, Sonia Tandon Wimalasena, Annmarie Hoch, Jeremy P Gygi, Gisela Gabernet, Al Ozonoff, Shanshan Liu, Carly E Milliren, Ofer Levy, Lindsey R Baden, Esther Melamed, Lauren I R Ehrlich, Grace A McComsey, Rafick P Sekaly, Charles B Cairns, Elias K Haddad, Joanna Schaenman, Albert C Shaw, David A Hafler, Ruth R Montgomery, David B Corry, Farrah Kheradmand, Mark A Atkinson, Scott C Brakenridge, Nelson I Agudelo Higuita, Jordan P Metcalf, Catherine L Hough, William B Messer, Bali Pulendran, Kari C Nadeau, Mark M Davis, Linda N Geng, Ana Fernandez Sesma, Viviana Simon, Florian Krammer, Monica Kraft, Chris Bime, Carolyn S Calfee, David J Erle, Charles R Langelier, Leying Guan, Holden T Maecker, Bjoern Peters, Steven H Kleinstein, Elaine F Reed, Joann Diray-Arce, Nadine Rouphael, Matthew C Altman","doi":"10.1101/2025.02.12.25322164","DOIUrl":null,"url":null,"abstract":"<p><p>The post-acute sequelae of SARS-CoV-2 (PASC), also known as long COVID, remain a significant health issue that is incompletely understood. Predicting which acutely infected individuals will go on to develop long COVID is challenging due to the lack of established biomarkers, clear disease mechanisms, or well-defined sub-phenotypes. Machine learning (ML) models offer the potential to address this by leveraging clinical data to enhance diagnostic precision. We utilized clinical data, including antibody titers and viral load measurements collected at the time of hospital admission, to predict the likelihood of acute COVID-19 progressing to long COVID. Our machine learning models achieved median AUROC values ranging from 0.64 to 0.66 and AUPRC values between 0.51 and 0.54, demonstrating their predictive capabilities. Feature importance analysis revealed that low antibody titers and high viral loads at hospital admission were the strongest predictors of long COVID outcomes. Comorbidities, including chronic respiratory, cardiac, and neurologic diseases, as well as female sex, were also identified as significant risk factors for long COVID. Our findings suggest that ML models have the potential to identify patients at risk for developing long COVID based on baseline clinical characteristics. These models can help guide early interventions, improving patient outcomes and mitigating the long-term public health impacts of SARS-CoV-2.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11844586/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv : the preprint server for health sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2025.02.12.25322164","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The post-acute sequelae of SARS-CoV-2 (PASC), also known as long COVID, remain a significant health issue that is incompletely understood. Predicting which acutely infected individuals will go on to develop long COVID is challenging due to the lack of established biomarkers, clear disease mechanisms, or well-defined sub-phenotypes. Machine learning (ML) models offer the potential to address this by leveraging clinical data to enhance diagnostic precision. We utilized clinical data, including antibody titers and viral load measurements collected at the time of hospital admission, to predict the likelihood of acute COVID-19 progressing to long COVID. Our machine learning models achieved median AUROC values ranging from 0.64 to 0.66 and AUPRC values between 0.51 and 0.54, demonstrating their predictive capabilities. Feature importance analysis revealed that low antibody titers and high viral loads at hospital admission were the strongest predictors of long COVID outcomes. Comorbidities, including chronic respiratory, cardiac, and neurologic diseases, as well as female sex, were also identified as significant risk factors for long COVID. Our findings suggest that ML models have the potential to identify patients at risk for developing long COVID based on baseline clinical characteristics. These models can help guide early interventions, improving patient outcomes and mitigating the long-term public health impacts of SARS-CoV-2.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
机器学习模型基于基线临床和免疫因素预测长期COVID结果。
SARS-CoV-2 (PASC)的急性后后遗症,也被称为长冠状病毒,仍然是一个尚未完全了解的重大健康问题。由于缺乏既定的生物标志物、明确的疾病机制或明确的亚表型,预测哪些急性感染者将继续发展长期COVID是具有挑战性的。机器学习(ML)模型通过利用临床数据来提高诊断精度,提供了解决这一问题的潜力。我们利用住院时收集的临床数据,包括抗体滴度和病毒载量测量,来预测急性COVID-19发展为长期COVID-19的可能性。我们的机器学习模型实现了中位AUROC值在0.64到0.66之间,AUPRC值在0.51到0.54之间,证明了它们的预测能力。特征重要性分析显示,入院时低抗体滴度和高病毒载量是长期COVID结果的最强预测因子。包括慢性呼吸系统、心脏和神经系统疾病以及女性在内的合并症也被确定为长期COVID的重要危险因素。我们的研究结果表明,ML模型有可能根据基线临床特征识别出有发展长期COVID风险的患者。这些模型可以帮助指导早期干预措施,改善患者预后,减轻SARS-CoV-2对公共卫生的长期影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
US Overdose Mortality Saw First Drop Below the Jalal-Burke Exponential Growth Curve in 2024. Suicidal thoughts and behaviours in Cape Town: a cross-sectional study of prevalence, social, contextual, and clinical correlates. Melanocyte loss dominates the vitiligo transcriptome: a rank-based meta-analysis. Fatigue Links Sociodemographic Risk to Pain Intensity and Spread in Two Surgical Cohorts. The Neuroendocrine Profile During the Trier Social Stress Test in College Freshmen Offers Insights into the Emergence of Anxiety and Depression Symptoms.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1