机器学习模型基于基线临床和免疫因素预测长期COVID结果。

medRxiv : the preprint server for health sciences Pub Date : 2025-02-13 DOI:10.1101/2025.02.12.25322164

Naresh Doni Jayavelu, Hady Samaha, Sonia Tandon Wimalasena, Annmarie Hoch, Jeremy P Gygi, Gisela Gabernet, Al Ozonoff, Shanshan Liu, Carly E Milliren, Ofer Levy, Lindsey R Baden, Esther Melamed, Lauren I R Ehrlich, Grace A McComsey, Rafick P Sekaly, Charles B Cairns, Elias K Haddad, Joanna Schaenman, Albert C Shaw, David A Hafler, Ruth R Montgomery, David B Corry, Farrah Kheradmand, Mark A Atkinson, Scott C Brakenridge, Nelson I Agudelo Higuita, Jordan P Metcalf, Catherine L Hough, William B Messer, Bali Pulendran, Kari C Nadeau, Mark M Davis, Linda N Geng, Ana Fernandez Sesma, Viviana Simon, Florian Krammer, Monica Kraft, Chris Bime, Carolyn S Calfee, David J Erle, Charles R Langelier, Leying Guan, Holden T Maecker, Bjoern Peters, Steven H Kleinstein, Elaine F Reed, Joann Diray-Arce, Nadine Rouphael, Matthew C Altman

{"title":"机器学习模型基于基线临床和免疫因素预测长期COVID结果。","authors":"Naresh Doni Jayavelu, Hady Samaha, Sonia Tandon Wimalasena, Annmarie Hoch, Jeremy P Gygi, Gisela Gabernet, Al Ozonoff, Shanshan Liu, Carly E Milliren, Ofer Levy, Lindsey R Baden, Esther Melamed, Lauren I R Ehrlich, Grace A McComsey, Rafick P Sekaly, Charles B Cairns, Elias K Haddad, Joanna Schaenman, Albert C Shaw, David A Hafler, Ruth R Montgomery, David B Corry, Farrah Kheradmand, Mark A Atkinson, Scott C Brakenridge, Nelson I Agudelo Higuita, Jordan P Metcalf, Catherine L Hough, William B Messer, Bali Pulendran, Kari C Nadeau, Mark M Davis, Linda N Geng, Ana Fernandez Sesma, Viviana Simon, Florian Krammer, Monica Kraft, Chris Bime, Carolyn S Calfee, David J Erle, Charles R Langelier, Leying Guan, Holden T Maecker, Bjoern Peters, Steven H Kleinstein, Elaine F Reed, Joann Diray-Arce, Nadine Rouphael, Matthew C Altman","doi":"10.1101/2025.02.12.25322164","DOIUrl":null,"url":null,"abstract":"The post-acute sequelae of SARS-CoV-2 (PASC), also known as long COVID, remain a significant health issue that is incompletely understood. Predicting which acutely infected individuals will go on to develop long COVID is challenging due to the lack of established biomarkers, clear disease mechanisms, or well-defined sub-phenotypes. Machine learning (ML) models offer the potential to address this by leveraging clinical data to enhance diagnostic precision. We utilized clinical data, including antibody titers and viral load measurements collected at the time of hospital admission, to predict the likelihood of acute COVID-19 progressing to long COVID. Our machine learning models achieved median AUROC values ranging from 0.64 to 0.66 and AUPRC values between 0.51 and 0.54, demonstrating their predictive capabilities. Feature importance analysis revealed that low antibody titers and high viral loads at hospital admission were the strongest predictors of long COVID outcomes. Comorbidities, including chronic respiratory, cardiac, and neurologic diseases, as well as female sex, were also identified as significant risk factors for long COVID. Our findings suggest that ML models have the potential to identify patients at risk for developing long COVID based on baseline clinical characteristics. These models can help guide early interventions, improving patient outcomes and mitigating the long-term public health impacts of SARS-CoV-2.","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11844586/pdf/","citationCount":"0","resultStr":"{\"title\":\"Machine learning models predict long COVID outcomes based on baseline clinical and immunologic factors.\",\"authors\":\"Naresh Doni Jayavelu, Hady Samaha, Sonia Tandon Wimalasena, Annmarie Hoch, Jeremy P Gygi, Gisela Gabernet, Al Ozonoff, Shanshan Liu, Carly E Milliren, Ofer Levy, Lindsey R Baden, Esther Melamed, Lauren I R Ehrlich, Grace A McComsey, Rafick P Sekaly, Charles B Cairns, Elias K Haddad, Joanna Schaenman, Albert C Shaw, David A Hafler, Ruth R Montgomery, David B Corry, Farrah Kheradmand, Mark A Atkinson, Scott C Brakenridge, Nelson I Agudelo Higuita, Jordan P Metcalf, Catherine L Hough, William B Messer, Bali Pulendran, Kari C Nadeau, Mark M Davis, Linda N Geng, Ana Fernandez Sesma, Viviana Simon, Florian Krammer, Monica Kraft, Chris Bime, Carolyn S Calfee, David J Erle, Charles R Langelier, Leying Guan, Holden T Maecker, Bjoern Peters, Steven H Kleinstein, Elaine F Reed, Joann Diray-Arce, Nadine Rouphael, Matthew C Altman\",\"doi\":\"10.1101/2025.02.12.25322164\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The post-acute sequelae of SARS-CoV-2 (PASC), also known as long COVID, remain a significant health issue that is incompletely understood. Predicting which acutely infected individuals will go on to develop long COVID is challenging due to the lack of established biomarkers, clear disease mechanisms, or well-defined sub-phenotypes. Machine learning (ML) models offer the potential to address this by leveraging clinical data to enhance diagnostic precision. We utilized clinical data, including antibody titers and viral load measurements collected at the time of hospital admission, to predict the likelihood of acute COVID-19 progressing to long COVID. Our machine learning models achieved median AUROC values ranging from 0.64 to 0.66 and AUPRC values between 0.51 and 0.54, demonstrating their predictive capabilities. Feature importance analysis revealed that low antibody titers and high viral loads at hospital admission were the strongest predictors of long COVID outcomes. Comorbidities, including chronic respiratory, cardiac, and neurologic diseases, as well as female sex, were also identified as significant risk factors for long COVID. Our findings suggest that ML models have the potential to identify patients at risk for developing long COVID based on baseline clinical characteristics. These models can help guide early interventions, improving patient outcomes and mitigating the long-term public health impacts of SARS-CoV-2.\",\"PeriodicalId\":94281,\"journal\":{\"name\":\"medRxiv : the preprint server for health sciences\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-02-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11844586/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"medRxiv : the preprint server for health sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2025.02.12.25322164\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv : the preprint server for health sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2025.02.12.25322164","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

SARS-CoV-2 （PASC）的急性后后遗症，也被称为长冠状病毒，仍然是一个尚未完全了解的重大健康问题。由于缺乏既定的生物标志物、明确的疾病机制或明确的亚表型，预测哪些急性感染者将继续发展长期COVID是具有挑战性的。机器学习（ML）模型通过利用临床数据来提高诊断精度，提供了解决这一问题的潜力。我们利用住院时收集的临床数据，包括抗体滴度和病毒载量测量，来预测急性COVID-19发展为长期COVID-19的可能性。我们的机器学习模型实现了中位AUROC值在0.64到0.66之间，AUPRC值在0.51到0.54之间，证明了它们的预测能力。特征重要性分析显示，入院时低抗体滴度和高病毒载量是长期COVID结果的最强预测因子。包括慢性呼吸系统、心脏和神经系统疾病以及女性在内的合并症也被确定为长期COVID的重要危险因素。我们的研究结果表明，ML模型有可能根据基线临床特征识别出有发展长期COVID风险的患者。这些模型可以帮助指导早期干预措施，改善患者预后，减轻SARS-CoV-2对公共卫生的长期影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Machine learning models predict long COVID outcomes based on baseline clinical and immunologic factors.

The post-acute sequelae of SARS-CoV-2 (PASC), also known as long COVID, remain a significant health issue that is incompletely understood. Predicting which acutely infected individuals will go on to develop long COVID is challenging due to the lack of established biomarkers, clear disease mechanisms, or well-defined sub-phenotypes. Machine learning (ML) models offer the potential to address this by leveraging clinical data to enhance diagnostic precision. We utilized clinical data, including antibody titers and viral load measurements collected at the time of hospital admission, to predict the likelihood of acute COVID-19 progressing to long COVID. Our machine learning models achieved median AUROC values ranging from 0.64 to 0.66 and AUPRC values between 0.51 and 0.54, demonstrating their predictive capabilities. Feature importance analysis revealed that low antibody titers and high viral loads at hospital admission were the strongest predictors of long COVID outcomes. Comorbidities, including chronic respiratory, cardiac, and neurologic diseases, as well as female sex, were also identified as significant risk factors for long COVID. Our findings suggest that ML models have the potential to identify patients at risk for developing long COVID based on baseline clinical characteristics. These models can help guide early interventions, improving patient outcomes and mitigating the long-term public health impacts of SARS-CoV-2.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

medRxiv : the preprint server for health sciences

自引率

0.00%

发文量