Mehmet Ulvi Saygi Ayvaci, Varghese S Jacobi, Young Ryu, Saikrishna Pannaga Srikar Gundreddy, Bekir Tanriover
{"title":"Clinically Guided Adaptive Machine Learning Update Strategies for Predicting Severe COVID-19 Outcomes.","authors":"Mehmet Ulvi Saygi Ayvaci, Varghese S Jacobi, Young Ryu, Saikrishna Pannaga Srikar Gundreddy, Bekir Tanriover","doi":"10.1016/j.amjmed.2024.10.011","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Machine learning algorithms are essential for predicting severe outcomes during public health crises like COVID-19. However, the dynamic nature of diseases requires continual evaluation and updating of these algorithms. This study aims to compare three update strategies for predicting severe COVID-19 outcomes postdiagnosis: \"naive\" (a single initial model), \"frequent\" (periodic retraining), and \"context-driven\" (retraining informed by clinical insights). The goal is to determine the most effective timing and approach for adapting algorithms to evolving disease dynamics and emerging data.</p><p><strong>Methods: </strong>A dataset of 1.11 million COVID-19 patients from diverse U.S. regions was used to develop and validate an XGBoost algorithm for predicting severe outcomes upon diagnosis. Data included patient demographics, vital signs, comorbidities, and immunity-related factors (prior infection and vaccination status) from January 2007 to November 2021. The study analyzed the performance of the three update strategies from March 2020 to November 2021.</p><p><strong>Results: </strong>Predictive features changed over the pandemic, with comorbidities and vitals being significant initially, and geography, demographics, and immunity-related variables gaining importance later. The \"naive\" strategy had an average area under the curve (AUC) of 0.77, the \"frequent\" strategy maintained stability with an average AUC of 0.81, and the \"context-driven\" strategy averaged an AUC of 0.80, outperforming the \"naive\" strategy and aligning closely with the \"frequent\" strategy.</p><p><strong>Conclusions: </strong>A context-driven approach, guided by clinical insights, can enhance predictive performance and offer cost-effective solutions for dynamic public health challenges. These findings have significant implications for efficiently managing healthcare resources during evolving disease outbreaks.</p>","PeriodicalId":50807,"journal":{"name":"American Journal of Medicine","volume":" ","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American Journal of Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.amjmed.2024.10.011","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Machine learning algorithms are essential for predicting severe outcomes during public health crises like COVID-19. However, the dynamic nature of diseases requires continual evaluation and updating of these algorithms. This study aims to compare three update strategies for predicting severe COVID-19 outcomes postdiagnosis: "naive" (a single initial model), "frequent" (periodic retraining), and "context-driven" (retraining informed by clinical insights). The goal is to determine the most effective timing and approach for adapting algorithms to evolving disease dynamics and emerging data.
Methods: A dataset of 1.11 million COVID-19 patients from diverse U.S. regions was used to develop and validate an XGBoost algorithm for predicting severe outcomes upon diagnosis. Data included patient demographics, vital signs, comorbidities, and immunity-related factors (prior infection and vaccination status) from January 2007 to November 2021. The study analyzed the performance of the three update strategies from March 2020 to November 2021.
Results: Predictive features changed over the pandemic, with comorbidities and vitals being significant initially, and geography, demographics, and immunity-related variables gaining importance later. The "naive" strategy had an average area under the curve (AUC) of 0.77, the "frequent" strategy maintained stability with an average AUC of 0.81, and the "context-driven" strategy averaged an AUC of 0.80, outperforming the "naive" strategy and aligning closely with the "frequent" strategy.
Conclusions: A context-driven approach, guided by clinical insights, can enhance predictive performance and offer cost-effective solutions for dynamic public health challenges. These findings have significant implications for efficiently managing healthcare resources during evolving disease outbreaks.
期刊介绍:
The American Journal of Medicine - "The Green Journal" - publishes original clinical research of interest to physicians in internal medicine, both in academia and community-based practice. AJM is the official journal of the Alliance for Academic Internal Medicine, a prestigious group comprising internal medicine department chairs at more than 125 medical schools across the U.S. Each issue carries useful reviews as well as seminal articles of immediate interest to the practicing physician, including peer-reviewed, original scientific studies that have direct clinical significance and position papers on health care issues, medical education, and public policy.