Objectives: Early prediction of critical COVID-19 disease is crucial for an optimal clinical management. The objective of this study was to optimize predictive models for critical COVID-19 disease. Clinical data, laboratory data and genetic polymorphisms were integrated into AI models to compare the performance of different machine learning algorithms.
Methods: Data from 155 inpatients were analyzed, 23 of whom developed critical disease. A univariate analysis was performed to assess potential correlations between seven SNPs, nine clinical variables and 10 laboratory parameters at admission.
Results: Of the 7 SNPs, only three SNPs demonstrated a significant association with critical disase, namely: rs77534576, rs10774671 and rs10490770. The ensemble models exhibited the best performance: Random Forest (AUC=0.989), XGBoost (AUC=0.954) and AdaBoost (AUC=0.927). Variable importance varied across models, with age, C-reactive protein, heart diseases and the three SNPs being the most influential features. The predictive power of models improved with the integration of the three SNPs, as compared to previous studies where genetic data were not included. Internal validation confirmed the superiority and stability of the ensemble models.
Conclusions: Machine learning models may help predict progression into critical COVID-19-disease. The predictive power of models improves when SNPs associated with COVID-19 severity are integrated with laboratory and clinical data. Prior to implementation in clinical practice, larger studies in different populations are needed to validate and support the generalization of these results.
扫码关注我们
求助内容:
应助结果提醒方式:
