Zeynab Salehnasab, A. Mousavizadeh, Ghasem Ghalamfarsa, A. Garavand, C. Salehnasab
{"title":"Predictive Modeling of COVID-19 Hospitalization Using Twenty Machine Learning Classification Algorithms on Cohort Data","authors":"Zeynab Salehnasab, A. Mousavizadeh, Ghasem Ghalamfarsa, A. Garavand, C. Salehnasab","doi":"10.30699/fhi.v12i0.473","DOIUrl":null,"url":null,"abstract":"Introduction: The global COVID-19 pandemic has led to a health crisis, emphasizing the need to identify high-risk patients for effective resource allocation and prioritized hospitalization. Previous studies have been limited in their use of algorithms and variables, while this research expands to include lifestyle factors and optimizes hyperparameters for twenty machine learning algorithms, enhancing prediction accuracy and identifying key predictors.Material and Methods: In this cross-sectional study, we analyzed data from 207 COVID-19 patients. The Boruta algorithm was used to select the best features for twenty classification algorithms, and RandomizedSearchCV was utilized to optimize hyperparameters. The models were evaluated using performance metrics such as accuracy, f-measure, and area under the curve (AUC).Results: The study identified eight key predictors of COVID-19 hospitalization, which include gamma-glutamyl transpeptidase, alkaline phosphatase, diagnosis by CT scan, mean platelet volume, mean corpuscular volume, fasting blood sugar, red blood cell count, and mean corpuscular hemoglobin concentration. By optimizing the hyperparameters of twenty machine learning algorithms, the accuracy and AUC were improved. With an outstanding AUC of 81.25, the XGBClassifier model exhibited superior performance.Conclusion: The findings of this study can assist clinicians in allocating resources effectively and improving patient care. Additionally, this approach can aid healthcare researchers in leveraging artificial intelligence to manage diseases.","PeriodicalId":154611,"journal":{"name":"Frontiers in Health Informatics","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Health Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30699/fhi.v12i0.473","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: The global COVID-19 pandemic has led to a health crisis, emphasizing the need to identify high-risk patients for effective resource allocation and prioritized hospitalization. Previous studies have been limited in their use of algorithms and variables, while this research expands to include lifestyle factors and optimizes hyperparameters for twenty machine learning algorithms, enhancing prediction accuracy and identifying key predictors.Material and Methods: In this cross-sectional study, we analyzed data from 207 COVID-19 patients. The Boruta algorithm was used to select the best features for twenty classification algorithms, and RandomizedSearchCV was utilized to optimize hyperparameters. The models were evaluated using performance metrics such as accuracy, f-measure, and area under the curve (AUC).Results: The study identified eight key predictors of COVID-19 hospitalization, which include gamma-glutamyl transpeptidase, alkaline phosphatase, diagnosis by CT scan, mean platelet volume, mean corpuscular volume, fasting blood sugar, red blood cell count, and mean corpuscular hemoglobin concentration. By optimizing the hyperparameters of twenty machine learning algorithms, the accuracy and AUC were improved. With an outstanding AUC of 81.25, the XGBClassifier model exhibited superior performance.Conclusion: The findings of this study can assist clinicians in allocating resources effectively and improving patient care. Additionally, this approach can aid healthcare researchers in leveraging artificial intelligence to manage diseases.