Karen E Villagrana-Bañuelos, Valeria Maeda-Gutiérrez, Vanessa Alcalá-Rmz, Juan J Oropeza-Valdez, Ana S Herrera-Van Oostdam, Julio E Castañeda-Delgado, Jesús Adrián López, Juan C Borrego Moreno, Carlos E Galván-Tejada, Jorge I Galván-Tejeda, Hamurabi Gamboa-Rosales, Huizilopoztli Luna-García, José M Celaya-Padilla, Yamilé López-Hernández
{"title":"COVID-19 Outcome Prediction by Integrating Clinical and Metabolic Data using Machine Learning Algorithms.","authors":"Karen E Villagrana-Bañuelos, Valeria Maeda-Gutiérrez, Vanessa Alcalá-Rmz, Juan J Oropeza-Valdez, Ana S Herrera-Van Oostdam, Julio E Castañeda-Delgado, Jesús Adrián López, Juan C Borrego Moreno, Carlos E Galván-Tejada, Jorge I Galván-Tejeda, Hamurabi Gamboa-Rosales, Huizilopoztli Luna-García, José M Celaya-Padilla, Yamilé López-Hernández","doi":"10.24875/RIC.22000182","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The coronavirus disease (COVID-19) is an infectious disease caused by the SARS-CoV-2 virus and is responsible for nearly 6 million deaths worldwide in the past 2 years. Machine learning (ML) models could help physicians in identifying high-risk individuals.</p><p><strong>Objectives: </strong>To study the use of ML models for COVID-19 prediction outcomes using clinical data and a combination of clinical and metabolic data, measured in a metabolomics facility from a public university.</p><p><strong>Methods: </strong>A total of 154 patients were included in the study. \"Basic profile\" was considered with clinical and demographic variables (33 variables), whereas in the \"extended profile,\" metabolomic and immunological variables were also considered (156 characteristics). A selection of features was carried out for each of the profiles with a genetic algorithm (GA) and random forest models were trained and tested to predict each of the stages of COVID-19.</p><p><strong>Results: </strong>The model based on extended profile was more useful in early stages of the disease. Models based on clinical data were preferred for predicting severe and critical illness and death. ML detected trimethylamine N-oxide, lipid mediators, and neutrophil/lymphocyte ratio as important variables.</p><p><strong>Conclusions: </strong>ML and GAs provided adequate models to predict COVID-19 outcomes in patients with different severity grades.</p>","PeriodicalId":49612,"journal":{"name":"Revista De Investigacion Clinica-Clinical and Translational Investigation","volume":"74 6","pages":"314-327"},"PeriodicalIF":1.4000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Revista De Investigacion Clinica-Clinical and Translational Investigation","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.24875/RIC.22000182","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 1
Abstract
Background: The coronavirus disease (COVID-19) is an infectious disease caused by the SARS-CoV-2 virus and is responsible for nearly 6 million deaths worldwide in the past 2 years. Machine learning (ML) models could help physicians in identifying high-risk individuals.
Objectives: To study the use of ML models for COVID-19 prediction outcomes using clinical data and a combination of clinical and metabolic data, measured in a metabolomics facility from a public university.
Methods: A total of 154 patients were included in the study. "Basic profile" was considered with clinical and demographic variables (33 variables), whereas in the "extended profile," metabolomic and immunological variables were also considered (156 characteristics). A selection of features was carried out for each of the profiles with a genetic algorithm (GA) and random forest models were trained and tested to predict each of the stages of COVID-19.
Results: The model based on extended profile was more useful in early stages of the disease. Models based on clinical data were preferred for predicting severe and critical illness and death. ML detected trimethylamine N-oxide, lipid mediators, and neutrophil/lymphocyte ratio as important variables.
Conclusions: ML and GAs provided adequate models to predict COVID-19 outcomes in patients with different severity grades.
期刊介绍:
The Revista de Investigación Clínica – Clinical and Translational Investigation (RIC-C&TI), publishes original clinical and biomedical research of interest to physicians in internal medicine, surgery, and any of their specialties. The Revista de Investigación Clínica – Clinical and Translational Investigation is the official journal of the National Institutes of Health of Mexico, which comprises a group of Institutes and High Specialty Hospitals belonging to the Ministery of Health. The journal is published both on-line and in printed version, appears bimonthly and publishes peer-reviewed original research articles as well as brief and in-depth reviews. All articles published are open access and can be immediately and permanently free for everyone to read and download. The journal accepts clinical and molecular research articles, short reports and reviews.
Types of manuscripts:
– Brief Communications
– Research Letters
– Original Articles
– Brief Reviews
– In-depth Reviews
– Perspectives
– Letters to the Editor