An individualized algorithm to predict mortality in COVID-19 pneumonia: a machine learning based study

Archives of Medical Science : AMS Pub Date : 2022-01-14 DOI:10.5114/aoms/144980

M. Laino, E. Generali, T. Tommasini, G. Angelotti, A. Aghemo, A. Desai, Pierandrea Morandini, G. Stefanini, A. Lleo, A. Voza, V. Savevski

{"title":"An individualized algorithm to predict mortality in COVID-19 pneumonia: a machine learning based study","authors":"M. Laino, E. Generali, T. Tommasini, G. Angelotti, A. Aghemo, A. Desai, Pierandrea Morandini, G. Stefanini, A. Lleo, A. Voza, V. Savevski","doi":"10.5114/aoms/144980","DOIUrl":null,"url":null,"abstract":"Introduction Identifying SARS-CoV-2 patients at higher risk of mortality is crucial in the management of a pandemic. Artificial intelligence techniques allow one to analyze large amounts of data to find hidden patterns. We aimed to develop and validate a mortality score at admission for COVID-19 based on high-level machine learning. Material and methods We conducted a retrospective cohort study on hospitalized adult COVID-19 patients between March and December 2020. The primary outcome was in-hospital mortality. A machine learning approach based on vital parameters, laboratory values and demographic features was applied to develop different models. Then, a feature importance analysis was performed to reduce the number of variables included in the model, to develop a risk score with good overall performance, that was finally evaluated in terms of discrimination and calibration capabilities. All results underwent cross-validation. Results 1,135 consecutive patients (median age 70 years, 64% male) were enrolled, 48 patients were excluded, and the cohort was randomly divided into training (760) and test (327) groups. During hospitalization, 251 (22%) patients died. After feature selection, the best performing classifier was random forest (AUC 0.88 ±0.03). Based on the relative importance of each variable, a pragmatic score was developed, showing good performances (AUC 0.85 ±0.025), and three levels were defined that correlated well with in-hospital mortality. Conclusions Machine learning techniques were applied in order to develop an accurate in-hospital mortality risk score for COVID-19 based on ten variables. The application of the proposed score has utility in clinical settings to guide the management and prognostication of COVID-19 patients.","PeriodicalId":190584,"journal":{"name":"Archives of Medical Science : AMS","volume":"438 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Archives of Medical Science : AMS","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5114/aoms/144980","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

Introduction Identifying SARS-CoV-2 patients at higher risk of mortality is crucial in the management of a pandemic. Artificial intelligence techniques allow one to analyze large amounts of data to find hidden patterns. We aimed to develop and validate a mortality score at admission for COVID-19 based on high-level machine learning. Material and methods We conducted a retrospective cohort study on hospitalized adult COVID-19 patients between March and December 2020. The primary outcome was in-hospital mortality. A machine learning approach based on vital parameters, laboratory values and demographic features was applied to develop different models. Then, a feature importance analysis was performed to reduce the number of variables included in the model, to develop a risk score with good overall performance, that was finally evaluated in terms of discrimination and calibration capabilities. All results underwent cross-validation. Results 1,135 consecutive patients (median age 70 years, 64% male) were enrolled, 48 patients were excluded, and the cohort was randomly divided into training (760) and test (327) groups. During hospitalization, 251 (22%) patients died. After feature selection, the best performing classifier was random forest (AUC 0.88 ±0.03). Based on the relative importance of each variable, a pragmatic score was developed, showing good performances (AUC 0.85 ±0.025), and three levels were defined that correlated well with in-hospital mortality. Conclusions Machine learning techniques were applied in order to develop an accurate in-hospital mortality risk score for COVID-19 based on ten variables. The application of the proposed score has utility in clinical settings to guide the management and prognostication of COVID-19 patients.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

预测COVID-19肺炎死亡率的个性化算法:一项基于机器学习的研究

识别死亡风险较高的SARS-CoV-2患者对于大流行管理至关重要。人工智能技术允许人们分析大量数据以发现隐藏的模式。我们的目标是基于高级机器学习开发和验证COVID-19入院时的死亡率评分。材料与方法对2020年3月至12月住院的成人COVID-19患者进行回顾性队列研究。主要终点是住院死亡率。基于重要参数、实验室值和人口特征的机器学习方法被应用于开发不同的模型。然后，进行特征重要性分析，以减少模型中包含的变量数量，形成一个整体性能良好的风险评分，最后根据区分和校准能力对其进行评估。所有结果进行交叉验证。结果连续入组1135例患者(中位年龄70岁，男性64%)，排除48例患者，随机分为训练组(760例)和试验组(327例)。在住院期间，251例(22%)患者死亡。经过特征选择后，表现最好的分类器为随机森林(AUC为0.88±0.03)。根据每个变量的相对重要性，制定了实用评分，表现良好(AUC为0.85±0.025)，并定义了三个与住院死亡率相关的水平。结论应用机器学习技术基于10个变量建立准确的COVID-19住院死亡风险评分。该评分的应用在临床环境中具有指导COVID-19患者管理和预后的效用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Archives of Medical Science : AMS

自引率

0.00%

发文量