基于机器学习模型集合的Covid-19患者早期死亡风险预测

Harsh Walia, J. S.
{"title":"基于机器学习模型集合的Covid-19患者早期死亡风险预测","authors":"Harsh Walia, J. S.","doi":"10.1109/ComPE53109.2021.9751945","DOIUrl":null,"url":null,"abstract":"COVID-19, which is subsequently named as SARS-CoV-2, First Human case was found in the City of Wuhan, from China, in Dec 2019. After that, the World health organization (WHO) has declared Coronavirus as a Pandemic on 11th March 2020. In this study, our primary aim is to Detect the Severe Covid-19 patient in the Early Stages by looking at the information on admission laboratory values, demographics, comorbidities, admission medications, admission supplementary oxygen orders, discharge, and mortality. 4711 patient’s dataset with confirmed SARS-CoV-2 infections are included in the study. Each Patient has total of 85 Features in the Dataset. So, we have Filtered the Top Best 35 features out of 85 features from the Dataset using the seven different feature Selection algorithm and taken the most common features out from the different feature Selection algorithm. After selecting the top most essential features, we have applied around 17 different kinds of ML models like Linear Regression, Logistic regression, SVM, LinearSVC, MLP-Classifier, Decision Tree Classifier, Gradient Boosting Classifier, AdaBoost, Random Forest, XGBoost, LightGBM Classifier, Ridge Classifier, Bagging Classifier, ExtraTreeClassifier, KNN, Naive Bayes, Neural network with Keras, and finally, a Voting Classifier which is the ensemble of all the Top Models from the above-mentioned Models. Finally, all Models are Compared on the basis of Area under the receiver operating characteristic (AUC) & get the best AUC as 0.89.","PeriodicalId":211704,"journal":{"name":"2021 International Conference on Computational Performance Evaluation (ComPE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Early Mortality Risk Prediction in Covid-19 Patients Using an Ensemble of Machine Learning Models\",\"authors\":\"Harsh Walia, J. S.\",\"doi\":\"10.1109/ComPE53109.2021.9751945\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"COVID-19, which is subsequently named as SARS-CoV-2, First Human case was found in the City of Wuhan, from China, in Dec 2019. After that, the World health organization (WHO) has declared Coronavirus as a Pandemic on 11th March 2020. In this study, our primary aim is to Detect the Severe Covid-19 patient in the Early Stages by looking at the information on admission laboratory values, demographics, comorbidities, admission medications, admission supplementary oxygen orders, discharge, and mortality. 4711 patient’s dataset with confirmed SARS-CoV-2 infections are included in the study. Each Patient has total of 85 Features in the Dataset. So, we have Filtered the Top Best 35 features out of 85 features from the Dataset using the seven different feature Selection algorithm and taken the most common features out from the different feature Selection algorithm. After selecting the top most essential features, we have applied around 17 different kinds of ML models like Linear Regression, Logistic regression, SVM, LinearSVC, MLP-Classifier, Decision Tree Classifier, Gradient Boosting Classifier, AdaBoost, Random Forest, XGBoost, LightGBM Classifier, Ridge Classifier, Bagging Classifier, ExtraTreeClassifier, KNN, Naive Bayes, Neural network with Keras, and finally, a Voting Classifier which is the ensemble of all the Top Models from the above-mentioned Models. Finally, all Models are Compared on the basis of Area under the receiver operating characteristic (AUC) & get the best AUC as 0.89.\",\"PeriodicalId\":211704,\"journal\":{\"name\":\"2021 International Conference on Computational Performance Evaluation (ComPE)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Computational Performance Evaluation (ComPE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ComPE53109.2021.9751945\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computational Performance Evaluation (ComPE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ComPE53109.2021.9751945","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

COVID-19,后来被命名为SARS-CoV-2,于2019年12月在中国武汉市发现了首例人类病例。此后,世界卫生组织(世卫组织)于2020年3月11日宣布冠状病毒为大流行。在这项研究中,我们的主要目的是通过查看入院实验室值、人口统计学、合并症、入院药物、入院补充氧单、出院和死亡率等信息,在早期发现重症Covid-19患者。4711例确诊的SARS-CoV-2感染患者的数据集被纳入研究。每个患者在数据集中共有85个特征。因此,我们使用七种不同的特征选择算法从数据集中的85个特征中筛选出了最好的35个特征,并从不同的特征选择算法中提取了最常见的特征。在选择了最重要的特征后,我们应用了大约17种不同的ML模型,如线性回归,逻辑回归,SVM,线性svc, mlp分类器,决策树分类器,梯度增强分类器,AdaBoost,随机森林,XGBoost, LightGBM分类器,Ridge分类器,Bagging分类器,extratreecclassifier, KNN,朴素贝叶斯,神经网络与Keras,最后,一个投票分类器,它是上述模型中所有顶级模型的集合。最后,根据接收机工作特性下面积(Area under the receiver operating characteristic, AUC)对各模型进行比较,得到最佳AUC为0.89。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Early Mortality Risk Prediction in Covid-19 Patients Using an Ensemble of Machine Learning Models
COVID-19, which is subsequently named as SARS-CoV-2, First Human case was found in the City of Wuhan, from China, in Dec 2019. After that, the World health organization (WHO) has declared Coronavirus as a Pandemic on 11th March 2020. In this study, our primary aim is to Detect the Severe Covid-19 patient in the Early Stages by looking at the information on admission laboratory values, demographics, comorbidities, admission medications, admission supplementary oxygen orders, discharge, and mortality. 4711 patient’s dataset with confirmed SARS-CoV-2 infections are included in the study. Each Patient has total of 85 Features in the Dataset. So, we have Filtered the Top Best 35 features out of 85 features from the Dataset using the seven different feature Selection algorithm and taken the most common features out from the different feature Selection algorithm. After selecting the top most essential features, we have applied around 17 different kinds of ML models like Linear Regression, Logistic regression, SVM, LinearSVC, MLP-Classifier, Decision Tree Classifier, Gradient Boosting Classifier, AdaBoost, Random Forest, XGBoost, LightGBM Classifier, Ridge Classifier, Bagging Classifier, ExtraTreeClassifier, KNN, Naive Bayes, Neural network with Keras, and finally, a Voting Classifier which is the ensemble of all the Top Models from the above-mentioned Models. Finally, all Models are Compared on the basis of Area under the receiver operating characteristic (AUC) & get the best AUC as 0.89.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
iSIMP with Integrity Validation using MD5 Hash A Fault Detection Scheme for IoT-enabled WSNs YOLOv3 based Real Time Social Distance Violation Detection in Public Places Finite Element Analysis of Femur Bone under Different Loading Conditions An Efficient and Anonymous Authentication Key Agreement Protocol for Smart Transportation System
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1