{"title":"利用重症监护中的电子健康记录建立 COVID-19 预测模型的综合基准","authors":"Junyi Gao, Yinghao Zhu, Wenqing Wang, Zixiang Wang, Guiying Dong, Wen Tang, Hao Wang, Yasha Wang, Ewen M. Harrison, Liantao Ma","doi":"10.1016/j.patter.2024.100951","DOIUrl":null,"url":null,"abstract":"The COVID-19 pandemic highlighted the need for predictive deep-learning models in health care. However, practical prediction task design, fair comparison, and model selection for clinical applications remain a challenge. To address this, we introduce and evaluate two new prediction tasks—outcome-specific length-of-stay and early-mortality prediction for COVID-19 patients in intensive care—which better reflect clinical realities. We developed evaluation metrics, model adaptation designs, and open-source data preprocessing pipelines for these tasks while also evaluating 18 predictive models, including clinical scoring methods and traditional machine-learning, basic deep-learning, and advanced deep-learning models, tailored for electronic health record (EHR) data. Benchmarking results from two real-world COVID-19 EHR datasets are provided, and all results and trained models have been released on an online platform for use by clinicians and researchers. Our efforts contribute to the advancement of deep-learning and machine-learning research in pandemic predictive modeling.","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"65 1","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2024-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A comprehensive benchmark for COVID-19 predictive modeling using electronic health records in intensive care\",\"authors\":\"Junyi Gao, Yinghao Zhu, Wenqing Wang, Zixiang Wang, Guiying Dong, Wen Tang, Hao Wang, Yasha Wang, Ewen M. Harrison, Liantao Ma\",\"doi\":\"10.1016/j.patter.2024.100951\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The COVID-19 pandemic highlighted the need for predictive deep-learning models in health care. However, practical prediction task design, fair comparison, and model selection for clinical applications remain a challenge. To address this, we introduce and evaluate two new prediction tasks—outcome-specific length-of-stay and early-mortality prediction for COVID-19 patients in intensive care—which better reflect clinical realities. We developed evaluation metrics, model adaptation designs, and open-source data preprocessing pipelines for these tasks while also evaluating 18 predictive models, including clinical scoring methods and traditional machine-learning, basic deep-learning, and advanced deep-learning models, tailored for electronic health record (EHR) data. Benchmarking results from two real-world COVID-19 EHR datasets are provided, and all results and trained models have been released on an online platform for use by clinicians and researchers. Our efforts contribute to the advancement of deep-learning and machine-learning research in pandemic predictive modeling.\",\"PeriodicalId\":36242,\"journal\":{\"name\":\"Patterns\",\"volume\":\"65 1\",\"pages\":\"\"},\"PeriodicalIF\":6.7000,\"publicationDate\":\"2024-03-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Patterns\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1016/j.patter.2024.100951\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Patterns","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.patter.2024.100951","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
摘要
COVID-19 大流行凸显了医疗保健领域对预测性深度学习模型的需求。然而,临床应用中的实际预测任务设计、公平比较和模型选择仍然是一项挑战。为了解决这个问题,我们引入并评估了两个新的预测任务--COVID-19 重症监护患者的特定住院时间和早期死亡预测--这两个任务更好地反映了临床实际情况。我们为这些任务开发了评估指标、模型适配设计和开源数据预处理管道,同时还评估了 18 种预测模型,包括临床评分方法、传统机器学习模型、基础深度学习模型和高级深度学习模型,这些模型都是为电子健康记录(EHR)数据量身定制的。我们提供了两个真实世界 COVID-19 EHR 数据集的基准测试结果,并在一个在线平台上发布了所有结果和训练模型,供临床医生和研究人员使用。我们的努力有助于推动大流行病预测建模方面的深度学习和机器学习研究。
A comprehensive benchmark for COVID-19 predictive modeling using electronic health records in intensive care
The COVID-19 pandemic highlighted the need for predictive deep-learning models in health care. However, practical prediction task design, fair comparison, and model selection for clinical applications remain a challenge. To address this, we introduce and evaluate two new prediction tasks—outcome-specific length-of-stay and early-mortality prediction for COVID-19 patients in intensive care—which better reflect clinical realities. We developed evaluation metrics, model adaptation designs, and open-source data preprocessing pipelines for these tasks while also evaluating 18 predictive models, including clinical scoring methods and traditional machine-learning, basic deep-learning, and advanced deep-learning models, tailored for electronic health record (EHR) data. Benchmarking results from two real-world COVID-19 EHR datasets are provided, and all results and trained models have been released on an online platform for use by clinicians and researchers. Our efforts contribute to the advancement of deep-learning and machine-learning research in pandemic predictive modeling.