Syed Aqif Mukhtar, Benjamin R McFadden, Md Tauhidul Islam, Qiu Yue Zhang, Ehsan Alvandi, Philippa Blatchford, Samantha Maybury, John Blakey, Pammy Yeoh, Brendon C McMullen
{"title":"Predictive analytics for early detection of hospital-acquired complications: An artificial intelligence approach.","authors":"Syed Aqif Mukhtar, Benjamin R McFadden, Md Tauhidul Islam, Qiu Yue Zhang, Ehsan Alvandi, Philippa Blatchford, Samantha Maybury, John Blakey, Pammy Yeoh, Brendon C McMullen","doi":"10.1177/18333583241256048","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Hospital-acquired complications (HACs) have an adverse impact on patient recovery by impeding their path to full recovery and increasing healthcare costs.</p><p><strong>Objective: </strong>The aim of this study was to create a HAC risk prediction machine learning (ML) framework using hospital administrative data collections within North Metropolitan Health Service (NMHS), Western Australia.</p><p><strong>Method: </strong>A retrospective cohort study was performed among 64,315 patients between July 2020 to June 2022 to develop an automated ML framework by inputting HAC and the healthcare site to obtain site-specific predictive algorithms for patients admitted to the hospital in NMHS. Univariate analysis was used for initial feature screening for 270 variables. Of these, 77 variables had significant relationship with any HAC. After excluding non-contemporaneous data, 37 variables were included in developing the ML framework based on logistic regression (LR), decision tree (DT) and random forest (RF) models to predict occurrence of four specific HACs: delirium, aspiration pneumonia, pneumonia and urinary tract infection.</p><p><strong>Results: </strong>All models exhibited similar performance with area under the curve scores around 0.90 for both training and testing datasets. For sensitivity, DT and RF exceeded LR performance while on average, false positives were lowest for LR-based models. Patient's length of stay, Charlson Index, operation length and intensive care unit stay were common predictors.</p><p><strong>Conclusion: </strong>Integrating ML-based risk detection systems into clinical workflows can potentially enhance patient safety and optimise resource allocation. LR-based models exhibited best performance.</p><p><strong>Implications: </strong>We have successfully developed a \"real-time\" risk prediction model, where patient risk scores are calculated and reviewed daily.</p>","PeriodicalId":73210,"journal":{"name":"Health information management : journal of the Health Information Management Association of Australia","volume":" ","pages":"18333583241256048"},"PeriodicalIF":0.0000,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health information management : journal of the Health Information Management Association of Australia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/18333583241256048","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Hospital-acquired complications (HACs) have an adverse impact on patient recovery by impeding their path to full recovery and increasing healthcare costs.
Objective: The aim of this study was to create a HAC risk prediction machine learning (ML) framework using hospital administrative data collections within North Metropolitan Health Service (NMHS), Western Australia.
Method: A retrospective cohort study was performed among 64,315 patients between July 2020 to June 2022 to develop an automated ML framework by inputting HAC and the healthcare site to obtain site-specific predictive algorithms for patients admitted to the hospital in NMHS. Univariate analysis was used for initial feature screening for 270 variables. Of these, 77 variables had significant relationship with any HAC. After excluding non-contemporaneous data, 37 variables were included in developing the ML framework based on logistic regression (LR), decision tree (DT) and random forest (RF) models to predict occurrence of four specific HACs: delirium, aspiration pneumonia, pneumonia and urinary tract infection.
Results: All models exhibited similar performance with area under the curve scores around 0.90 for both training and testing datasets. For sensitivity, DT and RF exceeded LR performance while on average, false positives were lowest for LR-based models. Patient's length of stay, Charlson Index, operation length and intensive care unit stay were common predictors.
Conclusion: Integrating ML-based risk detection systems into clinical workflows can potentially enhance patient safety and optimise resource allocation. LR-based models exhibited best performance.
Implications: We have successfully developed a "real-time" risk prediction model, where patient risk scores are calculated and reviewed daily.
背景:医院获得性并发症(HAC医院获得性并发症(HAC)会阻碍患者完全康复并增加医疗成本,从而对患者的康复产生不利影响:本研究旨在利用西澳大利亚北都会卫生服务(NMHS)收集的医院管理数据,创建一个HAC风险预测机器学习(ML)框架:在2020年7月至2022年6月期间,对64,315名患者进行了回顾性队列研究,通过输入HAC和医疗机构来开发自动ML框架,从而获得NMHS医院住院患者的特定地点预测算法。采用单变量分析对 270 个变量进行初步特征筛选。其中,77 个变量与任何 HAC 都有显著关系。在排除非同期数据后,37 个变量被纳入基于逻辑回归(LR)、决策树(DT)和随机森林(RF)模型的 ML 框架,以预测四种特定 HAC 的发生:谵妄、吸入性肺炎、肺炎和尿路感染:所有模型都表现出相似的性能,训练和测试数据集的曲线下面积均在 0.90 左右。就灵敏度而言,DT 和 RF 超过了 LR,而平均而言,基于 LR 的模型的误报率最低。患者的住院时间、查尔森指数、手术时间和重症监护室住院时间是常见的预测因素:结论:将基于 ML 的风险检测系统集成到临床工作流程中可能会提高患者的安全性并优化资源分配。基于 LR 的模型表现出最佳性能:我们已成功开发出一种 "实时 "风险预测模型,每天计算并审核患者的风险评分。