Niklas Giesa, Stefan Haufe, Mario Menk, Björn Weiß, Claudia D Spies, Sophie K Piper, Felix Balzer, Sebastian D Boie
{"title":"预测无开颅手术的非心脏手术恢复室中通过护理筛选谵妄量表评估的术后谵妄:使用机器学习方法的回顾性研究。","authors":"Niklas Giesa, Stefan Haufe, Mario Menk, Björn Weiß, Claudia D Spies, Sophie K Piper, Felix Balzer, Sebastian D Boie","doi":"10.1371/journal.pdig.0000414","DOIUrl":null,"url":null,"abstract":"<p><p>Postoperative delirium (POD) contributes to severe outcomes such as death or development of dementia. Thus, it is desirable to identify vulnerable patients in advance during the perioperative phase. Previous studies mainly investigated risk factors for delirium during hospitalization and further used a linear logistic regression (LR) approach with time-invariant data. Studies have not investigated patients' fluctuating conditions to support POD precautions. In this single-center study, we aimed to predict POD in a recovery room setting with a non-linear machine learning (ML) technique using pre-, intra-, and postoperative data. The target variable POD was defined with the Nursing Screening Delirium Scale (Nu-DESC) ≥ 1. Feature selection was conducted based on robust univariate test statistics and L1 regularization. Non-linear multi-layer perceptron (MLP) as well as tree-based models were trained and evaluated-with the receiver operating characteristics curve (AUROC), the area under precision recall curve (AUPRC), and additional metrics-against LR and published models on bootstrapped testing data. The prevalence of POD was 8.2% in a sample of 73,181 surgeries performed between 2017 and 2020. Significant univariate impact factors were the preoperative ASA status (American Society of Anesthesiologists physical status classification system), the intraoperative amount of given remifentanil, and the postoperative Aldrete score. The best model used pre-, intra-, and postoperative data. The non-linear boosted trees model achieved a mean AUROC of 0.854 and a mean AUPRC of 0.418 outperforming linear LR, well as best applied and retrained baseline models. Overall, non-linear machine learning models using data from multiple perioperative time phases were superior to traditional ones in predicting POD in the recovery room. Class imbalance was seen as a main impediment for model application in clinical practice.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 8","pages":"e0000414"},"PeriodicalIF":0.0000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11324157/pdf/","citationCount":"0","resultStr":"{\"title\":\"Predicting postoperative delirium assessed by the Nursing Screening Delirium Scale in the recovery room for non-cardiac surgeries without craniotomy: A retrospective study using a machine learning approach.\",\"authors\":\"Niklas Giesa, Stefan Haufe, Mario Menk, Björn Weiß, Claudia D Spies, Sophie K Piper, Felix Balzer, Sebastian D Boie\",\"doi\":\"10.1371/journal.pdig.0000414\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Postoperative delirium (POD) contributes to severe outcomes such as death or development of dementia. Thus, it is desirable to identify vulnerable patients in advance during the perioperative phase. Previous studies mainly investigated risk factors for delirium during hospitalization and further used a linear logistic regression (LR) approach with time-invariant data. Studies have not investigated patients' fluctuating conditions to support POD precautions. In this single-center study, we aimed to predict POD in a recovery room setting with a non-linear machine learning (ML) technique using pre-, intra-, and postoperative data. The target variable POD was defined with the Nursing Screening Delirium Scale (Nu-DESC) ≥ 1. Feature selection was conducted based on robust univariate test statistics and L1 regularization. Non-linear multi-layer perceptron (MLP) as well as tree-based models were trained and evaluated-with the receiver operating characteristics curve (AUROC), the area under precision recall curve (AUPRC), and additional metrics-against LR and published models on bootstrapped testing data. The prevalence of POD was 8.2% in a sample of 73,181 surgeries performed between 2017 and 2020. Significant univariate impact factors were the preoperative ASA status (American Society of Anesthesiologists physical status classification system), the intraoperative amount of given remifentanil, and the postoperative Aldrete score. The best model used pre-, intra-, and postoperative data. The non-linear boosted trees model achieved a mean AUROC of 0.854 and a mean AUPRC of 0.418 outperforming linear LR, well as best applied and retrained baseline models. Overall, non-linear machine learning models using data from multiple perioperative time phases were superior to traditional ones in predicting POD in the recovery room. Class imbalance was seen as a main impediment for model application in clinical practice.</p>\",\"PeriodicalId\":74465,\"journal\":{\"name\":\"PLOS digital health\",\"volume\":\"3 8\",\"pages\":\"e0000414\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11324157/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"PLOS digital health\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1371/journal.pdig.0000414\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/8/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLOS digital health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1371/journal.pdig.0000414","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
术后谵妄(POD)会导致死亡或痴呆等严重后果。因此,最好能在围手术期提前发现易感患者。以往的研究主要调查住院期间谵妄的风险因素,并进一步使用线性逻辑回归(LR)方法和时间不变数据。研究并未调查患者的波动情况,以支持 POD 预防措施。在这项单中心研究中,我们旨在利用术前、术中和术后数据,通过非线性机器学习(ML)技术预测恢复室环境中的 POD。目标变量 POD 的定义是护理筛选谵妄量表(Nu-DESC)≥ 1。特征选择基于稳健的单变量测试统计和 L1 正则化。对非线性多层感知器(MLP)和基于树的模型进行了训练和评估--用接收者操作特性曲线(AUROC)、精确召回曲线下面积(AUPRC)和其他指标--与自引导测试数据上的LR和已发表模型进行对比。在2017年至2020年间进行的73181例手术样本中,POD的患病率为8.2%。重要的单变量影响因素是术前ASA状态(美国麻醉医师协会身体状况分类系统)、术中瑞芬太尼给药量和术后Aldrete评分。最佳模型使用了术前、术中和术后数据。非线性提升树模型的平均 AUROC 为 0.854,平均 AUPRC 为 0.418,优于线性 LR 以及最佳应用和再训练基线模型。总体而言,在预测恢复室的 POD 方面,使用围手术期多个时间阶段数据的非线性机器学习模型优于传统模型。类不平衡被认为是模型应用于临床实践的主要障碍。
Predicting postoperative delirium assessed by the Nursing Screening Delirium Scale in the recovery room for non-cardiac surgeries without craniotomy: A retrospective study using a machine learning approach.
Postoperative delirium (POD) contributes to severe outcomes such as death or development of dementia. Thus, it is desirable to identify vulnerable patients in advance during the perioperative phase. Previous studies mainly investigated risk factors for delirium during hospitalization and further used a linear logistic regression (LR) approach with time-invariant data. Studies have not investigated patients' fluctuating conditions to support POD precautions. In this single-center study, we aimed to predict POD in a recovery room setting with a non-linear machine learning (ML) technique using pre-, intra-, and postoperative data. The target variable POD was defined with the Nursing Screening Delirium Scale (Nu-DESC) ≥ 1. Feature selection was conducted based on robust univariate test statistics and L1 regularization. Non-linear multi-layer perceptron (MLP) as well as tree-based models were trained and evaluated-with the receiver operating characteristics curve (AUROC), the area under precision recall curve (AUPRC), and additional metrics-against LR and published models on bootstrapped testing data. The prevalence of POD was 8.2% in a sample of 73,181 surgeries performed between 2017 and 2020. Significant univariate impact factors were the preoperative ASA status (American Society of Anesthesiologists physical status classification system), the intraoperative amount of given remifentanil, and the postoperative Aldrete score. The best model used pre-, intra-, and postoperative data. The non-linear boosted trees model achieved a mean AUROC of 0.854 and a mean AUPRC of 0.418 outperforming linear LR, well as best applied and retrained baseline models. Overall, non-linear machine learning models using data from multiple perioperative time phases were superior to traditional ones in predicting POD in the recovery room. Class imbalance was seen as a main impediment for model application in clinical practice.