Performance of machine learning models in predicting difficult laryngoscopy in the emergency department: a single-centre retrospective study comparing with conventional regression method.

IF 2.3 3区医学 Q1 EMERGENCY MEDICINE BMC Emergency Medicine Pub Date : 2025-02-21 DOI:10.1186/s12873-025-01185-0

Winchana Srivilaithon, Pichamon Thanasarnpaiboon

{"title":"Performance of machine learning models in predicting difficult laryngoscopy in the emergency department: a single-centre retrospective study comparing with conventional regression method.","authors":"Winchana Srivilaithon, Pichamon Thanasarnpaiboon","doi":"10.1186/s12873-025-01185-0","DOIUrl":null,"url":null,"abstract":"Background: Emergency endotracheal intubation is a critical skill for managing airway emergencies in the emergency department (ED). Accurate prediction of difficult laryngoscopy is essential for improving first-attempt success, minimizing complications, optimizing resource utilization, and enhancing patient outcomes. Traditional methods, such as the LEMON criteria, have limited predictive accuracy. Machine learning (ML) offers advanced predictive capabilities by analyzing large datasets and identifying complex variable interactions. This study aimed to develop and validate the performance of ML models for predicting difficult laryngoscopy in the ED, comparing it with a conventional regression model.Methods: A retrospective cohort study was conducted on 4,370 adult patients who underwent intubation in the ED at Thammasat University Hospital. Difficult laryngoscopy was defined as a Cormack-Lehane grade III or IV. Patients were divided into development (training, 70%) and validation (testing, 30%) cohorts. Predictors of difficult laryngoscopy were identified using multivariable stepwise backward elimination logistic regression and were used to develop ML models, including Logistic Regression, Decision Tree, Random Forest, and XGBoost. Model performance was evaluated using the area under the receiver operating characteristic curve (AuROC), accuracy, precision, recall, and F1-score. Validation was performed on the validation cohort to confirm model accuracy.Results: Nine significant predictors were identified: male sex, trauma, absence of neuromuscular blocking agents, large incisors, large tongue, limited mouth opening, short thyrohyoid distance, obstructed airway, and poor neck mobility. The Random Forest model demonstrated the highest predictive performance, with an AuROC of 0.82 (95% CI: 0.78-0.85), accuracy of 0.89, recall of 0.89, and F1-score of 0.87, outperforming conventional regression (AuROC 0.76, 95% CI: 0.73-0.78) and other ML models. DeLong's test confirmed a statistically significant difference in AuROC between the two models (p = 0.002). The Decision Tree showed limited performance due to overfitting, while XGBoost demonstrated strong precision. No significant differences were found when comparing the two models with conventional regression (p = 0.498 and 0.496, respectively).Conclusion: The Random Forest model provides the most robust prediction of difficult laryngoscopy, outperforming both conventional and other ML methods. While ML models improve predictive accuracy, logistic regression remains a practical option in resource-limited settings. Integrating ML into clinical workflows could enhance decision-making, resource allocation, and patient safety in emergency airway management. Future research should prioritize external validation and real-world implementation.","PeriodicalId":9002,"journal":{"name":"BMC Emergency Medicine","volume":"25 1","pages":"28"},"PeriodicalIF":2.3000,"publicationDate":"2025-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11846364/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Emergency Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12873-025-01185-0","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EMERGENCY MEDICINE","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Emergency endotracheal intubation is a critical skill for managing airway emergencies in the emergency department (ED). Accurate prediction of difficult laryngoscopy is essential for improving first-attempt success, minimizing complications, optimizing resource utilization, and enhancing patient outcomes. Traditional methods, such as the LEMON criteria, have limited predictive accuracy. Machine learning (ML) offers advanced predictive capabilities by analyzing large datasets and identifying complex variable interactions. This study aimed to develop and validate the performance of ML models for predicting difficult laryngoscopy in the ED, comparing it with a conventional regression model.

Methods: A retrospective cohort study was conducted on 4,370 adult patients who underwent intubation in the ED at Thammasat University Hospital. Difficult laryngoscopy was defined as a Cormack-Lehane grade III or IV. Patients were divided into development (training, 70%) and validation (testing, 30%) cohorts. Predictors of difficult laryngoscopy were identified using multivariable stepwise backward elimination logistic regression and were used to develop ML models, including Logistic Regression, Decision Tree, Random Forest, and XGBoost. Model performance was evaluated using the area under the receiver operating characteristic curve (AuROC), accuracy, precision, recall, and F1-score. Validation was performed on the validation cohort to confirm model accuracy.

Results: Nine significant predictors were identified: male sex, trauma, absence of neuromuscular blocking agents, large incisors, large tongue, limited mouth opening, short thyrohyoid distance, obstructed airway, and poor neck mobility. The Random Forest model demonstrated the highest predictive performance, with an AuROC of 0.82 (95% CI: 0.78-0.85), accuracy of 0.89, recall of 0.89, and F1-score of 0.87, outperforming conventional regression (AuROC 0.76, 95% CI: 0.73-0.78) and other ML models. DeLong's test confirmed a statistically significant difference in AuROC between the two models (p = 0.002). The Decision Tree showed limited performance due to overfitting, while XGBoost demonstrated strong precision. No significant differences were found when comparing the two models with conventional regression (p = 0.498 and 0.496, respectively).

Conclusion: The Random Forest model provides the most robust prediction of difficult laryngoscopy, outperforming both conventional and other ML methods. While ML models improve predictive accuracy, logistic regression remains a practical option in resource-limited settings. Integrating ML into clinical workflows could enhance decision-making, resource allocation, and patient safety in emergency airway management. Future research should prioritize external validation and real-world implementation.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

机器学习模型在预测急诊科喉镜困难中的表现：一项与传统回归方法比较的单中心回顾性研究

背景：紧急气管插管是急诊科（ED）处理气道紧急情况的一项关键技能。准确预测困难喉镜检查对于提高首次尝试成功率、减少并发症、优化资源利用和提高患者预后至关重要。传统的方法，如柠檬标准，具有有限的预测准确性。机器学习（ML）通过分析大型数据集和识别复杂的变量相互作用提供了先进的预测能力。本研究旨在开发和验证ML模型在预测急诊科困难喉镜检查中的性能，并将其与传统的回归模型进行比较。方法：对4,370名在法政大学医院急诊科插管的成年患者进行回顾性队列研究。困难喉镜检查定义为Cormack-Lehane III级或IV级。患者分为发展（培训，70%）和验证（测试，30%）队列。使用多变量逐步反向消除逻辑回归确定困难喉镜的预测因子，并用于开发ML模型，包括逻辑回归，决策树，随机森林和XGBoost。使用受试者工作特征曲线下面积（AuROC）、准确度、精密度、召回率和f1评分来评估模型的性能。对验证队列进行验证以确认模型的准确性。结果：确定了9个显著的预测因素：男性、外伤、缺乏神经肌肉阻断剂、大门牙、大舌头、张嘴受限、甲状舌骨距离短、气道阻塞和颈部活动能力差。随机森林模型表现出最高的预测性能，AuROC为0.82 (95% CI: 0.78-0.85)，准确率为0.89，召回率为0.89，f1得分为0.87，优于传统回归（AuROC 0.76, 95% CI: 0.73-0.78）和其他ML模型。DeLong检验证实两种模型的AuROC差异有统计学意义（p = 0.002）。由于过度拟合，决策树表现出有限的性能，而XGBoost表现出很强的精度。两种模型与常规回归比较无显著差异（p分别= 0.498和0.496）。结论：随机森林模型提供了最稳健的困难喉镜预测，优于传统和其他ML方法。虽然ML模型提高了预测准确性，但在资源有限的情况下，逻辑回归仍然是一个实用的选择。将机器学习整合到临床工作流程中可以提高急诊气道管理的决策、资源分配和患者安全。未来的研究应该优先考虑外部验证和现实世界的实现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

BMC Emergency Medicine Medicine-Emergency Medicine

CiteScore

3.50

自引率

8.00%

发文量

178

审稿时长

29 weeks

期刊介绍： BMC Emergency Medicine is an open access, peer-reviewed journal that considers articles on all urgent and emergency aspects of medicine, in both practice and basic research. In addition, the journal covers aspects of disaster medicine and medicine in special locations, such as conflict areas and military medicine, together with articles concerning healthcare services in the emergency departments.