Performance of machine learning models in predicting difficult laryngoscopy in the emergency department: a single-centre retrospective study comparing with conventional regression method.

IF 2.3 3区 医学 Q1 EMERGENCY MEDICINE BMC Emergency Medicine Pub Date : 2025-02-21 DOI:10.1186/s12873-025-01185-0
Winchana Srivilaithon, Pichamon Thanasarnpaiboon
{"title":"Performance of machine learning models in predicting difficult laryngoscopy in the emergency department: a single-centre retrospective study comparing with conventional regression method.","authors":"Winchana Srivilaithon, Pichamon Thanasarnpaiboon","doi":"10.1186/s12873-025-01185-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Emergency endotracheal intubation is a critical skill for managing airway emergencies in the emergency department (ED). Accurate prediction of difficult laryngoscopy is essential for improving first-attempt success, minimizing complications, optimizing resource utilization, and enhancing patient outcomes. Traditional methods, such as the LEMON criteria, have limited predictive accuracy. Machine learning (ML) offers advanced predictive capabilities by analyzing large datasets and identifying complex variable interactions. This study aimed to develop and validate the performance of ML models for predicting difficult laryngoscopy in the ED, comparing it with a conventional regression model.</p><p><strong>Methods: </strong>A retrospective cohort study was conducted on 4,370 adult patients who underwent intubation in the ED at Thammasat University Hospital. Difficult laryngoscopy was defined as a Cormack-Lehane grade III or IV. Patients were divided into development (training, 70%) and validation (testing, 30%) cohorts. Predictors of difficult laryngoscopy were identified using multivariable stepwise backward elimination logistic regression and were used to develop ML models, including Logistic Regression, Decision Tree, Random Forest, and XGBoost. Model performance was evaluated using the area under the receiver operating characteristic curve (AuROC), accuracy, precision, recall, and F1-score. Validation was performed on the validation cohort to confirm model accuracy.</p><p><strong>Results: </strong>Nine significant predictors were identified: male sex, trauma, absence of neuromuscular blocking agents, large incisors, large tongue, limited mouth opening, short thyrohyoid distance, obstructed airway, and poor neck mobility. The Random Forest model demonstrated the highest predictive performance, with an AuROC of 0.82 (95% CI: 0.78-0.85), accuracy of 0.89, recall of 0.89, and F1-score of 0.87, outperforming conventional regression (AuROC 0.76, 95% CI: 0.73-0.78) and other ML models. DeLong's test confirmed a statistically significant difference in AuROC between the two models (p = 0.002). The Decision Tree showed limited performance due to overfitting, while XGBoost demonstrated strong precision. No significant differences were found when comparing the two models with conventional regression (p = 0.498 and 0.496, respectively).</p><p><strong>Conclusion: </strong>The Random Forest model provides the most robust prediction of difficult laryngoscopy, outperforming both conventional and other ML methods. While ML models improve predictive accuracy, logistic regression remains a practical option in resource-limited settings. Integrating ML into clinical workflows could enhance decision-making, resource allocation, and patient safety in emergency airway management. Future research should prioritize external validation and real-world implementation.</p>","PeriodicalId":9002,"journal":{"name":"BMC Emergency Medicine","volume":"25 1","pages":"28"},"PeriodicalIF":2.3000,"publicationDate":"2025-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11846364/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Emergency Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12873-025-01185-0","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EMERGENCY MEDICINE","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Emergency endotracheal intubation is a critical skill for managing airway emergencies in the emergency department (ED). Accurate prediction of difficult laryngoscopy is essential for improving first-attempt success, minimizing complications, optimizing resource utilization, and enhancing patient outcomes. Traditional methods, such as the LEMON criteria, have limited predictive accuracy. Machine learning (ML) offers advanced predictive capabilities by analyzing large datasets and identifying complex variable interactions. This study aimed to develop and validate the performance of ML models for predicting difficult laryngoscopy in the ED, comparing it with a conventional regression model.

Methods: A retrospective cohort study was conducted on 4,370 adult patients who underwent intubation in the ED at Thammasat University Hospital. Difficult laryngoscopy was defined as a Cormack-Lehane grade III or IV. Patients were divided into development (training, 70%) and validation (testing, 30%) cohorts. Predictors of difficult laryngoscopy were identified using multivariable stepwise backward elimination logistic regression and were used to develop ML models, including Logistic Regression, Decision Tree, Random Forest, and XGBoost. Model performance was evaluated using the area under the receiver operating characteristic curve (AuROC), accuracy, precision, recall, and F1-score. Validation was performed on the validation cohort to confirm model accuracy.

Results: Nine significant predictors were identified: male sex, trauma, absence of neuromuscular blocking agents, large incisors, large tongue, limited mouth opening, short thyrohyoid distance, obstructed airway, and poor neck mobility. The Random Forest model demonstrated the highest predictive performance, with an AuROC of 0.82 (95% CI: 0.78-0.85), accuracy of 0.89, recall of 0.89, and F1-score of 0.87, outperforming conventional regression (AuROC 0.76, 95% CI: 0.73-0.78) and other ML models. DeLong's test confirmed a statistically significant difference in AuROC between the two models (p = 0.002). The Decision Tree showed limited performance due to overfitting, while XGBoost demonstrated strong precision. No significant differences were found when comparing the two models with conventional regression (p = 0.498 and 0.496, respectively).

Conclusion: The Random Forest model provides the most robust prediction of difficult laryngoscopy, outperforming both conventional and other ML methods. While ML models improve predictive accuracy, logistic regression remains a practical option in resource-limited settings. Integrating ML into clinical workflows could enhance decision-making, resource allocation, and patient safety in emergency airway management. Future research should prioritize external validation and real-world implementation.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
机器学习模型在预测急诊科喉镜困难中的表现:一项与传统回归方法比较的单中心回顾性研究
背景:紧急气管插管是急诊科(ED)处理气道紧急情况的一项关键技能。准确预测困难喉镜检查对于提高首次尝试成功率、减少并发症、优化资源利用和提高患者预后至关重要。传统的方法,如柠檬标准,具有有限的预测准确性。机器学习(ML)通过分析大型数据集和识别复杂的变量相互作用提供了先进的预测能力。本研究旨在开发和验证ML模型在预测急诊科困难喉镜检查中的性能,并将其与传统的回归模型进行比较。方法:对4,370名在法政大学医院急诊科插管的成年患者进行回顾性队列研究。困难喉镜检查定义为Cormack-Lehane III级或IV级。患者分为发展(培训,70%)和验证(测试,30%)队列。使用多变量逐步反向消除逻辑回归确定困难喉镜的预测因子,并用于开发ML模型,包括逻辑回归,决策树,随机森林和XGBoost。使用受试者工作特征曲线下面积(AuROC)、准确度、精密度、召回率和f1评分来评估模型的性能。对验证队列进行验证以确认模型的准确性。结果:确定了9个显著的预测因素:男性、外伤、缺乏神经肌肉阻断剂、大门牙、大舌头、张嘴受限、甲状舌骨距离短、气道阻塞和颈部活动能力差。随机森林模型表现出最高的预测性能,AuROC为0.82 (95% CI: 0.78-0.85),准确率为0.89,召回率为0.89,f1得分为0.87,优于传统回归(AuROC 0.76, 95% CI: 0.73-0.78)和其他ML模型。DeLong检验证实两种模型的AuROC差异有统计学意义(p = 0.002)。由于过度拟合,决策树表现出有限的性能,而XGBoost表现出很强的精度。两种模型与常规回归比较无显著差异(p分别= 0.498和0.496)。结论:随机森林模型提供了最稳健的困难喉镜预测,优于传统和其他ML方法。虽然ML模型提高了预测准确性,但在资源有限的情况下,逻辑回归仍然是一个实用的选择。将机器学习整合到临床工作流程中可以提高急诊气道管理的决策、资源分配和患者安全。未来的研究应该优先考虑外部验证和现实世界的实现。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
BMC Emergency Medicine
BMC Emergency Medicine Medicine-Emergency Medicine
CiteScore
3.50
自引率
8.00%
发文量
178
审稿时长
29 weeks
期刊介绍: BMC Emergency Medicine is an open access, peer-reviewed journal that considers articles on all urgent and emergency aspects of medicine, in both practice and basic research. In addition, the journal covers aspects of disaster medicine and medicine in special locations, such as conflict areas and military medicine, together with articles concerning healthcare services in the emergency departments.
期刊最新文献
Comprehensive geriatric assessment measures and subsequent EMS-transported emergency department use in adults aged ≥ 80 years: a retrospective cohort study. Delirium in older adults in the emergency department: a scoping review of current research and identified knowledge gaps. Physiotherapy in Nigerian emergency departments: a national survey of physiotherapists' perceptions of practice. Lived experiences of prehospital care among road traffic accident survivors in Rwanda: a qualitative descriptive study. Generational differences in perceptions and acceptance of violence in emergency rooms: a cross-sectional study in Thailand.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1