Background: Acute care surgery (ACS) involves rapid, high-stakes decisions with limited opportunity for preoperative planning. While machine learning (ML) may improve risk prediction and decision making in this setting, its development, validation, and implementation in ACS remain understudied. We therefore evaluated the techniques, predictor features, and outcomes used in ML-driven risk prediction models in ACS and generated recommendations to inform future research and support clinically meaningful implementation.
Methods: A systematic review of ML-driven predictive models in ACS (emergency general surgery, surgical critical care, trauma) was conducted. Models were analyzed by predictor features, outcomes, algorithms, and performance. The best-performing models for the most commonly predicted outcome were identified.
Results: Of 52 studies, 57.7% focused on trauma populations. Most models used registry data (76.8%), fewer used electronic health records (28.8%), and only five studies performed external validation after model development. Common algorithms included logistic regression (44.2%), random forest (34.6%), and decision trees (26.9%). Mortality (59.6%), complications (30.8%), and triage/severity (15.4%) were the most frequent outcomes; patient-centered/reported outcomes were absent. Features commonly included demographics, physiologic scores, and vital signs, while imaging and intraoperative data were underused. Natural language processing was used in four studies. Model performance was typically assessed using area under the receiver operating characteristic curve (88.5%), with support vector machines demonstrating the highest performance. Machine learning models generally outperformed conventional risk scores among 11 comparative studies.
Conclusion: Machine learning-driven predictive models in ACS show promising performance but are constrained by limited methodological rigor, real-world validation, and substantial heterogeneity in features, outcomes, and algorithms, challenging systematic adoption and oversight. A grounded understanding of ACS decision making workflows and their postimplementation impact may ensure clinically relevant, seamless, and safe integration of ML-based risk prediction.
Level of evidence: Systematic Review Without Meta-analysis; Level IV.
扫码关注我们
求助内容:
应助结果提醒方式:
