Aylin Erman, Julia Ferreira, Waseem Abu Ashour, Elena Guadagno, Etienne St-Louis, Sherif Emil, Jackie Cheung, Dan Poenaru
{"title":"Machine-learning-assisted Preoperative Prediction of Pediatric Appendicitis Severity.","authors":"Aylin Erman, Julia Ferreira, Waseem Abu Ashour, Elena Guadagno, Etienne St-Louis, Sherif Emil, Jackie Cheung, Dan Poenaru","doi":"10.1016/j.jpedsurg.2024.162151","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>This study evaluates the effectiveness of machine learning (ML) algorithms for improving the preoperative diagnosis of acute appendicitis in children, focusing on the accurate prediction of the severity of disease.</p><p><strong>Methods: </strong>An anonymized clinical and operative dataset was retrieved from the medical records of children undergoing emergency appendectomy between 2014 and 2021. We developed an ML pipeline that pre-processed the dataset and developed algorithms to predict 5 appendicitis grades (1 - non-perforated, 2 - localized perforation, 3 - abscess, 4 - generalized peritonitis, and 5 - generalized peritonitis with abscess). Imputation strategies were used for missing values and upsampling techniques for infrequent classes. Standard classifier models were tested. The best combination of imputation strategy, class balancing technique and classification model was chosen based on validation performance. Model explainability was verified by a pediatric surgeon. Our model's performance was compared to another pediatric appendicitis severity prediction tool.</p><p><strong>Results: </strong>The study used a retrospective cohort including 1980 patients (60.6 % males, average age 10.7 years). Grade of appendicitis in the cohort was as follows: grade 1-70 %; grade 2-8 %; grade 3-7 %; grade 4-7 %; grade 5-8 %. Every combination of 6 imputation strategies, 7 class-balancing techniques, and 5 classification models was tested. The best-performing combined ML pipeline distinguished non-perforated from perforated appendicitis with 82.8 ± 0.2 % NPV and 56.4 ± 0.4 % PPV, and differentiated between severity grades with 70.1 ± 0.2 % accuracy and 0.77 ± 0.00 AUROC. The other pediatric appendicitis severity prediction tool gave an accuracy of 71.4 %, AUROC of 0.54 and NPV/PPV of 71.8/64.7.</p><p><strong>Conclusion: </strong>Prediction of appendiceal perforation outperforms prediction of the continuum of appendicitis grades. The variables our models primarily rely on to make predictions are consistent with clinical experience and the literature, suggesting that the ML models uncovered useful patterns in the dataset. Our model outperforms the other pediatric appendicitis prediction tools. The ML model developed for grade prediction is the first of this type, offering a novel approach for assessing appendicitis severity in children preoperatively. Following external validation and silent clinical testing, this ML model has the potential to enable personalized severity-based treatment of pediatric appendicitis and optimize resource allocation for its management.</p><p><strong>Level of evidence: 3: </strong></p>","PeriodicalId":16733,"journal":{"name":"Journal of pediatric surgery","volume":" ","pages":"162151"},"PeriodicalIF":2.4000,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of pediatric surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jpedsurg.2024.162151","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PEDIATRICS","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: This study evaluates the effectiveness of machine learning (ML) algorithms for improving the preoperative diagnosis of acute appendicitis in children, focusing on the accurate prediction of the severity of disease.
Methods: An anonymized clinical and operative dataset was retrieved from the medical records of children undergoing emergency appendectomy between 2014 and 2021. We developed an ML pipeline that pre-processed the dataset and developed algorithms to predict 5 appendicitis grades (1 - non-perforated, 2 - localized perforation, 3 - abscess, 4 - generalized peritonitis, and 5 - generalized peritonitis with abscess). Imputation strategies were used for missing values and upsampling techniques for infrequent classes. Standard classifier models were tested. The best combination of imputation strategy, class balancing technique and classification model was chosen based on validation performance. Model explainability was verified by a pediatric surgeon. Our model's performance was compared to another pediatric appendicitis severity prediction tool.
Results: The study used a retrospective cohort including 1980 patients (60.6 % males, average age 10.7 years). Grade of appendicitis in the cohort was as follows: grade 1-70 %; grade 2-8 %; grade 3-7 %; grade 4-7 %; grade 5-8 %. Every combination of 6 imputation strategies, 7 class-balancing techniques, and 5 classification models was tested. The best-performing combined ML pipeline distinguished non-perforated from perforated appendicitis with 82.8 ± 0.2 % NPV and 56.4 ± 0.4 % PPV, and differentiated between severity grades with 70.1 ± 0.2 % accuracy and 0.77 ± 0.00 AUROC. The other pediatric appendicitis severity prediction tool gave an accuracy of 71.4 %, AUROC of 0.54 and NPV/PPV of 71.8/64.7.
Conclusion: Prediction of appendiceal perforation outperforms prediction of the continuum of appendicitis grades. The variables our models primarily rely on to make predictions are consistent with clinical experience and the literature, suggesting that the ML models uncovered useful patterns in the dataset. Our model outperforms the other pediatric appendicitis prediction tools. The ML model developed for grade prediction is the first of this type, offering a novel approach for assessing appendicitis severity in children preoperatively. Following external validation and silent clinical testing, this ML model has the potential to enable personalized severity-based treatment of pediatric appendicitis and optimize resource allocation for its management.
期刊介绍:
The journal presents original contributions as well as a complete international abstracts section and other special departments to provide the most current source of information and references in pediatric surgery. The journal is based on the need to improve the surgical care of infants and children, not only through advances in physiology, pathology and surgical techniques, but also by attention to the unique emotional and physical needs of the young patient.