Predicting postoperative pulmonary infection in elderly patients undergoing major surgery: a study based on logistic regression and machine learning models.
{"title":"Predicting postoperative pulmonary infection in elderly patients undergoing major surgery: a study based on logistic regression and machine learning models.","authors":"Jie Liu, Xia Li, Yanting Wang, Zhenzhen Xu, Yong Lv, Yuyao He, Lu Chen, Yiqi Feng, Guoyang Liu, Yunxiao Bai, Wanli Xie, Qingping Wu","doi":"10.1186/s12890-025-03582-4","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Postoperative pulmonary infection (POI) is strongly associated with a poor prognosis and has a high incidence in elderly patients undergoing major surgery. Machine learning (ML) algorithms are increasingly being used in medicine, but the predictive role of logistic regression (LR) and ML algorithms for POI in high-risk populations remains unclear.</p><p><strong>Methods: </strong>We conducted a retrospective cohort study of older adults undergoing major surgery over a period of six years. The included patients were randomly divided into training and validation sets at a ratio of 7:3. The features selected by the least absolute shrinkage and selection operator regression algorithm were used as the input variables of the ML and LR models. The random forest of multiple interpretable methods was used to interpret the ML models.</p><p><strong>Results: </strong>Of the 9481 older adults in our study, 951 developed POI. Among the different algorithms, LR performed the best with an AUC of 0.80, whereas the decision tree performed the worst with an AUC of 0.75. Furthermore, the LR model outperformed the other ML models in terms of accuracy (88.22%), specificity (90.29%), precision (44.42%), and F1 score (54.25%). Despite employing four interpretable methods for RF analysis, there existed a certain degree of inconsistency in the results. Finally, to facilitate clinical application, we established a web-friendly version of the nomogram based on the LR algorithm; In addition, patients were divided into three significantly distinct risk intervals in predicting POI.</p><p><strong>Conclusions: </strong>Compared with popular ML algorithms, LR was more effective at predicting POI in older patients undergoing major surgery. The constructed nomogram could identify high-risk elderly patients and facilitate perioperative management planning.</p><p><strong>Trial registration: </strong>The study was retrospectively registered (NCT06491459).</p>","PeriodicalId":9148,"journal":{"name":"BMC Pulmonary Medicine","volume":"25 1","pages":"128"},"PeriodicalIF":2.6000,"publicationDate":"2025-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Pulmonary Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12890-025-03582-4","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RESPIRATORY SYSTEM","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Postoperative pulmonary infection (POI) is strongly associated with a poor prognosis and has a high incidence in elderly patients undergoing major surgery. Machine learning (ML) algorithms are increasingly being used in medicine, but the predictive role of logistic regression (LR) and ML algorithms for POI in high-risk populations remains unclear.
Methods: We conducted a retrospective cohort study of older adults undergoing major surgery over a period of six years. The included patients were randomly divided into training and validation sets at a ratio of 7:3. The features selected by the least absolute shrinkage and selection operator regression algorithm were used as the input variables of the ML and LR models. The random forest of multiple interpretable methods was used to interpret the ML models.
Results: Of the 9481 older adults in our study, 951 developed POI. Among the different algorithms, LR performed the best with an AUC of 0.80, whereas the decision tree performed the worst with an AUC of 0.75. Furthermore, the LR model outperformed the other ML models in terms of accuracy (88.22%), specificity (90.29%), precision (44.42%), and F1 score (54.25%). Despite employing four interpretable methods for RF analysis, there existed a certain degree of inconsistency in the results. Finally, to facilitate clinical application, we established a web-friendly version of the nomogram based on the LR algorithm; In addition, patients were divided into three significantly distinct risk intervals in predicting POI.
Conclusions: Compared with popular ML algorithms, LR was more effective at predicting POI in older patients undergoing major surgery. The constructed nomogram could identify high-risk elderly patients and facilitate perioperative management planning.
Trial registration: The study was retrospectively registered (NCT06491459).
期刊介绍:
BMC Pulmonary Medicine is an open access, peer-reviewed journal that considers articles on all aspects of the prevention, diagnosis and management of pulmonary and associated disorders, as well as related molecular genetics, pathophysiology, and epidemiology.