{"title":"Machine learning-based prediction of 5-year survival in elderly NSCLC patients using oxidative stress markers.","authors":"Hao Chen, Jiangjiang Xu, Qiang Zhang, Pengfei Chen, Qiuxia Liu, Lianyi Guo, Bindong Xu","doi":"10.3389/fonc.2024.1482374","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Oxidative stress plays a significant role in aging and cancer, yet there is currently a lack of research utilizing machine learning models to examine the relationship between oxidative stress and prognosis in elderly non-small cell lung cancer (NSCLC) patients.</p><p><strong>Methods: </strong>This study included elderly NSCLC patients who underwent radical lung cancer resection from January 2012 to April 2018, exploring the relationship between Oxidative Stress Score (OSS) and prognosis. Machine learning techniques, including Decision Trees (DT), Random Forest (RF), and Support Vector Machine (SVM), were employed to develop predictive models for 5-year overall survival (OS).</p><p><strong>Results: </strong>The datasets consisted of 1647 patients in the training set, 705 in the internal validation set, and 516 in the external validation set. An OSS was formulated from six systemic oxidative stress biomarkers, such as albumin, total bilirubin, and blood urea nitrogen, among others. Boruta variable importance analysis identified low OSS as a key indicator of poor prognosis. The OSS was subsequently integrated into the DT, RF, and SVM models for training. These models, optimized through hyperparameter tuning on the training set, were then evaluated on the internal and external validation sets. The RF model demonstrated the highest predictive performance, with an Area Under the Receiver Operating Characteristic Curve (AUC) of 0.794 in the internal validation set, compared to AUCs of 0.711 and 0.760 for the DT and SVM models, respectively. Similarly, in the external validation set, the RF model achieved an AUC of 0.784, outperforming the DT and SVM models, which had AUCs of 0.699 and 0.730, respectively. Calibration plots confirmed the RF model's superior calibration, followed by the SVM model, with the DT model performing the poorest.</p><p><strong>Conclusion: </strong>The OSS-based clinical prediction model, constructed using machine learning methodologies, effectively predicts the prognosis of elderly NSCLC patients post-radical surgery.</p>","PeriodicalId":12482,"journal":{"name":"Frontiers in Oncology","volume":null,"pages":null},"PeriodicalIF":3.5000,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11540553/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Oncology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fonc.2024.1482374","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Oxidative stress plays a significant role in aging and cancer, yet there is currently a lack of research utilizing machine learning models to examine the relationship between oxidative stress and prognosis in elderly non-small cell lung cancer (NSCLC) patients.
Methods: This study included elderly NSCLC patients who underwent radical lung cancer resection from January 2012 to April 2018, exploring the relationship between Oxidative Stress Score (OSS) and prognosis. Machine learning techniques, including Decision Trees (DT), Random Forest (RF), and Support Vector Machine (SVM), were employed to develop predictive models for 5-year overall survival (OS).
Results: The datasets consisted of 1647 patients in the training set, 705 in the internal validation set, and 516 in the external validation set. An OSS was formulated from six systemic oxidative stress biomarkers, such as albumin, total bilirubin, and blood urea nitrogen, among others. Boruta variable importance analysis identified low OSS as a key indicator of poor prognosis. The OSS was subsequently integrated into the DT, RF, and SVM models for training. These models, optimized through hyperparameter tuning on the training set, were then evaluated on the internal and external validation sets. The RF model demonstrated the highest predictive performance, with an Area Under the Receiver Operating Characteristic Curve (AUC) of 0.794 in the internal validation set, compared to AUCs of 0.711 and 0.760 for the DT and SVM models, respectively. Similarly, in the external validation set, the RF model achieved an AUC of 0.784, outperforming the DT and SVM models, which had AUCs of 0.699 and 0.730, respectively. Calibration plots confirmed the RF model's superior calibration, followed by the SVM model, with the DT model performing the poorest.
Conclusion: The OSS-based clinical prediction model, constructed using machine learning methodologies, effectively predicts the prognosis of elderly NSCLC patients post-radical surgery.
期刊介绍:
Cancer Imaging and Diagnosis is dedicated to the publication of results from clinical and research studies applied to cancer diagnosis and treatment. The section aims to publish studies from the entire field of cancer imaging: results from routine use of clinical imaging in both radiology and nuclear medicine, results from clinical trials, experimental molecular imaging in humans and small animals, research on new contrast agents in CT, MRI, ultrasound, publication of new technical applications and processing algorithms to improve the standardization of quantitative imaging and image guided interventions for the diagnosis and treatment of cancer.