Linxia Wu, Lei Chen, Lijie Zhang, Yiming Liu, Die Ouyang, Wenlong Wu, Yu Lei, Ping Han, Huangxuan Zhao, Chuansheng Zheng
{"title":"A Machine Learning Model for Predicting Prognosis in HCC Patients With Diabetes After TACE.","authors":"Linxia Wu, Lei Chen, Lijie Zhang, Yiming Liu, Die Ouyang, Wenlong Wu, Yu Lei, Ping Han, Huangxuan Zhao, Chuansheng Zheng","doi":"10.2147/JHC.S496481","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Type II diabetes mellitus (T2DM) has been found to increase the mortality of patients with hepatocellular carcinoma (HCC). Therefore, this study aimed to establish and validate a machine learning-based explainable prediction model of prognosis in patients with HCC and T2DM undergoing transarterial chemoembolization (TACE).</p><p><strong>Patients and methods: </strong>The prediction model was developed using data from the derivation cohort comprising patients from three medical centers, followed by external validation utilizing patient data extracted from another center. Further, five predictive models were employed to establish prognosis models for 1-, 2-, and 3-year survival, respectively. Prediction performance was assessed by the receiver operating characteristic (ROC), calibration, and decision curve analysis curves. Lastly, the SHapley Additive exPlanations (SHAP) method was used to interpret the final ML model.</p><p><strong>Results: </strong>A total of 636 patients were included. Thirteen variables were selected for the model development. The final random survival forest (RSF) model exhibited excellent performance in the internal validation cohort, with areas under the ROC curve (AUCs) of 0.824, 0.853, and 0.810 in the 1-, 2-, and 3-year survival groups, respectively. This model also demonstrated remarkable discrimination in the external validation cohort, achieving AUCs of 0.862, 0.815, and 0.798 in the 1-, 2-, and 3-year survival groups, respectively. SHAP summary plots were also created to interpret the RSF model.</p><p><strong>Conclusion: </strong>An RSF model with good predictive performance was developed by incorporating simple parameters. This prognostic prediction model may assist physicians in early clinical intervention and improve prognosis outcomes in patients with HCC and comorbid T2DM after TACE.</p>","PeriodicalId":15906,"journal":{"name":"Journal of Hepatocellular Carcinoma","volume":"12 ","pages":"77-91"},"PeriodicalIF":4.2000,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11762020/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hepatocellular Carcinoma","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/JHC.S496481","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: Type II diabetes mellitus (T2DM) has been found to increase the mortality of patients with hepatocellular carcinoma (HCC). Therefore, this study aimed to establish and validate a machine learning-based explainable prediction model of prognosis in patients with HCC and T2DM undergoing transarterial chemoembolization (TACE).
Patients and methods: The prediction model was developed using data from the derivation cohort comprising patients from three medical centers, followed by external validation utilizing patient data extracted from another center. Further, five predictive models were employed to establish prognosis models for 1-, 2-, and 3-year survival, respectively. Prediction performance was assessed by the receiver operating characteristic (ROC), calibration, and decision curve analysis curves. Lastly, the SHapley Additive exPlanations (SHAP) method was used to interpret the final ML model.
Results: A total of 636 patients were included. Thirteen variables were selected for the model development. The final random survival forest (RSF) model exhibited excellent performance in the internal validation cohort, with areas under the ROC curve (AUCs) of 0.824, 0.853, and 0.810 in the 1-, 2-, and 3-year survival groups, respectively. This model also demonstrated remarkable discrimination in the external validation cohort, achieving AUCs of 0.862, 0.815, and 0.798 in the 1-, 2-, and 3-year survival groups, respectively. SHAP summary plots were also created to interpret the RSF model.
Conclusion: An RSF model with good predictive performance was developed by incorporating simple parameters. This prognostic prediction model may assist physicians in early clinical intervention and improve prognosis outcomes in patients with HCC and comorbid T2DM after TACE.