{"title":"An interpretable clinical ultrasound-radiomics combined model for diagnosis of stage I cervical cancer","authors":"Xianyue Yang, Chuanfen Gao, Nian Sun, X. Qin, Xiaoling Liu, Chaoxue Zhang","doi":"10.3389/fonc.2024.1353780","DOIUrl":null,"url":null,"abstract":"The purpose of this retrospective study was to establish a combined model based on ultrasound (US)-radiomics and clinical factors to predict patients with stage I cervical cancer (CC) before surgery.A total of 209 CC patients who had cervical lesions found by transvaginal sonography (TVS) from the First Affiliated Hospital of Anhui Medical University were retrospectively reviewed, patients were divided into the training set (n = 146) and internal validation set (n = 63), and 52 CC patients from Anhui Provincial Maternity and Child Health Hospital and Nanchong Central Hospital were taken as the external validation set. The clinical independent predictors were selected by univariate and multivariate logistic regression analyses. US-radiomics features were extracted from US images. After selecting the most significant features by univariate analysis, Spearman’s correlation analysis, and the least absolute shrinkage and selection operator (LASSO) algorithm, six machine learning (ML) algorithms were used to build the radiomics model. Next, the ability of the clinical, US-radiomics, and clinical US-radiomics combined model was compared to diagnose stage I CC. Finally, the Shapley additive explanations (SHAP) method was used to explain the contribution of each feature.Long diameter of the cervical lesion (L) and squamous cell carcinoma-associated antigen (SCCa) were independent clinical predictors of stage I CC. The eXtreme Gradient Boosting (Xgboost) model performed the best among the six ML radiomics models, with area under the curve (AUC) values in the training, internal validation, and external validation sets being 0.778, 0.751, and 0.751, respectively. In the final three models, the combined model based on clinical features and rad-score showed good discriminative power, with AUC values in the training, internal validation, and external validation sets being 0.837, 0.828, and 0.839, respectively. The decision curve analysis validated the clinical utility of the combined nomogram. The SHAP algorithm illustrates the contribution of each feature in the combined model.We established an interpretable combined model to predict stage I CC. This non-invasive prediction method may be used for the preoperative identification of patients with stage I CC.","PeriodicalId":507440,"journal":{"name":"Frontiers in Oncology","volume":"120 6","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Oncology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fonc.2024.1353780","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The purpose of this retrospective study was to establish a combined model based on ultrasound (US)-radiomics and clinical factors to predict patients with stage I cervical cancer (CC) before surgery.A total of 209 CC patients who had cervical lesions found by transvaginal sonography (TVS) from the First Affiliated Hospital of Anhui Medical University were retrospectively reviewed, patients were divided into the training set (n = 146) and internal validation set (n = 63), and 52 CC patients from Anhui Provincial Maternity and Child Health Hospital and Nanchong Central Hospital were taken as the external validation set. The clinical independent predictors were selected by univariate and multivariate logistic regression analyses. US-radiomics features were extracted from US images. After selecting the most significant features by univariate analysis, Spearman’s correlation analysis, and the least absolute shrinkage and selection operator (LASSO) algorithm, six machine learning (ML) algorithms were used to build the radiomics model. Next, the ability of the clinical, US-radiomics, and clinical US-radiomics combined model was compared to diagnose stage I CC. Finally, the Shapley additive explanations (SHAP) method was used to explain the contribution of each feature.Long diameter of the cervical lesion (L) and squamous cell carcinoma-associated antigen (SCCa) were independent clinical predictors of stage I CC. The eXtreme Gradient Boosting (Xgboost) model performed the best among the six ML radiomics models, with area under the curve (AUC) values in the training, internal validation, and external validation sets being 0.778, 0.751, and 0.751, respectively. In the final three models, the combined model based on clinical features and rad-score showed good discriminative power, with AUC values in the training, internal validation, and external validation sets being 0.837, 0.828, and 0.839, respectively. The decision curve analysis validated the clinical utility of the combined nomogram. The SHAP algorithm illustrates the contribution of each feature in the combined model.We established an interpretable combined model to predict stage I CC. This non-invasive prediction method may be used for the preoperative identification of patients with stage I CC.
这项回顾性研究的目的是建立一个基于超声(US)放射组学和临床因素的联合模型,以预测宫颈癌(CC)I期患者的术前情况。研究回顾性分析了安徽医科大学第一附属医院经阴道超声检查(TVS)发现宫颈病变的209例CC患者,将患者分为训练集(146例)和内部验证集(63例),并将安徽省妇幼保健院和南充市中心医院的52例CC患者作为外部验证集。通过单变量和多变量逻辑回归分析筛选出临床独立预测因子。从 US 图像中提取 US 放射组学特征。通过单变量分析、斯皮尔曼相关性分析和最小绝对收缩和选择算子(LASSO)算法筛选出最重要的特征后,使用六种机器学习(ML)算法建立放射组学模型。接下来,比较了临床模型、US-放射组学模型和临床US-放射组学组合模型诊断I期CC的能力。宫颈病变长径(L)和鳞状细胞癌相关抗原(SCCa)是 I 期 CC 的独立临床预测指标。在六个ML放射组学模型中,eXtreme Gradient Boosting(Xgboost)模型表现最佳,其训练集、内部验证集和外部验证集的曲线下面积(AUC)值分别为0.778、0.751和0.751。在最后的三个模型中,基于临床特征和 rad-score 的组合模型显示出良好的判别能力,训练集、内部验证集和外部验证集的 AUC 值分别为 0.837、0.828 和 0.839。决策曲线分析验证了组合提名图的临床实用性。我们建立了一个可解释的综合模型来预测 I 期 CC。这种无创预测方法可用于术前识别 I 期 CC 患者。