{"title":"基于放射组学的可解释机器学习模型预测膀胱癌的 HER2 状态:一项多中心研究。","authors":"Zongjie Wei, Xuesong Bai, Yingjie Xv, Shao-Hao Chen, Siwen Yin, Yang Li, Fajin Lv, Mingzhao Xiao, Yongpeng Xie","doi":"10.1186/s13244-024-01840-3","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>To develop a computed tomography (CT) radiomics-based interpretable machine learning (ML) model to preoperatively predict human epidermal growth factor receptor 2 (HER2) status in bladder cancer (BCa) with multicenter validation.</p><p><strong>Methods: </strong>In this retrospective study, 207 patients with pathologically confirmed BCa were enrolled and divided into the training set (n = 154) and test set (n = 53). Least absolute shrinkage and selection operator (LASSO) regression was used to identify the most discriminative features in the training set. Five radiomics-based ML models, namely logistic regression (LR), support vector machine (SVM), k-nearest neighbors (KNN), eXtreme Gradient Boosting (XGBoost) and random forest (RF), were developed. The predictive performance of established ML models was evaluated by the area under the receiver operating characteristic curve (AUC). The Shapley additive explanation (SHAP) was used to analyze the interpretability of ML models.</p><p><strong>Results: </strong>A total of 1218 radiomics features were extracted from the nephrographic phase CT images, and 11 features were filtered for constructing ML models. In the test set, the AUCs of LR, SVM, KNN, XGBoost, and RF were 0.803, 0.709, 0.679, 0.794, and 0.815, with corresponding accuracies of 71.7%, 69.8%, 60.4%, 75.5%, and 75.5%, respectively. RF was identified as the optimal classifier. SHAP analysis showed that texture features (gray level size zone matrix and gray level co-occurrence matrix) were significant predictors of HER2 status.</p><p><strong>Conclusions: </strong>The radiomics-based interpretable ML model provides a noninvasive tool to predict the HER2 status of BCa with satisfactory discriminatory performance.</p><p><strong>Critical relevance statement: </strong>An interpretable radiomics-based machine learning model can preoperatively predict HER2 status in bladder cancer, potentially aiding in the clinical decision-making process.</p><p><strong>Key points: </strong>The CT radiomics model could identify HER2 status in bladder cancer. The random forest model showed a more robust and accurate performance. The model demonstrated favorable interpretability through SHAP method.</p>","PeriodicalId":13639,"journal":{"name":"Insights into Imaging","volume":"15 1","pages":"262"},"PeriodicalIF":4.1000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11519251/pdf/","citationCount":"0","resultStr":"{\"title\":\"A radiomics-based interpretable machine learning model to predict the HER2 status in bladder cancer: a multicenter study.\",\"authors\":\"Zongjie Wei, Xuesong Bai, Yingjie Xv, Shao-Hao Chen, Siwen Yin, Yang Li, Fajin Lv, Mingzhao Xiao, Yongpeng Xie\",\"doi\":\"10.1186/s13244-024-01840-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>To develop a computed tomography (CT) radiomics-based interpretable machine learning (ML) model to preoperatively predict human epidermal growth factor receptor 2 (HER2) status in bladder cancer (BCa) with multicenter validation.</p><p><strong>Methods: </strong>In this retrospective study, 207 patients with pathologically confirmed BCa were enrolled and divided into the training set (n = 154) and test set (n = 53). Least absolute shrinkage and selection operator (LASSO) regression was used to identify the most discriminative features in the training set. Five radiomics-based ML models, namely logistic regression (LR), support vector machine (SVM), k-nearest neighbors (KNN), eXtreme Gradient Boosting (XGBoost) and random forest (RF), were developed. The predictive performance of established ML models was evaluated by the area under the receiver operating characteristic curve (AUC). The Shapley additive explanation (SHAP) was used to analyze the interpretability of ML models.</p><p><strong>Results: </strong>A total of 1218 radiomics features were extracted from the nephrographic phase CT images, and 11 features were filtered for constructing ML models. In the test set, the AUCs of LR, SVM, KNN, XGBoost, and RF were 0.803, 0.709, 0.679, 0.794, and 0.815, with corresponding accuracies of 71.7%, 69.8%, 60.4%, 75.5%, and 75.5%, respectively. RF was identified as the optimal classifier. SHAP analysis showed that texture features (gray level size zone matrix and gray level co-occurrence matrix) were significant predictors of HER2 status.</p><p><strong>Conclusions: </strong>The radiomics-based interpretable ML model provides a noninvasive tool to predict the HER2 status of BCa with satisfactory discriminatory performance.</p><p><strong>Critical relevance statement: </strong>An interpretable radiomics-based machine learning model can preoperatively predict HER2 status in bladder cancer, potentially aiding in the clinical decision-making process.</p><p><strong>Key points: </strong>The CT radiomics model could identify HER2 status in bladder cancer. The random forest model showed a more robust and accurate performance. The model demonstrated favorable interpretability through SHAP method.</p>\",\"PeriodicalId\":13639,\"journal\":{\"name\":\"Insights into Imaging\",\"volume\":\"15 1\",\"pages\":\"262\"},\"PeriodicalIF\":4.1000,\"publicationDate\":\"2024-10-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11519251/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Insights into Imaging\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s13244-024-01840-3\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Insights into Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s13244-024-01840-3","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
摘要
目的开发一种基于计算机断层扫描(CT)放射组学的可解释机器学习(ML)模型,用于术前预测膀胱癌(BCa)的人表皮生长因子受体2(HER2)状态,并进行多中心验证:在这项回顾性研究中,207 名病理确诊的 BCa 患者被纳入其中,并分为训练集(154 人)和测试集(53 人)。最小绝对收缩和选择算子(LASSO)回归用于识别训练集中最具鉴别力的特征。开发了五种基于放射组学的 ML 模型,即逻辑回归(LR)、支持向量机(SVM)、k-近邻(KNN)、极梯度提升(XGBoost)和随机森林(RF)。已建立的 ML 模型的预测性能通过接收者工作特征曲线下面积(AUC)进行评估。沙普利加法解释(SHAP)用于分析ML模型的可解释性:结果:从肾造影期 CT 图像中共提取了 1218 个放射组学特征,并筛选出 11 个特征用于构建 ML 模型。在测试集中,LR、SVM、KNN、XGBoost 和 RF 的 AUC 分别为 0.803、0.709、0.679、0.794 和 0.815,相应的准确率分别为 71.7%、69.8%、60.4%、75.5% 和 75.5%。RF 被确定为最佳分类器。SHAP分析表明,纹理特征(灰度级大小区矩阵和灰度级共现矩阵)是预测HER2状态的重要指标:基于放射组学的可解释 ML 模型为预测 BCa 的 HER2 状态提供了一种无创工具,其鉴别性能令人满意:基于放射组学的可解释机器学习模型可以在术前预测膀胱癌的 HER2 状态,从而为临床决策过程提供潜在帮助:CT放射组学模型可识别膀胱癌的HER2状态。随机森林模型表现得更稳健、更准确。该模型通过SHAP方法表现出良好的可解释性。
A radiomics-based interpretable machine learning model to predict the HER2 status in bladder cancer: a multicenter study.
Objective: To develop a computed tomography (CT) radiomics-based interpretable machine learning (ML) model to preoperatively predict human epidermal growth factor receptor 2 (HER2) status in bladder cancer (BCa) with multicenter validation.
Methods: In this retrospective study, 207 patients with pathologically confirmed BCa were enrolled and divided into the training set (n = 154) and test set (n = 53). Least absolute shrinkage and selection operator (LASSO) regression was used to identify the most discriminative features in the training set. Five radiomics-based ML models, namely logistic regression (LR), support vector machine (SVM), k-nearest neighbors (KNN), eXtreme Gradient Boosting (XGBoost) and random forest (RF), were developed. The predictive performance of established ML models was evaluated by the area under the receiver operating characteristic curve (AUC). The Shapley additive explanation (SHAP) was used to analyze the interpretability of ML models.
Results: A total of 1218 radiomics features were extracted from the nephrographic phase CT images, and 11 features were filtered for constructing ML models. In the test set, the AUCs of LR, SVM, KNN, XGBoost, and RF were 0.803, 0.709, 0.679, 0.794, and 0.815, with corresponding accuracies of 71.7%, 69.8%, 60.4%, 75.5%, and 75.5%, respectively. RF was identified as the optimal classifier. SHAP analysis showed that texture features (gray level size zone matrix and gray level co-occurrence matrix) were significant predictors of HER2 status.
Conclusions: The radiomics-based interpretable ML model provides a noninvasive tool to predict the HER2 status of BCa with satisfactory discriminatory performance.
Critical relevance statement: An interpretable radiomics-based machine learning model can preoperatively predict HER2 status in bladder cancer, potentially aiding in the clinical decision-making process.
Key points: The CT radiomics model could identify HER2 status in bladder cancer. The random forest model showed a more robust and accurate performance. The model demonstrated favorable interpretability through SHAP method.
期刊介绍:
Insights into Imaging (I³) is a peer-reviewed open access journal published under the brand SpringerOpen. All content published in the journal is freely available online to anyone, anywhere!
I³ continuously updates scientific knowledge and progress in best-practice standards in radiology through the publication of original articles and state-of-the-art reviews and opinions, along with recommendations and statements from the leading radiological societies in Europe.
Founded by the European Society of Radiology (ESR), I³ creates a platform for educational material, guidelines and recommendations, and a forum for topics of controversy.
A balanced combination of review articles, original papers, short communications from European radiological congresses and information on society matters makes I³ an indispensable source for current information in this field.
I³ is owned by the ESR, however authors retain copyright to their article according to the Creative Commons Attribution License (see Copyright and License Agreement). All articles can be read, redistributed and reused for free, as long as the author of the original work is cited properly.
The open access fees (article-processing charges) for this journal are kindly sponsored by ESR for all Members.
The journal went open access in 2012, which means that all articles published since then are freely available online.