利用机器学习的多参数 MRI 放射组学用于区分 HER2 零、低和阳性乳腺癌:模型开发、测试和可解释性分析。

IF 4.7 2区 医学 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING American Journal of Roentgenology Pub Date : 2024-10-16 DOI:10.2214/AJR.24.31717
Yongxin Chen, Siyi Chen, Wenjie Tang, Qingcong Konge, Zhidan Zhong, Xiaomeng Yu, Yi Sui, Wenke Hu, Xinqing Jiang, Yuan Guo
{"title":"利用机器学习的多参数 MRI 放射组学用于区分 HER2 零、低和阳性乳腺癌:模型开发、测试和可解释性分析。","authors":"Yongxin Chen, Siyi Chen, Wenjie Tang, Qingcong Konge, Zhidan Zhong, Xiaomeng Yu, Yi Sui, Wenke Hu, Xinqing Jiang, Yuan Guo","doi":"10.2214/AJR.24.31717","DOIUrl":null,"url":null,"abstract":"<p><p><b>BACKGROUND:</b> MRI radiomics has been explored for three-tiered classification of breast cancer HER2 expression (i.e., HER2-zero, HER2-low, or HER2-positive), although understanding of how such models reach their predictions is lacking. <b>OBJECTIVE:</b> To develop and test multiparametric MRI radiomics machine-learning models for differentiating three-tiered HER2 expression levels in patients with breast cancer, and to explain the contributions of model features through local and global interpretations using SHapley Additive exPlanation (SHAP) analysis. <b>METHODS:</b> This retrospective study included 737 patients (mean age, 54.1±10.6 years) with breast cancer from two centers (center 1: n=578; center 2: n=159), who underwent breast MRI and had HER2 expression determined after excisional biopsy. Analysis entailed two tasks: differentiating HER2-negative (i.e., HER2-zero or HER2-low) from HER2-positive tumors (task 1), and differentiating HER2-zero from HER2-low tumors (task 2). For each task, patients from center 1 were randomly assigned in 7:3 ratio to training (task 1: n=405; task 2: n=284) or internal test (task 1: n=173; task 2: n=122) sets; those from center 2 formed an external test set (task 1: n=159; task 2: n=105). Radiomics features were extracted from early-phase dynamic contrast-enhanced images (DCE), T2-weighted images (T2WI), and DWI. For each task, a support vector machine (SVM) was used for feature selection; a multiparametric radiomics score (radscore) was computed using feature weights from SVM correlation coefficients; conventional MRI and combined models were constructed; and model performances were evaluated. SHAP analysis was used to provide local and global interpretations for model outputs. <b>RESULTS:</b> In the external test set, for task 1, AUCs for the conventional MRI model, radscore, and combined model were 0.624, 0.757, and 0.762, respectively; for task 2, AUC for radscore was 0.754, and no conventional MRI model or combined model could be constructed. SHAP analysis identified early-phase DCE features as having the strongest influence for both tasks; T2WI features also had a prominent role for task 2. <b>CONCLUSION:</b> The findings indicate suboptimal performance of MRI radiomics models for noninvasive characterization of HER2 expression. <b>CLINICAL IMPACT:</b> The study provides an example of the use of SHAP interpretation analysis to better understand predictions of imaging-based machine learning models.</p>","PeriodicalId":55529,"journal":{"name":"American Journal of Roentgenology","volume":null,"pages":null},"PeriodicalIF":4.7000,"publicationDate":"2024-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multiparametric MRI Radiomics With Machine Learning for Differentiating HER2-Zero, -Low, and -Positive Breast Cancer: Model Development, Testing, and Interpretability Analysis.\",\"authors\":\"Yongxin Chen, Siyi Chen, Wenjie Tang, Qingcong Konge, Zhidan Zhong, Xiaomeng Yu, Yi Sui, Wenke Hu, Xinqing Jiang, Yuan Guo\",\"doi\":\"10.2214/AJR.24.31717\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p><b>BACKGROUND:</b> MRI radiomics has been explored for three-tiered classification of breast cancer HER2 expression (i.e., HER2-zero, HER2-low, or HER2-positive), although understanding of how such models reach their predictions is lacking. <b>OBJECTIVE:</b> To develop and test multiparametric MRI radiomics machine-learning models for differentiating three-tiered HER2 expression levels in patients with breast cancer, and to explain the contributions of model features through local and global interpretations using SHapley Additive exPlanation (SHAP) analysis. <b>METHODS:</b> This retrospective study included 737 patients (mean age, 54.1±10.6 years) with breast cancer from two centers (center 1: n=578; center 2: n=159), who underwent breast MRI and had HER2 expression determined after excisional biopsy. Analysis entailed two tasks: differentiating HER2-negative (i.e., HER2-zero or HER2-low) from HER2-positive tumors (task 1), and differentiating HER2-zero from HER2-low tumors (task 2). For each task, patients from center 1 were randomly assigned in 7:3 ratio to training (task 1: n=405; task 2: n=284) or internal test (task 1: n=173; task 2: n=122) sets; those from center 2 formed an external test set (task 1: n=159; task 2: n=105). Radiomics features were extracted from early-phase dynamic contrast-enhanced images (DCE), T2-weighted images (T2WI), and DWI. For each task, a support vector machine (SVM) was used for feature selection; a multiparametric radiomics score (radscore) was computed using feature weights from SVM correlation coefficients; conventional MRI and combined models were constructed; and model performances were evaluated. SHAP analysis was used to provide local and global interpretations for model outputs. <b>RESULTS:</b> In the external test set, for task 1, AUCs for the conventional MRI model, radscore, and combined model were 0.624, 0.757, and 0.762, respectively; for task 2, AUC for radscore was 0.754, and no conventional MRI model or combined model could be constructed. SHAP analysis identified early-phase DCE features as having the strongest influence for both tasks; T2WI features also had a prominent role for task 2. <b>CONCLUSION:</b> The findings indicate suboptimal performance of MRI radiomics models for noninvasive characterization of HER2 expression. <b>CLINICAL IMPACT:</b> The study provides an example of the use of SHAP interpretation analysis to better understand predictions of imaging-based machine learning models.</p>\",\"PeriodicalId\":55529,\"journal\":{\"name\":\"American Journal of Roentgenology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2024-10-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"American Journal of Roentgenology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.2214/AJR.24.31717\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"American Journal of Roentgenology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2214/AJR.24.31717","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0

摘要

背景:磁共振成像放射组学已被探索用于乳腺癌 HER2 表达的三级分类(即 HER2-零、HER2-低或 HER2-阳性),但人们对此类模型如何得出预测结果还缺乏了解。目的:开发并测试用于区分乳腺癌患者三级 HER2 表达水平的多参数 MRI 放射组学机器学习模型,并使用 SHapley Additive exPlanation(SHAP)分析法通过局部和全局解释模型特征的贡献。方法:这项回顾性研究纳入了来自两个中心(中心1:578人;中心2:159人)的737名乳腺癌患者(平均年龄为54.1±10.6岁),这些患者接受了乳腺磁共振成像检查,并在切除活检后确定了HER2的表达。分析包括两项任务:区分HER2阴性(即HER2-0或HER2-低)和HER2阳性肿瘤(任务1),以及区分HER2-0和HER2-低肿瘤(任务2)。对于每项任务,中心1的患者按7:3的比例随机分配到训练集(任务1:n=405;任务2:n=284)或内部测试集(任务1:n=173;任务2:n=122);中心2的患者组成外部测试集(任务1:n=159;任务2:n=105)。放射组学特征从早期动态对比增强图像(DCE)、T2 加权图像(T2WI)和 DWI 中提取。每个任务都使用支持向量机(SVM)进行特征选择;使用 SVM 相关系数的特征权重计算多参数放射组学评分(radscore);构建传统 MRI 模型和组合模型;评估模型性能。使用 SHAP 分析为模型输出提供局部和全局解释。结果:在外部测试集中,对于任务 1,传统 MRI 模型、radscore 和组合模型的 AUC 分别为 0.624、0.757 和 0.762;对于任务 2,radscore 的 AUC 为 0.754,且无法构建传统 MRI 模型或组合模型。SHAP分析表明,早期DCE特征对这两项任务的影响最大;T2WI特征对任务2的影响也很显著。结论:研究结果表明,磁共振成像放射组学模型在表征 HER2 表达的非侵入性方面表现欠佳。临床影响:该研究提供了一个使用 SHAP 解释分析来更好地理解基于成像的机器学习模型预测的例子。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Multiparametric MRI Radiomics With Machine Learning for Differentiating HER2-Zero, -Low, and -Positive Breast Cancer: Model Development, Testing, and Interpretability Analysis.

BACKGROUND: MRI radiomics has been explored for three-tiered classification of breast cancer HER2 expression (i.e., HER2-zero, HER2-low, or HER2-positive), although understanding of how such models reach their predictions is lacking. OBJECTIVE: To develop and test multiparametric MRI radiomics machine-learning models for differentiating three-tiered HER2 expression levels in patients with breast cancer, and to explain the contributions of model features through local and global interpretations using SHapley Additive exPlanation (SHAP) analysis. METHODS: This retrospective study included 737 patients (mean age, 54.1±10.6 years) with breast cancer from two centers (center 1: n=578; center 2: n=159), who underwent breast MRI and had HER2 expression determined after excisional biopsy. Analysis entailed two tasks: differentiating HER2-negative (i.e., HER2-zero or HER2-low) from HER2-positive tumors (task 1), and differentiating HER2-zero from HER2-low tumors (task 2). For each task, patients from center 1 were randomly assigned in 7:3 ratio to training (task 1: n=405; task 2: n=284) or internal test (task 1: n=173; task 2: n=122) sets; those from center 2 formed an external test set (task 1: n=159; task 2: n=105). Radiomics features were extracted from early-phase dynamic contrast-enhanced images (DCE), T2-weighted images (T2WI), and DWI. For each task, a support vector machine (SVM) was used for feature selection; a multiparametric radiomics score (radscore) was computed using feature weights from SVM correlation coefficients; conventional MRI and combined models were constructed; and model performances were evaluated. SHAP analysis was used to provide local and global interpretations for model outputs. RESULTS: In the external test set, for task 1, AUCs for the conventional MRI model, radscore, and combined model were 0.624, 0.757, and 0.762, respectively; for task 2, AUC for radscore was 0.754, and no conventional MRI model or combined model could be constructed. SHAP analysis identified early-phase DCE features as having the strongest influence for both tasks; T2WI features also had a prominent role for task 2. CONCLUSION: The findings indicate suboptimal performance of MRI radiomics models for noninvasive characterization of HER2 expression. CLINICAL IMPACT: The study provides an example of the use of SHAP interpretation analysis to better understand predictions of imaging-based machine learning models.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
12.80
自引率
4.00%
发文量
920
审稿时长
3 months
期刊介绍: Founded in 1907, the monthly American Journal of Roentgenology (AJR) is the world’s longest continuously published general radiology journal. AJR is recognized as among the specialty’s leading peer-reviewed journals and has a worldwide circulation of close to 25,000. The journal publishes clinically-oriented articles across all radiology subspecialties, seeking relevance to radiologists’ daily practice. The journal publishes hundreds of articles annually with a diverse range of formats, including original research, reviews, clinical perspectives, editorials, and other short reports. The journal engages its audience through a spectrum of social media and digital communication activities.
期刊最新文献
Advanced Imaging of the Peripheral Nerves, From the AJR "How We Do It" Special Series. CT Surveillance for Local Recurrence After Pancreatic Cancer Resection: Evaluation of Imaging Findings From the SAR Disease-Focused Panel Consensus Statement. Importance of Education in Radiation Safety During Medical School: A Medical Student's Perspective. PPV of Bone Uptake of 18F-Flotufolastat: Evaluation Using SPOTLIGHT Study Data. MR and Ultrasound Elastography for Fibrosis Assessment in Children: Practical Implementation and Supporting Evidence-AJR Expert Panel Narrative Review.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1