Generalizable MRI-based Nasopharyngeal Carcinoma Delineation: Bridging Gaps across Multiple Centers and Raters with Active Learning.

Xiangde Luo, Hongqiu Wang, Jinfeng Xu, Lu Li, Yue Zhao, Yuan He, Hui Huang, Jianghong Xiao, Song Tao, Shichuan Zhang, Shaoting Zhang, Guotai Wang, Wenjun Liao
{"title":"Generalizable MRI-based Nasopharyngeal Carcinoma Delineation: Bridging Gaps across Multiple Centers and Raters with Active Learning.","authors":"Xiangde Luo, Hongqiu Wang, Jinfeng Xu, Lu Li, Yue Zhao, Yuan He, Hui Huang, Jianghong Xiao, Song Tao, Shichuan Zhang, Shaoting Zhang, Guotai Wang, Wenjun Liao","doi":"10.1016/j.ijrobp.2024.11.064","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>To develop a deep learning (DL) method exploiting active learning and source-free domain adaptation for gross tumor volume (GTV) delineation in nasopharyngeal carcinoma (NPC), addressing the variability and inaccuracy when deploying segmentation models in multi-center and multi-rater settings.</p><p><strong>Materials and methods: </strong>1057 MRI scans of NPC patients from five hospitals were retrospectively collected and annotated by experts from the same medical group with consensus for multi-center adaptation evaluation. One dataset was used for model development (source domain), with the remaining four for adaptation testing (target domains). Meanwhile, another 170 NPC patients with annotations delineated by four independent experts were built for multi-rater adaptation evaluation. We evaluated the pre-trained model's migration ability to the four multi-center and four multi-rater target domains. Dice similarity coefficient (DSC), 95% Hausdorff distance (HD95) and other metrics were used for quantitative evaluations.</p><p><strong>Results: </strong>In the adaptation of dataset5 to other datasets, our source-free active learning adaptation method only requires limited labeled target samples (only 20%) to achieve a median DSC ranging from 0.70 to 0.86 and a median HD95 ranging from 3.16mm to 7.21mm for four target centers, and 0.78 to 0.85 and 3.64mm to 6.00mm for four multi-rater datasets. For DSC, our results for three of four multi-center datasets and all multi-rater datasets showed no statistical difference compared to the fully supervised U-Net model (P-values > 0.05) and significantly surpassed comparison models for three multi-center datasets and all multi-rater datasets (P-values < 0.05). Clinical assessment showed that our method-generated delineations can be used both in multi-center and multi-rater scenarios after minor refinement (revision ratio < 10% and median time < 2 minutes).</p><p><strong>Conclusion: </strong>The proposed method effectively minimizes domain gaps and delivers encouraging performance compared with fully supervised learning models with limited labeled training samples, offering a promising and practical solution for accurate and generalizable GTV segmentation in NPC.</p>","PeriodicalId":14215,"journal":{"name":"International Journal of Radiation Oncology Biology Physics","volume":" ","pages":""},"PeriodicalIF":6.4000,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Radiation Oncology Biology Physics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.ijrobp.2024.11.064","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: To develop a deep learning (DL) method exploiting active learning and source-free domain adaptation for gross tumor volume (GTV) delineation in nasopharyngeal carcinoma (NPC), addressing the variability and inaccuracy when deploying segmentation models in multi-center and multi-rater settings.

Materials and methods: 1057 MRI scans of NPC patients from five hospitals were retrospectively collected and annotated by experts from the same medical group with consensus for multi-center adaptation evaluation. One dataset was used for model development (source domain), with the remaining four for adaptation testing (target domains). Meanwhile, another 170 NPC patients with annotations delineated by four independent experts were built for multi-rater adaptation evaluation. We evaluated the pre-trained model's migration ability to the four multi-center and four multi-rater target domains. Dice similarity coefficient (DSC), 95% Hausdorff distance (HD95) and other metrics were used for quantitative evaluations.

Results: In the adaptation of dataset5 to other datasets, our source-free active learning adaptation method only requires limited labeled target samples (only 20%) to achieve a median DSC ranging from 0.70 to 0.86 and a median HD95 ranging from 3.16mm to 7.21mm for four target centers, and 0.78 to 0.85 and 3.64mm to 6.00mm for four multi-rater datasets. For DSC, our results for three of four multi-center datasets and all multi-rater datasets showed no statistical difference compared to the fully supervised U-Net model (P-values > 0.05) and significantly surpassed comparison models for three multi-center datasets and all multi-rater datasets (P-values < 0.05). Clinical assessment showed that our method-generated delineations can be used both in multi-center and multi-rater scenarios after minor refinement (revision ratio < 10% and median time < 2 minutes).

Conclusion: The proposed method effectively minimizes domain gaps and delivers encouraging performance compared with fully supervised learning models with limited labeled training samples, offering a promising and practical solution for accurate and generalizable GTV segmentation in NPC.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于核磁共振成像的鼻咽癌划定:通过主动学习缩小多个中心和评分者之间的差距
目的:开发一种利用主动学习和无源域自适应的深度学习(DL)方法,用于鼻咽癌(NPC)的肿瘤总体积(GTV)划分,解决在多中心和多评分者设置中部署分割模型时的可变性和不准确性。其中一个数据集用于模型开发(源域),其余四个数据集用于适应性测试(目标域)。同时,我们还建立了另一个 170 例鼻咽癌患者数据集,由四位独立专家进行注释,用于多方适应性评估。我们评估了预训练模型迁移到四个多中心和四个多评分者目标域的能力。我们采用了骰子相似系数(DSC)、95% Hausdorff 距离(HD95)和其他指标进行定量评估:在将数据集 5 改编为其他数据集时,我们的无源主动学习改编方法只需要有限的标注目标样本(仅 20%),就能使四个目标中心的 DSC 中值从 0.70 到 0.86 不等,HD95 中值从 3.16 毫米到 7.21 毫米不等;四个多评分者数据集的 DSC 中值从 0.78 到 0.85 不等,HD95 中值从 3.64 毫米到 6.00 毫米不等。在DSC方面,我们对四个多中心数据集中的三个数据集和所有多评分者数据集的结果显示,与完全监督的U-Net模型相比没有统计学差异(P值>0.05),而对三个多中心数据集和所有多评分者数据集的结果则明显优于比较模型(P值<0.05)。临床评估结果表明,我们的方法生成的划界在稍作改进后(修订率小于 10%,中位时间小于 2 分钟),可用于多中心和多评分者情况:结论:与标注训练样本有限的完全监督学习模型相比,所提出的方法有效地减少了领域差距,并提供了令人鼓舞的性能,为准确和可推广的鼻咽癌 GTV 分割提供了一种前景广阔的实用解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
11.00
自引率
7.10%
发文量
2538
审稿时长
6.6 weeks
期刊介绍: International Journal of Radiation Oncology • Biology • Physics (IJROBP), known in the field as the Red Journal, publishes original laboratory and clinical investigations related to radiation oncology, radiation biology, medical physics, and both education and health policy as it relates to the field. This journal has a particular interest in original contributions of the following types: prospective clinical trials, outcomes research, and large database interrogation. In addition, it seeks reports of high-impact innovations in single or combined modality treatment, tumor sensitization, normal tissue protection (including both precision avoidance and pharmacologic means), brachytherapy, particle irradiation, and cancer imaging. Technical advances related to dosimetry and conformal radiation treatment planning are of interest, as are basic science studies investigating tumor physiology and the molecular biology underlying cancer and normal tissue radiation response.
期刊最新文献
Docetaxel/cisplatin chemotherapy followed by pelvic radiation therapy in patients with high-risk endometrial cancer after staging surgery: a phase II study. Exploratory Evaluation of Personalized Ultra-Fractionated Stereotactic Adaptive Radiotherapy (PULSAR) with CNS-Active Drugs in Brain Metastases Treatment. Generalizable MRI-based Nasopharyngeal Carcinoma Delineation: Bridging Gaps across Multiple Centers and Raters with Active Learning. Ten-year outcomes of a phase III, multicenter, randomized controlled trial (SHIP0804) with three-month neoadjuvant androgen deprivation prior to 125I-seed transperineal prostate brachytherapy followed by nil versus nine-month adjuvant hormonal therapy in patients with intermediate-risk prostate cancer. Constructing Surrogate Lung Ventilation Maps from 4DCT-derived Subregional Respiratory Dynamics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1