Stitching, Fine-Tuning, and Re-Training: A SAM-Enabled Framework for Semi-Supervised 3D Medical Image Segmentation

IEEE transactions on medical imaging Pub Date : 2025-01-20 DOI:10.1109/TMI.2025.3532084

Shumeng Li;Lei Qi;Qian Yu;Jing Huo;Yinghuan Shi;Yang Gao

{"title":"Stitching, Fine-Tuning, and Re-Training: A SAM-Enabled Framework for Semi-Supervised 3D Medical Image Segmentation","authors":"Shumeng Li;Lei Qi;Qian Yu;Jing Huo;Yinghuan Shi;Yang Gao","doi":"10.1109/TMI.2025.3532084","DOIUrl":null,"url":null,"abstract":"Segment Anything Model (SAM) fine-tuning has shown remarkable performance in medical image segmentation in a fully supervised manner, but requires precise annotations. To reduce the annotation cost and maintain satisfactory performance, in this work, we leverage the capabilities of SAM for establishing semi-supervised medical image segmentation models. Rethinking the requirements of effectiveness, efficiency, and compatibility, we propose a three-stage framework, i.e., Stitching, Fine-tuning, and Re-training (SFR). The current fine-tuning approaches mostly involve 2D slice-wise fine-tuning that disregards the contextual information between adjacent slices. Our stitching strategy mitigates the mismatch between natural and 3D medical images. The stitched images are then used for fine-tuning SAM, providing robust initialization of pseudo-labels. Afterwards, we train a 3D semi-supervised segmentation model while maintaining the same parameter size as the conventional segmenter such as V-Net. Our SFR framework is plug-and-play, and easily compatible with various popular semi-supervised methods. We also develop an extended framework SFR+ with selective fine-tuning and re-training through confidence estimation. Extensive experiments validate that our SFR and SFR+ achieve significant improvements in both moderate annotation and scarce annotation across five datasets. In particular, SFR framework improves the Dice score of Mean Teacher from 29.68% to 74.40% with only one labeled data of LA dataset. The code is available at <uri>https://github.com/ShumengLI/SFR</uri>.","PeriodicalId":94033,"journal":{"name":"IEEE transactions on medical imaging","volume":"44 10","pages":"3909-3923"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on medical imaging","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10847777/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Segment Anything Model (SAM) fine-tuning has shown remarkable performance in medical image segmentation in a fully supervised manner, but requires precise annotations. To reduce the annotation cost and maintain satisfactory performance, in this work, we leverage the capabilities of SAM for establishing semi-supervised medical image segmentation models. Rethinking the requirements of effectiveness, efficiency, and compatibility, we propose a three-stage framework, i.e., Stitching, Fine-tuning, and Re-training (SFR). The current fine-tuning approaches mostly involve 2D slice-wise fine-tuning that disregards the contextual information between adjacent slices. Our stitching strategy mitigates the mismatch between natural and 3D medical images. The stitched images are then used for fine-tuning SAM, providing robust initialization of pseudo-labels. Afterwards, we train a 3D semi-supervised segmentation model while maintaining the same parameter size as the conventional segmenter such as V-Net. Our SFR framework is plug-and-play, and easily compatible with various popular semi-supervised methods. We also develop an extended framework SFR+ with selective fine-tuning and re-training through confidence estimation. Extensive experiments validate that our SFR and SFR+ achieve significant improvements in both moderate annotation and scarce annotation across five datasets. In particular, SFR framework improves the Dice score of Mean Teacher from 29.68% to 74.40% with only one labeled data of LA dataset. The code is available at https://github.com/ShumengLI/SFR.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

拼接、微调、再训练：半监督3D医学图像分割的sam支持框架

分段任意模型（SAM）微调在医学图像的全监督分割中表现出了显著的性能，但需要精确的标注。为了降低标注成本并保持令人满意的性能，在本工作中，我们利用SAM的功能建立半监督医学图像分割模型。重新考虑了有效性、效率和兼容性的要求，我们提出了一个三阶段的框架，即拼接、微调和再训练（SFR）。当前的微调方法主要涉及二维切片微调，忽略相邻切片之间的上下文信息。我们的拼接策略减轻了自然和3D医学图像之间的不匹配。然后将缝合的图像用于微调SAM，提供伪标签的鲁棒初始化。然后，我们训练了一个三维半监督分割模型，同时保持与传统分割器（如V-Net）相同的参数大小。我们的SFR框架是即插即用的，并且很容易与各种流行的半监督方法兼容。我们还开发了一个扩展框架SFR+，通过置信度估计进行选择性微调和再训练。大量的实验验证了我们的SFR和SFR+在五个数据集上的中度注释和稀缺注释都取得了显著的改进。特别是，SFR框架将Mean Teacher的Dice得分从29.68%提高到74.40%，仅使用LA数据集的一个标记数据。代码可在https://github.com/ShumengLI/SFR上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE transactions on medical imaging

自引率

0.00%

发文量