Segment anything model for few-shot medical image segmentation with domain tuning

IF 5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Complex & Intelligent Systems Pub Date : 2024-11-14 DOI:10.1007/s40747-024-01625-7

Weili Shi, Penglong Zhang, Yuqin Li, Zhengang Jiang

{"title":"Segment anything model for few-shot medical image segmentation with domain tuning","authors":"Weili Shi, Penglong Zhang, Yuqin Li, Zhengang Jiang","doi":"10.1007/s40747-024-01625-7","DOIUrl":null,"url":null,"abstract":"<p>Medical image segmentation constitutes a crucial step in the analysis of medical images, possessing extensive applications and research significance within the realm of medical research and practice. Convolutional neural network achieved great success in medical image segmentation. However, acquiring large labeled datasets remains unattainable due to the substantial expertise and time required for image labeling, as well as heightened patient privacy concerns. To solve scarce medical image data, we propose a powerful network Domain Tuning SAM for Medical images (DT-SAM). We construct an encoder utilizing a parameter-effective fine-tuning strategy and SAM. This strategy selectively updates a small fraction of the weight increments while preserving the majority of the pre-training weights in the SAM encoder, consequently reducing the required number of training samples. Meanwhile, our approach leverages only SAM encoder structure while incorporating a decoder similar to U-Net decoder structure and redesigning skip connections to concatenate encoder-extracted features, which effectively decode the features extracted by the encoder and preserve edge information. We have conducted comprehensive experiments on three publicly available medical image segmentation datasets. The combined experimental results show that our method can effectively perform few shot medical image segmentation. With just one labeled data, achieving a Dice score of 63.51%, a HD of 17.94 and an IoU score of 73.55% on Heart Task, on Prostate Task, an average Dice score of 46.01%, a HD of 10.25 and an IoU score of 65.92% were achieved, and the Dice, HD, and IoU score reaching 88.67%, 10.63, and 90.19% on BUSI. Remarkably, with few training samples, our method consistently outperforms various based on SAM and CNN.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":5.0000,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Complex & Intelligent Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s40747-024-01625-7","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Medical image segmentation constitutes a crucial step in the analysis of medical images, possessing extensive applications and research significance within the realm of medical research and practice. Convolutional neural network achieved great success in medical image segmentation. However, acquiring large labeled datasets remains unattainable due to the substantial expertise and time required for image labeling, as well as heightened patient privacy concerns. To solve scarce medical image data, we propose a powerful network Domain Tuning SAM for Medical images (DT-SAM). We construct an encoder utilizing a parameter-effective fine-tuning strategy and SAM. This strategy selectively updates a small fraction of the weight increments while preserving the majority of the pre-training weights in the SAM encoder, consequently reducing the required number of training samples. Meanwhile, our approach leverages only SAM encoder structure while incorporating a decoder similar to U-Net decoder structure and redesigning skip connections to concatenate encoder-extracted features, which effectively decode the features extracted by the encoder and preserve edge information. We have conducted comprehensive experiments on three publicly available medical image segmentation datasets. The combined experimental results show that our method can effectively perform few shot medical image segmentation. With just one labeled data, achieving a Dice score of 63.51%, a HD of 17.94 and an IoU score of 73.55% on Heart Task, on Prostate Task, an average Dice score of 46.01%, a HD of 10.25 and an IoU score of 65.92% were achieved, and the Dice, HD, and IoU score reaching 88.67%, 10.63, and 90.19% on BUSI. Remarkably, with few training samples, our method consistently outperforms various based on SAM and CNN.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用域调整功能对少帧医学图像进行分割的任何模型

医学图像分割是医学图像分析的关键步骤，在医学研究和实践领域具有广泛的应用和研究意义。卷积神经网络在医学图像分割方面取得了巨大成功。然而，由于图像标注需要大量的专业知识和时间，以及对病人隐私的高度关注，获取大型标注数据集仍然遥不可及。为了解决医学图像数据稀缺的问题，我们提出了一种功能强大的医学图像域调谐 SAM（DT-SAM）网络。我们利用参数有效微调策略和 SAM 构建了一个编码器。该策略在保留 SAM 编码器中大部分预训练权重的同时，选择性地更新了一小部分权重增量，从而减少了所需的训练样本数量。同时，我们的方法只利用了 SAM 编码器结构，同时加入了与 U-Net 解码器结构类似的解码器，并重新设计了跳转连接来串联编码器提取的特征，从而有效地解码了编码器提取的特征并保留了边缘信息。我们在三个公开的医学图像分割数据集上进行了综合实验。综合实验结果表明，我们的方法可以有效地进行少量医疗图像分割。在 "心脏任务 "中，仅用一个标注数据就获得了 63.51% 的 Dice 分数、17.94 的 HD 分数和 73.55% 的 IoU 分数；在 "前列腺任务 "中，获得了 46.01% 的平均 Dice 分数、10.25 的 HD 分数和 65.92% 的 IoU 分数；在 "BUSI "中，Dice 分数、HD 分数和 IoU 分数分别达到了 88.67%、10.63% 和 90.19%。值得注意的是，在训练样本较少的情况下，我们的方法始终优于基于 SAM 和 CNN 的各种方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Complex & Intelligent Systems COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

9.60

自引率

10.30%

发文量

297

期刊介绍： Complex & Intelligent Systems aims to provide a forum for presenting and discussing novel approaches, tools and techniques meant for attaining a cross-fertilization between the broad fields of complex systems, computational simulation, and intelligent analytics and visualization. The transdisciplinary research that the journal focuses on will expand the boundaries of our understanding by investigating the principles and processes that underlie many of the most profound problems facing society today.