SK-VM++: Mamba assists skip-connections for medical image segmentation

IF 4.9 2区 医学 Q1 ENGINEERING, BIOMEDICAL Biomedical Signal Processing and Control Pub Date : 2025-02-11 DOI:10.1016/j.bspc.2025.107646
Renkai Wu , Liuyue Pan , Pengchen Liang , Qing Chang , Xianjin Wang , Weihuan Fang
{"title":"SK-VM++: Mamba assists skip-connections for medical image segmentation","authors":"Renkai Wu ,&nbsp;Liuyue Pan ,&nbsp;Pengchen Liang ,&nbsp;Qing Chang ,&nbsp;Xianjin Wang ,&nbsp;Weihuan Fang","doi":"10.1016/j.bspc.2025.107646","DOIUrl":null,"url":null,"abstract":"<div><div>In medical automatic image segmentation engineering, the U-shaped structure is the primary key framework. And the skip-connection operation in it is an important operation for key fusion of high and low features, which is one of the highlights of the U-shaped architecture. However, the traditional U-shaped architecture usually employs direct concatenation or different variants of convolution-based module composition. The recent emergence of Mamba, based on state-space models (SSMs), has shaken up the traditional convolution and Transformers that have long been the foundational building blocks. In this study, we analyze the impact of Mamba on skip-connection operations for U-shaped architectures and propose a novel skip-connection operation (SK-VM++) combining the UNet++ framework and Mamba. Specifically, Mamba is able to refine the fusion of high and low feature information better than traditional convolution. In addition, SK-VM++ leverages the excellent property of Mamba’s concatenation, making it significantly less sensitive to changes in computational complexity and parameters caused by changes in the number of channels. In particular, the number of channels increases from 64 to 512, and the convolution-based FLOPs and parameters rise by 8.82 and 6.22 times, respectively, compared to our proposed Mamba-based skip-connection operation. In addition, comparing with the most popular nnU-Net and VM-UNet, the DSC of SK-VM++ improves by 2.01% and 1.10% on the ISIC2017 dataset, 1.59% and 9.10% on the CVC-ClinicDB dataset, 1.23% and 18.94% on the Promise12 dataset and 46.25% and 34.01% improvement on the UWF-RHS dataset. The code is available from <span><span>https://github.com/wurenkai/SK-VMPlusPlus</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"105 ","pages":"Article 107646"},"PeriodicalIF":4.9000,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425001570","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

Abstract

In medical automatic image segmentation engineering, the U-shaped structure is the primary key framework. And the skip-connection operation in it is an important operation for key fusion of high and low features, which is one of the highlights of the U-shaped architecture. However, the traditional U-shaped architecture usually employs direct concatenation or different variants of convolution-based module composition. The recent emergence of Mamba, based on state-space models (SSMs), has shaken up the traditional convolution and Transformers that have long been the foundational building blocks. In this study, we analyze the impact of Mamba on skip-connection operations for U-shaped architectures and propose a novel skip-connection operation (SK-VM++) combining the UNet++ framework and Mamba. Specifically, Mamba is able to refine the fusion of high and low feature information better than traditional convolution. In addition, SK-VM++ leverages the excellent property of Mamba’s concatenation, making it significantly less sensitive to changes in computational complexity and parameters caused by changes in the number of channels. In particular, the number of channels increases from 64 to 512, and the convolution-based FLOPs and parameters rise by 8.82 and 6.22 times, respectively, compared to our proposed Mamba-based skip-connection operation. In addition, comparing with the most popular nnU-Net and VM-UNet, the DSC of SK-VM++ improves by 2.01% and 1.10% on the ISIC2017 dataset, 1.59% and 9.10% on the CVC-ClinicDB dataset, 1.23% and 18.94% on the Promise12 dataset and 46.25% and 34.01% improvement on the UWF-RHS dataset. The code is available from https://github.com/wurenkai/SK-VMPlusPlus.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
sk - vm++: Mamba协助医学图像分割的跳过连接
在医学自动图像分割工程中,u型结构是主要的关键框架。其中的跳接操作是实现高低特征键融合的重要操作,是u型结构的亮点之一。然而,传统的u型架构通常采用直接连接或基于卷积的模块组合的不同变体。最近出现的基于状态空间模型(ssm)的Mamba动摇了传统的卷积和变形,它们长期以来一直是基本的构建模块。在本研究中,我们分析了Mamba对u型架构的跳过连接操作的影响,并提出了一种结合UNet++框架和Mamba的新型跳过连接操作(sk - vm++)。具体来说,Mamba能够比传统卷积更好地改进高低特征信息的融合。此外,sk - vm++利用了Mamba串联的优良特性,使其对由通道数量变化引起的计算复杂性和参数变化的敏感性大大降低。特别是,与我们提出的基于mamba的跳过连接操作相比,通道数量从64增加到512,基于卷积的FLOPs和参数分别增加了8.82倍和6.22倍。此外,与最流行的nnU-Net和VM-UNet相比,sk - vm++在ISIC2017数据集上的DSC分别提高了2.01%和1.10%,在CVC-ClinicDB数据集上提高了1.59%和9.10%,在Promise12数据集上提高了1.23%和18.94%,在UWF-RHS数据集上提高了46.25%和34.01%。该代码可从https://github.com/wurenkai/SK-VMPlusPlus获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Biomedical Signal Processing and Control
Biomedical Signal Processing and Control 工程技术-工程:生物医学
CiteScore
9.80
自引率
13.70%
发文量
822
审稿时长
4 months
期刊介绍: Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.
期刊最新文献
PulseAI: An automated machine learning-based augmentation index detector for arterial stiffness monitoring from cuff-based measurements 3D CNN-based method for automatic reorientation of 11C-acetate cardiac PET images using anchor point detection SWDL: Stratum-Wise Difference Learning with deep Laplacian pyramid for semi-supervised 3D intracranial hemorrhage segmentation Diffusion model-based medical ultrasound segmentation network in ultrasound image The application of convolutional neural networks for brain age prediction: A systematic review
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1