2MSPK-Net: A nuclei segmentation network based on multi-scale, multi-dimensional attention, and SAM prior knowledge

IF 4.9 2区 医学 Q1 ENGINEERING, BIOMEDICAL Biomedical Signal Processing and Control Pub Date : 2024-11-13 DOI:10.1016/j.bspc.2024.107140
Gongtao Yue , Xiaoguang Ma , Wenrui Li , Ziheng An , Chen Yang
{"title":"2MSPK-Net: A nuclei segmentation network based on multi-scale, multi-dimensional attention, and SAM prior knowledge","authors":"Gongtao Yue ,&nbsp;Xiaoguang Ma ,&nbsp;Wenrui Li ,&nbsp;Ziheng An ,&nbsp;Chen Yang","doi":"10.1016/j.bspc.2024.107140","DOIUrl":null,"url":null,"abstract":"<div><div>Refined nuclei segmentation is of great significance for diagnosing the pathological conditions of tumor tissues. Although existing encoder–decoder networks have achieved remarkable progress in nuclei segmentation tasks, practical applications still encounter obstacles, especially for challenging issues such as highly dense nuclei targets and the ambiguity of boundaries between inter-class features, resulting in unsatisfactory segmentation accuracy. In this work, a novel encoder–decoder architecture was proposed to address these issues. Specifically, we first proposed a multi-scale and multi-dimension attention module to capture the contextual dependencies between individual pixels and the overall pixels, where in cross-scale learning was achieved by fusing different scale feature information of the encoding layer. Secondly, we integrated the prior knowledge of SAM into nuclei images to enhance the network’s ability to distinguish fuzzy features. To the best of our knowledge, this was the first attempt to utilize the prior knowledge of SAM to optimize nuclei segmentation tasks. Furthermore, the network was guided to supplement missing detailed features through a reverse erasing strategy and cross-layer information flow. Comprehensive experiments illustrated that the proposed method achieved MIoU improvements of 1.26% and 0.94% on the MoNuSeg and TNBC datasets, respectively, over several SOTA methods, indicating its great potential as a backbone for cancer nuclei segmentation. Code: <span><span>https://github.com/ThirteenYue/2MSPK-Net</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"100 ","pages":"Article 107140"},"PeriodicalIF":4.9000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809424011984","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Refined nuclei segmentation is of great significance for diagnosing the pathological conditions of tumor tissues. Although existing encoder–decoder networks have achieved remarkable progress in nuclei segmentation tasks, practical applications still encounter obstacles, especially for challenging issues such as highly dense nuclei targets and the ambiguity of boundaries between inter-class features, resulting in unsatisfactory segmentation accuracy. In this work, a novel encoder–decoder architecture was proposed to address these issues. Specifically, we first proposed a multi-scale and multi-dimension attention module to capture the contextual dependencies between individual pixels and the overall pixels, where in cross-scale learning was achieved by fusing different scale feature information of the encoding layer. Secondly, we integrated the prior knowledge of SAM into nuclei images to enhance the network’s ability to distinguish fuzzy features. To the best of our knowledge, this was the first attempt to utilize the prior knowledge of SAM to optimize nuclei segmentation tasks. Furthermore, the network was guided to supplement missing detailed features through a reverse erasing strategy and cross-layer information flow. Comprehensive experiments illustrated that the proposed method achieved MIoU improvements of 1.26% and 0.94% on the MoNuSeg and TNBC datasets, respectively, over several SOTA methods, indicating its great potential as a backbone for cancer nuclei segmentation. Code: https://github.com/ThirteenYue/2MSPK-Net.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
2MSPK-Net:基于多尺度、多维度注意力和 SAM 先验知识的细胞核分割网络
精细的细胞核分割对诊断肿瘤组织的病理状况具有重要意义。虽然现有的编码器-解码器网络在核仁分割任务中取得了显著进展,但在实际应用中仍会遇到障碍,尤其是在核仁目标高度密集、类间特征边界模糊等挑战性问题上,导致分割精度不尽人意。在这项工作中,我们提出了一种新型编码器-解码器架构来解决这些问题。具体来说,我们首先提出了一个多尺度和多维度关注模块,以捕捉单个像素和整体像素之间的上下文依赖关系,其中跨尺度学习是通过融合编码层的不同尺度特征信息来实现的。其次,我们将 SAM 的先验知识整合到核图像中,以增强网络分辨模糊特征的能力。据我们所知,这是首次尝试利用 SAM 的先验知识来优化核仁分割任务。此外,还通过反向擦除策略和跨层信息流引导网络补充缺失的细节特征。综合实验表明,在 MoNuSeg 和 TNBC 数据集上,与几种 SOTA 方法相比,所提方法的 MIoU 分别提高了 1.26% 和 0.94%,这表明它作为癌症细胞核分割骨干的巨大潜力。代码:https://github.com/ThirteenYue/2MSPK-Net。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Biomedical Signal Processing and Control
Biomedical Signal Processing and Control 工程技术-工程:生物医学
CiteScore
9.80
自引率
13.70%
发文量
822
审稿时长
4 months
期刊介绍: Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.
期刊最新文献
Innovative brain tumor detection: Stacked random support vector-based hybrid gazelle coati algorithm A novel optimized machine learning approach with texture rectified cross-attention based transformer for COVID-19 detection A lightweight model for the retinal disease classification using optical coherence tomography An improved ECG data compression scheme based on ensemble empirical mode decomposition Performance evaluation of optimal ensemble learning approaches with PCA and LDA-based feature extraction for heart disease prediction
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1