2MSPK-Net: A nuclei segmentation network based on multi-scale, multi-dimensional attention, and SAM prior knowledge

IF 4.9 2区医学 Q1 ENGINEERING, BIOMEDICAL Biomedical Signal Processing and Control Pub Date : 2024-11-13 DOI:10.1016/j.bspc.2024.107140

Gongtao Yue , Xiaoguang Ma , Wenrui Li , Ziheng An , Chen Yang

{"title":"2MSPK-Net: A nuclei segmentation network based on multi-scale, multi-dimensional attention, and SAM prior knowledge","authors":"Gongtao Yue , Xiaoguang Ma , Wenrui Li , Ziheng An , Chen Yang","doi":"10.1016/j.bspc.2024.107140","DOIUrl":null,"url":null,"abstract":"<div><div>Refined nuclei segmentation is of great significance for diagnosing the pathological conditions of tumor tissues. Although existing encoder–decoder networks have achieved remarkable progress in nuclei segmentation tasks, practical applications still encounter obstacles, especially for challenging issues such as highly dense nuclei targets and the ambiguity of boundaries between inter-class features, resulting in unsatisfactory segmentation accuracy. In this work, a novel encoder–decoder architecture was proposed to address these issues. Specifically, we first proposed a multi-scale and multi-dimension attention module to capture the contextual dependencies between individual pixels and the overall pixels, where in cross-scale learning was achieved by fusing different scale feature information of the encoding layer. Secondly, we integrated the prior knowledge of SAM into nuclei images to enhance the network’s ability to distinguish fuzzy features. To the best of our knowledge, this was the first attempt to utilize the prior knowledge of SAM to optimize nuclei segmentation tasks. Furthermore, the network was guided to supplement missing detailed features through a reverse erasing strategy and cross-layer information flow. Comprehensive experiments illustrated that the proposed method achieved MIoU improvements of 1.26% and 0.94% on the MoNuSeg and TNBC datasets, respectively, over several SOTA methods, indicating its great potential as a backbone for cancer nuclei segmentation. Code: <span><span>https://github.com/ThirteenYue/2MSPK-Net</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"100 ","pages":"Article 107140"},"PeriodicalIF":4.9000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809424011984","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

Refined nuclei segmentation is of great significance for diagnosing the pathological conditions of tumor tissues. Although existing encoder–decoder networks have achieved remarkable progress in nuclei segmentation tasks, practical applications still encounter obstacles, especially for challenging issues such as highly dense nuclei targets and the ambiguity of boundaries between inter-class features, resulting in unsatisfactory segmentation accuracy. In this work, a novel encoder–decoder architecture was proposed to address these issues. Specifically, we first proposed a multi-scale and multi-dimension attention module to capture the contextual dependencies between individual pixels and the overall pixels, where in cross-scale learning was achieved by fusing different scale feature information of the encoding layer. Secondly, we integrated the prior knowledge of SAM into nuclei images to enhance the network’s ability to distinguish fuzzy features. To the best of our knowledge, this was the first attempt to utilize the prior knowledge of SAM to optimize nuclei segmentation tasks. Furthermore, the network was guided to supplement missing detailed features through a reverse erasing strategy and cross-layer information flow. Comprehensive experiments illustrated that the proposed method achieved MIoU improvements of 1.26% and 0.94% on the MoNuSeg and TNBC datasets, respectively, over several SOTA methods, indicating its great potential as a backbone for cancer nuclei segmentation. Code: https://github.com/ThirteenYue/2MSPK-Net.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

2MSPK-Net：基于多尺度、多维度注意力和 SAM 先验知识的细胞核分割网络

精细的细胞核分割对诊断肿瘤组织的病理状况具有重要意义。虽然现有的编码器-解码器网络在核仁分割任务中取得了显著进展，但在实际应用中仍会遇到障碍，尤其是在核仁目标高度密集、类间特征边界模糊等挑战性问题上，导致分割精度不尽人意。在这项工作中，我们提出了一种新型编码器-解码器架构来解决这些问题。具体来说，我们首先提出了一个多尺度和多维度关注模块，以捕捉单个像素和整体像素之间的上下文依赖关系，其中跨尺度学习是通过融合编码层的不同尺度特征信息来实现的。其次，我们将 SAM 的先验知识整合到核图像中，以增强网络分辨模糊特征的能力。据我们所知，这是首次尝试利用 SAM 的先验知识来优化核仁分割任务。此外，还通过反向擦除策略和跨层信息流引导网络补充缺失的细节特征。综合实验表明，在 MoNuSeg 和 TNBC 数据集上，与几种 SOTA 方法相比，所提方法的 MIoU 分别提高了 1.26% 和 0.94%，这表明它作为癌症细胞核分割骨干的巨大潜力。代码：https://github.com/ThirteenYue/2MSPK-Net。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Biomedical Signal Processing and Control 工程技术-工程：生物医学

CiteScore

9.80

自引率

13.70%

发文量

822

审稿时长

4 months

期刊介绍： Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.