Xingguo Zhu , Wei Wang , Chen Zhang , Haifeng Wang
{"title":"Polyp-Mamba: A Hybrid Multi-Frequency Perception Gated Selection Network for polyp segmentation","authors":"Xingguo Zhu , Wei Wang , Chen Zhang , Haifeng Wang","doi":"10.1016/j.inffus.2024.102759","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate segmentation of polyps in the colorectal region is crucial for medical diagnosis and the localization of polyp areas. However, challenges arise from blurred boundaries due to the similarity between polyp edges and surrounding tissues, variable polyp morphology, and speckle noise. To address these challenges, we propose a Hybrid Multi-Frequency Perception Gated Selection Network (Polyp-Mamba) for precise polyp segmentation. First, we design a dual multi-frequency fusion encoder that employs Mamba and ResNet to quickly and effectively learn global and local features in polyp images. Specifically, we incorporate a novel Hybrid Multi-Frequency Fusion Module (HMFM) within the encoder, using discrete cosine transform to analyze features from multiple spectral perspectives. This approach mitigates the issue of blurred polyp boundaries caused by their similarity to surrounding tissues, effectively integrating local and global features. Additionally, we construct a Gated Selection Decoder to suppress irrelevant feature regions in the encoder and introduce deep supervision to guide decoder features to align closely with the labels. We conduct extensive experiments using five commonly used polyp test datasets. Comparisons with 14 state-of-the-art segmentation methods demonstrate that our approach surpasses traditional methods in sensitivity to different polyp images, robustness to variations in polyp size and shape, speckle noise, and distribution similarity between surrounding tissues and polyps. Overall, our method achieves superior mDice scores on five polyp test datasets compared to state-of-the-art methods, indicating better performance in polyp segmentation.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"115 ","pages":"Article 102759"},"PeriodicalIF":14.7000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1566253524005372","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate segmentation of polyps in the colorectal region is crucial for medical diagnosis and the localization of polyp areas. However, challenges arise from blurred boundaries due to the similarity between polyp edges and surrounding tissues, variable polyp morphology, and speckle noise. To address these challenges, we propose a Hybrid Multi-Frequency Perception Gated Selection Network (Polyp-Mamba) for precise polyp segmentation. First, we design a dual multi-frequency fusion encoder that employs Mamba and ResNet to quickly and effectively learn global and local features in polyp images. Specifically, we incorporate a novel Hybrid Multi-Frequency Fusion Module (HMFM) within the encoder, using discrete cosine transform to analyze features from multiple spectral perspectives. This approach mitigates the issue of blurred polyp boundaries caused by their similarity to surrounding tissues, effectively integrating local and global features. Additionally, we construct a Gated Selection Decoder to suppress irrelevant feature regions in the encoder and introduce deep supervision to guide decoder features to align closely with the labels. We conduct extensive experiments using five commonly used polyp test datasets. Comparisons with 14 state-of-the-art segmentation methods demonstrate that our approach surpasses traditional methods in sensitivity to different polyp images, robustness to variations in polyp size and shape, speckle noise, and distribution similarity between surrounding tissues and polyps. Overall, our method achieves superior mDice scores on five polyp test datasets compared to state-of-the-art methods, indicating better performance in polyp segmentation.
期刊介绍:
Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.