Khaled ELKarazle, Valliappan Raman, Caslon Chua, Patrick Then
{"title":"SCABNet: A Novel Polyp Segmentation Network With Spatial-Gradient Attention and Channel Prioritization","authors":"Khaled ELKarazle, Valliappan Raman, Caslon Chua, Patrick Then","doi":"10.1002/ima.70039","DOIUrl":null,"url":null,"abstract":"<p>Current colorectal polyps detection methods often struggle with efficiency and boundary precision, especially when dealing with polyps of complex shapes and sizes. Traditional techniques may fail to precisely define the boundaries of these polyps, leading to suboptimal detection rates. Furthermore, flat and small polyps often blend into the background due to their low contrast against the mucosal wall, making them even more challenging to detect. To address these challenges, we introduce SCABNet, a novel deep learning architecture for the efficient detection of colorectal polyps. SCABNet employs an encoder-decoder structure with three novel blocks: the Feature Enhancement Block (FEB), the Channel Prioritization Block (CPB), and the Spatial-Gradient Boundary Attention Block (SGBAB). The FEB applies dilation and spatial attention to high-level features, enhancing their discriminative power and improving the model's ability to capture complex patterns. The CPB, an efficient alternative to traditional channel attention blocks, assigns prioritization weights to diverse feature channels. The SGBAB replaces conventional boundary attention mechanisms with a more efficient solution that focuses on the spatial attention of the feature map. It employs a Jacobian-based approach to construct learned convolutions on both vertical and horizontal components of the feature map. This allows the SGBAB to effectively understand the changes in the feature map across different spatial locations, which is crucial for detecting the boundaries of complex-shaped polyps. These blocks are strategically embedded within the network's skip connections, enhancing the model's boundary detection capabilities without imposing excessive computational demands. They exploit and enhance features at three levels: high, mid, and low, thereby ensuring the detection of a wide range of polyps. SCABNet has been trained on the Kvasir-SEG and CVC-ClinicDB datasets and evaluated on multiple datasets, demonstrating superior results. The code is available on: https://github.com/KhaledELKarazle97/SCABNet.</p>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"35 2","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ima.70039","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Imaging Systems and Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ima.70039","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Current colorectal polyps detection methods often struggle with efficiency and boundary precision, especially when dealing with polyps of complex shapes and sizes. Traditional techniques may fail to precisely define the boundaries of these polyps, leading to suboptimal detection rates. Furthermore, flat and small polyps often blend into the background due to their low contrast against the mucosal wall, making them even more challenging to detect. To address these challenges, we introduce SCABNet, a novel deep learning architecture for the efficient detection of colorectal polyps. SCABNet employs an encoder-decoder structure with three novel blocks: the Feature Enhancement Block (FEB), the Channel Prioritization Block (CPB), and the Spatial-Gradient Boundary Attention Block (SGBAB). The FEB applies dilation and spatial attention to high-level features, enhancing their discriminative power and improving the model's ability to capture complex patterns. The CPB, an efficient alternative to traditional channel attention blocks, assigns prioritization weights to diverse feature channels. The SGBAB replaces conventional boundary attention mechanisms with a more efficient solution that focuses on the spatial attention of the feature map. It employs a Jacobian-based approach to construct learned convolutions on both vertical and horizontal components of the feature map. This allows the SGBAB to effectively understand the changes in the feature map across different spatial locations, which is crucial for detecting the boundaries of complex-shaped polyps. These blocks are strategically embedded within the network's skip connections, enhancing the model's boundary detection capabilities without imposing excessive computational demands. They exploit and enhance features at three levels: high, mid, and low, thereby ensuring the detection of a wide range of polyps. SCABNet has been trained on the Kvasir-SEG and CVC-ClinicDB datasets and evaluated on multiple datasets, demonstrating superior results. The code is available on: https://github.com/KhaledELKarazle97/SCABNet.
期刊介绍:
The International Journal of Imaging Systems and Technology (IMA) is a forum for the exchange of ideas and results relevant to imaging systems, including imaging physics and informatics. The journal covers all imaging modalities in humans and animals.
IMA accepts technically sound and scientifically rigorous research in the interdisciplinary field of imaging, including relevant algorithmic research and hardware and software development, and their applications relevant to medical research. The journal provides a platform to publish original research in structural and functional imaging.
The journal is also open to imaging studies of the human body and on animals that describe novel diagnostic imaging and analyses methods. Technical, theoretical, and clinical research in both normal and clinical populations is encouraged. Submissions describing methods, software, databases, replication studies as well as negative results are also considered.
The scope of the journal includes, but is not limited to, the following in the context of biomedical research:
Imaging and neuro-imaging modalities: structural MRI, functional MRI, PET, SPECT, CT, ultrasound, EEG, MEG, NIRS etc.;
Neuromodulation and brain stimulation techniques such as TMS and tDCS;
Software and hardware for imaging, especially related to human and animal health;
Image segmentation in normal and clinical populations;
Pattern analysis and classification using machine learning techniques;
Computational modeling and analysis;
Brain connectivity and connectomics;
Systems-level characterization of brain function;
Neural networks and neurorobotics;
Computer vision, based on human/animal physiology;
Brain-computer interface (BCI) technology;
Big data, databasing and data mining.