{"title":"Application of Multilayer Information Fusion and Optimization Network Combined With Attention Mechanism in Polyp Segmentation","authors":"Jinghui Chu;Yongpeng Wang;Qi Tian;Wei Lu","doi":"10.1109/TIM.2025.3527621","DOIUrl":null,"url":null,"abstract":"Colorectal cancer (CRC) is a multifaceted disease, but it can be effectively prevented through colonoscopy for the detection of polyps. In clinical practice, the development of automatic polyp segmentation techniques for colonoscopy images can significantly enhance the efficiency and accuracy of polyp detection and help clinicians to precisely localize the polyps. However, existing segmentation methods have several obvious limitations: 1) inadequate utilization of multilevel features extracted by feature encoders; 2) ineffective aggregation of high- and low-level features; and 3) unclear delineation of polyp boundaries. To address these challenges while enhancing the clarity of polyp boundaries in segmentation, we propose a novel multilayer information fusion and optimization network (MIFONet) consisting of the following components: 1) contextual and fine feature processing (CFFP) module, employed to effectively extract both local and global contextual information; 2) hierarchical feature integration module (HFIM), added to facilitate efficient aggregation of processed high- and low-level features and strengthen the association between contextual features; 3) multiscale contextual attention (MSCA) module, used to deeply integrate aggregated high-level features with low-level features; and 4) a novel refinement module composed of an adaptive channel attention pyramid (ACAP) part and a skip-reverse attention (SRA) part, with the ability to capture fine-grained information and refining feature representation. We conducted extensive experiments and comparative analysis of our proposed model with 19 popular or state-of-the-art (SOTA) methods on five renowned polyp benchmark datasets. To further validate the model’s generalization performance, we also designed three cross-dataset experiments. Experimental results demonstrate that MIFONet consistently achieves excellent segmentation performance across most datasets. In particular, we achieve 94.6% mean Dice on the CVC-ClinicDB dataset, which obtains superior performance compared with SOTA methods.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-15"},"PeriodicalIF":5.6000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Instrumentation and Measurement","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10835248/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Colorectal cancer (CRC) is a multifaceted disease, but it can be effectively prevented through colonoscopy for the detection of polyps. In clinical practice, the development of automatic polyp segmentation techniques for colonoscopy images can significantly enhance the efficiency and accuracy of polyp detection and help clinicians to precisely localize the polyps. However, existing segmentation methods have several obvious limitations: 1) inadequate utilization of multilevel features extracted by feature encoders; 2) ineffective aggregation of high- and low-level features; and 3) unclear delineation of polyp boundaries. To address these challenges while enhancing the clarity of polyp boundaries in segmentation, we propose a novel multilayer information fusion and optimization network (MIFONet) consisting of the following components: 1) contextual and fine feature processing (CFFP) module, employed to effectively extract both local and global contextual information; 2) hierarchical feature integration module (HFIM), added to facilitate efficient aggregation of processed high- and low-level features and strengthen the association between contextual features; 3) multiscale contextual attention (MSCA) module, used to deeply integrate aggregated high-level features with low-level features; and 4) a novel refinement module composed of an adaptive channel attention pyramid (ACAP) part and a skip-reverse attention (SRA) part, with the ability to capture fine-grained information and refining feature representation. We conducted extensive experiments and comparative analysis of our proposed model with 19 popular or state-of-the-art (SOTA) methods on five renowned polyp benchmark datasets. To further validate the model’s generalization performance, we also designed three cross-dataset experiments. Experimental results demonstrate that MIFONet consistently achieves excellent segmentation performance across most datasets. In particular, we achieve 94.6% mean Dice on the CVC-ClinicDB dataset, which obtains superior performance compared with SOTA methods.
期刊介绍:
Papers are sought that address innovative solutions to the development and use of electrical and electronic instruments and equipment to measure, monitor and/or record physical phenomena for the purpose of advancing measurement science, methods, functionality and applications. The scope of these papers may encompass: (1) theory, methodology, and practice of measurement; (2) design, development and evaluation of instrumentation and measurement systems and components used in generating, acquiring, conditioning and processing signals; (3) analysis, representation, display, and preservation of the information obtained from a set of measurements; and (4) scientific and technical support to establishment and maintenance of technical standards in the field of Instrumentation and Measurement.