Jiawei Yan , Hongqing Zhu , Tong Hou , Ning Chen , Weiping Lu , Ying Wang , Bingcang Huang
{"title":"MBDA-Net: Multi-source boundary-aware prototype alignment domain adaptation for polyp segmentation","authors":"Jiawei Yan , Hongqing Zhu , Tong Hou , Ning Chen , Weiping Lu , Ying Wang , Bingcang Huang","doi":"10.1016/j.bspc.2024.106664","DOIUrl":null,"url":null,"abstract":"<div><p>Accurate segmentation of polyps in colonoscopy images is important for the prevention and treatment of colorectal cancer. However, samples collected from different centers often possess diverse distributions, leading to poor generalization when a segmentation model trained in one center is directly employed in another. This paper proposes a multi-source boundary-aware prototype alignment domain adaptation network (MBDA-Net) to improve the performance of cross-center colonoscopy image segmentation. Specifically, we first design an image translation (IT) module based on the discrete cosine transform (DCT) to reduce the distribution gap between source and target domains by translating source domain styles into target domain styles. Then we propose a mutual perception prototype alignment (MPPA) module containing prototype inference, prototype interaction and adaptive mutual feature fusion. By learning relationships between prototypes and features among multiple domains, one can obtain mutual perception features that fuse prototype information from multiple domains. In order to fully exploit the supervised information of the source domains and optimize the prediction boundaries, we develop a boundary-aware learning (BAL) module to align the boundaries of the source domain predictions and ground truths. Moreover, to mitigate the foreground–background imbalance present in small-sized polyp images and reduce the biased predictions of the model, this study proposes a double normalization strategy (DNS) during the inference stage to improve the detection rate of small polyps. Experimental results on three challenging public datasets show that the proposed MBDA-Net outperforms existing methods on cross-center colonoscopy image segmentation, achieving state-of-the-art performance.</p></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"96 ","pages":"Article 106664"},"PeriodicalIF":4.9000,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809424007225","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate segmentation of polyps in colonoscopy images is important for the prevention and treatment of colorectal cancer. However, samples collected from different centers often possess diverse distributions, leading to poor generalization when a segmentation model trained in one center is directly employed in another. This paper proposes a multi-source boundary-aware prototype alignment domain adaptation network (MBDA-Net) to improve the performance of cross-center colonoscopy image segmentation. Specifically, we first design an image translation (IT) module based on the discrete cosine transform (DCT) to reduce the distribution gap between source and target domains by translating source domain styles into target domain styles. Then we propose a mutual perception prototype alignment (MPPA) module containing prototype inference, prototype interaction and adaptive mutual feature fusion. By learning relationships between prototypes and features among multiple domains, one can obtain mutual perception features that fuse prototype information from multiple domains. In order to fully exploit the supervised information of the source domains and optimize the prediction boundaries, we develop a boundary-aware learning (BAL) module to align the boundaries of the source domain predictions and ground truths. Moreover, to mitigate the foreground–background imbalance present in small-sized polyp images and reduce the biased predictions of the model, this study proposes a double normalization strategy (DNS) during the inference stage to improve the detection rate of small polyps. Experimental results on three challenging public datasets show that the proposed MBDA-Net outperforms existing methods on cross-center colonoscopy image segmentation, achieving state-of-the-art performance.
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.