Dual-branch channel attention enhancement feature fusion network for diabetic retinopathy segmentation

IF 4.9 2区医学 Q1 ENGINEERING, BIOMEDICAL Biomedical Signal Processing and Control Pub Date : 2025-08-01 Epub Date: 2025-03-05 DOI:10.1016/j.bspc.2025.107721

Lei Ma, Ziqian Liu, Qihang Xu, Hanyu Hong, Lei Wang, Ying Zhu, Yu Shi

{"title":"Dual-branch channel attention enhancement feature fusion network for diabetic retinopathy segmentation","authors":"Lei Ma, Ziqian Liu, Qihang Xu, Hanyu Hong, Lei Wang, Ying Zhu, Yu Shi","doi":"10.1016/j.bspc.2025.107721","DOIUrl":null,"url":null,"abstract":"<div><div>Diabetic retinopathy (DR) is an eye disease caused by diabetes that leads to impaired vision and even blindness. DR segmentation technology can assist ophthalmologists with early diagnosis, which can help to prevent the progression of this disease. However, DR segmentation is a challenging task because of the large variation in scale, high inter-class similarity, complex structures, blurred edges and different brightness contrasts of different kinds of lesions. Most existing methods tend not to adequately extract the semantic information in the channels of lesion features, which is a critical element for effectively distinguishing lesion edges. In this paper, we propose a dual-branch channel attention enhancement feature fusion network that integrates CNN and Transformer for DR segmentation. First, we introduce a Channel Crossing Attention Module (CCAM) into the U-Net framework to eliminate semantic inconsistencies between the encoder and decoder for better integration of contextual information. Moreover, we leverage Transformer’s robust global information acquisition capabilities to acquire long-range information, and further enhance the contextual information. Finally, we build a Dual-branch Channel Attention Enhancement Fusion Module (DCAE) to enhance the semantic information of the channels in both branches, which improves the discriminability of the blurred edges of lesions. Compared with the state-of-the-art methods, our method improved mAUPR, mDice, and mIOU by 1.36%, 1.85%, and 2.20% on the IDRiD dataset, and by 4.62%, 0.20%, and 2.60% on the DDR dataset, respectively. The experimental results show that the multi-scale semantic features of the two branches are effectively fused, which achieves accurate lesion segmentation.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"106 ","pages":"Article 107721"},"PeriodicalIF":4.9000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425002320","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/5 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

Diabetic retinopathy (DR) is an eye disease caused by diabetes that leads to impaired vision and even blindness. DR segmentation technology can assist ophthalmologists with early diagnosis, which can help to prevent the progression of this disease. However, DR segmentation is a challenging task because of the large variation in scale, high inter-class similarity, complex structures, blurred edges and different brightness contrasts of different kinds of lesions. Most existing methods tend not to adequately extract the semantic information in the channels of lesion features, which is a critical element for effectively distinguishing lesion edges. In this paper, we propose a dual-branch channel attention enhancement feature fusion network that integrates CNN and Transformer for DR segmentation. First, we introduce a Channel Crossing Attention Module (CCAM) into the U-Net framework to eliminate semantic inconsistencies between the encoder and decoder for better integration of contextual information. Moreover, we leverage Transformer’s robust global information acquisition capabilities to acquire long-range information, and further enhance the contextual information. Finally, we build a Dual-branch Channel Attention Enhancement Fusion Module (DCAE) to enhance the semantic information of the channels in both branches, which improves the discriminability of the blurred edges of lesions. Compared with the state-of-the-art methods, our method improved mAUPR, mDice, and mIOU by 1.36%, 1.85%, and 2.20% on the IDRiD dataset, and by 4.62%, 0.20%, and 2.60% on the DDR dataset, respectively. The experimental results show that the multi-scale semantic features of the two branches are effectively fused, which achieves accurate lesion segmentation.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

双分支通道注意力增强特征融合网络用于糖尿病视网膜病变分割

糖尿病视网膜病变（DR）是一种由糖尿病引起的眼病，会导致视力受损甚至失明。DR分割技术可以帮助眼科医生进行早期诊断，有助于预防疾病的发展。然而，由于不同类型病变的尺度差异大、类间相似性高、结构复杂、边缘模糊以及亮度对比不同，DR分割是一项具有挑战性的任务。现有的方法往往不能充分提取病灶特征通道中的语义信息，而语义信息是有效识别病灶边缘的关键因素。本文提出了一种融合CNN和Transformer的双支路注意力增强特征融合网络，用于DR分割。首先，我们在U-Net框架中引入了信道交叉注意模块（CCAM），以消除编码器和解码器之间的语义不一致，从而更好地集成上下文信息。此外，我们利用Transformer健壮的全局信息获取能力来获取远程信息，并进一步增强上下文信息。最后，构建双分支通道注意力增强融合模块（Dual-branch Channel Attention Enhancement Fusion Module， DCAE），增强两个分支通道的语义信息，提高病灶模糊边缘的可分辨性。与现有方法相比，该方法在IDRiD数据集上的mAUPR、mdevice和mIOU分别提高了1.36%、1.85%和2.20%，在DDR数据集上的mAUPR、mdevice和mIOU分别提高了4.62%、0.20%和2.60%。实验结果表明，两分支的多尺度语义特征有效融合，实现了准确的病灶分割。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Biomedical Signal Processing and Control 工程技术-工程：生物医学

CiteScore

9.80

自引率

13.70%

发文量

822

审稿时长

4 months

期刊介绍： Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.