A novel deep learning framework for retinal disease detection leveraging contextual and local features cues from retinal images.

IF 2.6 4区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Medical & Biological Engineering & Computing Pub Date : 2025-07-01 Epub Date: 2025-02-07 DOI:10.1007/s11517-025-03314-0

Sultan Daud Khan, Saleh Basalamah, Ahmed Lbath

{"title":"A novel deep learning framework for retinal disease detection leveraging contextual and local features cues from retinal images.","authors":"Sultan Daud Khan, Saleh Basalamah, Ahmed Lbath","doi":"10.1007/s11517-025-03314-0","DOIUrl":null,"url":null,"abstract":"<p><p>Retinal diseases are a serious global threat to human vision, and early identification is essential for effective prevention and treatment. However, current diagnostic methods rely on manual analysis of fundus images, which heavily depends on the expertise of ophthalmologists. This manual process is time-consuming and labor-intensive and can sometimes lead to missed diagnoses. With advancements in computer vision technology, several automated models have been proposed to improve diagnostic accuracy for retinal diseases and medical imaging in general. However, these methods face challenges in accurately detecting specific diseases within images due to inherent issues associated with fundus images, including inter-class similarities, intra-class variations, limited local information, insufficient contextual understanding, and class imbalances within datasets. To address these challenges, we propose a novel deep learning framework for accurate retinal disease classification. This framework is designed to achieve high accuracy in identifying various retinal diseases while overcoming inherent challenges associated with fundus images. Generally, the framework consists of three main modules. The first module is Densely Connected Multidilated Convolution Neural Network (DCM-CNN) that extracts global contextual information by effectively integrating novel Casual Dilated Dense Convolutional Blocks (CDDCBs). The second module of the framework, namely, Local-Patch-based Convolution Neural Network (LP-CNN), utilizes Class Activation Map (CAM) (obtained from DCM-CNN) to extract local and fine-grained information. To identify the correct class and minimize the error, a synergic network is utilized that takes the feature maps of both DCM-CNN and LP-CNN and connects both maps in a fully connected fashion to identify the correct class and minimize the errors. The framework is evaluated through a comprehensive set of experiments, both quantitatively and qualitatively, using two publicly available benchmark datasets: RFMiD and ODIR-5K. Our experimental results demonstrate the effectiveness of the proposed framework and achieves higher performance on RFMiD and ODIR-5K datasets compared to reference methods.</p>","PeriodicalId":49840,"journal":{"name":"Medical & Biological Engineering & Computing","volume":" ","pages":"2029-2046"},"PeriodicalIF":2.6000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical & Biological Engineering & Computing","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11517-025-03314-0","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/7 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Retinal diseases are a serious global threat to human vision, and early identification is essential for effective prevention and treatment. However, current diagnostic methods rely on manual analysis of fundus images, which heavily depends on the expertise of ophthalmologists. This manual process is time-consuming and labor-intensive and can sometimes lead to missed diagnoses. With advancements in computer vision technology, several automated models have been proposed to improve diagnostic accuracy for retinal diseases and medical imaging in general. However, these methods face challenges in accurately detecting specific diseases within images due to inherent issues associated with fundus images, including inter-class similarities, intra-class variations, limited local information, insufficient contextual understanding, and class imbalances within datasets. To address these challenges, we propose a novel deep learning framework for accurate retinal disease classification. This framework is designed to achieve high accuracy in identifying various retinal diseases while overcoming inherent challenges associated with fundus images. Generally, the framework consists of three main modules. The first module is Densely Connected Multidilated Convolution Neural Network (DCM-CNN) that extracts global contextual information by effectively integrating novel Casual Dilated Dense Convolutional Blocks (CDDCBs). The second module of the framework, namely, Local-Patch-based Convolution Neural Network (LP-CNN), utilizes Class Activation Map (CAM) (obtained from DCM-CNN) to extract local and fine-grained information. To identify the correct class and minimize the error, a synergic network is utilized that takes the feature maps of both DCM-CNN and LP-CNN and connects both maps in a fully connected fashion to identify the correct class and minimize the errors. The framework is evaluated through a comprehensive set of experiments, both quantitatively and qualitatively, using two publicly available benchmark datasets: RFMiD and ODIR-5K. Our experimental results demonstrate the effectiveness of the proposed framework and achieves higher performance on RFMiD and ODIR-5K datasets compared to reference methods.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用视网膜图像的上下文和局部特征线索进行视网膜疾病检测的新型深度学习框架。

视网膜疾病是严重威胁人类视力的全球性疾病，早期识别对于有效预防和治疗至关重要。然而，目前的诊断方法依赖于眼底图像的人工分析，这在很大程度上依赖于眼科医生的专业知识。这种手工过程耗时耗力，有时还会导致漏诊。随着计算机视觉技术的进步，人们提出了几种自动化模型来提高视网膜疾病和医学成像的诊断准确性。然而，由于眼底图像的固有问题，这些方法在准确检测图像中的特定疾病方面面临挑战，包括类间相似性、类内变化、有限的局部信息、上下文理解不足以及数据集中的类不平衡。为了解决这些挑战，我们提出了一种新的深度学习框架，用于准确的视网膜疾病分类。该框架旨在实现识别各种视网膜疾病的高精度，同时克服与眼底图像相关的固有挑战。一般来说，该框架由三个主要模块组成。第一个模块是密集连接的多重扩展卷积神经网络（DCM-CNN），它通过有效整合新颖的随机扩展密集卷积块（CDDCBs）来提取全局上下文信息。该框架的第二个模块，即local - patch -based Convolution Neural Network (LP-CNN)，利用类激活图（Class Activation Map， CAM）（从DCM-CNN获得）提取局部和细粒度信息。为了识别正确的类并使误差最小化，我们使用了一个协同网络，该网络同时使用DCM-CNN和LP-CNN的特征图，并以全连接的方式连接这两个图，以识别正确的类并使误差最小化。通过使用两个公开可用的基准数据集：RFMiD和ODIR-5K，通过一组全面的定量和定性实验对该框架进行评估。实验结果证明了该框架的有效性，与参考方法相比，该框架在RFMiD和ODIR-5K数据集上取得了更高的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Medical & Biological Engineering & Computing 医学-工程：生物医学

CiteScore

6.00

自引率

3.10%

发文量

249

审稿时长

3.5 months

期刊介绍： Founded in 1963, Medical & Biological Engineering & Computing (MBEC) continues to serve the biomedical engineering community, covering the entire spectrum of biomedical and clinical engineering. The journal presents exciting and vital experimental and theoretical developments in biomedical science and technology, and reports on advances in computer-based methodologies in these multidisciplinary subjects. The journal also incorporates new and evolving technologies including cellular engineering and molecular imaging. MBEC publishes original research articles as well as reviews and technical notes. Its Rapid Communications category focuses on material of immediate value to the readership, while the Controversies section provides a forum to exchange views on selected issues, stimulating a vigorous and informed debate in this exciting and high profile field. MBEC is an official journal of the International Federation of Medical and Biological Engineering (IFMBE).