Deep Learning-Based Automated Detection and Grading of Papilledema From OCT Images: A Promising Approach for Improved Clinical Diagnosis and Management
Ahmed M. Salaheldin, Manal Abdel Wahed, Manar Talaat, Neven Saleh
{"title":"Deep Learning-Based Automated Detection and Grading of Papilledema From OCT Images: A Promising Approach for Improved Clinical Diagnosis and Management","authors":"Ahmed M. Salaheldin, Manal Abdel Wahed, Manar Talaat, Neven Saleh","doi":"10.1002/ima.23133","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Papilledema is a prevalent neuro-ophthalmic condition characterized by optic disk swelling. It is known to pose a significant risk of vision loss in its advanced stages. To address the pressing need for accurate detection and grading of papilledema, this study introduces a novel approach utilizing optical coherence tomography (OCT) scans. A cascaded model that combines four transfer learning models—SqueezeNet, AlexNet, GoogleNet, and ResNet-50—for both the detection and grading phases was proposed. Additionally, a specialized convolutional neural network (CNN) model is meticulously designed to cater specifically to the complexities of papilledema analysis. Unlike the fundus camera-based models, this study integrates deep learning models for the diagnosis of papilledema from OCT scans. A new dataset of OCT scans was collected to ensure a comprehensive evaluation of the models. It encompasses a wide range of papilledema, pseudopapilledema, and normal cases. This dataset serves as a valuable resource for training and testing of the proposed models. In addition, two validation strategies have been adopted to ensure the model's generalizability and robustness. Furthermore, it enhances the model's accuracy and reliability. The results are highly promising; remarkable accuracy rates have been achieved. Specifically, the SqueezeNet, AlexNet, GoogleNet, ResNet-50, and customized CNN models achieved accuracy levels of 98.44%, 98.50%, 98.28%, 98.30%, and 96.26%, respectively, for the handout validation strategy. These findings not only demonstrate the efficacy of using deep learning in papilledema detection and grading but also establish the superiority of the proposed models when compared with other relevant studies. By addressing the challenges associated with papilledema, the study significantly contributes to the advancement of neuro-ophthalmic diagnostics. The accurate and efficient detection of papilledema from OCT scans holds immense potential for guiding timely interventions and preserving patients' visual health.</p>\n </div>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"34 4","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2024-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Imaging Systems and Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ima.23133","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Papilledema is a prevalent neuro-ophthalmic condition characterized by optic disk swelling. It is known to pose a significant risk of vision loss in its advanced stages. To address the pressing need for accurate detection and grading of papilledema, this study introduces a novel approach utilizing optical coherence tomography (OCT) scans. A cascaded model that combines four transfer learning models—SqueezeNet, AlexNet, GoogleNet, and ResNet-50—for both the detection and grading phases was proposed. Additionally, a specialized convolutional neural network (CNN) model is meticulously designed to cater specifically to the complexities of papilledema analysis. Unlike the fundus camera-based models, this study integrates deep learning models for the diagnosis of papilledema from OCT scans. A new dataset of OCT scans was collected to ensure a comprehensive evaluation of the models. It encompasses a wide range of papilledema, pseudopapilledema, and normal cases. This dataset serves as a valuable resource for training and testing of the proposed models. In addition, two validation strategies have been adopted to ensure the model's generalizability and robustness. Furthermore, it enhances the model's accuracy and reliability. The results are highly promising; remarkable accuracy rates have been achieved. Specifically, the SqueezeNet, AlexNet, GoogleNet, ResNet-50, and customized CNN models achieved accuracy levels of 98.44%, 98.50%, 98.28%, 98.30%, and 96.26%, respectively, for the handout validation strategy. These findings not only demonstrate the efficacy of using deep learning in papilledema detection and grading but also establish the superiority of the proposed models when compared with other relevant studies. By addressing the challenges associated with papilledema, the study significantly contributes to the advancement of neuro-ophthalmic diagnostics. The accurate and efficient detection of papilledema from OCT scans holds immense potential for guiding timely interventions and preserving patients' visual health.
视乳头水肿是一种以视盘肿胀为特征的常见神经眼科疾病。众所周知,晚期乳头水肿会导致视力丧失。为了满足准确检测和分级乳头水肿的迫切需要,本研究引入了一种利用光学相干断层扫描(OCT)的新方法。研究提出了一种级联模型,该模型结合了四种迁移学习模型--SqueezeNet、AlexNet、GoogleNet 和 ResNet-50,用于检测和分级阶段。此外,还精心设计了一个专门的卷积神经网络(CNN)模型,以专门应对乳头水肿分析的复杂性。与基于眼底照相机的模型不同,本研究整合了深度学习模型,用于通过 OCT 扫描诊断乳头水肿。为了确保对模型进行全面评估,我们收集了一个新的 OCT 扫描数据集。该数据集涵盖了广泛的乳头水肿、假性乳头水肿和正常病例。该数据集是训练和测试建议模型的宝贵资源。此外,还采用了两种验证策略,以确保模型的普适性和稳健性。此外,它还提高了模型的准确性和可靠性。结果非常理想,准确率非常高。具体来说,在施舍验证策略中,SqueezeNet、AlexNet、GoogleNet、ResNet-50 和定制 CNN 模型的准确率分别达到了 98.44%、98.50%、98.28%、98.30% 和 96.26%。这些发现不仅证明了在乳头水肿检测和分级中使用深度学习的有效性,而且与其他相关研究相比,也确定了所提出模型的优越性。通过应对与乳头水肿相关的挑战,该研究极大地促进了神经眼科诊断的发展。从 OCT 扫描中准确有效地检测出乳头水肿,在指导及时干预和保护患者视力健康方面具有巨大的潜力。
期刊介绍:
The International Journal of Imaging Systems and Technology (IMA) is a forum for the exchange of ideas and results relevant to imaging systems, including imaging physics and informatics. The journal covers all imaging modalities in humans and animals.
IMA accepts technically sound and scientifically rigorous research in the interdisciplinary field of imaging, including relevant algorithmic research and hardware and software development, and their applications relevant to medical research. The journal provides a platform to publish original research in structural and functional imaging.
The journal is also open to imaging studies of the human body and on animals that describe novel diagnostic imaging and analyses methods. Technical, theoretical, and clinical research in both normal and clinical populations is encouraged. Submissions describing methods, software, databases, replication studies as well as negative results are also considered.
The scope of the journal includes, but is not limited to, the following in the context of biomedical research:
Imaging and neuro-imaging modalities: structural MRI, functional MRI, PET, SPECT, CT, ultrasound, EEG, MEG, NIRS etc.;
Neuromodulation and brain stimulation techniques such as TMS and tDCS;
Software and hardware for imaging, especially related to human and animal health;
Image segmentation in normal and clinical populations;
Pattern analysis and classification using machine learning techniques;
Computational modeling and analysis;
Brain connectivity and connectomics;
Systems-level characterization of brain function;
Neural networks and neurorobotics;
Computer vision, based on human/animal physiology;
Brain-computer interface (BCI) technology;
Big data, databasing and data mining.