Diabetic Retinopathy Grading based on a Sparse Network Fusion of Heterogeneous ConvNeXt Models with Category Attention

Agustin Castillo-Munguia, Gibran Benitez-Garcia, J. Olivares-Mercado, Hiroki Takahashi
{"title":"Diabetic Retinopathy Grading based on a Sparse Network Fusion of Heterogeneous ConvNeXt Models with Category Attention","authors":"Agustin Castillo-Munguia, Gibran Benitez-Garcia, J. Olivares-Mercado, Hiroki Takahashi","doi":"10.23919/MVA57639.2023.10216129","DOIUrl":null,"url":null,"abstract":"Diabetic retinopathy (DR) is an eye disease caused by high blood sugar levels that may damage vessels in the retina, leading to partial or complete loss of vision in later stages. In recent years, convolutional neural networks (CNN) have been used to help diagnose the DR severity. However, due to the slight differences between each class and the imbalanced nature of the datasets, standard CNNs often struggle to distinguish accurately between different grades of DR. To overcome these challenges, we propose combining a novel CNN model (ConvNeXt) with category-attention blocks incorporated at multiple levels of the architecture. This generates different models that can effectively extract fine-grained features and minimize the impact of dataset imbalance. Finally, we introduce a Sparse Network Fusion technique that learns to combine the outputs of all models to consolidate their individual decisions. Extensive experiments on the challenging DDR dataset show that our proposal achieves a new state-of-the-art performance, improving by about 3% grading accuracy compared with existing methods.","PeriodicalId":338734,"journal":{"name":"2023 18th International Conference on Machine Vision and Applications (MVA)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 18th International Conference on Machine Vision and Applications (MVA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/MVA57639.2023.10216129","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Diabetic retinopathy (DR) is an eye disease caused by high blood sugar levels that may damage vessels in the retina, leading to partial or complete loss of vision in later stages. In recent years, convolutional neural networks (CNN) have been used to help diagnose the DR severity. However, due to the slight differences between each class and the imbalanced nature of the datasets, standard CNNs often struggle to distinguish accurately between different grades of DR. To overcome these challenges, we propose combining a novel CNN model (ConvNeXt) with category-attention blocks incorporated at multiple levels of the architecture. This generates different models that can effectively extract fine-grained features and minimize the impact of dataset imbalance. Finally, we introduce a Sparse Network Fusion technique that learns to combine the outputs of all models to consolidate their individual decisions. Extensive experiments on the challenging DDR dataset show that our proposal achieves a new state-of-the-art performance, improving by about 3% grading accuracy compared with existing methods.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于类别关注的异构ConvNeXt模型稀疏网络融合的糖尿病视网膜病变分级
糖尿病性视网膜病变(DR)是一种由高血糖引起的眼部疾病,可能会损害视网膜血管,导致后期部分或完全丧失视力。近年来,卷积神经网络(CNN)已被用于帮助诊断DR的严重程度。然而,由于每个类别之间的细微差异和数据集的不平衡性,标准CNN经常难以准确区分不同等级的dr。为了克服这些挑战,我们提出将一种新的CNN模型(ConvNeXt)与在架构的多个层次上合并的类别关注块相结合。这产生了不同的模型,可以有效地提取细粒度特征,并最大限度地减少数据集不平衡的影响。最后,我们介绍了一种稀疏网络融合技术,该技术学习组合所有模型的输出以巩固它们的个体决策。在具有挑战性的DDR数据集上进行的大量实验表明,我们的建议达到了新的最先进的性能,与现有方法相比,分级精度提高了约3%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Small Object Detection for Birds with Swin Transformer CG-based dataset generation and adversarial image conversion for deep cucumber recognition Uncertainty Criteria in Active Transfer Learning for Efficient Video-Specific Human Pose Estimation Joint Learning with Group Relation and Individual Action Diabetic Retinopathy Grading based on a Sparse Network Fusion of Heterogeneous ConvNeXt Models with Category Attention
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1