EYE-YOLO: a multi-spatial pyramid pooling and Focal-EIOU loss inspired tiny YOLOv7 for fundus eye disease detection

IF 2.2 Q3 COMPUTER SCIENCE, CYBERNETICS International Journal of Intelligent Computing and Cybernetics Pub Date : 2024-06-04 DOI:10.1108/ijicc-02-2024-0077
Akhil Kumar, R. Dhanalakshmi
{"title":"EYE-YOLO: a multi-spatial pyramid pooling and Focal-EIOU loss inspired tiny YOLOv7 for fundus eye disease detection","authors":"Akhil Kumar, R. Dhanalakshmi","doi":"10.1108/ijicc-02-2024-0077","DOIUrl":null,"url":null,"abstract":"PurposeThe purpose of this work is to present an approach for autonomous detection of eye disease in fundus images. Furthermore, this work presents an improved variant of the Tiny YOLOv7 model developed specifically for eye disease detection. The model proposed in this work is a highly useful tool for the development of applications for autonomous detection of eye diseases in fundus images that can help and assist ophthalmologists.Design/methodology/approachThe approach adopted to carry out this work is twofold. Firstly, a richly annotated dataset consisting of eye disease classes, namely, cataract, glaucoma, retinal disease and normal eye, was created. Secondly, an improved variant of the Tiny YOLOv7 model was developed and proposed as EYE-YOLO. The proposed EYE-YOLO model has been developed by integrating multi-spatial pyramid pooling in the feature extraction network and Focal-EIOU loss in the detection network of the Tiny YOLOv7 model. Moreover, at run time, the mosaic augmentation strategy has been utilized with the proposed model to achieve benchmark results. Further, evaluations have been carried out for performance metrics, namely, precision, recall, F1 Score, average precision (AP) and mean average precision (mAP).FindingsThe proposed EYE-YOLO achieved 28% higher precision, 18% higher recall, 24% higher F1 Score and 30.81% higher mAP than the Tiny YOLOv7 model. Moreover, in terms of AP for each class of the employed dataset, it achieved 9.74% higher AP for cataract, 27.73% higher AP for glaucoma, 72.50% higher AP for retina disease and 13.26% higher AP for normal eye. In comparison to the state-of-the-art Tiny YOLOv5, Tiny YOLOv6 and Tiny YOLOv8 models, the proposed EYE-YOLO achieved 6–23.32% higher mAP.Originality/valueThis work addresses the problem of eye disease recognition as a bounding box regression and detection problem. Whereas, the work in the related research is largely based on eye disease classification. The other highlight of this work is to propose a richly annotated dataset for different eye diseases useful for training deep learning-based object detectors. The major highlight of this work lies in the proposal of an improved variant of the Tiny YOLOv7 model focusing on eye disease detection. The proposed modifications in the Tiny YOLOv7 aided the proposed model in achieving better results as compared to the state-of-the-art Tiny YOLOv8 and YOLOv8 Nano.","PeriodicalId":45291,"journal":{"name":"International Journal of Intelligent Computing and Cybernetics","volume":null,"pages":null},"PeriodicalIF":2.2000,"publicationDate":"2024-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Intelligent Computing and Cybernetics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1108/ijicc-02-2024-0077","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, CYBERNETICS","Score":null,"Total":0}
引用次数: 0

Abstract

PurposeThe purpose of this work is to present an approach for autonomous detection of eye disease in fundus images. Furthermore, this work presents an improved variant of the Tiny YOLOv7 model developed specifically for eye disease detection. The model proposed in this work is a highly useful tool for the development of applications for autonomous detection of eye diseases in fundus images that can help and assist ophthalmologists.Design/methodology/approachThe approach adopted to carry out this work is twofold. Firstly, a richly annotated dataset consisting of eye disease classes, namely, cataract, glaucoma, retinal disease and normal eye, was created. Secondly, an improved variant of the Tiny YOLOv7 model was developed and proposed as EYE-YOLO. The proposed EYE-YOLO model has been developed by integrating multi-spatial pyramid pooling in the feature extraction network and Focal-EIOU loss in the detection network of the Tiny YOLOv7 model. Moreover, at run time, the mosaic augmentation strategy has been utilized with the proposed model to achieve benchmark results. Further, evaluations have been carried out for performance metrics, namely, precision, recall, F1 Score, average precision (AP) and mean average precision (mAP).FindingsThe proposed EYE-YOLO achieved 28% higher precision, 18% higher recall, 24% higher F1 Score and 30.81% higher mAP than the Tiny YOLOv7 model. Moreover, in terms of AP for each class of the employed dataset, it achieved 9.74% higher AP for cataract, 27.73% higher AP for glaucoma, 72.50% higher AP for retina disease and 13.26% higher AP for normal eye. In comparison to the state-of-the-art Tiny YOLOv5, Tiny YOLOv6 and Tiny YOLOv8 models, the proposed EYE-YOLO achieved 6–23.32% higher mAP.Originality/valueThis work addresses the problem of eye disease recognition as a bounding box regression and detection problem. Whereas, the work in the related research is largely based on eye disease classification. The other highlight of this work is to propose a richly annotated dataset for different eye diseases useful for training deep learning-based object detectors. The major highlight of this work lies in the proposal of an improved variant of the Tiny YOLOv7 model focusing on eye disease detection. The proposed modifications in the Tiny YOLOv7 aided the proposed model in achieving better results as compared to the state-of-the-art Tiny YOLOv8 and YOLOv8 Nano.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
EYE-YOLO:受微小 YOLOv7 启发,用于眼底疾病检测的多空间金字塔汇集和 Focal-EIOU 损失法
目的 本作品旨在介绍一种在眼底图像中自主检测眼疾的方法。此外,本作品还介绍了专门为眼疾检测开发的 Tiny YOLOv7 模型的改进变体。这项工作中提出的模型是一个非常有用的工具,可用于开发在眼底图像中自主检测眼部疾病的应用程序,从而帮助和协助眼科医生。首先,创建了一个包含丰富眼病类别注释的数据集,即白内障、青光眼、视网膜疾病和正常眼。其次,开发了 Tiny YOLOv7 模型的改进变体,并将其命名为 EYE-YOLO。所提出的 EYE-YOLO 模型是在 Tiny YOLOv7 模型的特征提取网络中集成了多空间金字塔池,在检测网络中集成了 Focal-EIOU 损失。此外,在运行时,还将马赛克增强策略与所提出的模型结合使用,以获得基准结果。此外,还对精确度、召回率、F1 分数、平均精确度 (AP) 和平均平均精确度 (mAP) 等性能指标进行了评估。此外,在所使用数据集的每个类别中,它在白内障方面的 AP 值提高了 9.74%,在青光眼方面的 AP 值提高了 27.73%,在视网膜疾病方面的 AP 值提高了 72.50%,在正常眼方面的 AP 值提高了 13.26%。与最先进的 Tiny YOLOv5、Tiny YOLOv6 和 Tiny YOLOv8 模型相比,所提出的 EYE-YOLO 的 mAP 高出 6-23.32%。而相关研究工作主要基于眼病分类。这项工作的另一个亮点是为不同的眼疾提出了丰富的注释数据集,有助于训练基于深度学习的物体检测器。这项工作的主要亮点在于提出了 Tiny YOLOv7 模型的改进变体,重点用于眼疾检测。与最先进的 Tiny YOLOv8 和 YOLOv8 Nano 相比,对 Tiny YOLOv7 提出的修改有助于该模型取得更好的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
6.80
自引率
4.70%
发文量
26
期刊最新文献
X-News dataset for online news categorization X-News dataset for online news categorization A novel ensemble causal feature selection approach with mutual information and group fusion strategy for multi-label data Contextualized dynamic meta embeddings based on Gated CNNs and self-attention for Arabic machine translation Dynamic community detection algorithm based on hyperbolic graph convolution
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1