An Explainable Deep Learning Network With Transformer and Custom CNN for Bean Leaf Disease Classification

IF 3.6 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Access Pub Date : 2025-02-26 DOI:10.1109/ACCESS.2025.3546017
R. Karthik;R. Aswin;K. S. Geetha;K. Suganthi
{"title":"An Explainable Deep Learning Network With Transformer and Custom CNN for Bean Leaf Disease Classification","authors":"R. Karthik;R. Aswin;K. S. Geetha;K. Suganthi","doi":"10.1109/ACCESS.2025.3546017","DOIUrl":null,"url":null,"abstract":"Bean rust and angular leaf spot pose significant challenges to bean cultivation, impacting yields. Prompt disease identification maximizes productivity, but traditional methods need specialized expertise. This research presents an explainable deep learning model that combines the Pyramid Vision Transformer (PVT) and Group Context Aware Depthwise Shuffle Network (GCADSN). The PVT effectively models long-range dependencies, identifying disease patterns across larger leaf areas, while the GCADSN focuses on capturing nuanced, context-specific features. This combined approach leads to a richer representation of the input image, resulting in improved disease classification. Model explainability is provided through GradCAM visualizations, highlighting the image regions crucial for the model’s predictions and enabling transparent, class-specific insights. The model’s performance was rigorously tested using the IBean dataset, a collection of images depicting various bean leaf diseases, including common rust, angular leaf spot, and healthy leaves. Our proposed network achieved a high accuracy of 97.66%, outperforming many current state-of-the-art deep learning models. Furthermore, it demonstrated strong performance across other key metrics, with an F1 score of 97.67%, precision of 97.83%, and recall of 97.67%. Importantly, the model’s computational efficiency makes it well-suited for practical application in real-world agricultural scenarios.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"13 ","pages":"38562-38573"},"PeriodicalIF":3.6000,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10904208","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Access","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10904208/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Bean rust and angular leaf spot pose significant challenges to bean cultivation, impacting yields. Prompt disease identification maximizes productivity, but traditional methods need specialized expertise. This research presents an explainable deep learning model that combines the Pyramid Vision Transformer (PVT) and Group Context Aware Depthwise Shuffle Network (GCADSN). The PVT effectively models long-range dependencies, identifying disease patterns across larger leaf areas, while the GCADSN focuses on capturing nuanced, context-specific features. This combined approach leads to a richer representation of the input image, resulting in improved disease classification. Model explainability is provided through GradCAM visualizations, highlighting the image regions crucial for the model’s predictions and enabling transparent, class-specific insights. The model’s performance was rigorously tested using the IBean dataset, a collection of images depicting various bean leaf diseases, including common rust, angular leaf spot, and healthy leaves. Our proposed network achieved a high accuracy of 97.66%, outperforming many current state-of-the-art deep learning models. Furthermore, it demonstrated strong performance across other key metrics, with an F1 score of 97.67%, precision of 97.83%, and recall of 97.67%. Importantly, the model’s computational efficiency makes it well-suited for practical application in real-world agricultural scenarios.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
具有变压器和自定义CNN的可解释深度学习网络用于豆叶病害分类
大豆锈病和角斑病对大豆种植构成重大挑战,影响产量。及时识别疾病可使生产力最大化,但传统方法需要专门知识。本研究提出了一个可解释的深度学习模型,该模型结合了金字塔视觉变压器(PVT)和群体上下文感知深度洗牌网络(GCADSN)。PVT有效地建立了长期依赖关系的模型,在更大的叶面积上识别疾病模式,而GCADSN专注于捕捉细微的、特定于环境的特征。这种结合的方法导致输入图像的更丰富的表示,从而改进疾病分类。通过GradCAM可视化提供模型可解释性,突出显示对模型预测至关重要的图像区域,并实现透明的,特定于类的见解。该模型的性能使用IBean数据集进行了严格测试,IBean数据集是描述各种豆类叶片疾病的图像集合,包括普通锈病、角斑病和健康叶片。我们提出的网络达到了97.66%的高准确率,优于当前许多最先进的深度学习模型。此外,它在其他关键指标上表现出色,F1得分为97.67%,准确率为97.83%,召回率为97.67%。重要的是,该模型的计算效率使其非常适合于实际农业场景的实际应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Access
IEEE Access COMPUTER SCIENCE, INFORMATION SYSTEMSENGIN-ENGINEERING, ELECTRICAL & ELECTRONIC
CiteScore
9.80
自引率
7.70%
发文量
6673
审稿时长
6 weeks
期刊介绍: IEEE Access® is a multidisciplinary, open access (OA), applications-oriented, all-electronic archival journal that continuously presents the results of original research or development across all of IEEE''s fields of interest. IEEE Access will publish articles that are of high interest to readers, original, technically correct, and clearly presented. Supported by author publication charges (APC), its hallmarks are a rapid peer review and publication process with open access to all readers. Unlike IEEE''s traditional Transactions or Journals, reviews are "binary", in that reviewers will either Accept or Reject an article in the form it is submitted in order to achieve rapid turnaround. Especially encouraged are submissions on: Multidisciplinary topics, or applications-oriented articles and negative results that do not fit within the scope of IEEE''s traditional journals. Practical articles discussing new experiments or measurement techniques, interesting solutions to engineering. Development of new or improved fabrication or manufacturing techniques. Reviews or survey articles of new or evolving fields oriented to assist others in understanding the new area.
期刊最新文献
Integration of a Slotted Monopole Antenna With a Partially Reflective Surface for Enhanced Breast Tumor Detection DKC-LLM: Dynamic Knowledge Caching for Large Language Models in Business Applications Lightweight Wavelet Convolutional U-Net for Seismic Phase Recognition Unified Entropy–Spectral Fingerprinting of Chaotic Attractors via CFS, Lyapunov Stability, and Nonlinear Complexity Measures Abnormal Vessel Activity Detection Using a Two-Level Grid Representation of AIS Data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1