复杂场景下基于变形变压器注意机制的多种水稻病害识别方法

Xinyu Zhang, Hang Dong, Liang Gong, Xin Cheng, Zhenghui Ge, Liangchao Guo
{"title":"复杂场景下基于变形变压器注意机制的多种水稻病害识别方法","authors":"Xinyu Zhang, Hang Dong, Liang Gong, Xin Cheng, Zhenghui Ge, Liangchao Guo","doi":"10.1080/1206212x.2023.2263254","DOIUrl":null,"url":null,"abstract":"AbstractPaddy disease recognition presents challenges in the agricultural industry, and existing algorithms struggle to accurately identify diseases in complex scenarios. In this paper, we propose a precise object detection framework to address the challenges of severe overlap, multi-disease detection, morphological irregularities, multi-scale object classification, and complex scenarios in real-world environments in paddy disease detection. The proposed model is based on an improved version of the DEtection TRansformer (Detr) algorithm. The enhanced network architecture fuses multi-scale features by adding a feature fusion module after the backbone network, which is able to retain more original information of the images and greatly improves the detection accuracy; the use of deformable attention module greatly reduces the computational complexity of the model. To evaluate the PDN, a dedicated paddy disease detection dataset with 1200 images is created. Experimental results demonstrate that the proposed model obtained a precision value of 100%, a recall value of 89.3%, F1-score of 94.3%, and a mean average precision (mAP) value of 60.2%. The model outperforms the existing state-of-the-art detection models in detection accuracy.KEYWORDS: Paddy disease recognitionTransformermachine vision detection Disclosure statementNo potential conflict of interest was reported by the author(s).Additional informationFundingThis work was supported in part by the Jiangsu Basic Science (Natural Science) Research Projects in Higher Education Institutions (No.23KJB460034), Jiangsu province Youth Fund Project (No.BK2023040059), the China Postdoctoral Science Foundation Funded Project (No. 2022M721185), Jiangsu Agriculture Science and Technology Innovation Fund (No. CX(21)3145).Notes on contributorsXinyu ZhangXinyu Zhang is currently a master's student in mechanical engineering at the School of Mechanical Engineering, Yangzhou University. His research interest is machine learning.Hang DongDr. Hang Dong is a lecturer at Yangzhou University. He received his PhD degree in Mechanical Manufacture and Automation from Dalian University of Technology (2019). His research interests include the deep learning, machine learning, and robotics. Hang Dong is the corresponding author and can be contacted at hdong@yzu.edu.cn.Liang GongLiang Gong was born in Maanshan City, Anhui Province, China on October 26, 1999. He received his bachelor's degree from Anhui Polytechnic University in 2021. He is currently a master's student in mechanical engineering at the School of Mechanical Engineering, Yangzhou University. His research interests are machine vision and machine learning.Xin ChengXin Cheng was born in Lian Yungang, China, in 2002.He is currently a student in Yangzhou University.His research interests include computer vision,natural language processing.Zhenghui GeZhenghui Ge is currently an associate professor at Yangzhou University, China. He received his PhD degree from Nanjing University of Aeronautics and Astronautics, China, in 2018. His researches mainly focus on electrochemical machining.Liangchao GuoDr. Liangchao Guo is a distinguished research fellow at Yangzhou University. He received his PhD degree in Mechanical Manufacture and Automation from Dalian University of Technology (2019). His research interests include the fabrication and application of gas sensing, and storage devices.","PeriodicalId":39673,"journal":{"name":"International Journal of Computers and Applications","volume":"94 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multiple paddy disease recognition methods based on deformable transformer attention mechanism in complex scenarios\",\"authors\":\"Xinyu Zhang, Hang Dong, Liang Gong, Xin Cheng, Zhenghui Ge, Liangchao Guo\",\"doi\":\"10.1080/1206212x.2023.2263254\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"AbstractPaddy disease recognition presents challenges in the agricultural industry, and existing algorithms struggle to accurately identify diseases in complex scenarios. In this paper, we propose a precise object detection framework to address the challenges of severe overlap, multi-disease detection, morphological irregularities, multi-scale object classification, and complex scenarios in real-world environments in paddy disease detection. The proposed model is based on an improved version of the DEtection TRansformer (Detr) algorithm. The enhanced network architecture fuses multi-scale features by adding a feature fusion module after the backbone network, which is able to retain more original information of the images and greatly improves the detection accuracy; the use of deformable attention module greatly reduces the computational complexity of the model. To evaluate the PDN, a dedicated paddy disease detection dataset with 1200 images is created. Experimental results demonstrate that the proposed model obtained a precision value of 100%, a recall value of 89.3%, F1-score of 94.3%, and a mean average precision (mAP) value of 60.2%. The model outperforms the existing state-of-the-art detection models in detection accuracy.KEYWORDS: Paddy disease recognitionTransformermachine vision detection Disclosure statementNo potential conflict of interest was reported by the author(s).Additional informationFundingThis work was supported in part by the Jiangsu Basic Science (Natural Science) Research Projects in Higher Education Institutions (No.23KJB460034), Jiangsu province Youth Fund Project (No.BK2023040059), the China Postdoctoral Science Foundation Funded Project (No. 2022M721185), Jiangsu Agriculture Science and Technology Innovation Fund (No. CX(21)3145).Notes on contributorsXinyu ZhangXinyu Zhang is currently a master's student in mechanical engineering at the School of Mechanical Engineering, Yangzhou University. His research interest is machine learning.Hang DongDr. Hang Dong is a lecturer at Yangzhou University. He received his PhD degree in Mechanical Manufacture and Automation from Dalian University of Technology (2019). His research interests include the deep learning, machine learning, and robotics. Hang Dong is the corresponding author and can be contacted at hdong@yzu.edu.cn.Liang GongLiang Gong was born in Maanshan City, Anhui Province, China on October 26, 1999. He received his bachelor's degree from Anhui Polytechnic University in 2021. He is currently a master's student in mechanical engineering at the School of Mechanical Engineering, Yangzhou University. His research interests are machine vision and machine learning.Xin ChengXin Cheng was born in Lian Yungang, China, in 2002.He is currently a student in Yangzhou University.His research interests include computer vision,natural language processing.Zhenghui GeZhenghui Ge is currently an associate professor at Yangzhou University, China. He received his PhD degree from Nanjing University of Aeronautics and Astronautics, China, in 2018. His researches mainly focus on electrochemical machining.Liangchao GuoDr. Liangchao Guo is a distinguished research fellow at Yangzhou University. He received his PhD degree in Mechanical Manufacture and Automation from Dalian University of Technology (2019). His research interests include the fabrication and application of gas sensing, and storage devices.\",\"PeriodicalId\":39673,\"journal\":{\"name\":\"International Journal of Computers and Applications\",\"volume\":\"94 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Computers and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/1206212x.2023.2263254\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computers and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/1206212x.2023.2263254","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0

摘要

摘要水稻病害识别在农业领域面临挑战,现有算法难以在复杂场景下准确识别病害。针对水稻病害检测中存在的严重重叠、多病害检测、形态不规则、多尺度目标分类和复杂场景等问题,提出了一种精确的目标检测框架。提出的模型是基于检测变压器(Detr)算法的改进版本。改进后的网络架构通过在骨干网后增加特征融合模块融合多尺度特征,能够保留更多图像的原始信息,大大提高检测精度;可变形注意力模块的使用大大降低了模型的计算复杂度。为了评估PDN,创建了一个包含1200张图像的专用水稻病害检测数据集。实验结果表明,该模型的准确率为100%,召回率为89.3%,f1分数为94.3%,平均精度(mAP)为60.2%。该模型在检测精度上优于现有的最先进的检测模型。关键词:水稻疾病识别变压器机器视觉检测披露声明作者未报告潜在利益冲突。项目资助:江苏省高校基础科学(自然科学)研究项目(No. 23kjb460034)、江苏省青年基金项目(No. bk2023040059)、中国博士后科学基金项目(No. 2022M721185)、江苏省农业科技创新基金项目(No. 2022M721185)。残雪(21)3145)。作者简介张新宇张新宇现任扬州大学机械工程学院机械工程专业硕士研究生。他的研究兴趣是机器学习。挂DongDr。董航是扬州大学的讲师。2019年毕业于大连理工大学机械制造及自动化专业,获博士学位。他的研究兴趣包括深度学习、机器学习和机器人技术。董航,通讯作者,联系邮箱:hdong@yzu.edu.cn.Liang龚亮,1999年10月26日出生于中国安徽省马鞍山市。他于2021年获得安徽工业大学学士学位。他目前是扬州大学机械工程学院机械工程专业的硕士研究生。主要研究方向为机器视觉和机器学习。辛成,2002年出生于中国连云港。他现在是扬州大学的学生。主要研究方向为计算机视觉、自然语言处理。葛正辉,现任中国扬州大学副教授。2018年获南京航空航天大学博士学位。主要研究方向为电化学加工。Liangchao GuoDr。郭良超,扬州大学杰出研究员。2019年毕业于大连理工大学机械制造及自动化专业,获博士学位。他的研究兴趣包括气体传感和存储器件的制造和应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Multiple paddy disease recognition methods based on deformable transformer attention mechanism in complex scenarios
AbstractPaddy disease recognition presents challenges in the agricultural industry, and existing algorithms struggle to accurately identify diseases in complex scenarios. In this paper, we propose a precise object detection framework to address the challenges of severe overlap, multi-disease detection, morphological irregularities, multi-scale object classification, and complex scenarios in real-world environments in paddy disease detection. The proposed model is based on an improved version of the DEtection TRansformer (Detr) algorithm. The enhanced network architecture fuses multi-scale features by adding a feature fusion module after the backbone network, which is able to retain more original information of the images and greatly improves the detection accuracy; the use of deformable attention module greatly reduces the computational complexity of the model. To evaluate the PDN, a dedicated paddy disease detection dataset with 1200 images is created. Experimental results demonstrate that the proposed model obtained a precision value of 100%, a recall value of 89.3%, F1-score of 94.3%, and a mean average precision (mAP) value of 60.2%. The model outperforms the existing state-of-the-art detection models in detection accuracy.KEYWORDS: Paddy disease recognitionTransformermachine vision detection Disclosure statementNo potential conflict of interest was reported by the author(s).Additional informationFundingThis work was supported in part by the Jiangsu Basic Science (Natural Science) Research Projects in Higher Education Institutions (No.23KJB460034), Jiangsu province Youth Fund Project (No.BK2023040059), the China Postdoctoral Science Foundation Funded Project (No. 2022M721185), Jiangsu Agriculture Science and Technology Innovation Fund (No. CX(21)3145).Notes on contributorsXinyu ZhangXinyu Zhang is currently a master's student in mechanical engineering at the School of Mechanical Engineering, Yangzhou University. His research interest is machine learning.Hang DongDr. Hang Dong is a lecturer at Yangzhou University. He received his PhD degree in Mechanical Manufacture and Automation from Dalian University of Technology (2019). His research interests include the deep learning, machine learning, and robotics. Hang Dong is the corresponding author and can be contacted at hdong@yzu.edu.cn.Liang GongLiang Gong was born in Maanshan City, Anhui Province, China on October 26, 1999. He received his bachelor's degree from Anhui Polytechnic University in 2021. He is currently a master's student in mechanical engineering at the School of Mechanical Engineering, Yangzhou University. His research interests are machine vision and machine learning.Xin ChengXin Cheng was born in Lian Yungang, China, in 2002.He is currently a student in Yangzhou University.His research interests include computer vision,natural language processing.Zhenghui GeZhenghui Ge is currently an associate professor at Yangzhou University, China. He received his PhD degree from Nanjing University of Aeronautics and Astronautics, China, in 2018. His researches mainly focus on electrochemical machining.Liangchao GuoDr. Liangchao Guo is a distinguished research fellow at Yangzhou University. He received his PhD degree in Mechanical Manufacture and Automation from Dalian University of Technology (2019). His research interests include the fabrication and application of gas sensing, and storage devices.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
International Journal of Computers and Applications
International Journal of Computers and Applications Computer Science-Computer Graphics and Computer-Aided Design
CiteScore
4.70
自引率
0.00%
发文量
20
期刊介绍: The International Journal of Computers and Applications (IJCA) is a unique platform for publishing novel ideas, research outcomes and fundamental advances in all aspects of Computer Science, Computer Engineering, and Computer Applications. This is a peer-reviewed international journal with a vision to provide the academic and industrial community a platform for presenting original research ideas and applications. IJCA welcomes four special types of papers in addition to the regular research papers within its scope: (a) Papers for which all results could be easily reproducible. For such papers, the authors will be asked to upload "instructions for reproduction'''', possibly with the source codes or stable URLs (from where the codes could be downloaded). (b) Papers with negative results. For such papers, the experimental setting and negative results must be presented in detail. Also, why the negative results are important for the research community must be explained clearly. The rationale behind this kind of paper is that this would help researchers choose the correct approaches to solve problems and avoid the (already worked out) failed approaches. (c) Detailed report, case study and literature review articles about innovative software / hardware, new technology, high impact computer applications and future development with sufficient background and subject coverage. (d) Special issue papers focussing on a particular theme with significant importance or papers selected from a relevant conference with sufficient improvement and new material to differentiate from the papers published in a conference proceedings.
期刊最新文献
Weight assignment in cloud service selection based on FAHP and rough sets The social force model: a behavioral modeling approach for information propagation during significant events A comprehensive study on social networks analysis and mining to detect opinion leaders A machine learning approach for skin lesion classification on iOS: implementing and optimizing a convolutional transfer learning model with Create ML Physical-layer security for primary users in 5G underlay cognitive radio system via artificial-noise-aided by secondary users
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1