MC-ShuffleNetV2:玉米病害识别的轻量级模型

IF 5 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Egyptian Informatics Journal Pub Date : 2024-07-06 DOI:10.1016/j.eij.2024.100503
Shaoqiu Zhu , Haitao Gao
{"title":"MC-ShuffleNetV2:玉米病害识别的轻量级模型","authors":"Shaoqiu Zhu ,&nbsp;Haitao Gao","doi":"10.1016/j.eij.2024.100503","DOIUrl":null,"url":null,"abstract":"<div><p>Maize has a long history of cultivation and is renowned for its high yield, superior quality, and adaptability. Currently, maize holds a significant position in grain cultivation and occupies a significant place in the agricultural structure. However, maize is susceptible to various diseases during its growth process, which can have a significant impact on the quality and yield. Traditional machine learning is heavily reliant on feature extraction, whereas deep learning has demonstrated notable success in image recognition for computer vision.The use of bloated models and the resulting wastage of computational resources represent significant challenges. The paper proposes a lightweight model, MC-ShuffleNetV2 (Mish + Convolutional Block Attention Module + ShuffleNetV2), to meet the practical needs of convolutional neural networks in maize disease image recognition. The model has designed with a focus on network lightweighting and accurate feature extraction. The model was constructed upon the foundation of the high-performance ShuffleNetV2 1 × network. The Convolutional Block Attention Module was integrated into the network architecture to enhance the model’s adaptive expressiveness. The depthwise separable convolution kernel of the depth-separable module was modified from a 3 × 3 kernel to a 5 × 5 kernel. This modification was implemented with the objective of expanding the image receptive field and extracting more detailed features of the image. It was necessary to modify the activation function in each stage for Mish. The model was compressed through the application of pruning operations. In the maize disease dataset test, the accuracy of the test set recognition accuracy of the network model constructed in this paper reaches 99.86 %, the model parameters are only 873,936, and the FLOPs (Floating-point Operations) are only 1,751,286. Compared with LeNet, AlexNet, MobileNetV2, and EfficientNetV2 models, the MC-ShufflenetV2 model’s recognition ability and size have obvious advantages, and it is more conducive to the actual deployment of the agricultural mobile terminal.</p></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":null,"pages":null},"PeriodicalIF":5.0000,"publicationDate":"2024-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1110866524000665/pdfft?md5=b6fc6748f084b0530dace13f462b152c&pid=1-s2.0-S1110866524000665-main.pdf","citationCount":"0","resultStr":"{\"title\":\"MC-ShuffleNetV2: A lightweight model for maize disease recognition\",\"authors\":\"Shaoqiu Zhu ,&nbsp;Haitao Gao\",\"doi\":\"10.1016/j.eij.2024.100503\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Maize has a long history of cultivation and is renowned for its high yield, superior quality, and adaptability. Currently, maize holds a significant position in grain cultivation and occupies a significant place in the agricultural structure. However, maize is susceptible to various diseases during its growth process, which can have a significant impact on the quality and yield. Traditional machine learning is heavily reliant on feature extraction, whereas deep learning has demonstrated notable success in image recognition for computer vision.The use of bloated models and the resulting wastage of computational resources represent significant challenges. The paper proposes a lightweight model, MC-ShuffleNetV2 (Mish + Convolutional Block Attention Module + ShuffleNetV2), to meet the practical needs of convolutional neural networks in maize disease image recognition. The model has designed with a focus on network lightweighting and accurate feature extraction. The model was constructed upon the foundation of the high-performance ShuffleNetV2 1 × network. The Convolutional Block Attention Module was integrated into the network architecture to enhance the model’s adaptive expressiveness. The depthwise separable convolution kernel of the depth-separable module was modified from a 3 × 3 kernel to a 5 × 5 kernel. This modification was implemented with the objective of expanding the image receptive field and extracting more detailed features of the image. It was necessary to modify the activation function in each stage for Mish. The model was compressed through the application of pruning operations. In the maize disease dataset test, the accuracy of the test set recognition accuracy of the network model constructed in this paper reaches 99.86 %, the model parameters are only 873,936, and the FLOPs (Floating-point Operations) are only 1,751,286. Compared with LeNet, AlexNet, MobileNetV2, and EfficientNetV2 models, the MC-ShufflenetV2 model’s recognition ability and size have obvious advantages, and it is more conducive to the actual deployment of the agricultural mobile terminal.</p></div>\",\"PeriodicalId\":56010,\"journal\":{\"name\":\"Egyptian Informatics Journal\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":5.0000,\"publicationDate\":\"2024-07-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S1110866524000665/pdfft?md5=b6fc6748f084b0530dace13f462b152c&pid=1-s2.0-S1110866524000665-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Egyptian Informatics Journal\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1110866524000665\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Egyptian Informatics Journal","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110866524000665","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

玉米种植历史悠久,以高产、优质和适应性强而闻名。目前,玉米在粮食种植中占有重要地位,在农业结构中占据重要位置。然而,玉米在生长过程中容易受到各种病害的侵袭,对玉米的品质和产量产生重大影响。传统的机器学习严重依赖于特征提取,而深度学习在计算机视觉的图像识别方面取得了显著的成功。本文提出了一种轻量级模型 MC-ShuffleNetV2(Mish + 卷积块注意力模块 + ShuffleNetV2),以满足卷积神经网络在玉米病害图像识别中的实际需求。该模型的设计侧重于网络轻量化和精确特征提取。该模型是在高性能 ShuffleNetV2 1 × 网络的基础上构建的。网络架构中集成了卷积块注意力模块,以增强模型的自适应表达能力。深度可分模块的深度可分卷积核从 3 × 3 核修改为 5 × 5 核。实施这一修改的目的是扩大图像感受野,提取图像中更多的细节特征。有必要修改 Mish 每个阶段的激活函数。通过剪枝操作对模型进行了压缩。在玉米病害数据集测试中,本文构建的网络模型的测试集识别准确率达到 99.86 %,模型参数仅为 873 936,FLOP(浮点运算)仅为 1 751 286。与 LeNet、AlexNet、MobileNetV2 和 EfficientNetV2 模型相比,MC-ShufflenetV2 模型的识别能力和规模具有明显优势,更有利于农业移动终端的实际部署。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
MC-ShuffleNetV2: A lightweight model for maize disease recognition

Maize has a long history of cultivation and is renowned for its high yield, superior quality, and adaptability. Currently, maize holds a significant position in grain cultivation and occupies a significant place in the agricultural structure. However, maize is susceptible to various diseases during its growth process, which can have a significant impact on the quality and yield. Traditional machine learning is heavily reliant on feature extraction, whereas deep learning has demonstrated notable success in image recognition for computer vision.The use of bloated models and the resulting wastage of computational resources represent significant challenges. The paper proposes a lightweight model, MC-ShuffleNetV2 (Mish + Convolutional Block Attention Module + ShuffleNetV2), to meet the practical needs of convolutional neural networks in maize disease image recognition. The model has designed with a focus on network lightweighting and accurate feature extraction. The model was constructed upon the foundation of the high-performance ShuffleNetV2 1 × network. The Convolutional Block Attention Module was integrated into the network architecture to enhance the model’s adaptive expressiveness. The depthwise separable convolution kernel of the depth-separable module was modified from a 3 × 3 kernel to a 5 × 5 kernel. This modification was implemented with the objective of expanding the image receptive field and extracting more detailed features of the image. It was necessary to modify the activation function in each stage for Mish. The model was compressed through the application of pruning operations. In the maize disease dataset test, the accuracy of the test set recognition accuracy of the network model constructed in this paper reaches 99.86 %, the model parameters are only 873,936, and the FLOPs (Floating-point Operations) are only 1,751,286. Compared with LeNet, AlexNet, MobileNetV2, and EfficientNetV2 models, the MC-ShufflenetV2 model’s recognition ability and size have obvious advantages, and it is more conducive to the actual deployment of the agricultural mobile terminal.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Egyptian Informatics Journal
Egyptian Informatics Journal Decision Sciences-Management Science and Operations Research
CiteScore
11.10
自引率
1.90%
发文量
59
审稿时长
110 days
期刊介绍: The Egyptian Informatics Journal is published by the Faculty of Computers and Artificial Intelligence, Cairo University. This Journal provides a forum for the state-of-the-art research and development in the fields of computing, including computer sciences, information technologies, information systems, operations research and decision support. Innovative and not-previously-published work in subjects covered by the Journal is encouraged to be submitted, whether from academic, research or commercial sources.
期刊最新文献
HD-MVCNN: High-density ECG signal based diabetic prediction and classification using multi-view convolutional neural network A hybrid encryption algorithm based approach for secure privacy protection of big data in hospitals A new probabilistic linguistic decision-making process based on PL-BWM and improved three-way TODIM methods Interval valued inventory model with different payment strategies for green products under interval valued Grey Wolf optimizer Algorithm fitness function Intelligent SDN to enhance security in IoT networks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1