Mushroom image classification and recognition based on improved ConvNeXt V2

IF 3.4 2区 农林科学 Q2 FOOD SCIENCE & TECHNOLOGY Journal of Food Science Pub Date : 2025-03-17 DOI:10.1111/1750-3841.70133
Shulong Zhang, Kexin Zhao, Yukang Huo, Mingyuan Yao, Lin Xue, Haihua Wang
{"title":"Mushroom image classification and recognition based on improved ConvNeXt V2","authors":"Shulong Zhang,&nbsp;Kexin Zhao,&nbsp;Yukang Huo,&nbsp;Mingyuan Yao,&nbsp;Lin Xue,&nbsp;Haihua Wang","doi":"10.1111/1750-3841.70133","DOIUrl":null,"url":null,"abstract":"<div>\n \n <section>\n \n \n <p>Using on-site images to classify and identify wild mushroom species is the most effective way to prevent incidents of harm caused by eating wild mushrooms. However, the complexity of natural scenes and the similarity of mushroom morphology bring challenges for accurate classification and recognition. To this end, this paper proposes an improved ConvNeXt V2 network model for classification and recognition of mushrooms in complex scenes and similar appearances. First, this study applies data enhancement techniques such as image flipping, adding noise and mosaic to solve the problem of dataset equalization, and constructs a mushroom image dataset containing 18 categories and the number of 10,986 images. Second, a cross-modular approach is used to extract and fuse image features of different dimensions to enhance the feature capture capability of the ConvNeXt V2 model. In addition, the model is optimized by the one-hot coding and the spatial pyramid pooling techniques. The experimental results show that the improved ConvNeXt V2 model outperforms the comparative models such as ResNet, MobileVit, Swin Transformer, ConvNeXt, and ConvNeXt V2 in terms of accuracy, precision, recall, and F1-Score, which are 96.7%, 96.84%, 96.83%, and 96.84%. The ablation experiments further verify the effectiveness and superiority of the proposed improvement strategy in enhancing the model performance, which can effectively improve the efficiency and accuracy of mushroom image classification and recognition.</p>\n \n <p><b>Practical Application</b>: The study in this paper can be used for the identification of edible and nonedible mushroom, and it can provide technical support to reduce the incidence of mushroom poisoning and ensure food safety.</p>\n </section>\n </div>","PeriodicalId":193,"journal":{"name":"Journal of Food Science","volume":"90 3","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Food Science","FirstCategoryId":"97","ListUrlMain":"https://ift.onlinelibrary.wiley.com/doi/10.1111/1750-3841.70133","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"FOOD SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Using on-site images to classify and identify wild mushroom species is the most effective way to prevent incidents of harm caused by eating wild mushrooms. However, the complexity of natural scenes and the similarity of mushroom morphology bring challenges for accurate classification and recognition. To this end, this paper proposes an improved ConvNeXt V2 network model for classification and recognition of mushrooms in complex scenes and similar appearances. First, this study applies data enhancement techniques such as image flipping, adding noise and mosaic to solve the problem of dataset equalization, and constructs a mushroom image dataset containing 18 categories and the number of 10,986 images. Second, a cross-modular approach is used to extract and fuse image features of different dimensions to enhance the feature capture capability of the ConvNeXt V2 model. In addition, the model is optimized by the one-hot coding and the spatial pyramid pooling techniques. The experimental results show that the improved ConvNeXt V2 model outperforms the comparative models such as ResNet, MobileVit, Swin Transformer, ConvNeXt, and ConvNeXt V2 in terms of accuracy, precision, recall, and F1-Score, which are 96.7%, 96.84%, 96.83%, and 96.84%. The ablation experiments further verify the effectiveness and superiority of the proposed improvement strategy in enhancing the model performance, which can effectively improve the efficiency and accuracy of mushroom image classification and recognition.

Practical Application: The study in this paper can be used for the identification of edible and nonedible mushroom, and it can provide technical support to reduce the incidence of mushroom poisoning and ensure food safety.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于改进ConvNeXt V2的蘑菇图像分类与识别
利用现场图像对野生蘑菇品种进行分类鉴定是预防食用野生蘑菇危害事件发生的最有效途径。然而,自然场景的复杂性和蘑菇形态的相似性给蘑菇的准确分类和识别带来了挑战。为此,本文提出了一种改进的ConvNeXt V2网络模型,用于复杂场景和相似外观下蘑菇的分类识别。首先,采用图像翻转、加噪、拼接等数据增强技术解决数据集均衡化问题,构建了包含18个类别、10986张图像的蘑菇图像数据集。其次,采用交叉模块化方法提取和融合不同维数的图像特征,增强ConvNeXt V2模型的特征捕获能力;此外,采用单热编码和空间金字塔池化技术对模型进行了优化。实验结果表明,改进后的ConvNeXt V2模型在准确率、精密度、召回率和F1-Score方面均优于ResNet、MobileVit、Swin Transformer、ConvNeXt和ConvNeXt V2模型,分别为96.7%、96.84%、96.83%和96.84%。烧蚀实验进一步验证了所提出的改进策略在增强模型性能方面的有效性和优越性,能够有效提高蘑菇图像分类识别的效率和准确率。实际应用:本研究可用于食用菌和非食用菌的鉴别,为减少食用菌中毒的发生,保障食品安全提供技术支持。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Food Science
Journal of Food Science 工程技术-食品科技
CiteScore
7.10
自引率
2.60%
发文量
412
审稿时长
3.1 months
期刊介绍: The goal of the Journal of Food Science is to offer scientists, researchers, and other food professionals the opportunity to share knowledge of scientific advancements in the myriad disciplines affecting their work, through a respected peer-reviewed publication. The Journal of Food Science serves as an international forum for vital research and developments in food science. The range of topics covered in the journal include: -Concise Reviews and Hypotheses in Food Science -New Horizons in Food Research -Integrated Food Science -Food Chemistry -Food Engineering, Materials Science, and Nanotechnology -Food Microbiology and Safety -Sensory and Consumer Sciences -Health, Nutrition, and Food -Toxicology and Chemical Food Safety The Journal of Food Science publishes peer-reviewed articles that cover all aspects of food science, including safety and nutrition. Reviews should be 15 to 50 typewritten pages (including tables, figures, and references), should provide in-depth coverage of a narrowly defined topic, and should embody careful evaluation (weaknesses, strengths, explanation of discrepancies in results among similar studies) of all pertinent studies, so that insightful interpretations and conclusions can be presented. Hypothesis papers are especially appropriate in pioneering areas of research or important areas that are afflicted by scientific controversy.
期刊最新文献
Industrial Applications of Selected JFS Articles Industrial Applications of Selected JFS Articles Industrial Applications of Selected JFS Articles Interview With Dr. Brendan Niemira, Chief Science & Technology Officer, IFT Lion's Mane Mushroom: Nutritional Profile, Bioactive Compounds, Functional Properties, and Applications in Functional Food Systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1