基于多模态自监督学习的小样本黄瓜病害识别

IF 2.5 2区农林科学 Q1 AGRONOMY Crop Protection Pub Date : 2024-10-30 DOI:10.1016/j.cropro.2024.107006

Yiyi Cao , Guangling Sun , Yuan Yuan , Lei Chen

{"title":"基于多模态自监督学习的小样本黄瓜病害识别","authors":"Yiyi Cao , Guangling Sun , Yuan Yuan , Lei Chen","doi":"10.1016/j.cropro.2024.107006","DOIUrl":null,"url":null,"abstract":"<div><div>It is difficult and costly to obtain large-scale, labeled crop disease data in the field of agriculture. How to use small samples of unlabeled data for feature learning has become an urgent problem that needs to be solved. The emergence of self-supervised contrastive learning methods and self-supervised mask learning methods can solve the problem of missing labels on the training data. However, each of these paradigms comes with its own advantages and drawbacks. At the same time, the features learned by dataset in a single modality are limited, ignoring the correlation with other modal information. Hence, this paper introduced an effective framework for multimodal self-supervised learning, denoted as MMSSL, to address the task of identifying cucumber diseases with small sample sizes. Integrating image self-supervised mask learning, image self-supervised contrastive learning, and multimodal image-text contrastive learning, the model can not only learn disease feature information from different modalities, but also capture global and local disease feature information. Simultaneously, the mask learning branch was enhanced by introducing a prompt learning module based on a cross-attention network. This module aided in approximately locating the masked regions in the image data in advance, facilitating the decoder in making accurate decoding predictions. Experimental results demonstrate that the proposed method achieves a 95% accuracy in cucumber disease identification in the absence of labels. The approach effectively uncovers high-level semantic features within multimodal small-sample cucumber disease data. GradCAM is also employed for visual analysis to further understand the decision-making process of the model in disease identification. In conclusion, the proposed method in this paper is advantageous for enhancing the classification accuracy of small-sample cucumber data in a multimodal, unlabeled context, demonstrating good generalization performance.</div></div>","PeriodicalId":10785,"journal":{"name":"Crop Protection","volume":"188 ","pages":"Article 107006"},"PeriodicalIF":2.5000,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Small-sample cucumber disease identification based on multimodal self-supervised learning\",\"authors\":\"Yiyi Cao , Guangling Sun , Yuan Yuan , Lei Chen\",\"doi\":\"10.1016/j.cropro.2024.107006\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>It is difficult and costly to obtain large-scale, labeled crop disease data in the field of agriculture. How to use small samples of unlabeled data for feature learning has become an urgent problem that needs to be solved. The emergence of self-supervised contrastive learning methods and self-supervised mask learning methods can solve the problem of missing labels on the training data. However, each of these paradigms comes with its own advantages and drawbacks. At the same time, the features learned by dataset in a single modality are limited, ignoring the correlation with other modal information. Hence, this paper introduced an effective framework for multimodal self-supervised learning, denoted as MMSSL, to address the task of identifying cucumber diseases with small sample sizes. Integrating image self-supervised mask learning, image self-supervised contrastive learning, and multimodal image-text contrastive learning, the model can not only learn disease feature information from different modalities, but also capture global and local disease feature information. Simultaneously, the mask learning branch was enhanced by introducing a prompt learning module based on a cross-attention network. This module aided in approximately locating the masked regions in the image data in advance, facilitating the decoder in making accurate decoding predictions. Experimental results demonstrate that the proposed method achieves a 95% accuracy in cucumber disease identification in the absence of labels. The approach effectively uncovers high-level semantic features within multimodal small-sample cucumber disease data. GradCAM is also employed for visual analysis to further understand the decision-making process of the model in disease identification. In conclusion, the proposed method in this paper is advantageous for enhancing the classification accuracy of small-sample cucumber data in a multimodal, unlabeled context, demonstrating good generalization performance.</div></div>\",\"PeriodicalId\":10785,\"journal\":{\"name\":\"Crop Protection\",\"volume\":\"188 \",\"pages\":\"Article 107006\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2024-10-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Crop Protection\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0261219424004344\",\"RegionNum\":2,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRONOMY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Crop Protection","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0261219424004344","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}

引用次数: 0

摘要

在农业领域，获取大规模、有标记的作物病害数据既困难又昂贵。如何利用未标记的小样本数据进行特征学习已成为亟待解决的问题。自监督对比学习方法和自监督掩码学习方法的出现可以解决训练数据中标签缺失的问题。然而，这些范式各有利弊。同时，单一模态数据集学习到的特征是有限的，忽略了与其他模态信息的相关性。因此，本文介绍了一种有效的多模态自监督学习框架（简称 MMSSL），以解决样本量较小的黄瓜病害识别任务。该模型集成了图像自监督掩码学习、图像自监督对比学习和多模态图像-文本对比学习，不仅能学习不同模态的疾病特征信息，还能捕捉全局和局部疾病特征信息。同时，通过引入基于交叉注意网络的提示学习模块，掩膜学习分支得到了增强。该模块有助于提前在图像数据中大致定位掩码区域，从而帮助解码器做出准确的解码预测。实验结果表明，所提出的方法在没有标签的情况下识别黄瓜疾病的准确率达到 95%。该方法有效地发现了多模态小样本黄瓜疾病数据中的高级语义特征。此外，还利用 GradCAM 进行了可视化分析，以进一步了解该模型在疾病识别中的决策过程。总之，本文提出的方法有利于提高多模态、无标记背景下黄瓜小样本数据的分类准确性，并表现出良好的泛化性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Small-sample cucumber disease identification based on multimodal self-supervised learning

It is difficult and costly to obtain large-scale, labeled crop disease data in the field of agriculture. How to use small samples of unlabeled data for feature learning has become an urgent problem that needs to be solved. The emergence of self-supervised contrastive learning methods and self-supervised mask learning methods can solve the problem of missing labels on the training data. However, each of these paradigms comes with its own advantages and drawbacks. At the same time, the features learned by dataset in a single modality are limited, ignoring the correlation with other modal information. Hence, this paper introduced an effective framework for multimodal self-supervised learning, denoted as MMSSL, to address the task of identifying cucumber diseases with small sample sizes. Integrating image self-supervised mask learning, image self-supervised contrastive learning, and multimodal image-text contrastive learning, the model can not only learn disease feature information from different modalities, but also capture global and local disease feature information. Simultaneously, the mask learning branch was enhanced by introducing a prompt learning module based on a cross-attention network. This module aided in approximately locating the masked regions in the image data in advance, facilitating the decoder in making accurate decoding predictions. Experimental results demonstrate that the proposed method achieves a 95% accuracy in cucumber disease identification in the absence of labels. The approach effectively uncovers high-level semantic features within multimodal small-sample cucumber disease data. GradCAM is also employed for visual analysis to further understand the decision-making process of the model in disease identification. In conclusion, the proposed method in this paper is advantageous for enhancing the classification accuracy of small-sample cucumber data in a multimodal, unlabeled context, demonstrating good generalization performance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Crop Protection 农林科学-农艺学

CiteScore

6.10

自引率

3.60%

发文量

200

审稿时长

29 days

期刊介绍： The Editors of Crop Protection especially welcome papers describing an interdisciplinary approach showing how different control strategies can be integrated into practical pest management programs, covering high and low input agricultural systems worldwide. Crop Protection particularly emphasizes the practical aspects of control in the field and for protected crops, and includes work which may lead in the near future to more effective control. The journal does not duplicate the many existing excellent biological science journals, which deal mainly with the more fundamental aspects of plant pathology, applied zoology and weed science. Crop Protection covers all practical aspects of pest, disease and weed control, including the following topics: -Abiotic damage- Agronomic control methods- Assessment of pest and disease damage- Molecular methods for the detection and assessment of pests and diseases- Biological control- Biorational pesticides- Control of animal pests of world crops- Control of diseases of crop plants caused by microorganisms- Control of weeds and integrated management- Economic considerations- Effects of plant growth regulators- Environmental benefits of reduced pesticide use- Environmental effects of pesticides- Epidemiology of pests and diseases in relation to control- GM Crops, and genetic engineering applications- Importance and control of postharvest crop losses- Integrated control- Interrelationships and compatibility among different control strategies- Invasive species as they relate to implications for crop protection- Pesticide application methods- Pest management- Phytobiomes for pest and disease control- Resistance management- Sampling and monitoring schemes for diseases, nematodes, pests and weeds.