{"title":"基于支持向量机的可解释视觉变换器与迁移学习相结合的高效干旱压力识别技术","authors":"Aswini Kumar Patra, Ankit Varshney, Lingaraj Sahoo","doi":"arxiv-2407.21666","DOIUrl":null,"url":null,"abstract":"Early detection of drought stress is critical for taking timely measures for\nreducing crop loss before the drought impact becomes irreversible. The subtle\nphenotypical and physiological changes in response to drought stress are\ncaptured by non-invasive imaging techniques and these imaging data serve as\nvaluable resource for machine learning methods to identify drought stress.\nWhile convolutional neural networks (CNNs) are in wide use, vision transformers\n(ViTs) present a promising alternative in capturing long-range dependencies and\nintricate spatial relationships, thereby enhancing the detection of subtle\nindicators of drought stress. We propose an explainable deep learning pipeline\nthat leverages the power of ViTs for drought stress detection in potato crops\nusing aerial imagery. We applied two distinct approaches: a synergistic\ncombination of ViT and support vector machine (SVM), where ViT extracts\nintricate spatial features from aerial images, and SVM classifies the crops as\nstressed or healthy and an end-to-end approach using a dedicated classification\nlayer within ViT to directly detect drought stress. Our key findings explain\nthe ViT model's decision-making process by visualizing attention maps. These\nmaps highlight the specific spatial features within the aerial images that the\nViT model focuses as the drought stress signature. Our findings demonstrate\nthat the proposed methods not only achieve high accuracy in drought stress\nidentification but also shedding light on the diverse subtle plant features\nassociated with drought stress. This offers a robust and interpretable solution\nfor drought stress monitoring for farmers to undertake informed decisions for\nimproved crop management.","PeriodicalId":501168,"journal":{"name":"arXiv - CS - Emerging Technologies","volume":"46 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Explainable Vision Transformer with Transfer Learning Combined with Support Vector Machine Based Efficient Drought Stress Identification\",\"authors\":\"Aswini Kumar Patra, Ankit Varshney, Lingaraj Sahoo\",\"doi\":\"arxiv-2407.21666\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Early detection of drought stress is critical for taking timely measures for\\nreducing crop loss before the drought impact becomes irreversible. The subtle\\nphenotypical and physiological changes in response to drought stress are\\ncaptured by non-invasive imaging techniques and these imaging data serve as\\nvaluable resource for machine learning methods to identify drought stress.\\nWhile convolutional neural networks (CNNs) are in wide use, vision transformers\\n(ViTs) present a promising alternative in capturing long-range dependencies and\\nintricate spatial relationships, thereby enhancing the detection of subtle\\nindicators of drought stress. We propose an explainable deep learning pipeline\\nthat leverages the power of ViTs for drought stress detection in potato crops\\nusing aerial imagery. We applied two distinct approaches: a synergistic\\ncombination of ViT and support vector machine (SVM), where ViT extracts\\nintricate spatial features from aerial images, and SVM classifies the crops as\\nstressed or healthy and an end-to-end approach using a dedicated classification\\nlayer within ViT to directly detect drought stress. Our key findings explain\\nthe ViT model's decision-making process by visualizing attention maps. These\\nmaps highlight the specific spatial features within the aerial images that the\\nViT model focuses as the drought stress signature. Our findings demonstrate\\nthat the proposed methods not only achieve high accuracy in drought stress\\nidentification but also shedding light on the diverse subtle plant features\\nassociated with drought stress. This offers a robust and interpretable solution\\nfor drought stress monitoring for farmers to undertake informed decisions for\\nimproved crop management.\",\"PeriodicalId\":501168,\"journal\":{\"name\":\"arXiv - CS - Emerging Technologies\",\"volume\":\"46 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Emerging Technologies\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2407.21666\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Emerging Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.21666","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
早期检测干旱胁迫对于在干旱影响变得不可逆转之前及时采取措施减少作物损失至关重要。虽然卷积神经网络(CNNs)得到了广泛应用,但视觉变换器(ViTs)在捕捉长程依赖性和错综复杂的空间关系方面提供了一种有前途的替代方法,从而增强了对干旱胁迫微妙指标的检测。我们提出了一种可解释的深度学习流水线,利用视觉转换器的强大功能,利用航空图像检测马铃薯作物的干旱胁迫。我们采用了两种不同的方法:一种是 ViT 和支持向量机(SVM)的协同组合,其中 ViT 从航空图像中提取错综复杂的空间特征,SVM 将作物分类为受胁迫或健康;另一种是端到端方法,使用 ViT 中的专用分类层直接检测干旱胁迫。我们的主要发现通过可视化注意力地图解释了 ViT 模型的决策过程。这些地图突出显示了 ViT 模型作为干旱胁迫特征所关注的航空图像中的特定空间特征。我们的研究结果表明,所提出的方法不仅能实现高精度的干旱胁迫识别,还能揭示与干旱胁迫相关的各种细微植物特征。这为干旱胁迫监测提供了一种稳健且可解释的解决方案,农民可据此做出明智的决策,改善作物管理。
An Explainable Vision Transformer with Transfer Learning Combined with Support Vector Machine Based Efficient Drought Stress Identification
Early detection of drought stress is critical for taking timely measures for
reducing crop loss before the drought impact becomes irreversible. The subtle
phenotypical and physiological changes in response to drought stress are
captured by non-invasive imaging techniques and these imaging data serve as
valuable resource for machine learning methods to identify drought stress.
While convolutional neural networks (CNNs) are in wide use, vision transformers
(ViTs) present a promising alternative in capturing long-range dependencies and
intricate spatial relationships, thereby enhancing the detection of subtle
indicators of drought stress. We propose an explainable deep learning pipeline
that leverages the power of ViTs for drought stress detection in potato crops
using aerial imagery. We applied two distinct approaches: a synergistic
combination of ViT and support vector machine (SVM), where ViT extracts
intricate spatial features from aerial images, and SVM classifies the crops as
stressed or healthy and an end-to-end approach using a dedicated classification
layer within ViT to directly detect drought stress. Our key findings explain
the ViT model's decision-making process by visualizing attention maps. These
maps highlight the specific spatial features within the aerial images that the
ViT model focuses as the drought stress signature. Our findings demonstrate
that the proposed methods not only achieve high accuracy in drought stress
identification but also shedding light on the diverse subtle plant features
associated with drought stress. This offers a robust and interpretable solution
for drought stress monitoring for farmers to undertake informed decisions for
improved crop management.