{"title":"基于迁移学习的自监督分解作物分类","authors":"J. Jayanth, H. K. Ravikiran, K. M. Madhu","doi":"10.25081/jaa.2023.v9.8566","DOIUrl":null,"url":null,"abstract":"The 2S-DT (Self-Supervised Decomposition for Transfer Learning) model, created for crop categorization using remotely sensed data, is a unique method introduced in this paper. It deals with the difficulty of incorrectly identifying crops with comparable phenology patterns, a problem that frequently arises in agricultural remote sensing. Two datasets from Nanajangudu taluk in the Mysore district, which has a widely varied irrigated agriculture system, are used to assess the model. Using self-supervised learning, the 2S-DT model addresses the misclassification issue that frequently occurs when working with unlabeled classes, especially in high-resolution images. It uses class decomposition (CD) layer and a downstream learning approach. Using the model’s learning and the particulars of each geographical context, this layer improves the information’s arrangement. Our model architecture’s foundation is ResNet, a well-known deep learning framework. Each residual block in our ResNet architecture is made up of two 3x3 convolutional layers. Each convolutional layer is followed by batch normalization and Rectified Linear Unit (ReLU) activation functions, which improve the model’s capacity for learning. We utilized a 7x7 convolutional layer with 64 filters and a stride of 2 for Conv1 in ResNet18, resulting in an output size of 112x112x64. Conv2, which consists of Res2a and Res2b, generated an output with the dimensions 48x48x64. Conv3, which included Res3a and Res3b, produced an output with the dimensions 28x28x128. These architectural selections were made with our experimental needs in mind. The 2S-DT model’s newly added features make it easier to identify classes and update weights, improving the stability of the features’ spatial and spectral data. Extensive tests performed on two datasets show the model’s viability. Overall accuracy has improved significantly, with the 2S-DT model surpassing comparable models like TVSM, 3DCAE, and GAN Model by obtaining 95.65% accuracy for dataset 1 and 88.91% accuracy for dataset 2.","PeriodicalId":36570,"journal":{"name":"Journal of Aridland Agriculture","volume":"74 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Classification of Crops through Self-Supervised Decomposition for Transfer Learning\",\"authors\":\"J. Jayanth, H. K. Ravikiran, K. M. Madhu\",\"doi\":\"10.25081/jaa.2023.v9.8566\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The 2S-DT (Self-Supervised Decomposition for Transfer Learning) model, created for crop categorization using remotely sensed data, is a unique method introduced in this paper. It deals with the difficulty of incorrectly identifying crops with comparable phenology patterns, a problem that frequently arises in agricultural remote sensing. Two datasets from Nanajangudu taluk in the Mysore district, which has a widely varied irrigated agriculture system, are used to assess the model. Using self-supervised learning, the 2S-DT model addresses the misclassification issue that frequently occurs when working with unlabeled classes, especially in high-resolution images. It uses class decomposition (CD) layer and a downstream learning approach. Using the model’s learning and the particulars of each geographical context, this layer improves the information’s arrangement. Our model architecture’s foundation is ResNet, a well-known deep learning framework. Each residual block in our ResNet architecture is made up of two 3x3 convolutional layers. Each convolutional layer is followed by batch normalization and Rectified Linear Unit (ReLU) activation functions, which improve the model’s capacity for learning. We utilized a 7x7 convolutional layer with 64 filters and a stride of 2 for Conv1 in ResNet18, resulting in an output size of 112x112x64. Conv2, which consists of Res2a and Res2b, generated an output with the dimensions 48x48x64. Conv3, which included Res3a and Res3b, produced an output with the dimensions 28x28x128. These architectural selections were made with our experimental needs in mind. The 2S-DT model’s newly added features make it easier to identify classes and update weights, improving the stability of the features’ spatial and spectral data. Extensive tests performed on two datasets show the model’s viability. Overall accuracy has improved significantly, with the 2S-DT model surpassing comparable models like TVSM, 3DCAE, and GAN Model by obtaining 95.65% accuracy for dataset 1 and 88.91% accuracy for dataset 2.\",\"PeriodicalId\":36570,\"journal\":{\"name\":\"Journal of Aridland Agriculture\",\"volume\":\"74 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-10-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Aridland Agriculture\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.25081/jaa.2023.v9.8566\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Agricultural and Biological Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Aridland Agriculture","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.25081/jaa.2023.v9.8566","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Agricultural and Biological Sciences","Score":null,"Total":0}
引用次数: 0
摘要
本文介绍了一种独特的基于遥感数据的作物分类方法,即2S-DT (Self-Supervised Decomposition for Transfer Learning)模型。它处理了不正确地识别具有可比物候模式的作物的困难,这是农业遥感中经常出现的问题。来自迈索尔地区的Nanajangudu taluk的两个数据集被用于评估该模型,该地区拥有广泛多样的灌溉农业系统。使用自监督学习,2S-DT模型解决了在处理未标记类时经常出现的错误分类问题,特别是在高分辨率图像中。它使用类分解(CD)层和下游学习方法。这一层利用模型的学习和每个地理环境的特殊性,改进了信息的排列。我们的模型架构的基础是ResNet,一个著名的深度学习框架。我们的ResNet架构中的每个残差块由两个3x3卷积层组成。每个卷积层之后是批归一化和整流线性单元(ReLU)激活函数,提高了模型的学习能力。我们在ResNet18中使用了一个7x7的卷积层,其中包含64个过滤器,Conv1的步长为2,结果输出大小为112x112x64。由Res2a和Res2b组成的Conv2生成了尺寸为48x48x64的输出。Conv3(包括Res3a和Res3b)产生的输出尺寸为28x28x128。这些建筑的选择是考虑到我们的实验需求。2S-DT模型新增的特征使其更容易识别类别和更新权重,提高了特征空间和光谱数据的稳定性。在两个数据集上进行的广泛测试显示了该模型的可行性。总体准确率显著提高,2S-DT模型在数据集1和数据集2的准确率分别达到95.65%和88.91%,超过了TVSM、3DCAE和GAN模型等可比模型。
Classification of Crops through Self-Supervised Decomposition for Transfer Learning
The 2S-DT (Self-Supervised Decomposition for Transfer Learning) model, created for crop categorization using remotely sensed data, is a unique method introduced in this paper. It deals with the difficulty of incorrectly identifying crops with comparable phenology patterns, a problem that frequently arises in agricultural remote sensing. Two datasets from Nanajangudu taluk in the Mysore district, which has a widely varied irrigated agriculture system, are used to assess the model. Using self-supervised learning, the 2S-DT model addresses the misclassification issue that frequently occurs when working with unlabeled classes, especially in high-resolution images. It uses class decomposition (CD) layer and a downstream learning approach. Using the model’s learning and the particulars of each geographical context, this layer improves the information’s arrangement. Our model architecture’s foundation is ResNet, a well-known deep learning framework. Each residual block in our ResNet architecture is made up of two 3x3 convolutional layers. Each convolutional layer is followed by batch normalization and Rectified Linear Unit (ReLU) activation functions, which improve the model’s capacity for learning. We utilized a 7x7 convolutional layer with 64 filters and a stride of 2 for Conv1 in ResNet18, resulting in an output size of 112x112x64. Conv2, which consists of Res2a and Res2b, generated an output with the dimensions 48x48x64. Conv3, which included Res3a and Res3b, produced an output with the dimensions 28x28x128. These architectural selections were made with our experimental needs in mind. The 2S-DT model’s newly added features make it easier to identify classes and update weights, improving the stability of the features’ spatial and spectral data. Extensive tests performed on two datasets show the model’s viability. Overall accuracy has improved significantly, with the 2S-DT model surpassing comparable models like TVSM, 3DCAE, and GAN Model by obtaining 95.65% accuracy for dataset 1 and 88.91% accuracy for dataset 2.