{"title":"农业多光谱数据的比较研究和机器学习支持的高效分类","authors":"Priyanka Gupta, S. Kanga, Varun Narayan Mishra","doi":"10.21123/bsj.2023.8952","DOIUrl":null,"url":null,"abstract":"Reliable and accurate crop maps are required for food security from regional to global scale. The increased availability of satellite imagery leads to a “Big Data” problem while producing crop maps. Now, cloud-based platforms have gained a lot of attention for crop classification over large regions. The main goal of the research is to analyze crop classification using various machine learning (ML) such as Support Vector Machine (SVM), Gradient Tree Boosting (GTB), Random Forest (RF), Decision Tree (DT) as well as Classification and Regression Trees (CART) on Google Earth Engine platform. The aim is to explore the Google Earth Engine’s efficiency (GEE) when classification different crops using multi- spectral datasets of Sentinel 2 MSI and Landsat 8 OLI satellites for crop mapping of Mathura district of Uttar Pradesh, India. The best cloud free image (less than 5%) of Landsat 8 OLI and Sentinel 2 MSI datasets (\"2020-12-26\",\"2020-12-30\") were used for crop classification with the help of automatic filtering i.e. percentage cloud property on the GEE platforms. Moreover that GEE platform perform, acquiring, clarifying as well as preprocessing of satellite dataset could be organized very powerfully. Points as feature spaces were used like training datasets. Furthermore confusion matrixes are used for accuracy assessment (producer and user accuracy) and kappa coefficient. Additionally compare the outcome of the dataset on the basis of overall accuracy (OA), F1 score as well as kappa coefficient. The highest OA is found using GTB (86.7%) followed by RF (82.5%), CART (81.0%), DT (78.1%) and SVM (66.5%) for Landsat 8 OLI image. For the Sentinel 2 image, GTB achieved the highest OA of 84.2% followed by SVM (84%), RF (82.3%), DT (75.2%), and CART (75. 0%) respectively. On the basis of research, found that GTB performed well among all the classifiers to crop mapping using both multi-spectral datasets.","PeriodicalId":8687,"journal":{"name":"Baghdad Science Journal","volume":"599 3","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2023-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Comparative Study and Machine Learning Enabled Efficient Classification for Multispectral Data in Agriculture\",\"authors\":\"Priyanka Gupta, S. Kanga, Varun Narayan Mishra\",\"doi\":\"10.21123/bsj.2023.8952\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Reliable and accurate crop maps are required for food security from regional to global scale. The increased availability of satellite imagery leads to a “Big Data” problem while producing crop maps. Now, cloud-based platforms have gained a lot of attention for crop classification over large regions. The main goal of the research is to analyze crop classification using various machine learning (ML) such as Support Vector Machine (SVM), Gradient Tree Boosting (GTB), Random Forest (RF), Decision Tree (DT) as well as Classification and Regression Trees (CART) on Google Earth Engine platform. The aim is to explore the Google Earth Engine’s efficiency (GEE) when classification different crops using multi- spectral datasets of Sentinel 2 MSI and Landsat 8 OLI satellites for crop mapping of Mathura district of Uttar Pradesh, India. The best cloud free image (less than 5%) of Landsat 8 OLI and Sentinel 2 MSI datasets (\\\"2020-12-26\\\",\\\"2020-12-30\\\") were used for crop classification with the help of automatic filtering i.e. percentage cloud property on the GEE platforms. Moreover that GEE platform perform, acquiring, clarifying as well as preprocessing of satellite dataset could be organized very powerfully. Points as feature spaces were used like training datasets. Furthermore confusion matrixes are used for accuracy assessment (producer and user accuracy) and kappa coefficient. Additionally compare the outcome of the dataset on the basis of overall accuracy (OA), F1 score as well as kappa coefficient. The highest OA is found using GTB (86.7%) followed by RF (82.5%), CART (81.0%), DT (78.1%) and SVM (66.5%) for Landsat 8 OLI image. For the Sentinel 2 image, GTB achieved the highest OA of 84.2% followed by SVM (84%), RF (82.3%), DT (75.2%), and CART (75. 0%) respectively. On the basis of research, found that GTB performed well among all the classifiers to crop mapping using both multi-spectral datasets.\",\"PeriodicalId\":8687,\"journal\":{\"name\":\"Baghdad Science Journal\",\"volume\":\"599 3\",\"pages\":\"\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2023-12-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Baghdad Science Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21123/bsj.2023.8952\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Baghdad Science Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21123/bsj.2023.8952","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
摘要
从区域到全球范围的粮食安全都需要可靠而准确的作物地图。卫星图像可用性的提高导致了制作作物地图时的 "大数据 "问题。现在,基于云的平台在大面积农作物分类方面受到了广泛关注。本研究的主要目标是在谷歌地球引擎平台上使用支持向量机(SVM)、梯度树提升(GTB)、随机森林(RF)、决策树(DT)以及分类和回归树(CART)等各种机器学习(ML)分析作物分类。该研究旨在探索谷歌地球引擎在利用哨兵 2 MSI 和大地遥感卫星 8 OLI 的多光谱数据集对不同作物进行分类时的效率(GEE),以绘制印度北方邦马图拉地区的作物图。借助 GEE 平台上的自动过滤功能(即云属性百分比),Landsat 8 OLI 和 Sentinel 2 MSI 数据集("2020-12-26"、"2020-12-30")中的最佳无云图像(小于 5%)被用于作物分类。此外,GEE 平台还能对卫星数据集的获取、澄清和预处理进行有效组织。作为特征空间的点被用作训练数据集。此外,混淆矩阵被用于准确度评估(生产者和用户准确度)和卡帕系数。此外,还根据总体准确度(OA)、F1 分数和卡帕系数对数据集的结果进行了比较。对于 Landsat 8 OLI 图像,使用 GTB 的 OA 最高(86.7%),其次是 RF(82.5%)、CART(81.0%)、DT(78.1%)和 SVM(66.5%)。在哨兵 2 号图像中,GTB 的 OA 值最高,达到 84.2%,其次分别是 SVM(84%)、RF(82.3%)、DT(75.2%)和 CART(75.0%)。研究发现,在使用这两个多光谱数据集绘制作物分布图的所有分类器中,GTB 的表现都很出色。
A Comparative Study and Machine Learning Enabled Efficient Classification for Multispectral Data in Agriculture
Reliable and accurate crop maps are required for food security from regional to global scale. The increased availability of satellite imagery leads to a “Big Data” problem while producing crop maps. Now, cloud-based platforms have gained a lot of attention for crop classification over large regions. The main goal of the research is to analyze crop classification using various machine learning (ML) such as Support Vector Machine (SVM), Gradient Tree Boosting (GTB), Random Forest (RF), Decision Tree (DT) as well as Classification and Regression Trees (CART) on Google Earth Engine platform. The aim is to explore the Google Earth Engine’s efficiency (GEE) when classification different crops using multi- spectral datasets of Sentinel 2 MSI and Landsat 8 OLI satellites for crop mapping of Mathura district of Uttar Pradesh, India. The best cloud free image (less than 5%) of Landsat 8 OLI and Sentinel 2 MSI datasets ("2020-12-26","2020-12-30") were used for crop classification with the help of automatic filtering i.e. percentage cloud property on the GEE platforms. Moreover that GEE platform perform, acquiring, clarifying as well as preprocessing of satellite dataset could be organized very powerfully. Points as feature spaces were used like training datasets. Furthermore confusion matrixes are used for accuracy assessment (producer and user accuracy) and kappa coefficient. Additionally compare the outcome of the dataset on the basis of overall accuracy (OA), F1 score as well as kappa coefficient. The highest OA is found using GTB (86.7%) followed by RF (82.5%), CART (81.0%), DT (78.1%) and SVM (66.5%) for Landsat 8 OLI image. For the Sentinel 2 image, GTB achieved the highest OA of 84.2% followed by SVM (84%), RF (82.3%), DT (75.2%), and CART (75. 0%) respectively. On the basis of research, found that GTB performed well among all the classifiers to crop mapping using both multi-spectral datasets.
期刊介绍:
The journal publishes academic and applied papers dealing with recent topics and scientific concepts. Papers considered for publication in biology, chemistry, computer sciences, physics, and mathematics. Accepted papers will be freely downloaded by professors, researchers, instructors, students, and interested workers. ( Open Access) Published Papers are registered and indexed in the universal libraries.