A. H. N. Mfondoum, Sofia Hakdaoui, Roseline Batcha
{"title":"Landsat 8Bands’ 1 to 7 spectral vectors plus machine learning to improve land use/cover classification using Google Earth Engine","authors":"A. H. N. Mfondoum, Sofia Hakdaoui, Roseline Batcha","doi":"10.1080/19475683.2022.2026475","DOIUrl":null,"url":null,"abstract":"ABSTRACT This paper explores a spectral vector-based methodology on Landsat 8 bands of the visible wavelengths, that is deep-blue (1) to shortwave infrared (7), to improve the urban land features classification. Using two different ratio models, based on two and three bands’ combinations in the cloud environment of Google Earth Engine, the Uncertainty reducing Spectral Vector (USVr), the Onward Continuous Spectral Vector (OSVc) and the Onward Discontinuous Spectral Vector (OSVd) are proposed as new entries for the land use land cover (LULC) classification. Two different sizes of arrays are built, i.e. 42 vectors and 15 vectors corresponding to the same number of derivative bands and new pixels′ values. A decision tree is built in J.48 and applied to select the most suitable derivative bands for the analysis. Hereafter, the selected ones are stacked and submitted to five machine learning classifiers using a supervised process, namely, Classification and Regression Trees (CART), Random Forest (RF) Gradient Boosting (GBR), Support Vector Machine (SVM) and Minimum Distance (MD). This method was tested in the two cities of Bamenda and Foumban in west-Cameroon highlands, due to their good representativeness of tropical hilly urban areas’ spatial heterogeneity. The results are satisfying for 4/5 classifiers, up to 87% Overall Accuracy, OA, for 0.82 kappa coefficient, KC, in Bamenda, while combining SVM/OSVd. Whereas, in Foumban, the classifiers perform up to 85%OA and 0.78 KC for the combination SVM/USVr. Only the MD classifier has always performed below 80%OA. The process has been found better than performing classifiers directly on the multispectral (MS) image, by providing more possibilities of hidden spectral indices not yet explored, as far as we know, to detect and discriminate between LULC features, plus an accurate extraction of human settlements.","PeriodicalId":46270,"journal":{"name":"Annals of GIS","volume":"352 1","pages":"401 - 424"},"PeriodicalIF":2.7000,"publicationDate":"2022-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of GIS","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/19475683.2022.2026475","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOGRAPHY","Score":null,"Total":0}
引用次数: 2
Abstract
ABSTRACT This paper explores a spectral vector-based methodology on Landsat 8 bands of the visible wavelengths, that is deep-blue (1) to shortwave infrared (7), to improve the urban land features classification. Using two different ratio models, based on two and three bands’ combinations in the cloud environment of Google Earth Engine, the Uncertainty reducing Spectral Vector (USVr), the Onward Continuous Spectral Vector (OSVc) and the Onward Discontinuous Spectral Vector (OSVd) are proposed as new entries for the land use land cover (LULC) classification. Two different sizes of arrays are built, i.e. 42 vectors and 15 vectors corresponding to the same number of derivative bands and new pixels′ values. A decision tree is built in J.48 and applied to select the most suitable derivative bands for the analysis. Hereafter, the selected ones are stacked and submitted to five machine learning classifiers using a supervised process, namely, Classification and Regression Trees (CART), Random Forest (RF) Gradient Boosting (GBR), Support Vector Machine (SVM) and Minimum Distance (MD). This method was tested in the two cities of Bamenda and Foumban in west-Cameroon highlands, due to their good representativeness of tropical hilly urban areas’ spatial heterogeneity. The results are satisfying for 4/5 classifiers, up to 87% Overall Accuracy, OA, for 0.82 kappa coefficient, KC, in Bamenda, while combining SVM/OSVd. Whereas, in Foumban, the classifiers perform up to 85%OA and 0.78 KC for the combination SVM/USVr. Only the MD classifier has always performed below 80%OA. The process has been found better than performing classifiers directly on the multispectral (MS) image, by providing more possibilities of hidden spectral indices not yet explored, as far as we know, to detect and discriminate between LULC features, plus an accurate extraction of human settlements.