Filagot Mengistu , Binyam Tesfaw Hailu , Temesgen Alemayehu Abera , Janne Heiskanen , Tadesse Terefe Zeleke , Tino Johansson , Petri Pellikka
{"title":"Improving the accuracy of honey bee forage class mapping using ensemble learning and multi-source satellite data in Google Earth Engine","authors":"Filagot Mengistu , Binyam Tesfaw Hailu , Temesgen Alemayehu Abera , Janne Heiskanen , Tadesse Terefe Zeleke , Tino Johansson , Petri Pellikka","doi":"10.1016/j.sciaf.2024.e02415","DOIUrl":null,"url":null,"abstract":"<div><div>In semi-arid agro-pastoral environments of Africa, beekeeping is widely recognized as an important activity to improve and diversify livelihoods. Although the scientific identification of suitable honey bees (<em>Apis mellifera ssps.</em>) forages may guide beekeepers to set up apiaries or to timely move honey bee colonies to exploit bee forage resources available in various landscapes, the characterization and mapping of bee forage classes is challenging. We evaluated how various data sources and classification algorithms in Google Earth Engine (GEE) affect the accuracy of honey bee forage class mapping in a semi-arid region of Ethiopia. Predictors derived from multi-source satellite data, such as high-resolution Planet imagery (P), Sentinel 1 RADAR (S1), Sentinel 2 multispectral (S2), and Shuttle Radar Topographic Mission (SRTM) Digital Elevation Model (DEM) were tested and best predictors were identified using Forward Feature Selection (FFS). Four machine learning algorithms (Gradient Tree Boost (GTB), Random Forest (RF), Classification and Regression Trees (CART), and Support Vector Machine (SVM)), all available in GEE, were compared and ensembled for honey bee forage class mapping. The results show that the highest accuracy is obtained by all four algorithms when combining P, S1, S2, and DEM compared to using predictors from a single data source or any other combinations. GTB had higher overall accuracy (90.9 %) than RF (88.2 %), CART (85.5 %), or SVM (79.9 %). Nonetheless, the highest overall accuracy (94.7 %) was obtained when integrating the four machine learning algorithms in an Ensemble Learning Approach (ELA). Applying ELA improved the classification accuracy by 3.8 %, 6.5 %, 9.2 %, and 14.8 % compared to single learner classification algorithms (i.e., GTB, RF, CART, and SVM, respectively). This study demonstrates an improved classification accuracy for honey bee forage class mapping in tropical rangeland by applying ELA, which can provide a better approach for monitoring and managing bee forage resources.</div></div>","PeriodicalId":21690,"journal":{"name":"Scientific African","volume":"26 ","pages":"Article e02415"},"PeriodicalIF":2.7000,"publicationDate":"2024-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific African","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2468227624003570","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
In semi-arid agro-pastoral environments of Africa, beekeeping is widely recognized as an important activity to improve and diversify livelihoods. Although the scientific identification of suitable honey bees (Apis mellifera ssps.) forages may guide beekeepers to set up apiaries or to timely move honey bee colonies to exploit bee forage resources available in various landscapes, the characterization and mapping of bee forage classes is challenging. We evaluated how various data sources and classification algorithms in Google Earth Engine (GEE) affect the accuracy of honey bee forage class mapping in a semi-arid region of Ethiopia. Predictors derived from multi-source satellite data, such as high-resolution Planet imagery (P), Sentinel 1 RADAR (S1), Sentinel 2 multispectral (S2), and Shuttle Radar Topographic Mission (SRTM) Digital Elevation Model (DEM) were tested and best predictors were identified using Forward Feature Selection (FFS). Four machine learning algorithms (Gradient Tree Boost (GTB), Random Forest (RF), Classification and Regression Trees (CART), and Support Vector Machine (SVM)), all available in GEE, were compared and ensembled for honey bee forage class mapping. The results show that the highest accuracy is obtained by all four algorithms when combining P, S1, S2, and DEM compared to using predictors from a single data source or any other combinations. GTB had higher overall accuracy (90.9 %) than RF (88.2 %), CART (85.5 %), or SVM (79.9 %). Nonetheless, the highest overall accuracy (94.7 %) was obtained when integrating the four machine learning algorithms in an Ensemble Learning Approach (ELA). Applying ELA improved the classification accuracy by 3.8 %, 6.5 %, 9.2 %, and 14.8 % compared to single learner classification algorithms (i.e., GTB, RF, CART, and SVM, respectively). This study demonstrates an improved classification accuracy for honey bee forage class mapping in tropical rangeland by applying ELA, which can provide a better approach for monitoring and managing bee forage resources.