{"title":"Modeling water hyacinth (Eichhornia crassipes) distribution in Lake Tana, Ethiopia, using machine learning","authors":"Matiwos Belayhun , Asnake Mekuriaw","doi":"10.1016/j.rsase.2024.101273","DOIUrl":null,"url":null,"abstract":"<div><p>Aquatic invasive plant, water hyacinth poses serious environmental and socioeconomic challenges. Understanding and predicting the spatiotemporal distribution of this species is important for reducing its environmental impact. Therefore, the present study aimed to model the distribution of water hyacinths in an important ecological region (Lake Tana) of Ethiopia using four machine learning models. We used 11 variables obtained from Sentinel-1 SAR bands, Sentinel-2A bands and indices, and bioclimate data sources. The models use 458 presence and 458 randomly generated pseudoabsence data as response variables and employ a tenfold bootstrap sampling method. The area under the curve (AUC), receiver operator curve (ROC), true skill statistics (TSS), coefficient of rank correlation (COR), sensitivity, specificity, and kappa coefficient were used to evaluate the models. The findings demonstrate that the random forest model outperforms the other models, with AUC values of 0.93 and 0.95, TSS values of 0.77 and 0.82, and kappa values of 0.76 and 0.82 in the wet and dry seasons, respectively. B12 (16% and 20%), NDWI (15% and 12%), mean annual temperature (13% and 14%), and B5 (11% and 12%) were found to be the most relevant variables during the wet and dry seasons, respectively. Water hyacinths have greater spatial coverage during the wet season than during the dry season because of high rainfall, high water levels and nutrient runoff. We can conclude that to detect and predict the spatiotemporal conditions of water hyacinth accurately, integrating Sentinel image indices and bands with bioclimatic variables and using machine learning models are crucial.</p></div>","PeriodicalId":53227,"journal":{"name":"Remote Sensing Applications-Society and Environment","volume":"36 ","pages":"Article 101273"},"PeriodicalIF":3.8000,"publicationDate":"2024-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing Applications-Society and Environment","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S235293852400137X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Aquatic invasive plant, water hyacinth poses serious environmental and socioeconomic challenges. Understanding and predicting the spatiotemporal distribution of this species is important for reducing its environmental impact. Therefore, the present study aimed to model the distribution of water hyacinths in an important ecological region (Lake Tana) of Ethiopia using four machine learning models. We used 11 variables obtained from Sentinel-1 SAR bands, Sentinel-2A bands and indices, and bioclimate data sources. The models use 458 presence and 458 randomly generated pseudoabsence data as response variables and employ a tenfold bootstrap sampling method. The area under the curve (AUC), receiver operator curve (ROC), true skill statistics (TSS), coefficient of rank correlation (COR), sensitivity, specificity, and kappa coefficient were used to evaluate the models. The findings demonstrate that the random forest model outperforms the other models, with AUC values of 0.93 and 0.95, TSS values of 0.77 and 0.82, and kappa values of 0.76 and 0.82 in the wet and dry seasons, respectively. B12 (16% and 20%), NDWI (15% and 12%), mean annual temperature (13% and 14%), and B5 (11% and 12%) were found to be the most relevant variables during the wet and dry seasons, respectively. Water hyacinths have greater spatial coverage during the wet season than during the dry season because of high rainfall, high water levels and nutrient runoff. We can conclude that to detect and predict the spatiotemporal conditions of water hyacinth accurately, integrating Sentinel image indices and bands with bioclimatic variables and using machine learning models are crucial.
期刊介绍:
The journal ''Remote Sensing Applications: Society and Environment'' (RSASE) focuses on remote sensing studies that address specific topics with an emphasis on environmental and societal issues - regional / local studies with global significance. Subjects are encouraged to have an interdisciplinary approach and include, but are not limited by: " -Global and climate change studies addressing the impact of increasing concentrations of greenhouse gases, CO2 emission, carbon balance and carbon mitigation, energy system on social and environmental systems -Ecological and environmental issues including biodiversity, ecosystem dynamics, land degradation, atmospheric and water pollution, urban footprint, ecosystem management and natural hazards (e.g. earthquakes, typhoons, floods, landslides) -Natural resource studies including land-use in general, biomass estimation, forests, agricultural land, plantation, soils, coral reefs, wetland and water resources -Agriculture, food production systems and food security outcomes -Socio-economic issues including urban systems, urban growth, public health, epidemics, land-use transition and land use conflicts -Oceanography and coastal zone studies, including sea level rise projections, coastlines changes and the ocean-land interface -Regional challenges for remote sensing application techniques, monitoring and analysis, such as cloud screening and atmospheric correction for tropical regions -Interdisciplinary studies combining remote sensing, household survey data, field measurements and models to address environmental, societal and sustainability issues -Quantitative and qualitative analysis that documents the impact of using remote sensing studies in social, political, environmental or economic systems