Integrated machine learning and geospatial analysis enhanced gully erosion susceptibility modeling in the Erer watershed in Eastern Ethiopia

IF 3.3 3区 环境科学与生态学 Q2 ENVIRONMENTAL SCIENCES Frontiers in Environmental Science Pub Date : 2024-08-06 DOI:10.3389/fenvs.2024.1410741
Tadele Bedo Gelete, Pernaidu Pasala, Nigus Gebremedhn Abay, Gezahegn Weldu Woldemariam, Kalid Hassen Yasin, Erana Kebede, Ibsa Aliyi
{"title":"Integrated machine learning and geospatial analysis enhanced gully erosion susceptibility modeling in the Erer watershed in Eastern Ethiopia","authors":"Tadele Bedo Gelete, Pernaidu Pasala, Nigus Gebremedhn Abay, Gezahegn Weldu Woldemariam, Kalid Hassen Yasin, Erana Kebede, Ibsa Aliyi","doi":"10.3389/fenvs.2024.1410741","DOIUrl":null,"url":null,"abstract":"Land degradation from gully erosion poses a significant threat to the Erer watershed in Eastern Ethiopia, particularly due to agricultural activities and resource exploitation. Identifying erosion-prone areas and underlying factors using advanced machine learning algorithms (MLAs) and geospatial analysis is crucial for addressing this problem and prioritizing adaptive and mitigating strategies. However, previous studies have not leveraged machine learning (ML) and GIS-based approaches to generate susceptibility maps identifying these areas and conditioning factors, hindering sustainable watershed management solutions. This study aimed to predict gully erosion susceptibility (GES) and identify underlying areas and factors in the Erer watershed. Four ML models, namely, XGBoost, random forest (RF), support vector machine (SVM), and artificial neural network (ANN), were integrated with geospatial analysis using 22 geoenvironmental predictors and 1,200 inventory points (70% used for training and 30% for testing). Model performance and robustness were validated through the area under the curve (AUC), accuracy, precision, sensitivity, specificity, kappa coefficient, F1 score, and logarithmic loss. The relative slope position is most influential, with 100% importance in SVM and RF and 95% importance in XGBoost, while annual rainfall (AR) dominated ANN (100% importance). Notably, XGBoost demonstrated robustness and superior prediction/mapping, achieving an AUC of 0.97, 91% accuracy, 92% precision, and 81% kappa while maintaining a low logloss (0.0394). However, SVM excelled in classifying gully resistant/susceptible areas (97% sensitivity, 98% specificity, and 91% F1 score). The ANN model predicted the most areas with very high gully susceptibility (13.74%), followed by the SVM (11.69%), XGBoost (10.65%), and RF (7.85%) models, while XGBoost identified the most areas with very low susceptibility (70.19%). The ensemble technique was employed to further enhance GES modeling, and it outperformed the individual models, achieving an AUC of 0.99, 93.5% accuracy, 92.5% precision, 97.5% sensitivity, 95.4% specificity, 85.8% kappa, and 94.9% F1 score. This technique also classified the GES of the watershed as 36.48% very low, 26.51% low, 16.24% moderate, 11.55% high, and 9.22% very high. Furthermore, district-level analyses revealed the most susceptible areas, including the Babile, Fedis, Harar, and Meyumuluke districts, with high GES areas of 32.4%, 21.3%, 14.3%, and 13.6%, respectively. This study offers robust and flexible ML models with comprehensive validation metrics to enhance GES modeling and identify gully prone areas and factors, thereby supporting decision-making for sustainable watershed conservation and land degradation prevention.","PeriodicalId":12460,"journal":{"name":"Frontiers in Environmental Science","volume":"193 1","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Environmental Science","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.3389/fenvs.2024.1410741","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Land degradation from gully erosion poses a significant threat to the Erer watershed in Eastern Ethiopia, particularly due to agricultural activities and resource exploitation. Identifying erosion-prone areas and underlying factors using advanced machine learning algorithms (MLAs) and geospatial analysis is crucial for addressing this problem and prioritizing adaptive and mitigating strategies. However, previous studies have not leveraged machine learning (ML) and GIS-based approaches to generate susceptibility maps identifying these areas and conditioning factors, hindering sustainable watershed management solutions. This study aimed to predict gully erosion susceptibility (GES) and identify underlying areas and factors in the Erer watershed. Four ML models, namely, XGBoost, random forest (RF), support vector machine (SVM), and artificial neural network (ANN), were integrated with geospatial analysis using 22 geoenvironmental predictors and 1,200 inventory points (70% used for training and 30% for testing). Model performance and robustness were validated through the area under the curve (AUC), accuracy, precision, sensitivity, specificity, kappa coefficient, F1 score, and logarithmic loss. The relative slope position is most influential, with 100% importance in SVM and RF and 95% importance in XGBoost, while annual rainfall (AR) dominated ANN (100% importance). Notably, XGBoost demonstrated robustness and superior prediction/mapping, achieving an AUC of 0.97, 91% accuracy, 92% precision, and 81% kappa while maintaining a low logloss (0.0394). However, SVM excelled in classifying gully resistant/susceptible areas (97% sensitivity, 98% specificity, and 91% F1 score). The ANN model predicted the most areas with very high gully susceptibility (13.74%), followed by the SVM (11.69%), XGBoost (10.65%), and RF (7.85%) models, while XGBoost identified the most areas with very low susceptibility (70.19%). The ensemble technique was employed to further enhance GES modeling, and it outperformed the individual models, achieving an AUC of 0.99, 93.5% accuracy, 92.5% precision, 97.5% sensitivity, 95.4% specificity, 85.8% kappa, and 94.9% F1 score. This technique also classified the GES of the watershed as 36.48% very low, 26.51% low, 16.24% moderate, 11.55% high, and 9.22% very high. Furthermore, district-level analyses revealed the most susceptible areas, including the Babile, Fedis, Harar, and Meyumuluke districts, with high GES areas of 32.4%, 21.3%, 14.3%, and 13.6%, respectively. This study offers robust and flexible ML models with comprehensive validation metrics to enhance GES modeling and identify gully prone areas and factors, thereby supporting decision-making for sustainable watershed conservation and land degradation prevention.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
综合机器学习和地理空间分析增强了埃塞俄比亚东部埃勒流域沟壑侵蚀易发性建模能力
沟壑侵蚀造成的土地退化对埃塞俄比亚东部的埃勒流域构成了严重威胁,特别是由于农业活动和资源开发造成的土地退化。利用先进的机器学习算法(MLAs)和地理空间分析来识别易受侵蚀的地区和潜在因素,对于解决这一问题以及确定适应和缓解战略的优先次序至关重要。然而,以往的研究并没有利用机器学习(ML)和基于地理信息系统的方法来生成确定这些地区和影响因素的易感性地图,从而阻碍了可持续的流域管理解决方案。本研究旨在预测沟谷侵蚀易发性(GES),并确定埃尔河流域的潜在区域和因素。利用 22 个地理环境预测因子和 1,200 个清单点(70% 用于训练,30% 用于测试),将四种 ML 模型,即 XGBoost、随机森林 (RF)、支持向量机 (SVM) 和人工神经网络 (ANN) 与地理空间分析相结合。通过曲线下面积(AUC)、准确度、精确度、灵敏度、特异性、卡帕系数、F1 分数和对数损失验证了模型的性能和稳健性。相对坡度位置的影响最大,在 SVM 和 RF 中的重要性为 100%,在 XGBoost 中的重要性为 95%,而年降雨量(AR)在 ANN 中占主导地位(重要性为 100%)。值得注意的是,XGBoost 表现出稳健性和出色的预测/绘图能力,其 AUC 达到 0.97,准确率达到 91%,精确率达到 92%,卡帕率达到 81%,同时保持了较低的对数损失(0.0394)。然而,SVM 在沟壑抗性/易受影响区域的分类方面表现出色(灵敏度 97%、特异性 98% 和 F1 分数 91%)。ANN 模型预测出了最多的沟壑易感性极高的区域(13.74%),其次是 SVM(11.69%)、XGBoost(10.65%)和 RF(7.85%)模型,而 XGBoost 则识别出了最多的易感性极低的区域(70.19%)。采用集合技术进一步增强了 GES 建模能力,其 AUC 值为 0.99,准确率为 93.5%,精确率为 92.5%,灵敏度为 97.5%,特异性为 95.4%,卡帕率为 85.8%,F1 分数为 94.9%,表现优于单个模型。该技术还将流域的 GES 分为极低 36.48%、低 26.51%、中等 16.24%、高 11.55%、极高 9.22%。此外,区级分析显示了最易受影响的地区,包括巴比莱区、菲迪斯区、哈拉尔区和梅乌穆卢克区,其 GES 高的地区分别为 32.4%、21.3%、14.3% 和 13.6%。本研究提供了稳健灵活的 ML 模型和全面的验证指标,以加强 GES 建模并识别沟壑易发地区和因素,从而为可持续流域保护和防止土地退化的决策提供支持。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Frontiers in Environmental Science
Frontiers in Environmental Science Environmental Science-General Environmental Science
CiteScore
4.50
自引率
8.70%
发文量
2276
审稿时长
12 weeks
期刊介绍: Our natural world is experiencing a state of rapid change unprecedented in the presence of humans. The changes affect virtually all physical, chemical and biological systems on Earth. The interaction of these systems leads to tipping points, feedbacks and amplification of effects. In virtually all cases, the causes of environmental change can be traced to human activity through either direct interventions as a consequence of pollution, or through global warming from greenhouse case emissions. Well-formulated and internationally-relevant policies to mitigate the change, or adapt to the consequences, that will ensure our ability to thrive in the coming decades are badly needed. Without proper understanding of the processes involved, and deep understanding of the likely impacts of bad decisions or inaction, the security of food, water and energy is a risk. Left unchecked shortages of these basic commodities will lead to migration, global geopolitical tension and conflict. This represents the major challenge of our time. We are the first generation to appreciate the problem and we will be judged in future by our ability to determine and take the action necessary. Appropriate knowledge of the condition of our natural world, appreciation of the changes occurring, and predictions of how the future will develop are requisite to the definition and implementation of solutions. Frontiers in Environmental Science publishes research at the cutting edge of knowledge of our natural world and its various intersections with society. It bridges between the identification and measurement of change, comprehension of the processes responsible, and the measures needed to reduce their impact. Its aim is to assist the formulation of policies, by offering sound scientific evidence on environmental science, that will lead to a more inhabitable and sustainable world for the generations to come.
期刊最新文献
Spatial distribution of available phosphorus in surface road and trackway surface materials on a sheep farm in Ireland Heavy metal changes related to land use changes in a karst area: a case study in Changshun, Guizhou Province, China Long-term trends in water transparency of Tibetan Plateau lakes and the response to extreme climate events Carbon neutralization frontier tracking Occurrence, sustainable treatment technologies, potential sources, and future prospects of emerging pollutants in aquatic environments: a review
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1