利用 K-means 和 XGBoost-SHAP 算法分析西藏自治区冰川融化的空间异质性及其影响因素

IF 4.8 2区 环境科学与生态学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Environmental Modelling & Software Pub Date : 2024-08-31 DOI:10.1016/j.envsoft.2024.106194
{"title":"利用 K-means 和 XGBoost-SHAP 算法分析西藏自治区冰川融化的空间异质性及其影响因素","authors":"","doi":"10.1016/j.envsoft.2024.106194","DOIUrl":null,"url":null,"abstract":"<div><p>This study employed machine learning to comprehensively analyze glacier melting in Tibet Autonomous Region (TAR) and its vital influencing factors. Existing machine learning research often lacks detailed explanations, leading to generalized predictions without considering essential driving factors necessary for yielding an insightful understanding of glacier melting dynamics. To overcome these limitations and fulfill multi-level analysis requirements for comprehending glacier melting, this study identifies factors contributing to glacier melting heterogeneity and assesses distinct melting causes in three spatial melted glacier clusters. We utilized K-means unsupervised classification to cluster Tibet melted glaciers into three categories based on temperature, sunshine hours, evapotranspiration, precipitation, normalized vegetation index, and slope. XGBoost algorithm explores the nonlinear relationships of glacier melting with these features and Shapley values were used for model transparency, quantifying feature's influence on the melting process. Investigating geographical heterogeneity among clusters enhanced our understanding of the observed changes. High fitting accuracy (&gt;0.98) enhanced the result reliability, as well. The results show that Tibetan glaciers melt significantly from 2010 to 2020, and the cluster analysis reveals its unique melting characteristics. Melting glaciers in the same cluster are not only similar in characteristics, but also in spatial and geographical distribution, with two of the clusters concentrating in the eastern part of TAR, and the third cluster scattered in the western part of the country. the XGBoost-SHAP analysis efficiently quantifies the contribution of each cluster feature to the glacier melting, revealing the different roles of different clustered features.</p></div>","PeriodicalId":310,"journal":{"name":"Environmental Modelling & Software","volume":null,"pages":null},"PeriodicalIF":4.8000,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis of the spatial heterogeneity of glacier melting in Tibet Autonomous Region and its influential factors using the K-means and XGBoost-SHAP algorithms\",\"authors\":\"\",\"doi\":\"10.1016/j.envsoft.2024.106194\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>This study employed machine learning to comprehensively analyze glacier melting in Tibet Autonomous Region (TAR) and its vital influencing factors. Existing machine learning research often lacks detailed explanations, leading to generalized predictions without considering essential driving factors necessary for yielding an insightful understanding of glacier melting dynamics. To overcome these limitations and fulfill multi-level analysis requirements for comprehending glacier melting, this study identifies factors contributing to glacier melting heterogeneity and assesses distinct melting causes in three spatial melted glacier clusters. We utilized K-means unsupervised classification to cluster Tibet melted glaciers into three categories based on temperature, sunshine hours, evapotranspiration, precipitation, normalized vegetation index, and slope. XGBoost algorithm explores the nonlinear relationships of glacier melting with these features and Shapley values were used for model transparency, quantifying feature's influence on the melting process. Investigating geographical heterogeneity among clusters enhanced our understanding of the observed changes. High fitting accuracy (&gt;0.98) enhanced the result reliability, as well. The results show that Tibetan glaciers melt significantly from 2010 to 2020, and the cluster analysis reveals its unique melting characteristics. Melting glaciers in the same cluster are not only similar in characteristics, but also in spatial and geographical distribution, with two of the clusters concentrating in the eastern part of TAR, and the third cluster scattered in the western part of the country. the XGBoost-SHAP analysis efficiently quantifies the contribution of each cluster feature to the glacier melting, revealing the different roles of different clustered features.</p></div>\",\"PeriodicalId\":310,\"journal\":{\"name\":\"Environmental Modelling & Software\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2024-08-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Modelling & Software\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S136481522400255X\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Modelling & Software","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S136481522400255X","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

摘要

本研究利用机器学习全面分析了西藏自治区(TAR)的冰川融化及其重要影响因素。现有的机器学习研究往往缺乏详细的解释,导致预测结果过于笼统,没有考虑深刻理解冰川融化动态所必需的重要驱动因素。为了克服这些局限性并满足理解冰川融化的多层次分析要求,本研究确定了导致冰川融化异质性的因素,并评估了三个空间融化冰川群中不同的融化原因。我们利用 K-means 无监督分类法,根据温度、日照时数、蒸散量、降水量、归一化植被指数和坡度,将西藏融化冰川分为三类。XGBoost 算法探讨了冰川融化与这些特征之间的非线性关系,Shapley 值用于模型透明度,量化特征对融化过程的影响。对集群间地理异质性的研究加深了我们对观测到的变化的理解。高拟合精度(大于 0.98)也提高了结果的可靠性。结果表明,2010 年至 2020 年西藏冰川融化显著,聚类分析揭示了其独特的融化特征。XGBoost-SHAP 分析有效地量化了每个聚类特征对冰川融化的贡献,揭示了不同聚类特征的不同作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Analysis of the spatial heterogeneity of glacier melting in Tibet Autonomous Region and its influential factors using the K-means and XGBoost-SHAP algorithms

This study employed machine learning to comprehensively analyze glacier melting in Tibet Autonomous Region (TAR) and its vital influencing factors. Existing machine learning research often lacks detailed explanations, leading to generalized predictions without considering essential driving factors necessary for yielding an insightful understanding of glacier melting dynamics. To overcome these limitations and fulfill multi-level analysis requirements for comprehending glacier melting, this study identifies factors contributing to glacier melting heterogeneity and assesses distinct melting causes in three spatial melted glacier clusters. We utilized K-means unsupervised classification to cluster Tibet melted glaciers into three categories based on temperature, sunshine hours, evapotranspiration, precipitation, normalized vegetation index, and slope. XGBoost algorithm explores the nonlinear relationships of glacier melting with these features and Shapley values were used for model transparency, quantifying feature's influence on the melting process. Investigating geographical heterogeneity among clusters enhanced our understanding of the observed changes. High fitting accuracy (>0.98) enhanced the result reliability, as well. The results show that Tibetan glaciers melt significantly from 2010 to 2020, and the cluster analysis reveals its unique melting characteristics. Melting glaciers in the same cluster are not only similar in characteristics, but also in spatial and geographical distribution, with two of the clusters concentrating in the eastern part of TAR, and the third cluster scattered in the western part of the country. the XGBoost-SHAP analysis efficiently quantifies the contribution of each cluster feature to the glacier melting, revealing the different roles of different clustered features.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Environmental Modelling & Software
Environmental Modelling & Software 工程技术-工程:环境
CiteScore
9.30
自引率
8.20%
发文量
241
审稿时长
60 days
期刊介绍: Environmental Modelling & Software publishes contributions, in the form of research articles, reviews and short communications, on recent advances in environmental modelling and/or software. The aim is to improve our capacity to represent, understand, predict or manage the behaviour of environmental systems at all practical scales, and to communicate those improvements to a wide scientific and professional audience.
期刊最新文献
Assessing the influence of temperature on slope stability in a temperate climate: A nationwide spatial probability analysis in Italy Research progress and prospects of urban flooding simulation: From traditional numerical models to deep learning approaches Editorial Board An integrated, automated and modular approach for real-time weather monitoring of surface meteorological variables and short-range forecasting using machine learning The Fogees system for forecasting particulate matter concentrations in urban areas
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1