Sulei Naibi , Anming Bao , Ye Yuan , Jiayu Bao , Rafiq Hamdi , Tao Yu , Xiaoran Huang , Ting Wang , Tao Li , Jingyu Jin , Gang Long , Piet Termonia
{"title":"Flood-drought shifts monitoring on arid Xinjiang, China using a novel machine learning based algorithm","authors":"Sulei Naibi , Anming Bao , Ye Yuan , Jiayu Bao , Rafiq Hamdi , Tao Yu , Xiaoran Huang , Ting Wang , Tao Li , Jingyu Jin , Gang Long , Piet Termonia","doi":"10.1016/j.ecoinf.2025.103030","DOIUrl":null,"url":null,"abstract":"<div><div>This study addresses the growing challenges of climate extremes and their impact on flood-drought shifts in Xinjiang, China, a region highly sensitive to climate variations. While existing classification models such as logistic regression (LR), support vector machines (SVMs), and geographically weighted logistic regression (GWLR) have been applied to spatial data, they exhibit limitations in handling spatial nonstationarity and balancing accuracy with interpretability. To fill this gap, we propose a novel least squares SVM (LSSVM)-based spatially varying coefficient logistic regression (LSSVM-SVCLR) model, which combines the flexibility of LSSVM with the interpretability of logistic regression and the spatial adaptability of spatially varying coefficient models. Through simulations under varying data sizes and complexity, the model achieved high accuracy, with area under the curve (<em>AUC</em>) values approaching 1 in simpler cases and around 0.8 in more complex scenarios. A case study analyzing the relationship between climate extremes and flood-drought shifts in Xinjiang demonstrated the model's applicability, achieving training and testing accuracies of 0.994 and 0.831, respectively, outperforming state-of-the-art machine learning models. Furthermore, the model revealed specific spatial effects of climate extremes on flood-drought shifts, providing probabilistic predictions across the study area. The findings highlight the potential of the proposed model to improve predictions of extreme climate-related events, offering valuable insights for disaster management and climate risk evaluation. This study provides a robust framework for analyzing the complexities of spatial nonstationarity in climate risk analysis.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"86 ","pages":"Article 103030"},"PeriodicalIF":7.3000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574954125000391","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/19 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
This study addresses the growing challenges of climate extremes and their impact on flood-drought shifts in Xinjiang, China, a region highly sensitive to climate variations. While existing classification models such as logistic regression (LR), support vector machines (SVMs), and geographically weighted logistic regression (GWLR) have been applied to spatial data, they exhibit limitations in handling spatial nonstationarity and balancing accuracy with interpretability. To fill this gap, we propose a novel least squares SVM (LSSVM)-based spatially varying coefficient logistic regression (LSSVM-SVCLR) model, which combines the flexibility of LSSVM with the interpretability of logistic regression and the spatial adaptability of spatially varying coefficient models. Through simulations under varying data sizes and complexity, the model achieved high accuracy, with area under the curve (AUC) values approaching 1 in simpler cases and around 0.8 in more complex scenarios. A case study analyzing the relationship between climate extremes and flood-drought shifts in Xinjiang demonstrated the model's applicability, achieving training and testing accuracies of 0.994 and 0.831, respectively, outperforming state-of-the-art machine learning models. Furthermore, the model revealed specific spatial effects of climate extremes on flood-drought shifts, providing probabilistic predictions across the study area. The findings highlight the potential of the proposed model to improve predictions of extreme climate-related events, offering valuable insights for disaster management and climate risk evaluation. This study provides a robust framework for analyzing the complexities of spatial nonstationarity in climate risk analysis.
本研究探讨了极端气候对中国新疆这个气候变化高度敏感的地区日益严峻的挑战及其对旱涝变化的影响。虽然现有的分类模型,如逻辑回归(LR)、支持向量机(svm)和地理加权逻辑回归(GWLR)已经应用于空间数据,但它们在处理空间非平稳性和平衡准确性与可解释性方面存在局限性。为了填补这一空白,我们提出了一种新的基于最小二乘支持向量机(LSSVM)的空间变系数逻辑回归(LSSVM- svclr)模型,该模型将LSSVM的灵活性与逻辑回归的可解释性和空间变系数模型的空间适应性相结合。通过不同数据规模和复杂程度的模拟,模型取得了较高的精度,简单情况下曲线下面积(area under the curve, AUC)接近1,复杂情况下AUC在0.8左右。以新疆极端气候事件与水旱变化的关系为例,验证了该模型的适用性,训练和测试准确率分别达到0.994和0.831,优于目前最先进的机器学习模型。此外,该模型还揭示了极端气候对水旱变化的特定空间影响,并提供了整个研究区域的概率预测。这些发现突出了提出的模型在改进极端气候相关事件预测方面的潜力,为灾害管理和气候风险评估提供了有价值的见解。该研究为分析气候风险分析中空间非平稳性的复杂性提供了一个强有力的框架。
期刊介绍:
The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change.
The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.