The study proposes a comprehensive Bayesian framework that integrates spatial clustering with binary copulas and zero-inflated binary logistic regression to model complex, non-linear dependencies between seasonal drought conditions and inter-seasonal drought persistence. Bayesian inference was performed via the RStan package, with model performance evaluated based on convergence diagnostics (R-hat, effective sample size), model fit (log posterior density, AIC, BIC), and predictive accuracy (posterior predictive checks, AUC/ROC curves, confusion matrix). Clustering analysis using the Elbow Method and Silhouette Score revealed that 4 clusters optimally capture the drought patterns across the stations. K-means clustering found most appropriate choice in term of higher Silhouette Score (winter-spring = 0.332, spring–summer = 0.391, autumn–winter = 0.363 and higher Dunn Index (winter-spring = 0.260, spring–summer = 0.260, autumn–winter = 0.181) for all seasonal transitions except summer-autumn. Medoid-Based Selection proved to be the most reliable method for selecting representative stations across all seasonal transitions with the lowest Davies–Bouldin Index (DB Index) (1.38, 1.091, 0.992, and 1.163) and highest Silhouette Score (0.273, 0.345, 0.411, and 0.348), ensuring comprehensive coverage of climatic conditions within each cluster. Archimedean copulas and Farlie–Gumbel–Morgenstern (FGM) copulas captured most seasonal transitions, while Plackett and Gumbel copulas were better for autumn–winter cases. Building on the dependence captured by the copulas, Bayesian zero-inflated binary logistic regression (ZIBLR) indicated strong effects of seasonal drought conditions on drought persistence. These models examine how drought conditions in one season influence persistence into the subsequent season, offering valuable insights into the drivers of drought dynamics in Punjab. ZIBLR parameters, across all clusters and seasonal transitions, showed statistically sound convergence with R-hat values of 1.00 and sufficient posterior sampling with high effective sample sizes (n_eff > 12,000). The confusion matrices illustrates the superior predictive accuracy and stable model performance, with high sensitivity and specificity. The ROC curves corroborate this, as evidenced by the shape of the curve, with a sharp rise towards the top-left corner of the plots. The AUC values, appear to be 1, confirming strong classification accuracy, and validating the model’s reliability in capturing drought persistence dynamics. The study contributes a novel methodological framework for understanding regional drought dynamics and provides policymakers with data driven insights to develop targeted mitigation and adaptation strategies, particularly in arid and semi-arid environments, for effective risk management.
扫码关注我们
求助内容:
应助结果提醒方式:
