Upper and lower percentiles of Flow Duration Curves (FDCs) of daily streamflow data were investigated to develop frequency curves. Upper percentiles with exceedance probability of 1, 5 and 10% (Q1, Q5, Q10) were used for high flows, and lower percentiles with non-exceedance probability of 10, 5 and 1% (Q90, Q95, Q99) for low flows. Median value (Q50) was covered to represent the average conditions of streamflow. A mixed frequency analysis based on the total probability theorem taking zero values into account was applied for the lower percentiles of FDC. Case studies were performed for three intermittent Streamflow Gauging Stations (SGSs) from Kucuk Menderes River Basin in western Turkey. An overall assessment of results shows that the best-fit probability distribution function does not change from one SGS to another considerably for low flows while each SGS has its own probability distribution function for high flows. Upper and lower percentiles, and median value were calculated at various return periods by using the identified probability distribution functions. The calculated values were plotted in the form of frequency curves of high flow percentiles and low flow percentiles. The frequency curves have a practically significant potential use in hydrological analysis, water resources management and hydraulic design under high and low flow conditions. They are yet open to further development for regionalization and their applicability can be extended to ungauged sites in river basins.
{"title":"Frequency curves of high and low flows in intermittent river basins for hydrological analysis and hydraulic design","authors":"Gokhan Sarigil, Yonca Cavus, Hafzullah Aksoy, Ebru Eris","doi":"10.1007/s00477-024-02732-0","DOIUrl":"https://doi.org/10.1007/s00477-024-02732-0","url":null,"abstract":"<p>Upper and lower percentiles of Flow Duration Curves (FDCs) of daily streamflow data were investigated to develop frequency curves. Upper percentiles with exceedance probability of 1, 5 and 10% (Q<sub>1</sub>, Q<sub>5</sub>, Q<sub>10</sub>) were used for high flows, and lower percentiles with non-exceedance probability of 10, 5 and 1% (Q<sub>90</sub>, Q<sub>95</sub>, Q<sub>99</sub>) for low flows. Median value (Q<sub>50</sub>) was covered to represent the average conditions of streamflow. A mixed frequency analysis based on the total probability theorem taking zero values into account was applied for the lower percentiles of FDC. Case studies were performed for three intermittent Streamflow Gauging Stations (SGSs) from Kucuk Menderes River Basin in western Turkey. An overall assessment of results shows that the best-fit probability distribution function does not change from one SGS to another considerably for low flows while each SGS has its own probability distribution function for high flows. Upper and lower percentiles, and median value were calculated at various return periods by using the identified probability distribution functions. The calculated values were plotted in the form of frequency curves of high flow percentiles and low flow percentiles. The frequency curves have a practically significant potential use in hydrological analysis, water resources management and hydraulic design under high and low flow conditions. They are yet open to further development for regionalization and their applicability can be extended to ungauged sites in river basins.</p>","PeriodicalId":21987,"journal":{"name":"Stochastic Environmental Research and Risk Assessment","volume":"18 1","pages":""},"PeriodicalIF":4.2,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140941017","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-06DOI: 10.1007/s00477-024-02736-w
Alaa A. Jasim Al-Hasani, Shamsuddin Shahid
Reliable estimation of reference evapotranspiration (ETo), an essential component of optimal irrigation management, is challenging in many regions due to its complex dependence on meteorological factors. Alternative empirical models, often used to estimate ETo considering data limitations, provide highly unreliable estimates for Iraq. This study aimed to formulate simpler empirical models for accurate ETo estimation with fewer variables in different climate regions of Iraq. The metaheuristic Whale Optimization Algorithm (WOA) was used to finetune the coefficients of the nonlinear least square fitting regression (NLLSF) model during development. Two simpler models were developed based on (1) only mean air temperature (T) (NLLSF-T) and (2) solar radiation and T (NLLSF-R) as inputs. The performance of the models was validated using historical ground observations (2012–2021), and the ETo was estimated using the Penman–Monteith method from the reanalyzed (ERA5) datasets (1959–2021). The models' spatial, seasonal, and temporal performance in estimating daily ETo was rigorously evaluated using multiple statistical metrics and visual presentations. The Kling-Gupta Efficiency (KGE) and normalized root mean square error (NRMSE) of the NLLSF-T model were 0.95 and 0.30, respectively, compared to 0.75 and 0.40 for Kharrufa, the best-performing temperature-based models in Iraq. Similarly, NLLSF-R improved the KGE from 0.78 to 0.97 in KGE and NRMSE from 0.44 to 0.22 compared to Caprio, the best-performing radiation-based model in Iraq. The spatial assessment revealed both the models' excellent performance over most of Iraq, except in the far north, indicating their suitability in estimating ETo in arid and semi-arid regions.
{"title":"Development of radiation and temperature-based empirical models for accurate daily reference evapotranspiration estimation in Iraq","authors":"Alaa A. Jasim Al-Hasani, Shamsuddin Shahid","doi":"10.1007/s00477-024-02736-w","DOIUrl":"https://doi.org/10.1007/s00477-024-02736-w","url":null,"abstract":"<p>Reliable estimation of reference evapotranspiration (ET<sub>o</sub>), an essential component of optimal irrigation management, is challenging in many regions due to its complex dependence on meteorological factors. Alternative empirical models, often used to estimate ET<sub>o</sub> considering data limitations, provide highly unreliable estimates for Iraq. This study aimed to formulate simpler empirical models for accurate ET<sub>o</sub> estimation with fewer variables in different climate regions of Iraq. The metaheuristic Whale Optimization Algorithm (WOA) was used to finetune the coefficients of the nonlinear least square fitting regression (NLLSF) model during development. Two simpler models were developed based on (1) only mean air temperature (T) (NLLSF-T) and (2) solar radiation and T (NLLSF-R) as inputs. The performance of the models was validated using historical ground observations (2012–2021), and the ET<sub>o</sub> was estimated using the Penman–Monteith method from the reanalyzed (ERA5) datasets (1959–2021). The models' spatial, seasonal, and temporal performance in estimating daily ET<sub>o</sub> was rigorously evaluated using multiple statistical metrics and visual presentations. The Kling-Gupta Efficiency (KGE) and normalized root mean square error (NRMSE) of the NLLSF-T model were 0.95 and 0.30, respectively, compared to 0.75 and 0.40 for Kharrufa, the best-performing temperature-based models in Iraq. Similarly, NLLSF-R improved the KGE from 0.78 to 0.97 in KGE and NRMSE from 0.44 to 0.22 compared to Caprio, the best-performing radiation-based model in Iraq. The spatial assessment revealed both the models' excellent performance over most of Iraq, except in the far north, indicating their suitability in estimating ET<sub>o</sub> in arid and semi-arid regions.</p>","PeriodicalId":21987,"journal":{"name":"Stochastic Environmental Research and Risk Assessment","volume":"61 1","pages":""},"PeriodicalIF":4.2,"publicationDate":"2024-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140881540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-06DOI: 10.1007/s00477-024-02737-9
Chao Zhang, Tingxin Qin, Wan Wang, Fengjiao Xu, Qian Zhou
The intensities and frequencies of extreme rainstorms are increasing, which may result in severe inundation of urban metro systems. Although there is some risk assessment research on regional metro systems based on spatiotemporal data, the characteristics of specific metro stations and shortcomings in the emergency response process need more consideration. In this paper, a risk analysis model for rainstorm inundation in metro systems based on a Bayesian network and a practical case study are proposed. First, the risk factors are obtained by integrating general mechanism analysis and the case study. Second, an event evolution diagram is established to represent the comprehensive evolution process of a potential event. Third, the risk analysis model is established using a Bayesian network model considering the quantitative causal relationships between risk factors. This model is used to analyze the risk of supporting emergency management, including emergency preparation based on critical risk factor sensitivity identification, prewarning response strategy development based on risk analysis as rainstorms occur, and rescue strategy development based on risk analysis as rainstorm water flows into metro tunnels. Furthermore, this model can be flexibly improved as natural hazards and metro systems change and as new problems are exposed in practical cases.
{"title":"Case-based risk analysis model for rainstorm inundation in metro systems based on a bayesian network","authors":"Chao Zhang, Tingxin Qin, Wan Wang, Fengjiao Xu, Qian Zhou","doi":"10.1007/s00477-024-02737-9","DOIUrl":"https://doi.org/10.1007/s00477-024-02737-9","url":null,"abstract":"<p>The intensities and frequencies of extreme rainstorms are increasing, which may result in severe inundation of urban metro systems. Although there is some risk assessment research on regional metro systems based on spatiotemporal data, the characteristics of specific metro stations and shortcomings in the emergency response process need more consideration. In this paper, a risk analysis model for rainstorm inundation in metro systems based on a Bayesian network and a practical case study are proposed. First, the risk factors are obtained by integrating general mechanism analysis and the case study. Second, an event evolution diagram is established to represent the comprehensive evolution process of a potential event. Third, the risk analysis model is established using a Bayesian network model considering the quantitative causal relationships between risk factors. This model is used to analyze the risk of supporting emergency management, including emergency preparation based on critical risk factor sensitivity identification, prewarning response strategy development based on risk analysis as rainstorms occur, and rescue strategy development based on risk analysis as rainstorm water flows into metro tunnels. Furthermore, this model can be flexibly improved as natural hazards and metro systems change and as new problems are exposed in practical cases.</p>","PeriodicalId":21987,"journal":{"name":"Stochastic Environmental Research and Risk Assessment","volume":"161 1","pages":""},"PeriodicalIF":4.2,"publicationDate":"2024-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140881503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-06DOI: 10.1007/s00477-024-02724-0
Valentin Waeselynck, Gary Johnson, David Schmidt, Max A. Moritz, David Saah
This article provides a precise, quantitative description of the sampling error on burn counts in Monte-Carlo wildfire simulations - that is, the prediction variability introduced by the fact that the set of simulated fires is random and finite. We show that the marginal burn counts are (very nearly) Poisson-distributed in typical settings and infer through Bayesian updating that Gamma distributions are suitable summaries of the remaining uncertainty. In particular, the coefficient of variation of the burn count is equal to the inverse square root of its expected value, and this expected value is proportional to the number of simulated fires multiplied by the asymptotic burn probability. From these results, we derive practical guidelines for choosing the number of simulated fires and estimating the sampling error. Notably, the required number of simulated years is expressed as a power law. Such findings promise to relieve fire modelers of resource-consuming iterative experiments for sizing simulations and assessing their convergence: statistical theory provides better answers, faster.
{"title":"Quantifying the sampling error on burn counts in Monte-Carlo wildfire simulations using Poisson and Gamma distributions","authors":"Valentin Waeselynck, Gary Johnson, David Schmidt, Max A. Moritz, David Saah","doi":"10.1007/s00477-024-02724-0","DOIUrl":"https://doi.org/10.1007/s00477-024-02724-0","url":null,"abstract":"<p>This article provides a precise, quantitative description of the sampling error on burn counts in Monte-Carlo wildfire simulations - that is, the prediction variability introduced by the fact that the set of simulated fires is random and finite. We show that the marginal burn counts are (very nearly) Poisson-distributed in typical settings and infer through Bayesian updating that Gamma distributions are suitable summaries of the remaining uncertainty. In particular, the coefficient of variation of the burn count is equal to the inverse square root of its expected value, and this expected value is proportional to the number of simulated fires multiplied by the asymptotic burn probability. From these results, we derive practical guidelines for choosing the number of simulated fires and estimating the sampling error. Notably, the required number of simulated years is expressed as a power law. Such findings promise to relieve fire modelers of resource-consuming iterative experiments for sizing simulations and assessing their convergence: statistical theory provides better answers, faster.</p>","PeriodicalId":21987,"journal":{"name":"Stochastic Environmental Research and Risk Assessment","volume":"41 1","pages":""},"PeriodicalIF":4.2,"publicationDate":"2024-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140889756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-03DOI: 10.1007/s00477-024-02729-9
Faisal Mehraj Wani, Jayaprakash Vemuri, K. S. K. Karthik Reddy, Chenna Rajaram
The duration characteristics of near-fault earthquake ground motions play a significant role in the dynamic response of a structure. Linear regression-based models are extensively used to forecast ground motions and duration parameters. However, such an approach fails to account for the complexity arising from the non-linear patterns in the data set. Nevertheless, implementing machine learning algorithms has the ability to uncover these unexplored patterns as well as the unique characteristics of ground motions comprised in the datasets. In this study, statistical relationships between several duration metrics and intensity measures of near-fault ground motions are evaluated using machine learning algorithms. Four different machine learning algorithms, namely Regression, Decision Tree, Support Vector machines, and Gaussian Process regression model are trained to determine the optimum model. All these machine learning models were examined using the selected database of 200 near-fault pulse-like ground motions, which was split into two parts, with 75% of data used for training and the remaining 25% for testing. The results indicate that the fine tree model for bracketed duration, stepwise linear regression model for uniform duration, and the exponential and rational gaussian process regression model for significant and effective duration, showed more accurate and reliable results as compared to other models.
{"title":"Forecasting duration characteristics of near fault pulse-like ground motions using machine learning algorithms","authors":"Faisal Mehraj Wani, Jayaprakash Vemuri, K. S. K. Karthik Reddy, Chenna Rajaram","doi":"10.1007/s00477-024-02729-9","DOIUrl":"https://doi.org/10.1007/s00477-024-02729-9","url":null,"abstract":"<p>The duration characteristics of near-fault earthquake ground motions play a significant role in the dynamic response of a structure. Linear regression-based models are extensively used to forecast ground motions and duration parameters. However, such an approach fails to account for the complexity arising from the non-linear patterns in the data set. Nevertheless, implementing machine learning algorithms has the ability to uncover these unexplored patterns as well as the unique characteristics of ground motions comprised in the datasets. In this study, statistical relationships between several duration metrics and intensity measures of near-fault ground motions are evaluated using machine learning algorithms. Four different machine learning algorithms, namely Regression, Decision Tree, Support Vector machines, and Gaussian Process regression model are trained to determine the optimum model. All these machine learning models were examined using the selected database of 200 near-fault pulse-like ground motions, which was split into two parts, with 75% of data used for training and the remaining 25% for testing. The results indicate that the fine tree model for bracketed duration, stepwise linear regression model for uniform duration, and the exponential and rational gaussian process regression model for significant and effective duration, showed more accurate and reliable results as compared to other models.</p>","PeriodicalId":21987,"journal":{"name":"Stochastic Environmental Research and Risk Assessment","volume":"91 1","pages":""},"PeriodicalIF":4.2,"publicationDate":"2024-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140881507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-29DOI: 10.1007/s00477-024-02727-x
Zongwang Wu, Hossein Moayedi, Marjan Salari, Binh Nguyen Le, Atefeh Ahmadi Dehrashid
In developing countries, evaluating irrigation water quality using conventional methods can be costly and time-consuming. To overcome these challenges, this study explores the potential of utilizing physical parameters and artificial intelligence (AI) models for predicting and evaluating the quality indicators of irrigation water in aquifer systems. To achieve this goal, novel hybrid methods, namely the Whale Optimization Algorithm (WOA) and Wind-Driven Optimization (WDO), are employed in conjunction with Artificial Neural Network (ANN) models. The specific objective of this study is to forecast the Sodium Adsorption Ratio (SAR) by considering independent variables such as Na+, Mg2+, Ca2+, Na percent, K+, SO42−, Cl−, pH, and HCO3−. A dataset of 540 samples from the Shiraz plain, collected over a statistical period of 16 years (2002–2018), is used to estimate the groundwater quality variables. A pre-processing technique is applied in the AI approach to enhance the model's efficiency. The results indicate that the WDO-ANN model exhibits higher accuracy (R2 = 0.9983 and RMSE = 0.10618) than the WOA-ANN model (R2 = 0.9957 and RMSE = 0.16957). The optimization of computational parameters and comparison of AI model structures demonstrate that the WDO-ANN model outperforms the WOA-ANN model in predictive ability. In general, using AI models as a tool for low-cost and timely prediction of underground water quality using physical parameters as input variables has a high potential.
{"title":"Assessment of sodium adsorption ratio (SAR) in groundwater: Integrating experimental data with cutting-edge swarm intelligence approaches","authors":"Zongwang Wu, Hossein Moayedi, Marjan Salari, Binh Nguyen Le, Atefeh Ahmadi Dehrashid","doi":"10.1007/s00477-024-02727-x","DOIUrl":"https://doi.org/10.1007/s00477-024-02727-x","url":null,"abstract":"<p>In developing countries, evaluating irrigation water quality using conventional methods can be costly and time-consuming. To overcome these challenges, this study explores the potential of utilizing physical parameters and artificial intelligence (AI) models for predicting and evaluating the quality indicators of irrigation water in aquifer systems. To achieve this goal, novel hybrid methods, namely the Whale Optimization Algorithm (WOA) and Wind-Driven Optimization (WDO), are employed in conjunction with Artificial Neural Network (ANN) models. The specific objective of this study is to forecast the Sodium Adsorption Ratio (SAR) by considering independent variables such as Na<sup>+</sup>, Mg<sup>2+</sup>, Ca<sup>2+</sup>, Na percent, K<sup>+</sup>, SO<sub>4</sub><sup>2−</sup>, Cl<sup>−</sup>, pH, and HCO<sub>3</sub><sup>−</sup>. A dataset of 540 samples from the Shiraz plain, collected over a statistical period of 16 years (2002–2018), is used to estimate the groundwater quality variables. A pre-processing technique is applied in the AI approach to enhance the model's efficiency. The results indicate that the WDO-ANN model exhibits higher accuracy (R<sup>2</sup> = 0.9983 and RMSE = 0.10618) than the WOA-ANN model (R<sup>2</sup> = 0.9957 and RMSE = 0.16957). The optimization of computational parameters and comparison of AI model structures demonstrate that the WDO-ANN model outperforms the WOA-ANN model in predictive ability. In general, using AI models as a tool for low-cost and timely prediction of underground water quality using physical parameters as input variables has a high potential.</p>","PeriodicalId":21987,"journal":{"name":"Stochastic Environmental Research and Risk Assessment","volume":"20 1","pages":""},"PeriodicalIF":4.2,"publicationDate":"2024-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140811647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-29DOI: 10.1007/s00477-024-02718-y
Yao Kang, Feilong Lu, Danshu Sheng, Shuhui Wang
Count time series exhibiting seasonal patterns are frequently encountered in practical scenarios. For example, the number of hospital emergency service arrivals may show seasonal behavior (Moriña et al. 2011 Stat Med 30:3125–3136). Numerous models have been proposed for the analysis of seasonal count time series with an unbounded support, yet seasonal patterns in bounded count time series, which are sometimes suffered in environmental science such as the number of monthly rainy-days and air quality level data, have not received formal attention. The contribution of this article lies in coping with the modeling challenges associated with seasonal count time series with a bounded support, which is beneficial for enhancing the applicability of environmental science data. This is achieved by introducing a seasonal structure and seasonally varying model parameters into the first-order binomial autoregressive (BAR(1)) model (McKenzie 1985 J Am Water Resour Assoc 21:645–650). The probabilistic and statistical properties, marginal distribution and some special cases of the proposed model are studied. Estimation of model parameters is conducted using the Yule-Walker, conditional least squares and maximum likelihood methods. The asymptotic normality of the estimators is also presented. To demonstrate the utility of our model in environmental data, applications are carried out on the monthly number of rainy-days in two Russian cities.
在实际应用中,经常会遇到呈现季节性模式的计数时间序列。例如,医院急诊服务到达人数可能表现出季节性行为(Moriña 等,2011 Stat Med 30:3125-3136)。人们已经提出了许多用于分析无界支持的季节性计数时间序列的模型,但有界计数时间序列中的季节性模式还没有得到正式关注,环境科学中有时会遇到这种情况,例如月雨日数和空气质量水平数据。本文的贡献在于应对与有界支持的季节性计数时间序列相关的建模挑战,这有利于提高环境科学数据的适用性。这是通过在一阶二项自回归(BAR(1))模型(McKenzie 1985 J Am Water Resour Assoc 21:645-650)中引入季节结构和随季节变化的模型参数来实现的。研究了拟议模型的概率和统计特性、边际分布和一些特殊情况。采用 Yule-Walker、条件最小二乘法和最大似然法对模型参数进行了估计。此外,还介绍了估计值的渐近正态性。为了证明我们的模型在环境数据中的实用性,对俄罗斯两个城市的月降雨日数进行了应用。
{"title":"A seasonal binomial autoregressive process with applications to monthly rainy-days counts","authors":"Yao Kang, Feilong Lu, Danshu Sheng, Shuhui Wang","doi":"10.1007/s00477-024-02718-y","DOIUrl":"https://doi.org/10.1007/s00477-024-02718-y","url":null,"abstract":"<p>Count time series exhibiting seasonal patterns are frequently encountered in practical scenarios. For example, the number of hospital emergency service arrivals may show seasonal behavior (Moriña et al. 2011 Stat Med 30:3125–3136). Numerous models have been proposed for the analysis of seasonal count time series with an unbounded support, yet seasonal patterns in bounded count time series, which are sometimes suffered in environmental science such as the number of monthly rainy-days and air quality level data, have not received formal attention. The contribution of this article lies in coping with the modeling challenges associated with seasonal count time series with a bounded support, which is beneficial for enhancing the applicability of environmental science data. This is achieved by introducing a seasonal structure and seasonally varying model parameters into the first-order binomial autoregressive (BAR(1)) model (McKenzie 1985 J Am Water Resour Assoc 21:645–650). The probabilistic and statistical properties, marginal distribution and some special cases of the proposed model are studied. Estimation of model parameters is conducted using the Yule-Walker, conditional least squares and maximum likelihood methods. The asymptotic normality of the estimators is also presented. To demonstrate the utility of our model in environmental data, applications are carried out on the monthly number of rainy-days in two Russian cities.</p>","PeriodicalId":21987,"journal":{"name":"Stochastic Environmental Research and Risk Assessment","volume":"22 1","pages":""},"PeriodicalIF":4.2,"publicationDate":"2024-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140811646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Accurate streamflow prediction is significant for water resources management. However, due to the impact of climate change and human activities, accurately identifying the input factors of the streamflow prediction model and achieving high-precision results presents a significant challenge. In this study, past streamflow, meteorological, and climate factors were utilized as inputs to develop a predictive scenario for the bi-decomposition of input factors and streamflow series, i.e. Scenario 3 (S3). Mutual information (MI) was applied to recognize the input factors prediction potential. Based on the predictive potentials, factors were progressively incorporated into the kernel extreme learning machine (KELM) and hybrid kernel extreme learning machine (HKELM) models optimized by the gazelle optimization algorithm (GOA) to ascertain the optimal input configuration for each sub-series. The prediction results of S3-KELM and S3-HKELM models were obtained by reconstructing the optimal prediction results of each sub-series. The monthly streamflow of the upper Fenhe River Basin, which is in the semi-humid and semi-arid climate zone, was selected as a case study. The results indicate that in comparison to both undecomposed and singly decomposed scenarios, the input–output bi-decomposed scenario more accurately identifies the input factors and constructs high-precision prediction models. The Nash–Sutcliffe efficiency (NSE) of both the S3-KELM and S3-HKELM models exceeds 0.85. Specifically, the S3-HKELM model demonstrates superior performance, capable of handling more complex inputs, with its NSE reaching up to 0.93. Importantly, meteorological and climate factors contribute to the accuracy of streamflow predictions across different scenarios.
{"title":"Enhanced monthly streamflow prediction using an input–output bi-decomposition data driven model considering meteorological and climate information","authors":"Qiucen Guo, Xuehua Zhao, Yuhang Zhao, Zhijing Ren, Huifang Wang, Wenjun Cai","doi":"10.1007/s00477-024-02731-1","DOIUrl":"https://doi.org/10.1007/s00477-024-02731-1","url":null,"abstract":"<p>Accurate streamflow prediction is significant for water resources management. However, due to the impact of climate change and human activities, accurately identifying the input factors of the streamflow prediction model and achieving high-precision results presents a significant challenge. In this study, past streamflow, meteorological, and climate factors were utilized as inputs to develop a predictive scenario for the bi-decomposition of input factors and streamflow series, i.e. Scenario 3 (S3). Mutual information (MI) was applied to recognize the input factors prediction potential. Based on the predictive potentials, factors were progressively incorporated into the kernel extreme learning machine (KELM) and hybrid kernel extreme learning machine (HKELM) models optimized by the gazelle optimization algorithm (GOA) to ascertain the optimal input configuration for each sub-series. The prediction results of S3-KELM and S3-HKELM models were obtained by reconstructing the optimal prediction results of each sub-series. The monthly streamflow of the upper Fenhe River Basin, which is in the semi-humid and semi-arid climate zone, was selected as a case study. The results indicate that in comparison to both undecomposed and singly decomposed scenarios, the input–output bi-decomposed scenario more accurately identifies the input factors and constructs high-precision prediction models. The Nash–Sutcliffe efficiency (NSE) of both the S3-KELM and S3-HKELM models exceeds 0.85. Specifically, the S3-HKELM model demonstrates superior performance, capable of handling more complex inputs, with its NSE reaching up to 0.93. Importantly, meteorological and climate factors contribute to the accuracy of streamflow predictions across different scenarios.</p>","PeriodicalId":21987,"journal":{"name":"Stochastic Environmental Research and Risk Assessment","volume":"56 1","pages":""},"PeriodicalIF":4.2,"publicationDate":"2024-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140804733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-26DOI: 10.1007/s00477-024-02722-2
Wajid Hussain, Hong Shu, Hasnain Abbas, Sajid Hussain, Isma Kulsoom, Saqib Hussain, Hajra Mustafa, Aftab Ahmed Khan, Muhammad Ismail, Javed Iqbal
Gilgit-Baltistan, Pakistan, is particularly susceptible to landslides due to various geological, tectonics, meteorological, and anthropogenic factors consequently. However, the persisting conundrum of landslide database/data imbalance stands as a formidable challenge within this domain. To better stabilize the objective of landslide prediction, stacking ensemble Machine Learning and Generative Adversarial Network (GAN) were applied, because previous research in this area has mostly been limited by a lack of data. GAN is employed to synthesize training samples, ensuring the creation of a balanced dataset. Stacking ensemble architecture involves two stages of learning: the first class of learners incorporates diverse machine learning algorithms, while, the second level logistic regression model integrates prediction based on the strong learner, thereby enhancing overall prediction performance. To investigate landslide susceptibility in District Chilas, Northern Pakistan, we employed optical remote sensing and introduced a GAN with a Multi-Layers Hybrid Model (MLHM). This study involved the preparation of a spatial database with a total of 106 landslides and ten major landslide factors. We utilized a hybrid ensemble model and compared its performance with different algorithms like Conventional Neural Network, Artificial Neural network, Decision Tree, K-Nearest Neighbouring, and Hybrid Model, achieving accuracies of 0.91, 0.92, 0.90, 0.89, and 0.93, respectively. this approach has with Hybrid architecture learning accuracy of 0.98. The GAN with MLHM developed improved landslide susceptibility assessment with cross-comparison of Persistent Scattered Interferometric Synthetic Aperture Radar (PS-InSAR) investigation to ensure the safe functioning of KKH.
{"title":"The generative adversarial neural network with multi-layers stack ensemble hybrid model for landslide prediction in case of training sample imbalance","authors":"Wajid Hussain, Hong Shu, Hasnain Abbas, Sajid Hussain, Isma Kulsoom, Saqib Hussain, Hajra Mustafa, Aftab Ahmed Khan, Muhammad Ismail, Javed Iqbal","doi":"10.1007/s00477-024-02722-2","DOIUrl":"https://doi.org/10.1007/s00477-024-02722-2","url":null,"abstract":"<p>Gilgit-Baltistan, Pakistan, is particularly susceptible to landslides due to various geological, tectonics, meteorological, and anthropogenic factors consequently. However, the persisting conundrum of landslide database/data imbalance stands as a formidable challenge within this domain. To better stabilize the objective of landslide prediction, stacking ensemble Machine Learning and Generative Adversarial Network (GAN) were applied, because previous research in this area has mostly been limited by a lack of data. GAN is employed to synthesize training samples, ensuring the creation of a balanced dataset. Stacking ensemble architecture involves two stages of learning: the first class of learners incorporates diverse machine learning algorithms, while, the second level logistic regression model integrates prediction based on the strong learner, thereby enhancing overall prediction performance. To investigate landslide susceptibility in District Chilas, Northern Pakistan, we employed optical remote sensing and introduced a GAN with a Multi-Layers Hybrid Model (MLHM). This study involved the preparation of a spatial database with a total of 106 landslides and ten major landslide factors. We utilized a hybrid ensemble model and compared its performance with different algorithms like Conventional Neural Network, Artificial Neural network, Decision Tree, K-Nearest Neighbouring, and Hybrid Model, achieving accuracies of 0.91, 0.92, 0.90, 0.89, and 0.93, respectively. this approach has with Hybrid architecture learning accuracy of 0.98. The GAN with MLHM developed improved landslide susceptibility assessment with cross-comparison of Persistent Scattered Interferometric Synthetic Aperture Radar (PS-InSAR) investigation to ensure the safe functioning of KKH. </p>","PeriodicalId":21987,"journal":{"name":"Stochastic Environmental Research and Risk Assessment","volume":"71 1","pages":""},"PeriodicalIF":4.2,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140806777","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-22DOI: 10.1007/s00477-024-02720-4
Alwan Fadlurohman, Achmad Choiruddin, Jorge Mateu
The inhomogeneous Log-Gaussian Cox Process (LGCP) defines a flexible point process model for the analysis of spatial point patterns featuring inhomogeneity/spatial trend and aggregation patterns. To fit an LGCP model to spatial point pattern data and study the spatial trend, one could link the intensity function with continuous spatial covariates. Although non-continuous covariates are becoming more common in practice, the existing estimation methods so far only cover covariates in continuous form. As a consequence, to implement such methods, the non-continuous covariates are replaced by the continuous ones by applying some transformation techniques, which are many times problematic. In this paper, we develop a technique for inhomogeneous LGCP involving non-continuous covariates, termed piecewise constant covariates. The method does not require covariates transformation and likelihood approximation, resulting in an estimation technique equivalent to the one for generalized linear models. We apply our method for modeling COVID-19 transmission risk in East Java, Indonesia, which involves five piecewise constant covariates representing population density and sources of crowd. We outline that population density and industry density are significant covariates affecting the COVID-19 transmission risk in East Java.
{"title":"Inhomogeneous log-Gaussian Cox processes with piecewise constant covariates: a case study in modeling of COVID-19 transmission risk in East Java","authors":"Alwan Fadlurohman, Achmad Choiruddin, Jorge Mateu","doi":"10.1007/s00477-024-02720-4","DOIUrl":"https://doi.org/10.1007/s00477-024-02720-4","url":null,"abstract":"<p>The inhomogeneous Log-Gaussian Cox Process (LGCP) defines a flexible point process model for the analysis of spatial point patterns featuring inhomogeneity/spatial trend and aggregation patterns. To fit an LGCP model to spatial point pattern data and study the spatial trend, one could link the intensity function with continuous spatial covariates. Although non-continuous covariates are becoming more common in practice, the existing estimation methods so far only cover covariates in continuous form. As a consequence, to implement such methods, the non-continuous covariates are replaced by the continuous ones by applying some transformation techniques, which are many times problematic. In this paper, we develop a technique for inhomogeneous LGCP involving non-continuous covariates, termed piecewise constant covariates. The method does not require covariates transformation and likelihood approximation, resulting in an estimation technique equivalent to the one for generalized linear models. We apply our method for modeling COVID-19 transmission risk in East Java, Indonesia, which involves five piecewise constant covariates representing population density and sources of crowd. We outline that population density and industry density are significant covariates affecting the COVID-19 transmission risk in East Java.</p>","PeriodicalId":21987,"journal":{"name":"Stochastic Environmental Research and Risk Assessment","volume":"1 1","pages":""},"PeriodicalIF":4.2,"publicationDate":"2024-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140636402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}