In storm surge (SS) simulation, data-driven methods can establish the relationship between predictor variables and the predictand, enabling long-term SS level reconstructions. Here, using the U.S. East Coast as an example, we explored the capabilities of four machine learning algorithms, namely Artificial Neural Networks (ANN), Long Short-Term Memory (LSTM), Light Gradient Boosting Machine (LightGBM), and Extreme Gradient Boosting (XGBoost) in reconstructing hourly SS levels from 1979 to 2018 under an all-site modeling framework. Four atmospheric parameters, time index, and tide gauge coordinates from 51 tide gauges are used as predictors. The model performance was evaluated at both the tide gauge and coastal scales. Results indicate that LightGBM and XGBoost models outperform ANN and LSTM in SS reconstructions, with XGBoost showing better overall performance, especially for extreme SSs and historical extreme events. XGBoost can capture the temporal evolution of SSs with higher accuracy, producing reconstructions comparable to observations under the all-site modeling framework. The model interpretability analysis focusing on XGBoost reveals that the spatial distribution of feature importance varies for each predictor. Mean sea level pressure and the 10 m eastward wind component are the two most important predictors, followed by time index, latitude, and longitude under the all-site modeling framework and selected stations. These results indicate that data-driven models under this framework have the potential to capture region-specific and physically reasonable relationships between SS levels and atmospheric drivers.
{"title":"Exploring Machine Learning Capabilities for High Spatiotemporal Resolution Storm Surge Reconstructions","authors":"Qi Feng, Taoyong Jin, Lianjun Yang, Jiancheng Li","doi":"10.1029/2024EA004161","DOIUrl":"https://doi.org/10.1029/2024EA004161","url":null,"abstract":"<p>In storm surge (SS) simulation, data-driven methods can establish the relationship between predictor variables and the predictand, enabling long-term SS level reconstructions. Here, using the U.S. East Coast as an example, we explored the capabilities of four machine learning algorithms, namely Artificial Neural Networks (ANN), Long Short-Term Memory (LSTM), Light Gradient Boosting Machine (LightGBM), and Extreme Gradient Boosting (XGBoost) in reconstructing hourly SS levels from 1979 to 2018 under an all-site modeling framework. Four atmospheric parameters, time index, and tide gauge coordinates from 51 tide gauges are used as predictors. The model performance was evaluated at both the tide gauge and coastal scales. Results indicate that LightGBM and XGBoost models outperform ANN and LSTM in SS reconstructions, with XGBoost showing better overall performance, especially for extreme SSs and historical extreme events. XGBoost can capture the temporal evolution of SSs with higher accuracy, producing reconstructions comparable to observations under the all-site modeling framework. The model interpretability analysis focusing on XGBoost reveals that the spatial distribution of feature importance varies for each predictor. Mean sea level pressure and the 10 m eastward wind component are the two most important predictors, followed by time index, latitude, and longitude under the all-site modeling framework and selected stations. These results indicate that data-driven models under this framework have the potential to capture region-specific and physically reasonable relationships between SS levels and atmospheric drivers.</p>","PeriodicalId":54286,"journal":{"name":"Earth and Space Science","volume":"12 11","pages":""},"PeriodicalIF":2.6,"publicationDate":"2025-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://agupubs.onlinelibrary.wiley.com/doi/epdf/10.1029/2024EA004161","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145469626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Farshad Salajegheh, Xiaoli Deng, Ole Baltazar Andersen, Richard Coleman, Mehdi Khaki
<p>This paper presents BathDNN25, a global bathymetry model developed using gravity data derived from wide-swath altimetry collected by the Surface Water and Ocean Topography (SWOT) mission, with shipborne bathymetry serving as training data in a deep neural network (DNN) framework. BathDNN25 integrates multiple geophysical inputs, including gravity anomalies <span></span><math>