Nan Yu, Changhong Hu, Jinghuan Wang, Weibo Rao, Siwei Liu, Minghui Yang, Gang Chen, Jinze Li
El Niño and the Southern Oscillation (ENSO) is the strongest inter-annual signal in the global climate system with worldwide climatic, ecological, and societal impacts. Over the past decades, the research on ENSO prediction and predictability has attracted broad attention. Typical prediction efforts based on physically coupled models (e.g., SINTEX-F, CanCM4) demonstrate skill at short lead times (approximately 6–12 months) but tend to lose predictability rapidly over longer horizons. Benefiting from their ability to capture nonlinear dependencies and improve long-term accuracy, deep learning methods such as Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) networks have been widely applied in the prediction of ENSO-related indices, such as the Niño 3.4 index. In this study, we propose a newly designed neural network, named ACTNet, by incorporating a self-attention mechanism into a CNN + LSTM architecture. ACTNet is designed to process the past 12 months of global sea surface temperature (SST), heat content, zonal wind (UA), and meridional wind (VA) as inputs, where CNN layers extract spatial patterns, LSTM layers capture temporal dependencies, and a self-attention mechanism highlights critical spatiotemporal relationships for accurate ENSO prediction. It can predict the Niño 3.4 index at a monthly resolution up to 24 months in advance reasonably well, achieving correlation coefficients exceeding 0.5. Compared to conventional CNN and CNN + LSTM models, ACTNet demonstrates improved spatiotemporal feature extraction and long-lead prediction skill. Another, independent aspect is ENSO-type prediction based on historical observed SST anomalies. Since ENSO events manifest in different types—such as Eastern Pacific and Central Pacific El Niño, as well as their La Niña counterparts—distinguishing these types is crucial for understanding regional climate impacts. To this end, we further employed an LSTM model to classify events into six defined ENSO types based on Niño 3 and Niño 4 indices, achieving a classification accuracy of 70.5% at a 12-month lead time.
{"title":"Spatio-Temporal Network With Self-Attention Mechanism for Improved ENSO Prediction","authors":"Nan Yu, Changhong Hu, Jinghuan Wang, Weibo Rao, Siwei Liu, Minghui Yang, Gang Chen, Jinze Li","doi":"10.1029/2024EA004179","DOIUrl":"https://doi.org/10.1029/2024EA004179","url":null,"abstract":"<p>El Niño and the Southern Oscillation (ENSO) is the strongest inter-annual signal in the global climate system with worldwide climatic, ecological, and societal impacts. Over the past decades, the research on ENSO prediction and predictability has attracted broad attention. Typical prediction efforts based on physically coupled models (e.g., SINTEX-F, CanCM4) demonstrate skill at short lead times (approximately 6–12 months) but tend to lose predictability rapidly over longer horizons. Benefiting from their ability to capture nonlinear dependencies and improve long-term accuracy, deep learning methods such as Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) networks have been widely applied in the prediction of ENSO-related indices, such as the Niño 3.4 index. In this study, we propose a newly designed neural network, named ACTNet, by incorporating a self-attention mechanism into a CNN + LSTM architecture. ACTNet is designed to process the past 12 months of global sea surface temperature (SST), heat content, zonal wind (UA), and meridional wind (VA) as inputs, where CNN layers extract spatial patterns, LSTM layers capture temporal dependencies, and a self-attention mechanism highlights critical spatiotemporal relationships for accurate ENSO prediction. It can predict the Niño 3.4 index at a monthly resolution up to 24 months in advance reasonably well, achieving correlation coefficients exceeding 0.5. Compared to conventional CNN and CNN + LSTM models, ACTNet demonstrates improved spatiotemporal feature extraction and long-lead prediction skill. Another, independent aspect is ENSO-type prediction based on historical observed SST anomalies. Since ENSO events manifest in different types—such as Eastern Pacific and Central Pacific El Niño, as well as their La Niña counterparts—distinguishing these types is crucial for understanding regional climate impacts. To this end, we further employed an LSTM model to classify events into six defined ENSO types based on Niño 3 and Niño 4 indices, achieving a classification accuracy of 70.5% at a 12-month lead time.</p>","PeriodicalId":54286,"journal":{"name":"Earth and Space Science","volume":"12 12","pages":""},"PeriodicalIF":2.6,"publicationDate":"2025-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://agupubs.onlinelibrary.wiley.com/doi/epdf/10.1029/2024EA004179","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145626415","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Drought poses a major threat to agriculture and food security in the Horn of Africa (HOA), where monitoring efforts are hindered by sparse in situ observations and a lack of ground truth data. In this paper, a new self-supervised drought classification model, Seasonal Anomaly Embedding with Vision Transformers (SAED-ViT) is proposed using satellite-derived seasonal anomalies of NDVI, Land Surface Temperature (LST), and precipitation. The method employs the masked autoencoders with Vision Transformers (MAE-ViT) to learn robust spatiotemporal representations from 25 years of satellite Earth observation data (2000–2024). The learned latent features are clustered using unsupervised K-Means to identify semantically meaningful drought regimes, which are then mapped to standardized severity classes without requiring predefined thresholds or labeled data. The results exhibit high spatial accuracy and temporal coherence across a broad range of agro-climatic regions with capturing the large-scale droughts in 2011, 2017, and 2022. Quantitatively compared and verified against Standardized Precipitation Evapotranspiration Index (SPEI), the agreement is strong (r = −0.91, P-value < 0.01) and better when compared to the conventional indexes like NDVI, TCI, and VHI. SAED-ViT achieved robust and label-free drought-severity classification across multiple satellite data sources.
{"title":"A Self-Supervised Seasonal Anomaly Embedding ViT for Label-Free Drought Mapping in the Horn of Africa","authors":"Nasser A. M. Abdelrahim, Shuanggen Jin, Shiyu Li","doi":"10.1029/2025EA004716","DOIUrl":"https://doi.org/10.1029/2025EA004716","url":null,"abstract":"<p>Drought poses a major threat to agriculture and food security in the Horn of Africa (HOA), where monitoring efforts are hindered by sparse in situ observations and a lack of ground truth data. In this paper, a new self-supervised drought classification model, Seasonal Anomaly Embedding with Vision Transformers (SAED-ViT) is proposed using satellite-derived seasonal anomalies of NDVI, Land Surface Temperature (LST), and precipitation. The method employs the masked autoencoders with Vision Transformers (MAE-ViT) to learn robust spatiotemporal representations from 25 years of satellite Earth observation data (2000–2024). The learned latent features are clustered using unsupervised K-Means to identify semantically meaningful drought regimes, which are then mapped to standardized severity classes without requiring predefined thresholds or labeled data. The results exhibit high spatial accuracy and temporal coherence across a broad range of agro-climatic regions with capturing the large-scale droughts in 2011, 2017, and 2022. Quantitatively compared and verified against Standardized Precipitation Evapotranspiration Index (SPEI), the agreement is strong (<i>r</i> = −0.91, P-value < 0.01) and better when compared to the conventional indexes like NDVI, TCI, and VHI. SAED-ViT achieved robust and label-free drought-severity classification across multiple satellite data sources.</p>","PeriodicalId":54286,"journal":{"name":"Earth and Space Science","volume":"12 12","pages":""},"PeriodicalIF":2.6,"publicationDate":"2025-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://agupubs.onlinelibrary.wiley.com/doi/epdf/10.1029/2025EA004716","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145618869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lei Fan, Xiaoning Xie, Cailing Wang, Jianing Guo, Heng Liu, Xiyue Mao, Zhengguo Shi
Precipitation downscaling is essential for generating high-resolution data from coarse-resolution global climate models and assessing the environmental impacts of climate change at regional and local scales. Convolutional Neural Networks (CNNs) are an emerging critical deep-learning technique that promises significant improvements over other downscaling methods. This study employs three attention-enhanced CNNs including Attention-based Laplacian Pyramid Network (AttLap), Attention and Convolutional Mix Network (ACMix) and Multi-scale Attention Network (MAN), and further evaluates their performance in regional precipitation downscaling. Focusing on the Middle Reaches of the Yellow River in China (MRYR), we utilize ERA5 atmospheric variables and Global Precipitation Measurement (GPM) data for model training and testing. Our results indicate that all the attention-enhanced CNNs improve spatio-temporal precipitation simulations across daily, monthly, and annual timescales compared to the conventional CNN model. Notably, the AttLap model shows the greatest improvements compared to the conventional CNN, reducing root-mean-square error of daily precipitation by 10.1% and increasing the correlation coefficient by 16.7% for the regional mean. Moreover, the attention mechanism improves the model's ability to simulate extreme precipitation, showing that the 95th and 99th percentiles of predicted precipitation are much closer to that of GPM data. Meanwhile, the probability density function for daily precipitation in the attention-enhanced CNNs exhibits better agreement with GPM data, particularly for heavy precipitation, further confirming the advantage of the attention mechanism in simulating extreme precipitation. These findings indicate that the attention-enhanced CNNs significantly improve the ability to capture the spatio-temporal precipitation features, thereby enhancing the downscaling accuracy. The study highlights the potential of attention-enhanced models for regional precipitation downscaling, providing valuable tools for climate projection and water resource management in complex terrain regions.
{"title":"Applications of Attention-Enhanced CNN Models to Regional Precipitation Downscaling","authors":"Lei Fan, Xiaoning Xie, Cailing Wang, Jianing Guo, Heng Liu, Xiyue Mao, Zhengguo Shi","doi":"10.1029/2025EA004465","DOIUrl":"https://doi.org/10.1029/2025EA004465","url":null,"abstract":"<p>Precipitation downscaling is essential for generating high-resolution data from coarse-resolution global climate models and assessing the environmental impacts of climate change at regional and local scales. Convolutional Neural Networks (CNNs) are an emerging critical deep-learning technique that promises significant improvements over other downscaling methods. This study employs three attention-enhanced CNNs including Attention-based Laplacian Pyramid Network (AttLap), Attention and Convolutional Mix Network (ACMix) and Multi-scale Attention Network (MAN), and further evaluates their performance in regional precipitation downscaling. Focusing on the Middle Reaches of the Yellow River in China (MRYR), we utilize ERA5 atmospheric variables and Global Precipitation Measurement (GPM) data for model training and testing. Our results indicate that all the attention-enhanced CNNs improve spatio-temporal precipitation simulations across daily, monthly, and annual timescales compared to the conventional CNN model. Notably, the AttLap model shows the greatest improvements compared to the conventional CNN, reducing root-mean-square error of daily precipitation by 10.1% and increasing the correlation coefficient by 16.7% for the regional mean. Moreover, the attention mechanism improves the model's ability to simulate extreme precipitation, showing that the 95th and 99th percentiles of predicted precipitation are much closer to that of GPM data. Meanwhile, the probability density function for daily precipitation in the attention-enhanced CNNs exhibits better agreement with GPM data, particularly for heavy precipitation, further confirming the advantage of the attention mechanism in simulating extreme precipitation. These findings indicate that the attention-enhanced CNNs significantly improve the ability to capture the spatio-temporal precipitation features, thereby enhancing the downscaling accuracy. The study highlights the potential of attention-enhanced models for regional precipitation downscaling, providing valuable tools for climate projection and water resource management in complex terrain regions.</p>","PeriodicalId":54286,"journal":{"name":"Earth and Space Science","volume":"12 12","pages":""},"PeriodicalIF":2.6,"publicationDate":"2025-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://agupubs.onlinelibrary.wiley.com/doi/epdf/10.1029/2025EA004465","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145585310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Alex P. Hoffmann, Hyeonhu Park, Wooin Jo, Ho Jin, Mark B. Moldwin, Eftyhia Zesta, Ian Garrick-Bethell
The Korea Pathfinder Lunar Orbiter (KPLO) spacecraft utilizes the KPLO Magnetometer (KMAG) payload, a three-fluxgate magnetometer array mounted on a 1.2 m boom, to measure crustal and induced lunar magnetic fields. The short boom length exposes the magnetometers to intricate, multi-source stray magnetic fields. These interference signals include a low-frequency, 20 nT peak-to-peak signal from the solar panels and batteries as the spacecraft transitions between sunlight and darkness during certain orbital phases. These stray magnetic fields impede the analysis of lunar magnetic anomalies with magnitudes up to 3 nT at a 100 km altitude. Additionally, downlink issues during the mission's initial stages occasionally resulted in data gaps of up to 12 min (approximately 13% of the orbit) in several orbits. To overcome these data quality challenges, we present a comprehensive three-component method: (a) the Recurrent Forecasting Multichannel Singular Spectrum Analysis (M-SSA) algorithm interpolates data gaps, (b) Wavelet-Adaptive Interference Cancellation for Underdetermined Platforms (WAIC-UP) removes stray magnetic fields from the continuous magnetometer measurements, and (c) the Removal Algorithm for Magnetometer Environmental Noise (RAMEN) gradiometry algorithm corrects low-frequency trends not observed by WAIC-UP. We demonstrate the efficacy of our approach by comparing the results with contemporaneous magnetic field measurements from the lunar-orbiting ARTEMIS-P1 spacecraft and lunar crustal magnetic field maps from the Lunar Prospector and Kaguya missions. This integrated application of M-SSA, WAIC-UP, and RAMEN enables KMAG to reliably investigate lunar magnetic fields despite non-dipolar spacecraft interference and intermittent data gaps.
{"title":"Enhancing Magnetic Field Analysis on the KMAG Instrument: Applying WAIC-UP for Spacecraft Interference Removal and Interpolating Data Gaps","authors":"Alex P. Hoffmann, Hyeonhu Park, Wooin Jo, Ho Jin, Mark B. Moldwin, Eftyhia Zesta, Ian Garrick-Bethell","doi":"10.1029/2025EA004427","DOIUrl":"https://doi.org/10.1029/2025EA004427","url":null,"abstract":"<p>The Korea Pathfinder Lunar Orbiter (KPLO) spacecraft utilizes the KPLO Magnetometer (KMAG) payload, a three-fluxgate magnetometer array mounted on a 1.2 m boom, to measure crustal and induced lunar magnetic fields. The short boom length exposes the magnetometers to intricate, multi-source stray magnetic fields. These interference signals include a low-frequency, 20 nT peak-to-peak signal from the solar panels and batteries as the spacecraft transitions between sunlight and darkness during certain orbital phases. These stray magnetic fields impede the analysis of lunar magnetic anomalies with magnitudes up to 3 nT at a 100 km altitude. Additionally, downlink issues during the mission's initial stages occasionally resulted in data gaps of up to 12 min (approximately 13% of the orbit) in several orbits. To overcome these data quality challenges, we present a comprehensive three-component method: (a) the Recurrent Forecasting Multichannel Singular Spectrum Analysis (M-SSA) algorithm interpolates data gaps, (b) Wavelet-Adaptive Interference Cancellation for Underdetermined Platforms (WAIC-UP) removes stray magnetic fields from the continuous magnetometer measurements, and (c) the Removal Algorithm for Magnetometer Environmental Noise (RAMEN) gradiometry algorithm corrects low-frequency trends not observed by WAIC-UP. We demonstrate the efficacy of our approach by comparing the results with contemporaneous magnetic field measurements from the lunar-orbiting ARTEMIS-P1 spacecraft and lunar crustal magnetic field maps from the Lunar Prospector and Kaguya missions. This integrated application of M-SSA, WAIC-UP, and RAMEN enables KMAG to reliably investigate lunar magnetic fields despite non-dipolar spacecraft interference and intermittent data gaps.</p>","PeriodicalId":54286,"journal":{"name":"Earth and Space Science","volume":"12 11","pages":""},"PeriodicalIF":2.6,"publicationDate":"2025-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://agupubs.onlinelibrary.wiley.com/doi/epdf/10.1029/2025EA004427","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145626020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Badarvada Yadidya, Brian K. Arbic, Jay F. Shriver, Edward D. Zaron, Maarten C. Buijsman, Loren Carrère, Michel Tchilibou, Takaya Uchida
Internal tides are sub-surface inertia-gravity waves that generate significant sea surface height signals detectable with satellite altimetry. The Surface Water and Ocean Topography (SWOT) mission provides an exciting opportunity to characterize these signals with unprecedented spatial detail. Separating tidal and non-tidal oceanic signals is necessary for achieving the SWOT mission's objective of advancing our understanding of mesoscale and submesoscale processes. In this study, we evaluate the performance of a data-assimilative HYbrid Coordinate Ocean Model (HYCOM) forecast system in resolving both phase-locked and non-phase-locked internal tides during the SWOT Cal/Val period. We compare HYCOM's effectiveness to the High-Resolution Empirical Tide model (HRET22), which is currently used for internal tide corrections but only accounts for the phase-locked component. HYCOM achieves an average of 5% greater reduction in phase-locked internal tide variance and a 24.6% greater total variance reduction compared to HRET22 by also accounting for non-phase-locked internal tides. At the