Liu Zeyang, Tan Yujun, Zhou Shengnan, Li Yarong, Zhang Jing, Yang Yadong, Shi Zhongrong, Zhou Xiancun
Accurate aerosol optical depth (AOD) prediction remains challenging due to complex aerosol-radiation interactions and highly variable spatio-temporal patterns. Three critical scientific issues motivate this work: understanding whether and how physical principles can enhance deep learning predictions, identifying which aerosol properties most strongly govern AOD variations, and improving the prediction of extreme AOD events critical for air quality management. Herein, utilizing MERRA-2 reanalysis data (1980–2024) over the Huaihe River Basin in eastern China, a Physics-Guided deep learning framework is presented for Aerosol Optical Depth (AOD) prediction. The model proposed integrates Convolutional Neural Networks (CNN), Long Short-TermMemory (LSTM) networks, and multi-head attention mechanisms to capture both spatio-temporal features and physical relationships of aerosol properties. Three key aspects are involved: First, a hybrid deep learning model is developed and evaluated, which combines CNNs for spatial correlation extraction, bidirectional LSTM for temporal dependency modeling, and multi-head attention for feature interaction learning. Second, a comprehensive feature importance analysis is conducted by examining the relationships between different aerosol properties (mass concentration, scattering coefficient, and Ångström exponent) and AOD prediction, offering physical insights into the model's decision-making process. Third, a specialized approach is proposed for extreme AOD event prediction, focusing on early detection and accurate forecasting of high-AOD episodes. Overall, the results demonstrate the model's efficacy in capturing both regular AOD variations and extreme events, with the Physics-Guided architecture showing superior performance compared to traditional methods. This integrated approach enhances AOD prediction accuracy and deepens insights into aerosol-radiation interactions, thereby improving atmospheric monitoring and air quality forecasting. While MERRA-2 has inherent temporal delays, this framework provides valuable capabilities for historical trend analysis, numerical model validation, and can be readily adapted for real-time applications through transfer learning with satellite observations.
{"title":"Physics-Guided CNN-LSTM Model With Multi-Head Attention for Aerosol Optical Depth Prediction","authors":"Liu Zeyang, Tan Yujun, Zhou Shengnan, Li Yarong, Zhang Jing, Yang Yadong, Shi Zhongrong, Zhou Xiancun","doi":"10.1029/2025EA004461","DOIUrl":"https://doi.org/10.1029/2025EA004461","url":null,"abstract":"<p>Accurate aerosol optical depth (AOD) prediction remains challenging due to complex aerosol-radiation interactions and highly variable spatio-temporal patterns. Three critical scientific issues motivate this work: understanding whether and how physical principles can enhance deep learning predictions, identifying which aerosol properties most strongly govern AOD variations, and improving the prediction of extreme AOD events critical for air quality management. Herein, utilizing MERRA-2 reanalysis data (1980–2024) over the Huaihe River Basin in eastern China, a Physics-Guided deep learning framework is presented for Aerosol Optical Depth (AOD) prediction. The model proposed integrates Convolutional Neural Networks (CNN), Long Short-TermMemory (LSTM) networks, and multi-head attention mechanisms to capture both spatio-temporal features and physical relationships of aerosol properties. Three key aspects are involved: First, a hybrid deep learning model is developed and evaluated, which combines CNNs for spatial correlation extraction, bidirectional LSTM for temporal dependency modeling, and multi-head attention for feature interaction learning. Second, a comprehensive feature importance analysis is conducted by examining the relationships between different aerosol properties (mass concentration, scattering coefficient, and Ångström exponent) and AOD prediction, offering physical insights into the model's decision-making process. Third, a specialized approach is proposed for extreme AOD event prediction, focusing on early detection and accurate forecasting of high-AOD episodes. Overall, the results demonstrate the model's efficacy in capturing both regular AOD variations and extreme events, with the Physics-Guided architecture showing superior performance compared to traditional methods. This integrated approach enhances AOD prediction accuracy and deepens insights into aerosol-radiation interactions, thereby improving atmospheric monitoring and air quality forecasting. While MERRA-2 has inherent temporal delays, this framework provides valuable capabilities for historical trend analysis, numerical model validation, and can be readily adapted for real-time applications through transfer learning with satellite observations.</p>","PeriodicalId":54286,"journal":{"name":"Earth and Space Science","volume":"13 1","pages":""},"PeriodicalIF":2.6,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://agupubs.onlinelibrary.wiley.com/doi/epdf/10.1029/2025EA004461","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145983521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
<p>Lidar observations of atmospheric gravity waves (GWs) have been made spanning 14 years above McMurdo Station, Antarctica. Using these extensive observations and interleaved data processing techniques which enable bias-free/noise-floor-free estimation of GW parameters, this study forms seasonal baselines for GW potential energy densities (<span></span><math>