{"title":"影响太阳耀斑多分类预报模型性能的主要因素分析","authors":"Changtian Xiang, Yanfang Zheng, Xuebao Li, Jinfang Wei, Pengchao Yan, Yingzhen Si, Xusheng Huang, Liang Dong, Shuainan Yan, Hengrui Lou, Hongwei Ye, Xuefeng Li, Shunhuang Zhang, Yexin Pan, Huiwen Wu","doi":"10.1007/s10509-024-04356-w","DOIUrl":null,"url":null,"abstract":"<div><p>Efficient forecasting of solar flares is of significant importance for better risk prevention. Currently, there is relatively rare research on multi/four-classification of flares, and the influence of the number of time steps and data feature dimensions on the prediction performance of multi-class models has not been considered. In this study, we utilize the Space-weather HMI Active Region Patch (SHARP) data to develop two categories of models for multiclass flare prediction within 24 hr, including direct output four-classification models and four-classification models using a cascading scheme. The former encompasses Random Forest (RF) model, Long Short-Term Memory (LSTM) model, and Bidirectional LSTM (BLSTM) model, while the latter includes BLSTM Cascade (BLSTM-C) model and BLSTM Cascade with Attention Mechanism (BLSTM-C-A) model. These two categories of models are employed to contrast the impact of different numbers of time steps and the predictive performance in solar flare multi/four-classification. Additionally, we conduct, for the first time, feature importance analysis for multi/four-classification solar flare prediction using deep learning models. The main results are as follows: (1) As the number of time steps increases, the True Skill Statistic (TSS) scores of the four deep learning models improve, showing an overall upward trend in predictive performance. The models achieve their optimal performance when the number of time steps reaches 120. (2) Among the direct output four-class models, deep learning models (LSTM and BLSTM) outperform traditional machine learning model (RF). In both multi-class and binary-class predictions using deep learning, the BLSTM-C model performs better than other deep learning models (LSTM, BLSTM, and BLSTM-C-A). (3) In the feature importance analysis, the top-ranked important features include SAVNCPP and R_VALUE, while the least important features include SHRGT45 and MEANPOT.</p></div>","PeriodicalId":8644,"journal":{"name":"Astrophysics and Space Science","volume":"369 8","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis of the main factors affecting the performance of multi-classification forecast model for solar flares\",\"authors\":\"Changtian Xiang, Yanfang Zheng, Xuebao Li, Jinfang Wei, Pengchao Yan, Yingzhen Si, Xusheng Huang, Liang Dong, Shuainan Yan, Hengrui Lou, Hongwei Ye, Xuefeng Li, Shunhuang Zhang, Yexin Pan, Huiwen Wu\",\"doi\":\"10.1007/s10509-024-04356-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Efficient forecasting of solar flares is of significant importance for better risk prevention. Currently, there is relatively rare research on multi/four-classification of flares, and the influence of the number of time steps and data feature dimensions on the prediction performance of multi-class models has not been considered. In this study, we utilize the Space-weather HMI Active Region Patch (SHARP) data to develop two categories of models for multiclass flare prediction within 24 hr, including direct output four-classification models and four-classification models using a cascading scheme. The former encompasses Random Forest (RF) model, Long Short-Term Memory (LSTM) model, and Bidirectional LSTM (BLSTM) model, while the latter includes BLSTM Cascade (BLSTM-C) model and BLSTM Cascade with Attention Mechanism (BLSTM-C-A) model. These two categories of models are employed to contrast the impact of different numbers of time steps and the predictive performance in solar flare multi/four-classification. Additionally, we conduct, for the first time, feature importance analysis for multi/four-classification solar flare prediction using deep learning models. The main results are as follows: (1) As the number of time steps increases, the True Skill Statistic (TSS) scores of the four deep learning models improve, showing an overall upward trend in predictive performance. The models achieve their optimal performance when the number of time steps reaches 120. (2) Among the direct output four-class models, deep learning models (LSTM and BLSTM) outperform traditional machine learning model (RF). In both multi-class and binary-class predictions using deep learning, the BLSTM-C model performs better than other deep learning models (LSTM, BLSTM, and BLSTM-C-A). (3) In the feature importance analysis, the top-ranked important features include SAVNCPP and R_VALUE, while the least important features include SHRGT45 and MEANPOT.</p></div>\",\"PeriodicalId\":8644,\"journal\":{\"name\":\"Astrophysics and Space Science\",\"volume\":\"369 8\",\"pages\":\"\"},\"PeriodicalIF\":1.8000,\"publicationDate\":\"2024-08-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Astrophysics and Space Science\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10509-024-04356-w\",\"RegionNum\":4,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ASTRONOMY & ASTROPHYSICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Astrophysics and Space Science","FirstCategoryId":"101","ListUrlMain":"https://link.springer.com/article/10.1007/s10509-024-04356-w","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
Analysis of the main factors affecting the performance of multi-classification forecast model for solar flares
Efficient forecasting of solar flares is of significant importance for better risk prevention. Currently, there is relatively rare research on multi/four-classification of flares, and the influence of the number of time steps and data feature dimensions on the prediction performance of multi-class models has not been considered. In this study, we utilize the Space-weather HMI Active Region Patch (SHARP) data to develop two categories of models for multiclass flare prediction within 24 hr, including direct output four-classification models and four-classification models using a cascading scheme. The former encompasses Random Forest (RF) model, Long Short-Term Memory (LSTM) model, and Bidirectional LSTM (BLSTM) model, while the latter includes BLSTM Cascade (BLSTM-C) model and BLSTM Cascade with Attention Mechanism (BLSTM-C-A) model. These two categories of models are employed to contrast the impact of different numbers of time steps and the predictive performance in solar flare multi/four-classification. Additionally, we conduct, for the first time, feature importance analysis for multi/four-classification solar flare prediction using deep learning models. The main results are as follows: (1) As the number of time steps increases, the True Skill Statistic (TSS) scores of the four deep learning models improve, showing an overall upward trend in predictive performance. The models achieve their optimal performance when the number of time steps reaches 120. (2) Among the direct output four-class models, deep learning models (LSTM and BLSTM) outperform traditional machine learning model (RF). In both multi-class and binary-class predictions using deep learning, the BLSTM-C model performs better than other deep learning models (LSTM, BLSTM, and BLSTM-C-A). (3) In the feature importance analysis, the top-ranked important features include SAVNCPP and R_VALUE, while the least important features include SHRGT45 and MEANPOT.
期刊介绍:
Astrophysics and Space Science publishes original contributions and invited reviews covering the entire range of astronomy, astrophysics, astrophysical cosmology, planetary and space science and the astrophysical aspects of astrobiology. This includes both observational and theoretical research, the techniques of astronomical instrumentation and data analysis and astronomical space instrumentation. We particularly welcome papers in the general fields of high-energy astrophysics, astrophysical and astrochemical studies of the interstellar medium including star formation, planetary astrophysics, the formation and evolution of galaxies and the evolution of large scale structure in the Universe. Papers in mathematical physics or in general relativity which do not establish clear astrophysical applications will no longer be considered.
The journal also publishes topically selected special issues in research fields of particular scientific interest. These consist of both invited reviews and original research papers. Conference proceedings will not be considered. All papers published in the journal are subject to thorough and strict peer-reviewing.
Astrophysics and Space Science features short publication times after acceptance and colour printing free of charge.