Kai Wang , Shuo Shan , Weijing Dou , Haikun Wei , Kanjian Zhang
{"title":"A cross-modal deep learning method for enhancing photovoltaic power forecasting with satellite imagery and time series data","authors":"Kai Wang , Shuo Shan , Weijing Dou , Haikun Wei , Kanjian Zhang","doi":"10.1016/j.enconman.2024.119218","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate photovoltaic (PV) power forecasting improves grid stability and energy utilization efficiency. Integrating large-scale cloud information from satellite imagery can enhance the accuracy of ultra-short-term PV power forecasts. However, existing satellite-based forecasting methods consider the global features of satellite images but overlook the impact of localized cloud movements on future PV generation in the target area. The focus on local information, such as PV time series and nearby clouds in the region of interest, contributes to more efficient feature extraction of satellite images. In this study, a deep learning method is proposed to strengthen the cross-modal correlation of global and local information in satellite image encoding and the multi-modal fusion stage. A novel satellite image encoder is designed by using the dual-branch spatio-temporal vision transformer to compress large-scale cloud features into the features of the region of interest. Satellite image features are then combined with PV time-series features using a cross transformer with rotary position embedding. The proposed method was validated using data from ten PV stations, demonstrating forecast skill of 47.29%–58.23% for PV power forecasts up to 4 h ahead. Compared to ViT, ViViT, CrossViT, and Perceiver, the proposed method achieves an average improvement of 2.39%–3.75%, and a minimum of 8.98% improvement in scenarios where PV time-series data is unavailable. Moreover, the proposed method outperforms the state-of-the-art methods by 2.85%–5.53%. The experimental results highlight that the proposed method shows accurate and robust forecasting performance and is a reliable alternative to PV power forecasting.</div></div>","PeriodicalId":11664,"journal":{"name":"Energy Conversion and Management","volume":"323 ","pages":"Article 119218"},"PeriodicalIF":9.9000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy Conversion and Management","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0196890424011592","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate photovoltaic (PV) power forecasting improves grid stability and energy utilization efficiency. Integrating large-scale cloud information from satellite imagery can enhance the accuracy of ultra-short-term PV power forecasts. However, existing satellite-based forecasting methods consider the global features of satellite images but overlook the impact of localized cloud movements on future PV generation in the target area. The focus on local information, such as PV time series and nearby clouds in the region of interest, contributes to more efficient feature extraction of satellite images. In this study, a deep learning method is proposed to strengthen the cross-modal correlation of global and local information in satellite image encoding and the multi-modal fusion stage. A novel satellite image encoder is designed by using the dual-branch spatio-temporal vision transformer to compress large-scale cloud features into the features of the region of interest. Satellite image features are then combined with PV time-series features using a cross transformer with rotary position embedding. The proposed method was validated using data from ten PV stations, demonstrating forecast skill of 47.29%–58.23% for PV power forecasts up to 4 h ahead. Compared to ViT, ViViT, CrossViT, and Perceiver, the proposed method achieves an average improvement of 2.39%–3.75%, and a minimum of 8.98% improvement in scenarios where PV time-series data is unavailable. Moreover, the proposed method outperforms the state-of-the-art methods by 2.85%–5.53%. The experimental results highlight that the proposed method shows accurate and robust forecasting performance and is a reliable alternative to PV power forecasting.
期刊介绍:
The journal Energy Conversion and Management provides a forum for publishing original contributions and comprehensive technical review articles of interdisciplinary and original research on all important energy topics.
The topics considered include energy generation, utilization, conversion, storage, transmission, conservation, management and sustainability. These topics typically involve various types of energy such as mechanical, thermal, nuclear, chemical, electromagnetic, magnetic and electric. These energy types cover all known energy resources, including renewable resources (e.g., solar, bio, hydro, wind, geothermal and ocean energy), fossil fuels and nuclear resources.