{"title":"基于张量-训练嵌入表和光伏预测的数据驱动大规模深度学习推荐模型训练优化","authors":"Yunfeng Li;Zheng Wang;Chenhao Ren;Xiaoming Hou;Shengli Zhang","doi":"10.1109/TSMC.2024.3485960","DOIUrl":null,"url":null,"abstract":"Photovoltaic (PV) power forecasting is important for promoting the integration of renewable energy sources. However, neural network-based methods, particularly deep learning for PV power forecasting, face challenges with computational and memory requirements when dealing with industry-scale datasets. To address this, we introduce Rec-PF, a robust computational framework employing the tensor-train (TT) technique. This framework aims to streamline the training process of massive deep learning recommendation models (DLRMs) on constrained resources. Rec-PF employs a high-performance compressed embedding table, enhancing TT decomposition using key computing primitives. It serves as a drop-in replacement for the PyTorch API. Additionally, Rec-PF utilizes an index reordering technique to leverage local and global information from training inputs, thereby enhancing performance. Furthermore, Rec-PF adopts a pipeline training model, eliminating the need for communication between training workers and host memory. We are pioneers in applying DLRM to PV power prediction to reduce training time without compromising accuracy. Our approach demonstrates a twofold improvement in training time compared to methods that do not incorporate our approach. To better demonstrate the enhanced performance of the algorithm, we specifically compare its efficiency with other frameworks using datasets commonly employed in recommender systems. Comprehensive experiments indicate that Rec-PF is capable of processing the largest publicly accessible DLRM and PV datasets on a single GPU, offering a threefold acceleration compared to state-of-the-art DLRM and PV frameworks.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 1","pages":"573-586"},"PeriodicalIF":8.6000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Rec-PF: Data-Driven Large-Scale Deep Learning Recommendation Model Training Optimization Based on Tensor-Train Embedding Table With Photovoltaic Forecast\",\"authors\":\"Yunfeng Li;Zheng Wang;Chenhao Ren;Xiaoming Hou;Shengli Zhang\",\"doi\":\"10.1109/TSMC.2024.3485960\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Photovoltaic (PV) power forecasting is important for promoting the integration of renewable energy sources. However, neural network-based methods, particularly deep learning for PV power forecasting, face challenges with computational and memory requirements when dealing with industry-scale datasets. To address this, we introduce Rec-PF, a robust computational framework employing the tensor-train (TT) technique. This framework aims to streamline the training process of massive deep learning recommendation models (DLRMs) on constrained resources. Rec-PF employs a high-performance compressed embedding table, enhancing TT decomposition using key computing primitives. It serves as a drop-in replacement for the PyTorch API. Additionally, Rec-PF utilizes an index reordering technique to leverage local and global information from training inputs, thereby enhancing performance. Furthermore, Rec-PF adopts a pipeline training model, eliminating the need for communication between training workers and host memory. We are pioneers in applying DLRM to PV power prediction to reduce training time without compromising accuracy. Our approach demonstrates a twofold improvement in training time compared to methods that do not incorporate our approach. To better demonstrate the enhanced performance of the algorithm, we specifically compare its efficiency with other frameworks using datasets commonly employed in recommender systems. Comprehensive experiments indicate that Rec-PF is capable of processing the largest publicly accessible DLRM and PV datasets on a single GPU, offering a threefold acceleration compared to state-of-the-art DLRM and PV frameworks.\",\"PeriodicalId\":48915,\"journal\":{\"name\":\"IEEE Transactions on Systems Man Cybernetics-Systems\",\"volume\":\"55 1\",\"pages\":\"573-586\"},\"PeriodicalIF\":8.6000,\"publicationDate\":\"2024-11-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Systems Man Cybernetics-Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10752362/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Systems Man Cybernetics-Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10752362/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Rec-PF: Data-Driven Large-Scale Deep Learning Recommendation Model Training Optimization Based on Tensor-Train Embedding Table With Photovoltaic Forecast
Photovoltaic (PV) power forecasting is important for promoting the integration of renewable energy sources. However, neural network-based methods, particularly deep learning for PV power forecasting, face challenges with computational and memory requirements when dealing with industry-scale datasets. To address this, we introduce Rec-PF, a robust computational framework employing the tensor-train (TT) technique. This framework aims to streamline the training process of massive deep learning recommendation models (DLRMs) on constrained resources. Rec-PF employs a high-performance compressed embedding table, enhancing TT decomposition using key computing primitives. It serves as a drop-in replacement for the PyTorch API. Additionally, Rec-PF utilizes an index reordering technique to leverage local and global information from training inputs, thereby enhancing performance. Furthermore, Rec-PF adopts a pipeline training model, eliminating the need for communication between training workers and host memory. We are pioneers in applying DLRM to PV power prediction to reduce training time without compromising accuracy. Our approach demonstrates a twofold improvement in training time compared to methods that do not incorporate our approach. To better demonstrate the enhanced performance of the algorithm, we specifically compare its efficiency with other frameworks using datasets commonly employed in recommender systems. Comprehensive experiments indicate that Rec-PF is capable of processing the largest publicly accessible DLRM and PV datasets on a single GPU, offering a threefold acceleration compared to state-of-the-art DLRM and PV frameworks.
期刊介绍:
The IEEE Transactions on Systems, Man, and Cybernetics: Systems encompasses the fields of systems engineering, covering issue formulation, analysis, and modeling throughout the systems engineering lifecycle phases. It addresses decision-making, issue interpretation, systems management, processes, and various methods such as optimization, modeling, and simulation in the development and deployment of large systems.