Spectrum Prediction With Deep 3D Pyramid Vision Transformer Learning

IF 10.3 1区计算机科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Wireless Communications Pub Date : 2024-11-19 DOI:10.1109/TWC.2024.3495812

Guangliang Pan;Qihui Wu;Bo Zhou;Jie Li;Wei Wang;Guoru Ding;David K. Y. Yau

{"title":"Spectrum Prediction With Deep 3D Pyramid Vision Transformer Learning","authors":"Guangliang Pan;Qihui Wu;Bo Zhou;Jie Li;Wei Wang;Guoru Ding;David K. Y. Yau","doi":"10.1109/TWC.2024.3495812","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a deep learning (DL)-based task-driven spectrum prediction framework, named DeepSPred. The DeepSPred comprises a feature encoder and a task predictor, where the encoder extracts spectrum usage pattern features, and the predictor configures different networks according to the task requirements to predict future spectrum. Based on the DeepSPred, we first propose a novel 3D spectrum prediction method combining a flow processing strategy with 3D vision Transformer (ViT, i.e., Swin) and a pyramid to serve possible applications such as spectrum monitoring task, named 3D-SwinSTB. 3D-SwinSTB unique \n<italic>3D Patch Merging ViT-to-3D ViT Patch Expanding</i>\n and pyramid designs help the model accurately learn the potential correlation of the evolution of the spectrogram over time. Then, we propose a novel spectrum occupancy rate (SOR) method by redesigning a predictor consisting exclusively of 3D convolutional and linear layers to serve possible applications such as dynamic spectrum access (DSA) task, named 3D-SwinLinear. Unlike the 3D-SwinSTB output spectrogram, 3D-SwinLinear projects the spectrogram directly as the SOR. Finally, we employ transfer learning (TL) to ensure the applicability of our two methods to diverse spectrum services. The results show that our 3D-SwinSTB outperforms recent benchmarks by more than 5%, while our 3D-SwinLinear achieves a 90% accuracy, with a performance improvement exceeding 10%.","PeriodicalId":13431,"journal":{"name":"IEEE Transactions on Wireless Communications","volume":"24 1","pages":"509-525"},"PeriodicalIF":10.3000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Wireless Communications","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10758371/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

In this paper, we propose a deep learning (DL)-based task-driven spectrum prediction framework, named DeepSPred. The DeepSPred comprises a feature encoder and a task predictor, where the encoder extracts spectrum usage pattern features, and the predictor configures different networks according to the task requirements to predict future spectrum. Based on the DeepSPred, we first propose a novel 3D spectrum prediction method combining a flow processing strategy with 3D vision Transformer (ViT, i.e., Swin) and a pyramid to serve possible applications such as spectrum monitoring task, named 3D-SwinSTB. 3D-SwinSTB unique 3D Patch Merging ViT-to-3D ViT Patch Expanding and pyramid designs help the model accurately learn the potential correlation of the evolution of the spectrogram over time. Then, we propose a novel spectrum occupancy rate (SOR) method by redesigning a predictor consisting exclusively of 3D convolutional and linear layers to serve possible applications such as dynamic spectrum access (DSA) task, named 3D-SwinLinear. Unlike the 3D-SwinSTB output spectrogram, 3D-SwinLinear projects the spectrogram directly as the SOR. Finally, we employ transfer learning (TL) to ensure the applicability of our two methods to diverse spectrum services. The results show that our 3D-SwinSTB outperforms recent benchmarks by more than 5%, while our 3D-SwinLinear achieves a 90% accuracy, with a performance improvement exceeding 10%.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用深度三维金字塔视觉变换器学习进行频谱预测

在本文中，我们提出了一个基于深度学习（DL）的任务驱动频谱预测框架，名为DeepSPred。DeepSPred包括一个特征编码器和一个任务预测器，其中编码器提取频谱使用模式特征，预测器根据任务要求配置不同的网络来预测未来的频谱。基于DeepSPred，我们首先提出了一种新的3D光谱预测方法，该方法将流处理策略与3D视觉变压器（ViT，即Swin）和金字塔相结合，以服务于频谱监测任务等可能的应用，称为3D- swinstb。3D- swinstb独特的3D Patch合并ViT到3D ViT Patch扩展和金字塔设计有助于模型准确地了解光谱图随时间演变的潜在相关性。然后，我们提出了一种新的频谱占用率（SOR）方法，通过重新设计一个仅由3D卷积层和线性层组成的预测器来服务于动态频谱访问（DSA）任务等可能的应用，称为3D- swinlinear。与3D-SwinSTB输出谱图不同，3D-SwinLinear直接将谱图投影为SOR。最后，我们使用迁移学习（TL）来确保我们的两种方法适用于不同的频谱服务。结果表明，我们的3D-SwinSTB比最近的基准测试高出5%以上，而我们的3D-SwinLinear达到90%的准确率，性能提高超过10%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Wireless Communications 工程技术-电信学

CiteScore

18.60

自引率

10.60%

发文量

708

审稿时长

5.6 months

期刊介绍： The IEEE Transactions on Wireless Communications is a prestigious publication that showcases cutting-edge advancements in wireless communications. It welcomes both theoretical and practical contributions in various areas. The scope of the Transactions encompasses a wide range of topics, including modulation and coding, detection and estimation, propagation and channel characterization, and diversity techniques. The journal also emphasizes the physical and link layer communication aspects of network architectures and protocols. The journal is open to papers on specific topics or non-traditional topics related to specific application areas. This includes simulation tools and methodologies, orthogonal frequency division multiplexing, MIMO systems, and wireless over optical technologies. Overall, the IEEE Transactions on Wireless Communications serves as a platform for high-quality manuscripts that push the boundaries of wireless communications and contribute to advancements in the field.