Time Complexity of Training DNNs With Parallel Computing for Wireless Communications

IF 4.8 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Open Journal of Vehicular Technology Pub Date : 2025-01-07 DOI:10.1109/OJVT.2025.3526847
Pengyu Cong;Chenyang Yang;Shengqian Han;Shuangfeng Han;Xiaoyun Wang
{"title":"Time Complexity of Training DNNs With Parallel Computing for Wireless Communications","authors":"Pengyu Cong;Chenyang Yang;Shengqian Han;Shuangfeng Han;Xiaoyun Wang","doi":"10.1109/OJVT.2025.3526847","DOIUrl":null,"url":null,"abstract":"Deep neural networks (DNNs) have been widely used for learning various wireless communication policies. While DNNs have demonstrated the ability to reduce the time complexity of inference, their training often incurs a high computational cost. Since practical wireless systems require retraining due to operating in open and dynamic environments, it is crucial to analyze the factors affecting the training complexity, which can guide the DNN architecture selection and the hyper-parameter tuning for efficient policy learning. As a metric of time complexity, the number of floating-point operations (FLOPs) for inference has been analyzed in the literature. However, the time complexity of training DNNs for learning wireless communication policies has only been evaluated in terms of runtime. In this paper, we introduce the number of serial FLOPs (se-FLOPs) as a new metric of time complexity, accounting for the ability of parallel computing. The se-FLOPs metric is consistent with actual runtime, making it suitable for measuring the time complexity of training DNNs. Since graph neural networks (GNNs) can learn a multitude of wireless communication policies efficiently and their architectures depend on specific policies, no universal GNN architecture is available for analyzing complexities across different policies. Thus, we first use precoder learning as an example to demonstrate the derivation of the numbers of se-FLOPs required to train several DNNs. Then, we compare the results with the se-FLOPs for inference of the DNNs and for executing a popular numerical algorithm, and provide the scaling laws of these complexities with respect to the numbers of antennas and users. Finally, we extend the analyses to the learning of general wireless communication policies. We use simulations to validate the analyses and compare the time complexity of each DNN trained for achieving the best learning performance and achieving an expected performance.","PeriodicalId":34270,"journal":{"name":"IEEE Open Journal of Vehicular Technology","volume":"6 ","pages":"359-384"},"PeriodicalIF":4.8000,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10830510","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Open Journal of Vehicular Technology","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10830510/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Deep neural networks (DNNs) have been widely used for learning various wireless communication policies. While DNNs have demonstrated the ability to reduce the time complexity of inference, their training often incurs a high computational cost. Since practical wireless systems require retraining due to operating in open and dynamic environments, it is crucial to analyze the factors affecting the training complexity, which can guide the DNN architecture selection and the hyper-parameter tuning for efficient policy learning. As a metric of time complexity, the number of floating-point operations (FLOPs) for inference has been analyzed in the literature. However, the time complexity of training DNNs for learning wireless communication policies has only been evaluated in terms of runtime. In this paper, we introduce the number of serial FLOPs (se-FLOPs) as a new metric of time complexity, accounting for the ability of parallel computing. The se-FLOPs metric is consistent with actual runtime, making it suitable for measuring the time complexity of training DNNs. Since graph neural networks (GNNs) can learn a multitude of wireless communication policies efficiently and their architectures depend on specific policies, no universal GNN architecture is available for analyzing complexities across different policies. Thus, we first use precoder learning as an example to demonstrate the derivation of the numbers of se-FLOPs required to train several DNNs. Then, we compare the results with the se-FLOPs for inference of the DNNs and for executing a popular numerical algorithm, and provide the scaling laws of these complexities with respect to the numbers of antennas and users. Finally, we extend the analyses to the learning of general wireless communication policies. We use simulations to validate the analyses and compare the time complexity of each DNN trained for achieving the best learning performance and achieving an expected performance.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
无线通信中并行计算训练dnn的时间复杂度
深度神经网络(dnn)已被广泛用于学习各种无线通信策略。虽然深度神经网络已经证明了降低推理时间复杂度的能力,但它们的训练通常会产生很高的计算成本。由于实际无线系统在开放和动态环境中运行需要再训练,因此分析影响训练复杂性的因素至关重要,这可以指导深度神经网络的架构选择和超参数调整,以实现有效的策略学习。作为时间复杂度的度量,用于推理的浮点运算次数已经在文献中进行了分析。然而,训练dnn学习无线通信策略的时间复杂度仅在运行时间方面进行了评估。在本文中,我们引入了串行FLOPs (se-FLOPs)的数量作为时间复杂度的新度量,以考虑并行计算的能力。se-FLOPs度量与实际运行时一致,适合于测量训练dnn的时间复杂度。由于图神经网络(GNN)可以有效地学习大量无线通信策略,并且它们的架构依赖于特定的策略,因此没有通用的GNN架构可用于分析不同策略之间的复杂性。因此,我们首先使用预编码器学习作为示例来演示训练多个dnn所需的se- flop数量的推导。然后,我们将结果与用于dnn推理和执行流行数值算法的se-FLOPs进行比较,并提供这些复杂性相对于天线和用户数量的缩放规律。最后,我们将分析扩展到一般无线通信策略的学习。我们使用模拟来验证分析,并比较每个DNN训练的时间复杂度,以实现最佳学习性能和实现预期性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
9.60
自引率
0.00%
发文量
25
审稿时长
10 weeks
期刊最新文献
Stochastic Modeling of EV On-Board Chargers for Fast Frequency Response Under Communication Delays Toward UAV-Assisted 3D UL-Heavy NOMA for Low-Altitude Economy: Joint Bandwidth, Power Allocation and Stereoscopic Trajectory Design RPDMA: A PAPR-Aware Multiple Access Scheme Profiling on EV Chargers: Attack Surface Assessment and a Deep Learning-Based Approach Deep Reinforcement Learning for Energy Management in Hybrid Electric Vehicles With Softmax Double-Actor Regularized Critics
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1