Actor-critic learning-based energy optimization for UAV access and backhaul networks.

IF 2.3 4区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC EURASIP Journal on Wireless Communications and Networking Pub Date : 2021-01-01 Epub Date: 2021-04-07 DOI:10.1186/s13638-021-01960-0
Yaxiong Yuan, Lei Lei, Thang X Vu, Symeon Chatzinotas, Sumei Sun, Björn Ottersten
{"title":"Actor-critic learning-based energy optimization for UAV access and backhaul networks.","authors":"Yaxiong Yuan,&nbsp;Lei Lei,&nbsp;Thang X Vu,&nbsp;Symeon Chatzinotas,&nbsp;Sumei Sun,&nbsp;Björn Ottersten","doi":"10.1186/s13638-021-01960-0","DOIUrl":null,"url":null,"abstract":"<p><p>In unmanned aerial vehicle (UAV)-assisted networks, UAV acts as an aerial base station which acquires the requested data via backhaul link and then serves ground users (GUs) through an access network. In this paper, we investigate an energy minimization problem with a limited power supply for both backhaul and access links. The difficulties for solving such a non-convex and combinatorial problem lie at the high computational complexity/time. In solution development, we consider the approaches from both actor-critic deep reinforcement learning (AC-DRL) and optimization perspectives. First, two offline non-learning algorithms, i.e., an optimal and a heuristic algorithms, based on piecewise linear approximation and relaxation are developed as benchmarks. Second, toward real-time decision-making, we improve the conventional AC-DRL and propose two learning schemes: AC-based user group scheduling and backhaul power allocation (ACGP), and joint AC-based user group scheduling and optimization-based backhaul power allocation (ACGOP). Numerical results show that the computation time of both ACGP and ACGOP is reduced tenfold to hundredfold compared to the offline approaches, and ACGOP is better than ACGP in energy savings. The results also verify the superiority of proposed learning solutions in terms of guaranteeing the feasibility and minimizing the system energy compared to the conventional AC-DRL.</p>","PeriodicalId":12040,"journal":{"name":"EURASIP Journal on Wireless Communications and Networking","volume":"2021 1","pages":"78"},"PeriodicalIF":2.3000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13638-021-01960-0","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EURASIP Journal on Wireless Communications and Networking","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1186/s13638-021-01960-0","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2021/4/7 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 3

Abstract

In unmanned aerial vehicle (UAV)-assisted networks, UAV acts as an aerial base station which acquires the requested data via backhaul link and then serves ground users (GUs) through an access network. In this paper, we investigate an energy minimization problem with a limited power supply for both backhaul and access links. The difficulties for solving such a non-convex and combinatorial problem lie at the high computational complexity/time. In solution development, we consider the approaches from both actor-critic deep reinforcement learning (AC-DRL) and optimization perspectives. First, two offline non-learning algorithms, i.e., an optimal and a heuristic algorithms, based on piecewise linear approximation and relaxation are developed as benchmarks. Second, toward real-time decision-making, we improve the conventional AC-DRL and propose two learning schemes: AC-based user group scheduling and backhaul power allocation (ACGP), and joint AC-based user group scheduling and optimization-based backhaul power allocation (ACGOP). Numerical results show that the computation time of both ACGP and ACGOP is reduced tenfold to hundredfold compared to the offline approaches, and ACGOP is better than ACGP in energy savings. The results also verify the superiority of proposed learning solutions in terms of guaranteeing the feasibility and minimizing the system energy compared to the conventional AC-DRL.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于actor - critical学习的无人机接入和回程网络能量优化。
在无人机辅助网络中,无人机作为空中基站,通过回程链路获取请求数据,然后通过接入网为地面用户提供服务。在这篇论文中,我们研究了在有限的电力供应下,回传和接入链路的能量最小化问题。求解此类非凸组合问题的难点在于计算复杂度/时间较高。在解决方案开发中,我们考虑了actor-critic深度强化学习(AC-DRL)和优化视角的方法。首先,提出了基于分段线性逼近和松弛的两种离线非学习算法,即最优算法和启发式算法作为基准。其次,在实时决策方面,对传统的AC-DRL进行了改进,提出了基于交流的用户组调度和回程功率分配(ACGP)和基于交流的用户组调度和优化回程功率分配(ACGOP)两种学习方案。数值结果表明,ACGP和ACGOP的计算时间都比离线方法缩短了10倍到100倍,ACGOP在节能方面优于ACGP。与传统的AC-DRL相比,所提出的学习方案在保证可行性和最小化系统能量方面具有优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
7.70
自引率
3.80%
发文量
109
审稿时长
8.0 months
期刊介绍: The overall aim of the EURASIP Journal on Wireless Communications and Networking (EURASIP JWCN) is to bring together science and applications of wireless communications and networking technologies with emphasis on signal processing techniques and tools. It is directed at both practicing engineers and academic researchers. EURASIP Journal on Wireless Communications and Networking will highlight the continued growth and new challenges in wireless technology, for both application development and basic research. Articles should emphasize original results relating to the theory and/or applications of wireless communications and networking. Review articles, especially those emphasizing multidisciplinary views of communications and networking, are also welcome. EURASIP Journal on Wireless Communications and Networking employs a paperless, electronic submission and evaluation system to promote a rapid turnaround in the peer-review process. The journal is an Open Access journal since 2004.
期刊最新文献
Anti-jamming for cognitive radio networks with Stackelberg game-assisted DSSS approach A SAR analysis of hexagonal-shaped UWB antenna for healthcare applications Successive interference cancellation with multiple feedback in NOMA-enabled massive IoT network Performance analysis of shared relay CR-NOMA network based on SWIPT Computational offloading into UAV swarm networks: a systematic literature review
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1