DRL-AdCAR: Adaptive Coding-Aware Routing With Maximum Coding Opportunities and High-Quality via Deep Reinforcement Learning in FANET

IF 7.1 2区 计算机科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Vehicular Technology Pub Date : 2024-09-13 DOI:10.1109/TVT.2024.3461161
Bo Song;Lei Xu;Ping Wang;Xiulin Qiu;Yaqi Ke;Junjie Gu;Yuwang Yang;Anhong Chen;Fan Zhang
{"title":"DRL-AdCAR: Adaptive Coding-Aware Routing With Maximum Coding Opportunities and High-Quality via Deep Reinforcement Learning in FANET","authors":"Bo Song;Lei Xu;Ping Wang;Xiulin Qiu;Yaqi Ke;Junjie Gu;Yuwang Yang;Anhong Chen;Fan Zhang","doi":"10.1109/TVT.2024.3461161","DOIUrl":null,"url":null,"abstract":"Research on flying ad-hoc networks (FANETs) has become important with the development of unmanned aerial vehicle (UAV) systems. Routing design in FANET is a major challenge due to its inherent characteristics, including dynamic network topology and network self-organization. Coding-aware routing based on network coding improves the performance of the network by selecting paths with more coding opportunities. However, current methods are mostly used in networks with relatively fixed topologies, which are difficult to adapt to the FANET environment. To address these issues, we propose an adaptive coding-aware routing algorithm via deep reinforcement learning (DRL-AdCAR). First, we transform the routing problem into a Markov decision model. Then, coding opportunities, coding gains, and link quality are considered simultaneously in the reward function to avoid the drawbacks of coding-aware routing algorithms that simply aim to increase coding opportunities while greatly affecting other aspects of network performance. In addition, we present an improvement of the deep deterministic policy gradient (DDPG) algorithm for FANET, combining the gated recurrent unit (GRU) and long short-term memory networks (LSTM) algorithms to replace the traditional neural network structure, which ensures prediction accuracy and improves the training efficiency. The experimental results showed that DRL-AdCAR can adaptively select the transmission paths with the most suitable coding opportunities according to environmental changes in FANET, improving coding performance and enhancing the network throughput and packet delivery rate.","PeriodicalId":13421,"journal":{"name":"IEEE Transactions on Vehicular Technology","volume":"74 1","pages":"1280-1295"},"PeriodicalIF":7.1000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Vehicular Technology","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10680358/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Research on flying ad-hoc networks (FANETs) has become important with the development of unmanned aerial vehicle (UAV) systems. Routing design in FANET is a major challenge due to its inherent characteristics, including dynamic network topology and network self-organization. Coding-aware routing based on network coding improves the performance of the network by selecting paths with more coding opportunities. However, current methods are mostly used in networks with relatively fixed topologies, which are difficult to adapt to the FANET environment. To address these issues, we propose an adaptive coding-aware routing algorithm via deep reinforcement learning (DRL-AdCAR). First, we transform the routing problem into a Markov decision model. Then, coding opportunities, coding gains, and link quality are considered simultaneously in the reward function to avoid the drawbacks of coding-aware routing algorithms that simply aim to increase coding opportunities while greatly affecting other aspects of network performance. In addition, we present an improvement of the deep deterministic policy gradient (DDPG) algorithm for FANET, combining the gated recurrent unit (GRU) and long short-term memory networks (LSTM) algorithms to replace the traditional neural network structure, which ensures prediction accuracy and improves the training efficiency. The experimental results showed that DRL-AdCAR can adaptively select the transmission paths with the most suitable coding opportunities according to environmental changes in FANET, improving coding performance and enhancing the network throughput and packet delivery rate.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
DRL-AdCAR:通过 FANET 中的深度强化学习实现具有最大编码机会和高质量的自适应编码感知路由选择
随着无人机(UAV)系统的发展,对飞行自组织网络(fanet)的研究变得越来越重要。由于FANET的动态拓扑和网络自组织等固有特性,路由设计是一项重大挑战。基于网络编码的编码感知路由通过选择具有更多编码机会的路径来提高网络性能。然而,现有的方法大多用于拓扑结构相对固定的网络,难以适应FANET环境。为了解决这些问题,我们提出了一种通过深度强化学习(DRL-AdCAR)的自适应编码感知路由算法。首先,将路由问题转化为马尔可夫决策模型。然后,在奖励函数中同时考虑编码机会、编码增益和链路质量,以避免编码感知路由算法的缺点,这些算法仅仅旨在增加编码机会,同时极大地影响网络性能的其他方面。此外,我们提出了一种针对FANET的深度确定性策略梯度(DDPG)算法的改进,结合门控循环单元(GRU)和长短期记忆网络(LSTM)算法来取代传统的神经网络结构,保证了预测精度,提高了训练效率。实验结果表明,DRL-AdCAR可以根据环境变化自适应选择具有最合适编码机会的传输路径,提高编码性能,提高网络吞吐量和分组传输速率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
6.00
自引率
8.80%
发文量
1245
审稿时长
6.3 months
期刊介绍: The scope of the Transactions is threefold (which was approved by the IEEE Periodicals Committee in 1967) and is published on the journal website as follows: Communications: The use of mobile radio on land, sea, and air, including cellular radio, two-way radio, and one-way radio, with applications to dispatch and control vehicles, mobile radiotelephone, radio paging, and status monitoring and reporting. Related areas include spectrum usage, component radio equipment such as cavities and antennas, compute control for radio systems, digital modulation and transmission techniques, mobile radio circuit design, radio propagation for vehicular communications, effects of ignition noise and radio frequency interference, and consideration of the vehicle as part of the radio operating environment. Transportation Systems: The use of electronic technology for the control of ground transportation systems including, but not limited to, traffic aid systems; traffic control systems; automatic vehicle identification, location, and monitoring systems; automated transport systems, with single and multiple vehicle control; and moving walkways or people-movers. Vehicular Electronics: The use of electronic or electrical components and systems for control, propulsion, or auxiliary functions, including but not limited to, electronic controls for engineer, drive train, convenience, safety, and other vehicle systems; sensors, actuators, and microprocessors for onboard use; electronic fuel control systems; vehicle electrical components and systems collision avoidance systems; electromagnetic compatibility in the vehicle environment; and electric vehicles and controls.
期刊最新文献
Statistical Beamforming Design for the Downlink MISO-NOMA System Enhancement of Large Language Models Driving Knowledge for Practical Autonomous Driving Decision Making Game-Theoretic Market Mechanism for Dynamic Wireless Resource Allocation Deep Auction Based Joint Resource Allocation for Space-Air-Ground Integrated V2X Communications Anti In-Band Full-Duplex Interception For Wireless Tactical Networks By Joint Passive Defense, Active Defense, and AES-Based Encryption
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1