DRL-AdCAR: Adaptive Coding-Aware Routing With Maximum Coding Opportunities and High-Quality via Deep Reinforcement Learning in FANET

IF 7.1 2区计算机科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Vehicular Technology Pub Date : 2024-09-13 DOI:10.1109/TVT.2024.3461161

Bo Song;Lei Xu;Ping Wang;Xiulin Qiu;Yaqi Ke;Junjie Gu;Yuwang Yang;Anhong Chen;Fan Zhang

{"title":"DRL-AdCAR: Adaptive Coding-Aware Routing With Maximum Coding Opportunities and High-Quality via Deep Reinforcement Learning in FANET","authors":"Bo Song;Lei Xu;Ping Wang;Xiulin Qiu;Yaqi Ke;Junjie Gu;Yuwang Yang;Anhong Chen;Fan Zhang","doi":"10.1109/TVT.2024.3461161","DOIUrl":null,"url":null,"abstract":"Research on flying ad-hoc networks (FANETs) has become important with the development of unmanned aerial vehicle (UAV) systems. Routing design in FANET is a major challenge due to its inherent characteristics, including dynamic network topology and network self-organization. Coding-aware routing based on network coding improves the performance of the network by selecting paths with more coding opportunities. However, current methods are mostly used in networks with relatively fixed topologies, which are difficult to adapt to the FANET environment. To address these issues, we propose an adaptive coding-aware routing algorithm via deep reinforcement learning (DRL-AdCAR). First, we transform the routing problem into a Markov decision model. Then, coding opportunities, coding gains, and link quality are considered simultaneously in the reward function to avoid the drawbacks of coding-aware routing algorithms that simply aim to increase coding opportunities while greatly affecting other aspects of network performance. In addition, we present an improvement of the deep deterministic policy gradient (DDPG) algorithm for FANET, combining the gated recurrent unit (GRU) and long short-term memory networks (LSTM) algorithms to replace the traditional neural network structure, which ensures prediction accuracy and improves the training efficiency. The experimental results showed that DRL-AdCAR can adaptively select the transmission paths with the most suitable coding opportunities according to environmental changes in FANET, improving coding performance and enhancing the network throughput and packet delivery rate.","PeriodicalId":13421,"journal":{"name":"IEEE Transactions on Vehicular Technology","volume":"74 1","pages":"1280-1295"},"PeriodicalIF":7.1000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Vehicular Technology","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10680358/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

Research on flying ad-hoc networks (FANETs) has become important with the development of unmanned aerial vehicle (UAV) systems. Routing design in FANET is a major challenge due to its inherent characteristics, including dynamic network topology and network self-organization. Coding-aware routing based on network coding improves the performance of the network by selecting paths with more coding opportunities. However, current methods are mostly used in networks with relatively fixed topologies, which are difficult to adapt to the FANET environment. To address these issues, we propose an adaptive coding-aware routing algorithm via deep reinforcement learning (DRL-AdCAR). First, we transform the routing problem into a Markov decision model. Then, coding opportunities, coding gains, and link quality are considered simultaneously in the reward function to avoid the drawbacks of coding-aware routing algorithms that simply aim to increase coding opportunities while greatly affecting other aspects of network performance. In addition, we present an improvement of the deep deterministic policy gradient (DDPG) algorithm for FANET, combining the gated recurrent unit (GRU) and long short-term memory networks (LSTM) algorithms to replace the traditional neural network structure, which ensures prediction accuracy and improves the training efficiency. The experimental results showed that DRL-AdCAR can adaptively select the transmission paths with the most suitable coding opportunities according to environmental changes in FANET, improving coding performance and enhancing the network throughput and packet delivery rate.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

DRL-AdCAR：通过 FANET 中的深度强化学习实现具有最大编码机会和高质量的自适应编码感知路由选择

随着无人机（UAV）系统的发展，对飞行自组织网络（fanet）的研究变得越来越重要。由于FANET的动态拓扑和网络自组织等固有特性，路由设计是一项重大挑战。基于网络编码的编码感知路由通过选择具有更多编码机会的路径来提高网络性能。然而，现有的方法大多用于拓扑结构相对固定的网络，难以适应FANET环境。为了解决这些问题，我们提出了一种通过深度强化学习（DRL-AdCAR）的自适应编码感知路由算法。首先，将路由问题转化为马尔可夫决策模型。然后，在奖励函数中同时考虑编码机会、编码增益和链路质量，以避免编码感知路由算法的缺点，这些算法仅仅旨在增加编码机会，同时极大地影响网络性能的其他方面。此外，我们提出了一种针对FANET的深度确定性策略梯度（DDPG）算法的改进，结合门控循环单元（GRU）和长短期记忆网络（LSTM）算法来取代传统的神经网络结构，保证了预测精度，提高了训练效率。实验结果表明，DRL-AdCAR可以根据环境变化自适应选择具有最合适编码机会的传输路径，提高编码性能，提高网络吞吐量和分组传输速率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Vehicular Technology 工程技术-电信学

CiteScore

6.00

自引率

8.80%

发文量

1245

审稿时长

6.3 months

期刊介绍： The scope of the Transactions is threefold (which was approved by the IEEE Periodicals Committee in 1967) and is published on the journal website as follows: Communications: The use of mobile radio on land, sea, and air, including cellular radio, two-way radio, and one-way radio, with applications to dispatch and control vehicles, mobile radiotelephone, radio paging, and status monitoring and reporting. Related areas include spectrum usage, component radio equipment such as cavities and antennas, compute control for radio systems, digital modulation and transmission techniques, mobile radio circuit design, radio propagation for vehicular communications, effects of ignition noise and radio frequency interference, and consideration of the vehicle as part of the radio operating environment. Transportation Systems: The use of electronic technology for the control of ground transportation systems including, but not limited to, traffic aid systems; traffic control systems; automatic vehicle identification, location, and monitoring systems; automated transport systems, with single and multiple vehicle control; and moving walkways or people-movers. Vehicular Electronics: The use of electronic or electrical components and systems for control, propulsion, or auxiliary functions, including but not limited to, electronic controls for engineer, drive train, convenience, safety, and other vehicle systems; sensors, actuators, and microprocessors for onboard use; electronic fuel control systems; vehicle electrical components and systems collision avoidance systems; electromagnetic compatibility in the vehicle environment; and electric vehicles and controls.