首页 > 最新文献

Physical Communication最新文献

英文 中文
Adaptive underreach protection relay using zero-mode band time-delay dynamics 基于零模带时延动态的自适应欠伸保护继电器
IF 2.2 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-01 Epub Date: 2026-01-05 DOI: 10.1016/j.phycom.2026.102998
Fusheng Li , Fan Zhang , Bin Qian , Xiaodong Zhou , Yi Luo , Xiangyong Feng
Traditional three-stage power system protection faces inherent conflicts among its four features. When protection settings avoid mal-operation during severe external faults without boundary elements, internal fault sensitivity is compromised. This limitation causes potential over-reach or under-reach mis-operation in conventional under-reach schemes. In this paper, a novel underreach local measurement-based protection scheme is proposed using zero-mode band time-delay dynamics. First, abrupt voltage changes detected in real-time by a Kalman filter initiate the proposed protection. Second, wavelet packet decomposition processes the zero-mode reverse traveling wave signal across multiple frequency bands. Two specific bands are extracted, Hilbert-transformed, and their first wave-head arrival time difference is calculated as the adaptive characteristic criterion. This value dynamically adjusts to fault locations. The theoretical travel time difference of arrival (TTDoA) serves as the dynamic action threshold. Simulations confirm accurate identification of abrupt changes and activation criteria for single-phase-to-ground faults at varying distances. The scheme reliably protects over 80% of the line length, precisely distinguishes internal from external faults, and operates correctly under high-resistance grounding faults.
传统的三级电力系统保护在其四个特征之间面临着内在的冲突。当保护设置避免了外部严重故障而没有边界元素时的误操作时,内部故障灵敏度就会降低。这一限制在传统的欠伸方案中会导致潜在的过伸或欠伸误操作。本文提出了一种基于零模带时延动态的欠伸局部测量保护方案。首先,通过卡尔曼滤波器实时检测电压突变,启动所提出的保护。其次,小波包分解处理跨多个频带的零模反向行波信号。提取两个特定波段,进行希尔伯特变换,计算其第一波头到达时间差作为自适应特征判据。该值根据故障位置动态调整。理论到达时间差(TTDoA)作为动态动作阈值。模拟验证了在不同距离上单相接地故障突变和激活判据的准确识别。该方案能够可靠地保护80%以上的线路长度,能够准确地区分内部故障和外部故障,在高阻接地故障下也能正常工作。
{"title":"Adaptive underreach protection relay using zero-mode band time-delay dynamics","authors":"Fusheng Li ,&nbsp;Fan Zhang ,&nbsp;Bin Qian ,&nbsp;Xiaodong Zhou ,&nbsp;Yi Luo ,&nbsp;Xiangyong Feng","doi":"10.1016/j.phycom.2026.102998","DOIUrl":"10.1016/j.phycom.2026.102998","url":null,"abstract":"<div><div>Traditional three-stage power system protection faces inherent conflicts among its four features. When protection settings avoid mal-operation during severe external faults without boundary elements, internal fault sensitivity is compromised. This limitation causes potential over-reach or under-reach mis-operation in conventional under-reach schemes. In this paper, a novel underreach local measurement-based protection scheme is proposed using zero-mode band time-delay dynamics. First, abrupt voltage changes detected in real-time by a Kalman filter initiate the proposed protection. Second, wavelet packet decomposition processes the zero-mode reverse traveling wave signal across multiple frequency bands. Two specific bands are extracted, Hilbert-transformed, and their first wave-head arrival time difference is calculated as the adaptive characteristic criterion. This value dynamically adjusts to fault locations. The theoretical travel time difference of arrival (TTDoA) serves as the dynamic action threshold. Simulations confirm accurate identification of abrupt changes and activation criteria for single-phase-to-ground faults at varying distances. The scheme reliably protects over 80% of the line length, precisely distinguishes internal from external faults, and operates correctly under high-resistance grounding faults.</div></div>","PeriodicalId":48707,"journal":{"name":"Physical Communication","volume":"75 ","pages":"Article 102998"},"PeriodicalIF":2.2,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145915145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lightweight generative channel estimation with adaptive regularization in massive MIMO systems 大规模MIMO系统中自适应正则化的轻量级生成信道估计
IF 2.2 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-01 Epub Date: 2026-02-19 DOI: 10.1016/j.phycom.2026.103052
Moonil Kim, Jaewoo So
In massive multiple-input–multiple-output (MIMO) systems, as the number of transmit antennas increases, the number of transmit pilots increases for the channel estimation of user equipment. To reduce pilot overheads, generative-model-based channel estimation methods have been studied recently. In this paper, we propose a lightweight generative adversarial network (GAN)-based channel estimation scheme with adaptive regularization. Additionally, the proposed scheme groups users based on the channel collinearity matrix obtained using pre-acquired channel realizations for each user, and uses GANs trained on a per-user group basis. We design lightweight GAN architectures based on the locally centralized sparse characteristics of the beamspace channel. Here, we develop a novel objective function that adaptively determines the relative importance between the received pilots and the pre-trained generator. The simulation results show that the proposed scheme significantly improves performance in terms of the estimation accuracy and complexity when compared with conventional GAN-based channel estimation schemes. Moreover, the proposed scheme can achieve a flexible trade-off between the performance and the complexity of the training network.
在大规模多输入多输出(MIMO)系统中,随着发射天线数量的增加,用于用户设备信道估计的发射导频数量也随之增加。为了减少导频开销,近年来研究了基于生成模型的信道估计方法。本文提出了一种基于自适应正则化的轻量级生成对抗网络(GAN)信道估计方案。此外,该方案根据每个用户使用预获取的信道实现获得的信道共线性矩阵对用户进行分组,并使用以每个用户组为基础训练的gan。我们基于波束空间信道的局部集中稀疏特性设计了轻量级GAN架构。在这里,我们开发了一个新的目标函数,自适应地确定接收到的飞行员和预训练的发电机之间的相对重要性。仿真结果表明,与传统的基于gan的信道估计方案相比,该方案在估计精度和复杂度方面都有显著提高。此外,该方案还能在训练网络的性能和复杂度之间实现灵活的权衡。
{"title":"Lightweight generative channel estimation with adaptive regularization in massive MIMO systems","authors":"Moonil Kim,&nbsp;Jaewoo So","doi":"10.1016/j.phycom.2026.103052","DOIUrl":"10.1016/j.phycom.2026.103052","url":null,"abstract":"<div><div>In massive multiple-input–multiple-output (MIMO) systems, as the number of transmit antennas increases, the number of transmit pilots increases for the channel estimation of user equipment. To reduce pilot overheads, generative-model-based channel estimation methods have been studied recently. In this paper, we propose a lightweight generative adversarial network (GAN)-based channel estimation scheme with adaptive regularization. Additionally, the proposed scheme groups users based on the channel collinearity matrix obtained using pre-acquired channel realizations for each user, and uses GANs trained on a per-user group basis. We design lightweight GAN architectures based on the locally centralized sparse characteristics of the beamspace channel. Here, we develop a novel objective function that adaptively determines the relative importance between the received pilots and the pre-trained generator. The simulation results show that the proposed scheme significantly improves performance in terms of the estimation accuracy and complexity when compared with conventional GAN-based channel estimation schemes. Moreover, the proposed scheme can achieve a flexible trade-off between the performance and the complexity of the training network.</div></div>","PeriodicalId":48707,"journal":{"name":"Physical Communication","volume":"75 ","pages":"Article 103052"},"PeriodicalIF":2.2,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147397591","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transformer-based outlier-tolerant adaptive beamforming algorithm 基于变压器的离群容错自适应波束形成算法
IF 2.2 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-01 Epub Date: 2026-01-27 DOI: 10.1016/j.phycom.2026.103024
Zheng Xu , Meng Wang , Zihao Pan , Ning Yang , Daoxing Guo
Adaptive beamforming technology plays a crucial role in modern communication systems, but existing deep learning methods lack robustness against data acquisition outliers. We propose an outlier-tolerant adaptive beamforming method based on the Transformer architecture for a 16-element linear antenna array with uniform inter-element spacing. First, an array signal model accounting for mutual coupling errors and data acquisition outliers is established, and an in-depth analysis is conducted of the performance degradation mechanism caused by data outliers arising from multiple factors, including computational precision, quantization errors, and system implementation. Second, a Transformer-based outlier signal classifier is designed that intelligently recognizes different types of outlier signals by fusing signal angle information and antenna weight features. On a large-scale dataset containing 56,373,835 samples, the classifier achieves an accuracy of 99.9990%. Finally, a two-stage Transformer adaptive beamforming method incorporating an outlier-aware mechanism is proposed. By combining the outlier-aware preprocessing module with the Transformer-based beamforming method, the system’s robustness to datasets with outliers is significantly enhanced. Experimental results demonstrate that under ideal conditions, the Transformer-based method achieves improvements in main lobe accuracy of approximately 15% and 16% compared to the traditional Null Steering Beamforming (NSB) and the Gated Recurrent Unit (GRU)-based adaptive beamforming methods, respectively, with approximately 10% improvement in the Signal to Interference plus Noise Ratio (SINR) performance. More importantly, under outlier levels of 10%-30%, the proposed outlier-tolerant method achieves significant performance improvement of approximately 75% in both main lobe accuracy and null accuracy compared to the conventional Transformer method, with main lobe accuracy stably maintained within 0.5° and null accuracy controlled within 0.1°, effectively addressing the data outlier robustness problem faced by deep learning beamforming algorithms in practical applications.
自适应波束形成技术在现代通信系统中起着至关重要的作用,但现有的深度学习方法缺乏对数据采集异常值的鲁棒性。针对均匀单元间距的16元线性天线阵列,提出了一种基于Transformer结构的离群容错自适应波束形成方法。首先,建立了考虑互耦误差和数据采集离群值的阵列信号模型,深入分析了计算精度、量化误差和系统实现等多种因素对数据离群值导致的性能下降机制。其次,设计了基于变压器的离群信号分类器,通过融合信号角度信息和天线权重特征,对不同类型的离群信号进行智能识别;在包含56,373,835个样本的大规模数据集上,分类器的准确率达到99.9990%。最后,提出了一种结合离群值感知机制的两级变压器自适应波束形成方法。通过将异常点感知预处理模块与基于变压器的波束形成方法相结合,显著增强了系统对异常点数据集的鲁棒性。实验结果表明,在理想条件下,与传统的零转向波束形成(NSB)和基于门控循环单元(GRU)的自适应波束形成方法相比,基于变压器的方法的主瓣精度分别提高了约15%和16%,信噪比(SINR)性能提高了约10%。更重要的是,在10% ~ 30%的离群值水平下,本文提出的容离群值方法在主瓣精度和零精度方面均较传统的Transformer方法取得了约75%的显著性能提升,主瓣精度稳定保持在0.5°以内,零精度控制在0.1°以内,有效解决了深度学习波束形成算法在实际应用中面临的数据离群值鲁棒性问题。
{"title":"Transformer-based outlier-tolerant adaptive beamforming algorithm","authors":"Zheng Xu ,&nbsp;Meng Wang ,&nbsp;Zihao Pan ,&nbsp;Ning Yang ,&nbsp;Daoxing Guo","doi":"10.1016/j.phycom.2026.103024","DOIUrl":"10.1016/j.phycom.2026.103024","url":null,"abstract":"<div><div>Adaptive beamforming technology plays a crucial role in modern communication systems, but existing deep learning methods lack robustness against data acquisition outliers. We propose an outlier-tolerant adaptive beamforming method based on the Transformer architecture for a 16-element linear antenna array with uniform inter-element spacing. First, an array signal model accounting for mutual coupling errors and data acquisition outliers is established, and an in-depth analysis is conducted of the performance degradation mechanism caused by data outliers arising from multiple factors, including computational precision, quantization errors, and system implementation. Second, a Transformer-based outlier signal classifier is designed that intelligently recognizes different types of outlier signals by fusing signal angle information and antenna weight features. On a large-scale dataset containing 56,373,835 samples, the classifier achieves an accuracy of 99.9990%. Finally, a two-stage Transformer adaptive beamforming method incorporating an outlier-aware mechanism is proposed. By combining the outlier-aware preprocessing module with the Transformer-based beamforming method, the system’s robustness to datasets with outliers is significantly enhanced. Experimental results demonstrate that under ideal conditions, the Transformer-based method achieves improvements in main lobe accuracy of approximately 15% and 16% compared to the traditional Null Steering Beamforming (NSB) and the Gated Recurrent Unit (GRU)-based adaptive beamforming methods, respectively, with approximately 10% improvement in the Signal to Interference plus Noise Ratio (SINR) performance. More importantly, under outlier levels of 10%-30%, the proposed outlier-tolerant method achieves significant performance improvement of approximately 75% in both main lobe accuracy and null accuracy compared to the conventional Transformer method, with main lobe accuracy stably maintained within 0.5° and null accuracy controlled within 0.1°, effectively addressing the data outlier robustness problem faced by deep learning beamforming algorithms in practical applications.</div></div>","PeriodicalId":48707,"journal":{"name":"Physical Communication","volume":"75 ","pages":"Article 103024"},"PeriodicalIF":2.2,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147397925","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi – task reinforcement learning for UAV-enabled urban systems: Balancing trajectory planning and communication fairness 基于无人机的城市系统多任务强化学习:平衡轨迹规划和通信公平性
IF 2.2 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-01 Epub Date: 2026-02-24 DOI: 10.1016/j.phycom.2026.103046
Xinqi He , Yun Li , Rongling Zhang
Enhancing multi-task performance in unmanned aerial vehicle (UAV)-enabled urban communication systems remains challenging due to conflicting objectives, particularly the trade-off between trajectory-planning efficiency and communication fairness. This paper addresses this issue by jointly optimizing UAV path planning and hovering communication tasks. A UAV Network Fairness-Efficiency Model (UFEM) is proposed to quantify overall system performance by integrating an energy-efficiency metric for trajectory planning with a communication-fairness index for user scheduling, thereby capturing the inherent trade-off between these objectives. Building on this framework, a reinforcement learning (RL)-based urban downlink communication system is developed to track dynamic operating conditions through three-dimensional position observations and state-transition functions, while incorporating key environmental uncertainties such as a random wind model and a probabilistic line-of-sight (LoS) model. Task-specific reward functions are further designed to balance competing objectives and enable adaptive task switching. Based on these components, we introduce the Multi-Task Reinforcement Learning for UAV Maneuvers (MRLUM) algorithm, which jointly optimizes path planning and communication scheduling by fusing flight-state information and communication-channel data through an adaptive task-switching strategy. Simulation results demonstrate that MRLUM significantly improves both trajectory-planning efficiency and communication fairness under the UFEM metric, offering a promising solution for UAV-enabled urban communication systems facing multi-task conflicts and environmental uncertainties.
由于目标冲突,特别是在轨迹规划效率和通信公平性之间的权衡,提高无人机(UAV)城市通信系统的多任务性能仍然具有挑战性。本文通过联合优化无人机路径规划和悬停通信任务来解决这一问题。提出了一种无人机网络公平效率模型(UFEM),通过集成用于轨迹规划的能效指标和用于用户调度的通信公平指标来量化系统的整体性能,从而捕获这些目标之间的内在权衡。在此框架的基础上,开发了基于强化学习(RL)的城市下行通信系统,通过三维位置观测和状态转换函数跟踪动态运行条件,同时纳入关键的环境不确定性,如随机风模型和概率视距(LoS)模型。任务特定的奖励功能进一步设计,以平衡竞争目标和实现自适应任务切换。在此基础上,引入了无人机机动多任务强化学习(MRLUM)算法,该算法通过自适应任务切换策略融合飞行状态信息和通信信道数据,共同优化路径规划和通信调度。仿真结果表明,在UFEM度量下,MRLUM显著提高了轨迹规划效率和通信公平性,为面临多任务冲突和环境不确定性的无人机城市通信系统提供了一种有希望的解决方案。
{"title":"Multi – task reinforcement learning for UAV-enabled urban systems: Balancing trajectory planning and communication fairness","authors":"Xinqi He ,&nbsp;Yun Li ,&nbsp;Rongling Zhang","doi":"10.1016/j.phycom.2026.103046","DOIUrl":"10.1016/j.phycom.2026.103046","url":null,"abstract":"<div><div>Enhancing multi-task performance in unmanned aerial vehicle (UAV)-enabled urban communication systems remains challenging due to conflicting objectives, particularly the trade-off between trajectory-planning efficiency and communication fairness. This paper addresses this issue by jointly optimizing UAV path planning and hovering communication tasks. A UAV Network Fairness-Efficiency Model (UFEM) is proposed to quantify overall system performance by integrating an energy-efficiency metric for trajectory planning with a communication-fairness index for user scheduling, thereby capturing the inherent trade-off between these objectives. Building on this framework, a reinforcement learning (RL)-based urban downlink communication system is developed to track dynamic operating conditions through three-dimensional position observations and state-transition functions, while incorporating key environmental uncertainties such as a random wind model and a probabilistic line-of-sight (LoS) model. Task-specific reward functions are further designed to balance competing objectives and enable adaptive task switching. Based on these components, we introduce the Multi-Task Reinforcement Learning for UAV Maneuvers (MRLUM) algorithm, which jointly optimizes path planning and communication scheduling by fusing flight-state information and communication-channel data through an adaptive task-switching strategy. Simulation results demonstrate that MRLUM significantly improves both trajectory-planning efficiency and communication fairness under the UFEM metric, offering a promising solution for UAV-enabled urban communication systems facing multi-task conflicts and environmental uncertainties.</div></div>","PeriodicalId":48707,"journal":{"name":"Physical Communication","volume":"75 ","pages":"Article 103046"},"PeriodicalIF":2.2,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147397948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel beam training method for near-field wideband extremely large antenna arrays 一种新的近场宽带超大天线阵波束训练方法
IF 2.2 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-01 Epub Date: 2026-01-31 DOI: 10.1016/j.phycom.2026.103029
Deyang Zhou, Ke Huang, Jingyi Guan, Yang Wang, Jianhe Du
Extremely large antenna arrays (ELAA) is an important technology for future sixth-generation (6G) wireless networks and enables significantly high data transmission rates. However, existing beam training methods either require high overhead or perform poorly under limited training resources due to the joint estimation of angle and distance in the near-field. To address this problem, we design a novel three-stage beam training method for near-field wideband ELAA systems. Specifically, the base station (BS) first employs wide beams designed based on beam split analysis and sub-array partitioning for initial angle estimation to narrow the angle search space. Refined angle estimation of the user equipment (UE) is then achieved using symmetric narrow beams in the second stage. Finally, distance estimation is performed by leveraging the refined angle estimates to identify the optimal beam in the near-field. Simulation results show that our method achieves a near-optimal achievable rate with low training overhead. In addition, the proposed method exhibits smaller angle estimation errors compared with the existing beam training schemes.
超大型天线阵列(ELAA)是未来第六代(6G)无线网络的一项重要技术,可实现显著的高数据传输速率。然而,现有的波束训练方法由于近场角度和距离的联合估计,要么开销大,要么训练资源有限,训练效果不佳。为了解决这一问题,我们设计了一种新的近场宽带ELAA系统的三级波束训练方法。具体而言,基站首先采用基于波束劈裂分析和子阵列划分设计的宽波束进行初始角度估计,以缩小角度搜索空间。然后在第二阶段使用对称窄波束实现用户设备(UE)的精确角度估计。最后,利用改进的角度估计进行距离估计,以识别近场的最佳波束。仿真结果表明,该方法在较低的训练开销下达到了接近最优的可达率。此外,与现有的波束训练方法相比,该方法具有较小的角度估计误差。
{"title":"A novel beam training method for near-field wideband extremely large antenna arrays","authors":"Deyang Zhou,&nbsp;Ke Huang,&nbsp;Jingyi Guan,&nbsp;Yang Wang,&nbsp;Jianhe Du","doi":"10.1016/j.phycom.2026.103029","DOIUrl":"10.1016/j.phycom.2026.103029","url":null,"abstract":"<div><div>Extremely large antenna arrays (ELAA) is an important technology for future sixth-generation (6G) wireless networks and enables significantly high data transmission rates. However, existing beam training methods either require high overhead or perform poorly under limited training resources due to the joint estimation of angle and distance in the near-field. To address this problem, we design a novel three-stage beam training method for near-field wideband ELAA systems. Specifically, the base station (BS) first employs wide beams designed based on beam split analysis and sub-array partitioning for initial angle estimation to narrow the angle search space. Refined angle estimation of the user equipment (UE) is then achieved using symmetric narrow beams in the second stage. Finally, distance estimation is performed by leveraging the refined angle estimates to identify the optimal beam in the near-field. Simulation results show that our method achieves a near-optimal achievable rate with low training overhead. In addition, the proposed method exhibits smaller angle estimation errors compared with the existing beam training schemes.</div></div>","PeriodicalId":48707,"journal":{"name":"Physical Communication","volume":"75 ","pages":"Article 103029"},"PeriodicalIF":2.2,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147397955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MA-PPO driven autonomous decision system for UAV swarms: Integrating semantic parsing and anti-jamming RL control 基于MA-PPO驱动的无人机群自主决策系统:集成语义解析和抗干扰RL控制
IF 2.2 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-01 Epub Date: 2026-01-21 DOI: 10.1016/j.phycom.2026.103016
Yiming Xiang , Han Yang
The operational effectiveness of drone swarms in complex electromagnetic environments is fundamentally limited by autonomous decision-making capabilities, particularly under dynamic interference and stringent real-time constraints. This study develops an autonomous decision-making system centered on an enhanced Multi-Agent Proximal Policy Optimization (MA-PPO) algorithm, with a core focus on joint communication-control optimization. A hierarchical policy network architecture is designed to tightly couple global task planning with local anti-interference control. Crucially, a dynamic interference model is integrated to co-optimize communication power allocation and flight trajectory planning in real-time, thereby enhancing robustness against channel uncertainty and adversarial jamming. Experimental results under -80 dBm interference intensity demonstrate a 9.4% improvement in task completion rate over MADDPG, a communication interruption rate reduced to 7.1% (19.5% of traditional PID methods), and a 107% enhancement in energy efficiency (8.9 tasks/kWh). The primary contributions are threefold: (1) a hierarchical decision architecture that enables deep coupling between planning and interference-aware control; (2) a joint optimization framework that dynamically balances communication quality with motion constraints; (3) quantitative validation in a realistic electromagnetic environment, confirming the engineering feasibility of the proposed approach for reliable swarm operations. This work provides a scalable and robust solution for autonomous drone swarms, advancing the state-of-the-art in physical-layer aware cooperative control.
无人机群在复杂电磁环境中的作战效能从根本上受到自主决策能力的限制,特别是在动态干扰和严格的实时性约束下。本研究开发了一个以增强型多智能体近端策略优化(MA-PPO)算法为中心的自主决策系统,其核心是联合通信控制优化。设计了一种分层策略网络结构,将全局任务规划与局部抗干扰控制紧密耦合。关键是,集成了动态干扰模型,实时优化通信功率分配和飞行轨迹规划,从而增强了对信道不确定性和对抗性干扰的鲁棒性。实验结果表明,在-80 dBm干扰强度下,与madpg相比,任务完成率提高了9.4%,通信中断率降低到7.1%(传统PID方法的19.5%),能源效率提高了107%(8.9个任务/千瓦时)。主要贡献有三个方面:(1)分层决策架构,实现了计划和干扰感知控制之间的深度耦合;(2)动态平衡通信质量和运动约束的联合优化框架;(3)在现实电磁环境中进行定量验证,验证了所提出的方法在可靠的群体作战中的工程可行性。这项工作为自主无人机群提供了一个可扩展和强大的解决方案,推进了物理层感知协同控制的最新技术。
{"title":"MA-PPO driven autonomous decision system for UAV swarms: Integrating semantic parsing and anti-jamming RL control","authors":"Yiming Xiang ,&nbsp;Han Yang","doi":"10.1016/j.phycom.2026.103016","DOIUrl":"10.1016/j.phycom.2026.103016","url":null,"abstract":"<div><div>The operational effectiveness of drone swarms in complex electromagnetic environments is fundamentally limited by autonomous decision-making capabilities, particularly under dynamic interference and stringent real-time constraints. This study develops an autonomous decision-making system centered on an enhanced Multi-Agent Proximal Policy Optimization (MA-PPO) algorithm, with a core focus on joint communication-control optimization. A hierarchical policy network architecture is designed to tightly couple global task planning with local anti-interference control. Crucially, a dynamic interference model is integrated to co-optimize communication power allocation and flight trajectory planning in real-time, thereby enhancing robustness against channel uncertainty and adversarial jamming. Experimental results under -80 dBm interference intensity demonstrate a 9.4% improvement in task completion rate over MADDPG, a communication interruption rate reduced to 7.1% (19.5% of traditional PID methods), and a 107% enhancement in energy efficiency (8.9 tasks/kWh). The primary contributions are threefold: (1) a hierarchical decision architecture that enables deep coupling between planning and interference-aware control; (2) a joint optimization framework that dynamically balances communication quality with motion constraints; (3) quantitative validation in a realistic electromagnetic environment, confirming the engineering feasibility of the proposed approach for reliable swarm operations. This work provides a scalable and robust solution for autonomous drone swarms, advancing the state-of-the-art in physical-layer aware cooperative control.</div></div>","PeriodicalId":48707,"journal":{"name":"Physical Communication","volume":"75 ","pages":"Article 103016"},"PeriodicalIF":2.2,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146079109","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DQN-based optimization for enhancing the performance of RIS-NOMA system 基于dqn的RIS-NOMA系统性能优化
IF 2.2 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-01 Epub Date: 2025-12-30 DOI: 10.1016/j.phycom.2025.102990
Ying Lin, Haomin Li, Bowen Zheng, Xuefeng Jing, Xiangcheng Wang
In recent years, with the continuous deepening of research in the field of communication, the utilization rate of spectrum resources and the performance improvement of communication systems in specific scenarios have become the focus of attention. In this context, the integration of non-orthogonal multiple access (NOMA) technology for multi-user spectrum resource sharing with the groundbreaking innovation of reconfigurable intelligent surfaces (RIS) represents a promising direction for in-depth exploration in the era of 6 G wireless communications.This study addresses the challenges posed by complex channel environments and introduces deep reinforcement learning into RIS-NOMA systems.By achieving real-time optimization in ultra-high-dimensional spaces, the aim is to determine novel and effective transmission strategies.Specifically, the Deep Q-Network (DQN) algorithm is employed to optimize high-dimensional decision-making in the dynamic environment of RIS-NOMA systems. By leveraging the adaptive optimization capability of DQN for dynamic channel reconstruction, this method is integrated into the RIS-NOMA system.Simulation results demonstrate that the proposed DQN-based RIS-NOMA system achieves significant improvements in key performance metrics such as achievable data rate, system throughput, and energy efficiency, substantially outperforming traditional schemes. The system throughput is increased by approximately 29 % compared to conventional methods, thereby validating the effectiveness and advancement of the proposed design. The synergistic mechanism between RIS phase regulation and NOMA power allocation provides both theoretical support and practical guidance for the future deployment of RIS-NOMA systems.
近年来,随着通信领域研究的不断深入,频谱资源的利用率和通信系统在特定场景下的性能提升成为人们关注的焦点。在此背景下,将多用户频谱资源共享的非正交多址(NOMA)技术与可重构智能表面(RIS)的突破性创新相结合,是5g无线通信时代深入探索的一个有前景的方向。本研究解决了复杂通道环境带来的挑战,并将深度强化学习引入RIS-NOMA系统。通过在超高维空间中实现实时优化,目标是确定新颖有效的传输策略。具体而言,采用Deep Q-Network (DQN)算法对RIS-NOMA系统动态环境下的高维决策进行优化。利用DQN动态信道重建的自适应优化能力,将该方法集成到RIS-NOMA系统中。仿真结果表明,提出的基于dqn的RIS-NOMA系统在可实现的数据速率、系统吞吐量和能源效率等关键性能指标上取得了显著改进,大大优于传统方案。与传统方法相比,系统吞吐量提高了约29%,从而验证了所提出设计的有效性和先进性。RIS相位调节与NOMA功率分配之间的协同机制为RIS-NOMA系统的未来部署提供了理论支持和实践指导。
{"title":"DQN-based optimization for enhancing the performance of RIS-NOMA system","authors":"Ying Lin,&nbsp;Haomin Li,&nbsp;Bowen Zheng,&nbsp;Xuefeng Jing,&nbsp;Xiangcheng Wang","doi":"10.1016/j.phycom.2025.102990","DOIUrl":"10.1016/j.phycom.2025.102990","url":null,"abstract":"<div><div>In recent years, with the continuous deepening of research in the field of communication, the utilization rate of spectrum resources and the performance improvement of communication systems in specific scenarios have become the focus of attention. In this context, the integration of non-orthogonal multiple access (NOMA) technology for multi-user spectrum resource sharing with the groundbreaking innovation of reconfigurable intelligent surfaces (RIS) represents a promising direction for in-depth exploration in the era of 6 G wireless communications.This study addresses the challenges posed by complex channel environments and introduces deep reinforcement learning into RIS-NOMA systems.By achieving real-time optimization in ultra-high-dimensional spaces, the aim is to determine novel and effective transmission strategies.Specifically, the Deep Q-Network (DQN) algorithm is employed to optimize high-dimensional decision-making in the dynamic environment of RIS-NOMA systems. By leveraging the adaptive optimization capability of DQN for dynamic channel reconstruction, this method is integrated into the RIS-NOMA system.Simulation results demonstrate that the proposed DQN-based RIS-NOMA system achieves significant improvements in key performance metrics such as achievable data rate, system throughput, and energy efficiency, substantially outperforming traditional schemes. The system throughput is increased by approximately 29 % compared to conventional methods, thereby validating the effectiveness and advancement of the proposed design. The synergistic mechanism between RIS phase regulation and NOMA power allocation provides both theoretical support and practical guidance for the future deployment of RIS-NOMA systems.</div></div>","PeriodicalId":48707,"journal":{"name":"Physical Communication","volume":"75 ","pages":"Article 102990"},"PeriodicalIF":2.2,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145980715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive fluid antenna deployment for improved wireless reliability 自适应流体天线部署,提高无线可靠性
IF 2.2 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-01 Epub Date: 2026-01-13 DOI: 10.1016/j.phycom.2026.103002
Ahmed S. Alwakeel , Mohamed H. Saad , Mohamed S. Elbakry
Fluid Antenna System (FAS) have emerged as a promising solution for improving wireless communication by allowing an antenna’s placement within a device to dynamically adjust to its surroundings. This flexibility improves signal quality, link stability, and spectrum efficiency without requiring the deployment of extra antennas. However, realizing the full potential of FAS necessitates determining the ideal antenna arrangement, which is a difficult, multidimensional challenge driven by user locations and signal propagation parameters. To address this issue, this research proposes using the Whale Optimization Algorithm (WHO) for efficient FAS tuning. WHO automatically searches the solution space for ideal antenna placements that improve network performance while reducing deployment complexity. Simulation results show that WHO outperforms traditional methods such as Gaussian approximation (GA) and Particle Swarm Optimization (PSO), achieving better connection with fewer antennas–only three vs four and five for GA and PSO, respectively. WHO improves convergence by 49.6% compared to GA and reduces inference time by 35% compared to Differential Evolution (DE), making it suitable for real-time, adaptive, and resource-efficient wireless networks.
流体天线系统(FAS)是一种很有前途的无线通信解决方案,它允许天线在设备内的位置根据周围环境进行动态调整。这种灵活性提高了信号质量、链路稳定性和频谱效率,而无需部署额外的天线。然而,要充分发挥FAS的潜力,必须确定理想的天线布置,这是一项困难的、多维的挑战,受用户位置和信号传播参数的驱动。为了解决这个问题,本研究提出使用鲸鱼优化算法(WHO)进行有效的FAS调整。世卫组织自动搜索解决方案空间,寻找理想的天线放置位置,以提高网络性能,同时降低部署复杂性。仿真结果表明,WHO优于传统方法,如高斯近似(GA)和粒子群优化(PSO),用更少的天线实现了更好的连接——GA和PSO分别只有3个天线和5个天线。与遗传算法相比,WHO的收敛性提高了49.6%,与差分进化(DE)相比,推理时间缩短了35%,使其适用于实时、自适应和资源高效的无线网络。
{"title":"Adaptive fluid antenna deployment for improved wireless reliability","authors":"Ahmed S. Alwakeel ,&nbsp;Mohamed H. Saad ,&nbsp;Mohamed S. Elbakry","doi":"10.1016/j.phycom.2026.103002","DOIUrl":"10.1016/j.phycom.2026.103002","url":null,"abstract":"<div><div>Fluid Antenna System (FAS) have emerged as a promising solution for improving wireless communication by allowing an antenna’s placement within a device to dynamically adjust to its surroundings. This flexibility improves signal quality, link stability, and spectrum efficiency without requiring the deployment of extra antennas. However, realizing the full potential of FAS necessitates determining the ideal antenna arrangement, which is a difficult, multidimensional challenge driven by user locations and signal propagation parameters. To address this issue, this research proposes using the Whale Optimization Algorithm (WHO) for efficient FAS tuning. WHO automatically searches the solution space for ideal antenna placements that improve network performance while reducing deployment complexity. Simulation results show that WHO outperforms traditional methods such as Gaussian approximation (GA) and Particle Swarm Optimization (PSO), achieving better connection with fewer antennas–only three vs four and five for GA and PSO, respectively. WHO improves convergence by 49.6% compared to GA and reduces inference time by 35% compared to Differential Evolution (DE), making it suitable for real-time, adaptive, and resource-efficient wireless networks.</div></div>","PeriodicalId":48707,"journal":{"name":"Physical Communication","volume":"75 ","pages":"Article 103002"},"PeriodicalIF":2.2,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145980717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Autoencoder-Pelican optimization for nonlinear impairment mitigation in long-haul optical fiber systems 远距离光纤系统非线性损伤缓解的自编码器-鹈鹕优化
IF 2.2 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-01 Epub Date: 2026-01-13 DOI: 10.1016/j.phycom.2025.102972
Zahid Zaman , Yousaf Khan , Farman Ali , Ammar Armghan , Muhammad Kamran Shereen , Sultan S. Aldkeelalah , Mardeni Roslee
Long-haul optical transmission (LHOT) systems are affected by nonlinear impairments (NIs), including self-phase modulation (SPM), cross-phase modulation (XPM), four-wave mixing (FWM), amplified spontaneous emission (ASE) noise, and Kerr nonlinearities, which limit achievable data rates and system reach. Conventional methods, such as digital back-propagation (DBP), optical phase conjugation (OPC), and DSP-assisted receivers, have demonstrated mitigation capabilities but suffer from high computational complexity, latency, and power consumption, making them impractical for large-scale networks. Machine learning (ML) approaches, including label propagation and transformer-based schemes, reduce some processing overhead yet do not perform dimensionality reduction for feature compression and lack a mechanism to jointly handle multiple nonlinear effects across LHOT. Furthermore, most reported works do not align with optical communication standards, such as ITU-T G.652.D or OS1/OS2 fibers, which limits their practical implementation in standardized infrastructures.
This work proposes an autoencoder-based pelican optimization algorithm (APOA) for NIs mitigation in LHOT systems. The autoencoder compresses high-dimensional signal distortions into a latent space that preserves nonlinear mappings, reducing computational load while maintaining representation accuracy. The POA performs parameter tuning to optimize signal recovery in the presence of nonlinear effects and noise. The transmission channel is modeled using the nonlinear Schrŏdinger equation (NLSE), with propagation distortions characterized by ITU-T G.652.D single-mode fiber (SMF) parameters: attenuation of 0.20 dB/km, chromatic dispersion of  ∼ 17 ps/nm/km at 1550 nm, effective area of 80 µm2, and nonlinear coefficient γ ≈ 1.3 W1km1. Simulations are conducted using parameter settings aligned with OS1/OS2 fiber specifications (9 µm core diameter) and representative optical communication terminal (OCT) configurations, to reflect realistic long-haul transmission environments. Performance evaluation across multiple OSNR levels, fiber lengths, and modulation formats uses FEC thresholds and operating ranges that are consistent with IEEE 802.3 Ethernet and ITU-T G.709 OTN reference values, showing that APOA achieves BER values below the adopted FEC thresholds, increases spectral efficiency, and extends transmission reach.
远程光传输(LHOT)系统受到非线性损伤(NIs)的影响,包括自相位调制(SPM)、交叉相位调制(XPM)、四波混频(FWM)、放大自发发射(ASE)噪声和克尔非线性,这些非线性损伤限制了可实现的数据速率和系统覆盖范围。传统的方法,如数字反向传播(DBP)、光相位共轭(OPC)和dsp辅助接收器,已经证明了缓解能力,但存在较高的计算复杂性、延迟和功耗,使它们不适用于大规模网络。机器学习(ML)方法,包括标签传播和基于变压器的方案,减少了一些处理开销,但没有对特征压缩进行降维,并且缺乏一种机制来共同处理跨LHOT的多个非线性效应。此外,大多数报道的工作不符合光通信标准,如ITU-T G.652。D或OS1/OS2光纤,这限制了它们在标准化基础设施中的实际实现。本文提出了一种基于自编码器的鹈鹕优化算法(APOA),用于LHOT系统中的NIs缓解。自动编码器将高维信号失真压缩到保留非线性映射的潜在空间中,在保持表示精度的同时减少了计算负荷。在存在非线性效应和噪声的情况下,POA执行参数调谐以优化信号恢复。传输信道使用非线性Schrŏdinger方程(NLSE)建模,传输失真由ITU-T G.652表征。D单模光纤(SMF)参数:衰减0.20 dB/km,色散在1550 nm处为 ~ 17 ps/nm/km,有效面积80µm2,非线性系数γ ≈ 1.3 W−1km−1。模拟采用OS1/OS2光纤规格(芯径9µm)和典型光通信终端(OCT)配置进行参数设置,以反映真实的长途传输环境。跨多种OSNR水平、光纤长度和调制格式的性能评估使用与IEEE 802.3以太网和ITU-T G.709 OTN参考值一致的FEC阈值和工作范围,表明APOA实现了低于所采用的FEC阈值的误码率值,提高了频谱效率,并延长了传输距离。
{"title":"Autoencoder-Pelican optimization for nonlinear impairment mitigation in long-haul optical fiber systems","authors":"Zahid Zaman ,&nbsp;Yousaf Khan ,&nbsp;Farman Ali ,&nbsp;Ammar Armghan ,&nbsp;Muhammad Kamran Shereen ,&nbsp;Sultan S. Aldkeelalah ,&nbsp;Mardeni Roslee","doi":"10.1016/j.phycom.2025.102972","DOIUrl":"10.1016/j.phycom.2025.102972","url":null,"abstract":"<div><div>Long-haul optical transmission (LHOT) systems are affected by nonlinear impairments (NIs), including self-phase modulation (SPM), cross-phase modulation (XPM), four-wave mixing (FWM), amplified spontaneous emission (ASE) noise, and Kerr nonlinearities, which limit achievable data rates and system reach. Conventional methods, such as digital back-propagation (DBP), optical phase conjugation (OPC), and DSP-assisted receivers, have demonstrated mitigation capabilities but suffer from high computational complexity, latency, and power consumption, making them impractical for large-scale networks. Machine learning (ML) approaches, including label propagation and transformer-based schemes, reduce some processing overhead yet do not perform dimensionality reduction for feature compression and lack a mechanism to jointly handle multiple nonlinear effects across LHOT. Furthermore, most reported works do not align with optical communication standards, such as ITU-T G.652.D or OS1/OS2 fibers, which limits their practical implementation in standardized infrastructures.</div><div>This work proposes an autoencoder-based pelican optimization algorithm (APOA) for NIs mitigation in LHOT systems. The autoencoder compresses high-dimensional signal distortions into a latent space that preserves nonlinear mappings, reducing computational load while maintaining representation accuracy. The POA performs parameter tuning to optimize signal recovery in the presence of nonlinear effects and noise. The transmission channel is modeled using the nonlinear Schrŏdinger equation (NLSE), with propagation distortions characterized by ITU-T G.652.D single-mode fiber (SMF) parameters: attenuation of 0.20 dB/km, chromatic dispersion of  ∼ 17 ps/nm/km at 1550 nm, effective area of 80 µm<sup>2</sup>, and nonlinear coefficient <em>γ</em> ≈ 1.3 W<span><math><msup><mrow></mrow><mrow><mo>−</mo><mn>1</mn></mrow></msup></math></span>km<span><math><msup><mrow></mrow><mrow><mo>−</mo><mn>1</mn></mrow></msup></math></span>. Simulations are conducted using parameter settings aligned with OS1/OS2 fiber specifications (9 µm core diameter) and representative optical communication terminal (OCT) configurations, to reflect realistic long-haul transmission environments. Performance evaluation across multiple OSNR levels, fiber lengths, and modulation formats uses FEC thresholds and operating ranges that are consistent with IEEE 802.3 Ethernet and ITU-T G.709 OTN reference values, showing that APOA achieves BER values below the adopted FEC thresholds, increases spectral efficiency, and extends transmission reach.</div></div>","PeriodicalId":48707,"journal":{"name":"Physical Communication","volume":"75 ","pages":"Article 102972"},"PeriodicalIF":2.2,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145980712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint optimization of resource and position for UAV secure two-Way relay systems using reinforcement learning 基于强化学习的无人机安全双向中继系统资源与位置联合优化
IF 2.2 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-03-01 Epub Date: 2026-01-08 DOI: 10.1016/j.phycom.2026.103004
Bin Li , Jie Ding , Hui Li , Jinlong Shi , Xin Zuo
With the rapid development of 5G and the forthcoming B5G/6G networks, unmanned aerial vehicles (UAVs) have been widely adopted in communication systems for their flexible deployment and integrated air-space-ground coverage capabilities. However, UAV communications are highly vulnerable to eavesdropping and jamming attacks, posing a severe threat to communication security. To address this problem, we construct a joint resource and position optimization framework based on the soft Actor-Critic (SAC) algorithm for a secure Two-Way Relay (TWR) system of UAV enabled with Non-Orthogonal Multiple Access (NOMA) technology. In this framework, NOMA technology is incorporated into the TWR relay transmission to achieve spectrum reuse and multi-user parallel communication. The UAV’s position and power allocation are modeled as a Markov Decision Process (MDP), which is intelligently optimized using deep reinforcement learning. We aim to maximize the overall secrecy rate of the system in a dynamic environment while minimizing constraint violations and eavesdropping risks. Simulation results demonstrate that, compared with A2C and PPO algorithms, the proposed SAC-based approach achieves superior convergence speed, stability, and anti-eavesdropping performance, providing technical references for NOMA-based secure UAV communications in B5G/6G networks.
随着5G和即将到来的B5G/6G网络的快速发展,无人机因其灵活部署和综合空-地覆盖能力在通信系统中得到广泛应用。然而,无人机通信极易受到窃听和干扰攻击,对通信安全构成严重威胁。为了解决这一问题,针对采用非正交多址(NOMA)技术的无人机安全双向中继(TWR)系统,构建了一个基于软actor - critical (SAC)算法的联合资源和位置优化框架。在该框架中,将NOMA技术融入TWR中继传输中,实现频谱复用和多用户并行通信。将无人机的位置和功率分配建模为马尔可夫决策过程(MDP),利用深度强化学习对其进行智能优化。我们的目标是在动态环境下最大化系统的整体保密率,同时最小化约束违反和窃听风险。仿真结果表明,与A2C和PPO算法相比,本文提出的基于sac的方法具有更好的收敛速度、稳定性和抗窃听性能,为B5G/6G网络中基于noma的无人机安全通信提供了技术参考。
{"title":"Joint optimization of resource and position for UAV secure two-Way relay systems using reinforcement learning","authors":"Bin Li ,&nbsp;Jie Ding ,&nbsp;Hui Li ,&nbsp;Jinlong Shi ,&nbsp;Xin Zuo","doi":"10.1016/j.phycom.2026.103004","DOIUrl":"10.1016/j.phycom.2026.103004","url":null,"abstract":"<div><div>With the rapid development of 5G and the forthcoming B5G/6G networks, unmanned aerial vehicles (UAVs) have been widely adopted in communication systems for their flexible deployment and integrated air-space-ground coverage capabilities. However, UAV communications are highly vulnerable to eavesdropping and jamming attacks, posing a severe threat to communication security. To address this problem, we construct a joint resource and position optimization framework based on the soft Actor-Critic (SAC) algorithm for a secure Two-Way Relay (TWR) system of UAV enabled with Non-Orthogonal Multiple Access (NOMA) technology. In this framework, NOMA technology is incorporated into the TWR relay transmission to achieve spectrum reuse and multi-user parallel communication. The UAV’s position and power allocation are modeled as a Markov Decision Process (MDP), which is intelligently optimized using deep reinforcement learning. We aim to maximize the overall secrecy rate of the system in a dynamic environment while minimizing constraint violations and eavesdropping risks. Simulation results demonstrate that, compared with A2C and PPO algorithms, the proposed SAC-based approach achieves superior convergence speed, stability, and anti-eavesdropping performance, providing technical references for NOMA-based secure UAV communications in B5G/6G networks.</div></div>","PeriodicalId":48707,"journal":{"name":"Physical Communication","volume":"75 ","pages":"Article 103004"},"PeriodicalIF":2.2,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145980716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Physical Communication
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1