首页 > 最新文献

IEEE Transactions on Network and Service Management最新文献

英文 中文
Joint Multi-Agent Reinforcement Learning and Message-Passing for Resilient Multi-UAV Networks 弹性多无人机网络的联合多智能体强化学习与消息传递
IF 5.4 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-02 DOI: 10.1109/TNSM.2025.3650697
Yeryeong Cho;Sungwon Yi;Soohyun Park
This paper introduces a novel resilient algorithm designed for distributed un-crewed aerial vehicles (UAVs) in dynamic and unreliable network environments. Initially, the UAVs should be trained via multi-agent reinforcement learning (MARL) for autonomous mission-critical operations and are fundamentally grounded by centralized training and decentralized execution (CTDE) using a centralized MARL server. In this situation, it is crucial to consider the case where several UAVs cannot receive CTDE-based MARL learning parameters for resilient operations in unreliable network conditions. To tackle this issue, a communication graph is used where its edges are established when two UAVs/nodes are communicable. Then, the edge-connected UAVs can share their training data if one of the UAVs cannot be connected to the CTDE-based MARL server under unreliable network conditions. Additionally, the edge cost considers power efficiency. Based on this given communication graph, message-passing is used for electing the UAVs that can provide their MARL learning parameters to their edge-connected peers. Lastly, performance evaluations demonstrate the superiority of our proposed algorithm in terms of power efficiency and resilient UAV task management, outperforming existing benchmark algorithms.
本文介绍了一种针对分布式无人飞行器在动态和不可靠网络环境下的弹性算法。最初,无人机应该通过多智能体强化学习(MARL)进行自主关键任务操作的训练,并通过使用集中式MARL服务器的集中训练和分散执行(CTDE)从根本上建立基础。在这种情况下,考虑几种无人机在不可靠网络条件下无法接收基于ctde的MARL学习参数以进行弹性操作的情况至关重要。为了解决这个问题,使用了一个通信图,当两个无人机/节点是可通信的时,它的边缘被建立。然后,在不可靠的网络条件下,如果其中一架无人机无法连接到基于ctde的MARL服务器,则边缘连接无人机可以共享其训练数据。此外,边缘成本考虑了功率效率。在此通信图的基础上,采用消息传递的方法选择能够向边缘连接的节点提供MARL学习参数的无人机。最后,性能评估证明了我们提出的算法在功率效率和弹性无人机任务管理方面的优势,优于现有的基准算法。
{"title":"Joint Multi-Agent Reinforcement Learning and Message-Passing for Resilient Multi-UAV Networks","authors":"Yeryeong Cho;Sungwon Yi;Soohyun Park","doi":"10.1109/TNSM.2025.3650697","DOIUrl":"https://doi.org/10.1109/TNSM.2025.3650697","url":null,"abstract":"This paper introduces a novel resilient algorithm designed for distributed un-crewed aerial vehicles (UAVs) in dynamic and unreliable network environments. Initially, the UAVs should be trained via multi-agent reinforcement learning (MARL) for autonomous mission-critical operations and are fundamentally grounded by centralized training and decentralized execution (CTDE) using a centralized MARL server. In this situation, it is crucial to consider the case where several UAVs cannot receive CTDE-based MARL learning parameters for resilient operations in unreliable network conditions. To tackle this issue, a communication graph is used where its edges are established when two UAVs/nodes are communicable. Then, the edge-connected UAVs can share their training data if one of the UAVs cannot be connected to the CTDE-based MARL server under unreliable network conditions. Additionally, the edge cost considers power efficiency. Based on this given communication graph, message-passing is used for electing the UAVs that can provide their MARL learning parameters to their edge-connected peers. Lastly, performance evaluations demonstrate the superiority of our proposed algorithm in terms of power efficiency and resilient UAV task management, outperforming existing benchmark algorithms.","PeriodicalId":13423,"journal":{"name":"IEEE Transactions on Network and Service Management","volume":"23 ","pages":"2051-2063"},"PeriodicalIF":5.4,"publicationDate":"2026-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146026588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cooperative Multi-Agent Strategy for Caching of Transient Data in Edge-Assisted IoT Networks 边缘辅助物联网中瞬态数据缓存的协同多智能体策略
IF 5.4 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-30 DOI: 10.1109/TNSM.2025.3649503
Surabhi Sharma;Sateesh Kumar Peddoju
Internet of Things (IoT) applications continuously generate large volumes of transient data. Delivering transient data efficiently is challenging because it is short-lived, highly dynamic, and often critical for time-sensitive services. Caching at the Edge offers a practical solution by storing frequently requested content closer to users, reducing delivery delays, and easing network congestion. However, existing caching approaches in Edge-assisted IoT networks face four significant limitations: (i) lack of freshness-aware policies, leading to outdated data, (ii) static or centralized coordination, which restricts scalability, (iii) inability to adapt to bursty and heterogeneous traffic patterns, and (iv) inefficient handling of resource-constrained Edge nodes. IoT-Cooperative Caching (IoT-C ${}^{2}$ ) addresses these issues with a framework based on multi-agent reinforcement learning. Using the framework, Edge servers make decentralized, adaptive decisions that account for both user demand and data freshness. IoT-C2 introduces topic-based grouping of Edge nodes and a hierarchical state model that supports collaboration across local, group, and global states. Experiments show that IoT-C2 increases cache hit rates, reduces latency, and improves freshness compared with state-of-the-art techniques. These improvements make the proposed approach well-suited for time-critical IoT applications like smart cities, healthcare, and industrial networks.
物联网(IoT)应用不断产生大量的瞬态数据。有效地交付暂态数据是一项挑战,因为它是短暂的、高度动态的,并且对于时间敏感的服务通常是至关重要的。Edge上的缓存提供了一个实用的解决方案,它将频繁请求的内容存储在离用户更近的地方,减少了交付延迟,并缓解了网络拥塞。然而,边缘辅助物联网网络中现有的缓存方法面临四个重大限制:(i)缺乏新鲜度感知策略,导致数据过时;(ii)静态或集中协调,限制了可扩展性;(iii)无法适应突发和异构流量模式;(iv)对资源受限边缘节点的低效处理。IoT-Cooperative Caching (IoT-C ${}^{2}$)通过基于多智能体强化学习的框架解决了这些问题。使用该框架,Edge服务器可以根据用户需求和数据新鲜度做出分散的自适应决策。IoT-C2引入了基于主题的Edge节点分组和分层状态模型,支持跨本地、组和全局状态的协作。实验表明,与最先进的技术相比,IoT-C2提高了缓存命中率,减少了延迟,并提高了新鲜度。这些改进使所提出的方法非常适合智能城市、医疗保健和工业网络等时间关键型物联网应用。
{"title":"Cooperative Multi-Agent Strategy for Caching of Transient Data in Edge-Assisted IoT Networks","authors":"Surabhi Sharma;Sateesh Kumar Peddoju","doi":"10.1109/TNSM.2025.3649503","DOIUrl":"https://doi.org/10.1109/TNSM.2025.3649503","url":null,"abstract":"Internet of Things (IoT) applications continuously generate large volumes of transient data. Delivering transient data efficiently is challenging because it is short-lived, highly dynamic, and often critical for time-sensitive services. Caching at the Edge offers a practical solution by storing frequently requested content closer to users, reducing delivery delays, and easing network congestion. However, existing caching approaches in Edge-assisted IoT networks face four significant limitations: (i) lack of freshness-aware policies, leading to outdated data, (ii) static or centralized coordination, which restricts scalability, (iii) inability to adapt to bursty and heterogeneous traffic patterns, and (iv) inefficient handling of resource-constrained Edge nodes. IoT-Cooperative Caching (IoT-C<inline-formula> <tex-math>${}^{2}$ </tex-math></inline-formula>) addresses these issues with a framework based on multi-agent reinforcement learning. Using the framework, Edge servers make decentralized, adaptive decisions that account for both user demand and data freshness. IoT-C2 introduces topic-based grouping of Edge nodes and a hierarchical state model that supports collaboration across local, group, and global states. Experiments show that IoT-C2 increases cache hit rates, reduces latency, and improves freshness compared with state-of-the-art techniques. These improvements make the proposed approach well-suited for time-critical IoT applications like smart cities, healthcare, and industrial networks.","PeriodicalId":13423,"journal":{"name":"IEEE Transactions on Network and Service Management","volume":"23 ","pages":"1369-1380"},"PeriodicalIF":5.4,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Dynamic PAPR Reduction Method Using PTS-ESSA for MIMO Generalized FDM Wireless System 基于PTS-ESSA的MIMO广义FDM无线系统动态PAPR减小方法
IF 5.4 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-30 DOI: 10.1109/TNSM.2025.3619945
Surendra Kumar;Jitendra Kumar Samriya;Rajeev Tiwari;Mohit Kumar;Shilpi Harnal;Neeraj Kumar;Mohsen Guizani
Generalized Frequency Division Multiplexing (GFDM) is considered a strong candidate to replace Orthogonal Frequency Division Multiplexing (OFDM) in 5G MIMO networks because of its enhanced spectral utilization and design flexibility. Despite these advantages, GFDM faces the drawback of producing a relatively high Peak-to-Average Power Ratio (PAPR), which limits the efficiency of power amplifiers. To address this issue, the Partial Transmit Sequence (PTS) method is often employed for PAPR reduction. Nevertheless, the effectiveness of PTS is hindered by the intensive computational effort required for searching multiple phase factors. To overcome this challenge, we propose a method that integrates the Enhanced Squirrel Search Algorithm (ESSA) with an adaptive parameter control mechanism and a Grey Wolf Optimizer (GWO), enabling a dynamic balance between exploration and exploitation during phase factor selection. This improvement reduces the computational overhead, accelerates the convergence, and enhances the robustness of the phase sequence optimization. Simulation results show that the Hybrid PTS-ESSA-GWO-RPSM model achieves superior PAPR reduction compared to conventional ESSA-based approaches, while also providing better BER and SNR performance under varying channel conditions. The proposed method therefore offers an efficient trade-off between complexity and PAPR reduction, making it suitable for practical deployment in MIMO-GFDM-based 5G systems. The proposed scheme is evaluated against related methods by analyzing key performance indicators, including Complementary Cumulative Distribution Function (CCDF), Bit Error Rate (BER), Peak-to-Average Power Ratio (PAPR), and Signal-to-Noise Ratio (SNR).
广义频分复用(GFDM)由于其提高了频谱利用率和设计灵活性,被认为是5G MIMO网络中取代正交频分复用(OFDM)的有力候选技术。尽管有这些优点,GFDM面临的缺点是产生相对较高的峰值平均功率比(PAPR),这限制了功率放大器的效率。为了解决这个问题,通常采用部分传输序列(PTS)方法来降低PAPR。然而,PTS的有效性受到搜索多个相位因子所需的大量计算工作量的阻碍。为了克服这一挑战,我们提出了一种将增强型松鼠搜索算法(ESSA)与自适应参数控制机制和灰狼优化器(GWO)相结合的方法,在相位因子选择过程中实现了勘探和开发之间的动态平衡。这种改进减少了计算量,加快了收敛速度,增强了相序优化的鲁棒性。仿真结果表明,与传统的基于essa的方法相比,混合PTS-ESSA-GWO-RPSM模型具有更好的PAPR降低效果,同时在不同信道条件下也具有更好的BER和信噪比性能。因此,所提出的方法在复杂性和PAPR降低之间提供了有效的权衡,使其适用于基于mimo - gfdm的5G系统的实际部署。通过分析互补累积分布函数(CCDF)、误码率(BER)、峰均功率比(PAPR)和信噪比(SNR)等关键性能指标,对该方案进行了评价。
{"title":"A Dynamic PAPR Reduction Method Using PTS-ESSA for MIMO Generalized FDM Wireless System","authors":"Surendra Kumar;Jitendra Kumar Samriya;Rajeev Tiwari;Mohit Kumar;Shilpi Harnal;Neeraj Kumar;Mohsen Guizani","doi":"10.1109/TNSM.2025.3619945","DOIUrl":"https://doi.org/10.1109/TNSM.2025.3619945","url":null,"abstract":"Generalized Frequency Division Multiplexing (GFDM) is considered a strong candidate to replace Orthogonal Frequency Division Multiplexing (OFDM) in 5G MIMO networks because of its enhanced spectral utilization and design flexibility. Despite these advantages, GFDM faces the drawback of producing a relatively high Peak-to-Average Power Ratio (PAPR), which limits the efficiency of power amplifiers. To address this issue, the Partial Transmit Sequence (PTS) method is often employed for PAPR reduction. Nevertheless, the effectiveness of PTS is hindered by the intensive computational effort required for searching multiple phase factors. To overcome this challenge, we propose a method that integrates the Enhanced Squirrel Search Algorithm (ESSA) with an adaptive parameter control mechanism and a Grey Wolf Optimizer (GWO), enabling a dynamic balance between exploration and exploitation during phase factor selection. This improvement reduces the computational overhead, accelerates the convergence, and enhances the robustness of the phase sequence optimization. Simulation results show that the Hybrid PTS-ESSA-GWO-RPSM model achieves superior PAPR reduction compared to conventional ESSA-based approaches, while also providing better BER and SNR performance under varying channel conditions. The proposed method therefore offers an efficient trade-off between complexity and PAPR reduction, making it suitable for practical deployment in MIMO-GFDM-based 5G systems. The proposed scheme is evaluated against related methods by analyzing key performance indicators, including Complementary Cumulative Distribution Function (CCDF), Bit Error Rate (BER), Peak-to-Average Power Ratio (PAPR), and Signal-to-Noise Ratio (SNR).","PeriodicalId":13423,"journal":{"name":"IEEE Transactions on Network and Service Management","volume":"23 ","pages":"1162-1175"},"PeriodicalIF":5.4,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145852470","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HMCGeo: IP Region Prediction Based on Hierarchical Multi-Label Classification HMCGeo:基于分层多标签分类的IP区域预测
IF 5.4 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-26 DOI: 10.1109/TNSM.2025.3648815
Tianzi Zhao;Xinran Liu;Zhaoxin Zhang;Dong Zhao;Ning Li;Zhichao Zhang;Xinye Wang
Fine-grained IP geolocation plays a critical role in applications such as location-based services and cybersecurity. Most existing fine-grained IP geolocation methods treat the task as a coordinate regression problem. However, due to inherent noise in the IP features, they frequently produce kilometer-scale coordinate errors, which in turn lead to inaccuracies when the predicted coordinates are mapped to geographic regions. To alleviate this issue, this paper introduces a novel IP region prediction framework HMCGeo, framing IP region prediction as a hierarchical multi-label classification problem. City administrators divide urban areas into regions at multiple granularities. The proposed framework employs residual connection-based feature extraction units to obtain IP representations at each granularity and introduces class prototype attention to predict the IP’s belonging region at the current granularity. Additionally, it adopts an output fusion strategy combined with hierarchical focal loss to further enhance region prediction performance. We evaluate HMCGeo on real-world datasets from New York, Los Angeles, and Shanghai. It significantly outperforms existing methods in region prediction across all granularities and achieves lower coordinate errors on most samples by similarity-weighted averaging of candidate region centers.
细粒度IP地理定位在诸如基于位置的服务和网络安全等应用中起着至关重要的作用。大多数现有的细粒度IP地理定位方法都将该任务视为坐标回归问题。然而,由于IP特征中固有的噪声,它们经常产生千米尺度的坐标误差,从而导致在将预测坐标映射到地理区域时不准确。为了解决这一问题,本文引入了一种新的IP区域预测框架HMCGeo,将IP区域预测框架化为分层多标签分类问题。城市管理者将城市划分为多个粒度的区域。该框架采用基于残差连接的特征提取单元来获得每个粒度的IP表示,并引入类原型关注来预测当前粒度的IP所属区域。此外,采用了一种结合层次焦点损失的输出融合策略,进一步提高了区域预测性能。我们在纽约、洛杉矶和上海的真实数据集上评估了HMCGeo。该方法在所有粒度的区域预测方面都明显优于现有方法,并且通过对候选区域中心进行相似度加权平均,在大多数样本上实现了较低的坐标误差。
{"title":"HMCGeo: IP Region Prediction Based on Hierarchical Multi-Label Classification","authors":"Tianzi Zhao;Xinran Liu;Zhaoxin Zhang;Dong Zhao;Ning Li;Zhichao Zhang;Xinye Wang","doi":"10.1109/TNSM.2025.3648815","DOIUrl":"https://doi.org/10.1109/TNSM.2025.3648815","url":null,"abstract":"Fine-grained IP geolocation plays a critical role in applications such as location-based services and cybersecurity. Most existing fine-grained IP geolocation methods treat the task as a coordinate regression problem. However, due to inherent noise in the IP features, they frequently produce kilometer-scale coordinate errors, which in turn lead to inaccuracies when the predicted coordinates are mapped to geographic regions. To alleviate this issue, this paper introduces a novel IP region prediction framework HMCGeo, framing IP region prediction as a hierarchical multi-label classification problem. City administrators divide urban areas into regions at multiple granularities. The proposed framework employs residual connection-based feature extraction units to obtain IP representations at each granularity and introduces class prototype attention to predict the IP’s belonging region at the current granularity. Additionally, it adopts an output fusion strategy combined with hierarchical focal loss to further enhance region prediction performance. We evaluate HMCGeo on real-world datasets from New York, Los Angeles, and Shanghai. It significantly outperforms existing methods in region prediction across all granularities and achieves lower coordinate errors on most samples by similarity-weighted averaging of candidate region centers.","PeriodicalId":13423,"journal":{"name":"IEEE Transactions on Network and Service Management","volume":"23 ","pages":"1913-1926"},"PeriodicalIF":5.4,"publicationDate":"2025-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145982177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Threshold-Triggered Deep Q-Network-Based Framework for Self-Healing in Autonomic Software-Defined IIoT-Edge Networks 基于阈值触发深度q -网络的自主软件定义iiot边缘网络自修复框架
IF 5.4 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-24 DOI: 10.1109/TNSM.2025.3647853
Agrippina Mwangi;León Navarro-Hilfiker;Lukasz Brewka;Mikkel Gryning;Elena Fumagalli;Madeleine Gibescu
Stochastic disruptions such as flash events arising from benign traffic bursts and switch thermal fluctuations are major contributors to intermittent service degradation in software-defined industrial networks. These events violate IEC 61850-derived quality of service requirements and user-defined service-level agreements, hindering the reliable and timely delivery of control, monitoring, and best-effort traffic in IEC 61400-25-compliant wind power plants. Failure to maintain these requirements often results in delayed or lost control signals, reduced operational efficiency, and increased risk of wind turbine generator downtime. To address these challenges, this study proposes a threshold-triggered Deep Q-Network self-healing agent that autonomically detects, analyzes, and mitigates network disruptions while adapting routing behavior and resource allocation in real time. The proposed agent was trained, validated, and tested on an emulated tri-clustered switch network deployed in a cloud-based proof-of-concept testbed. Simulation results show that the proposed agent improves disruption recovery performance by 53.84% compared to a baseline shortest-path and load-balanced routing approach, and outperforms state-of-the-art methods, including the Adaptive Network-based Fuzzy Inference System by 13.1% and the Deep Q-Network and Traffic Prediction-based Routing Optimization method by 21.5%, in a super-spine leaf data-plane architecture. Additionally, the agent maintains switch thermal stability by proactively initiating external rack cooling when required. These findings highlight the potential of deep reinforcement learning in building resilience in software-defined industrial networks deployed in mission-critical, time-sensitive application scenarios.
随机中断,如由良性流量突发和交换机热波动引起的闪电事件,是软件定义工业网络中间歇性服务退化的主要原因。这些事件违反了IEC 61850衍生的服务质量要求和用户自定义的服务水平协议,阻碍了符合IEC 61400-25标准的风力发电厂可靠、及时地提供控制、监测和尽力而为的流量。如果不能保持这些要求,通常会导致控制信号延迟或丢失,降低运行效率,并增加风力发电机停机的风险。为了应对这些挑战,本研究提出了一种阈值触发的Deep Q-Network自愈代理,该代理可以自动检测、分析和减轻网络中断,同时实时适应路由行为和资源分配。提出的代理在部署在基于云的概念验证测试平台中的模拟三集群交换网络上进行了训练、验证和测试。仿真结果表明,与基线最短路径和负载均衡路由方法相比,所提出的智能体的中断恢复性能提高了53.84%,并且在超级脊柱叶数据平面架构中优于最先进的方法,包括基于自适应网络的模糊推理系统(Adaptive Network-based Fuzzy Inference System) 13.1%和基于深度Q-Network和流量预测的路由优化方法21.5%。此外,代理保持开关热稳定性主动启动外部机架冷却时需要。这些发现突出了深度强化学习在关键任务、时间敏感应用场景中部署的软件定义工业网络中建立弹性的潜力。
{"title":"A Threshold-Triggered Deep Q-Network-Based Framework for Self-Healing in Autonomic Software-Defined IIoT-Edge Networks","authors":"Agrippina Mwangi;León Navarro-Hilfiker;Lukasz Brewka;Mikkel Gryning;Elena Fumagalli;Madeleine Gibescu","doi":"10.1109/TNSM.2025.3647853","DOIUrl":"https://doi.org/10.1109/TNSM.2025.3647853","url":null,"abstract":"Stochastic disruptions such as flash events arising from benign traffic bursts and switch thermal fluctuations are major contributors to intermittent service degradation in software-defined industrial networks. These events violate IEC 61850-derived quality of service requirements and user-defined service-level agreements, hindering the reliable and timely delivery of control, monitoring, and best-effort traffic in IEC 61400-25-compliant wind power plants. Failure to maintain these requirements often results in delayed or lost control signals, reduced operational efficiency, and increased risk of wind turbine generator downtime. To address these challenges, this study proposes a threshold-triggered Deep Q-Network self-healing agent that autonomically detects, analyzes, and mitigates network disruptions while adapting routing behavior and resource allocation in real time. The proposed agent was trained, validated, and tested on an emulated tri-clustered switch network deployed in a cloud-based proof-of-concept testbed. Simulation results show that the proposed agent improves disruption recovery performance by 53.84% compared to a baseline shortest-path and load-balanced routing approach, and outperforms state-of-the-art methods, including the Adaptive Network-based Fuzzy Inference System by 13.1% and the Deep Q-Network and Traffic Prediction-based Routing Optimization method by 21.5%, in a super-spine leaf data-plane architecture. Additionally, the agent maintains switch thermal stability by proactively initiating external rack cooling when required. These findings highlight the potential of deep reinforcement learning in building resilience in software-defined industrial networks deployed in mission-critical, time-sensitive application scenarios.","PeriodicalId":13423,"journal":{"name":"IEEE Transactions on Network and Service Management","volume":"23 ","pages":"1297-1311"},"PeriodicalIF":5.4,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11313855","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Strategy-Proof Cost-Sharing Mechanism for Dynamic Adaptability Service in Vehicle Computing 车辆计算中动态适应性服务的无策略成本分担机制
IF 5.4 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-22 DOI: 10.1109/TNSM.2025.3646778
Xi Liu;Jun Liu;Weidong Li
Vehicle computing has emerged as a promising paradigm for delivering time-sensitive computing services to Internet of Things applications. Intelligent vehicles (IVs) offer onboard computing and sensing capabilities for delivering a wide range of services. In this paper, we propose a dynamic adaptability service model that leverages the swift mobility of vehicles to adjust the distribution of IVs to users’ dynamically changing locations. There are two types of areas in our model: the user area and the parking area. The former is where services are provided, while the latter serves as the preparation zone for backup IVs. IVs in the parking area are dispatched to service areas, where existing vehicle resources cannot meet user demand, and they return to the parking area after delivering the service. Multiple users share sensing resources, and our model allocates the costs among them. To ensure strategy-proofness, we introduce the concepts of no additional cost and allocation stability. We propose a strategy-proof cost-sharing mechanism for dynamic adaptability service. The proposed mechanism achieves no positive transfers, voluntary participation, individual rationality, consumer sovereignty, budget balance, no additional costs, and allocation stability. Moreover, the proposed mechanism’s approximation performance is analyzed. We further use comprehensive simulations to verify the effectiveness and efficiency of the proposed mechanism.
车载计算已经成为向物联网应用程序提供时间敏感计算服务的一个有前途的范例。智能车辆(IVs)提供车载计算和传感能力,以提供广泛的服务。本文提出了一种动态适应性服务模型,该模型利用车辆的快速移动性,根据用户动态变化的位置来调整车辆的分布。在我们的模型中有两种类型的区域:用户区和停车区。前者是提供服务的地方,后者是备份iv的准备区。停车区域的IVs被派往现有车辆资源无法满足用户需求的服务区,完成服务后返回停车区域。多个用户共享感知资源,我们的模型在用户之间分配成本。为了确保策略的正确性,我们引入了无额外成本和分配稳定性的概念。提出了一种不受策略约束的动态适应性服务成本分担机制。该机制没有实现正向转移、自愿参与、个人理性、消费者主权、预算平衡、无额外成本和分配稳定。此外,还分析了该机构的逼近性能。我们进一步使用综合仿真来验证所提出机制的有效性和效率。
{"title":"Strategy-Proof Cost-Sharing Mechanism for Dynamic Adaptability Service in Vehicle Computing","authors":"Xi Liu;Jun Liu;Weidong Li","doi":"10.1109/TNSM.2025.3646778","DOIUrl":"https://doi.org/10.1109/TNSM.2025.3646778","url":null,"abstract":"Vehicle computing has emerged as a promising paradigm for delivering time-sensitive computing services to Internet of Things applications. Intelligent vehicles (IVs) offer onboard computing and sensing capabilities for delivering a wide range of services. In this paper, we propose a dynamic adaptability service model that leverages the swift mobility of vehicles to adjust the distribution of IVs to users’ dynamically changing locations. There are two types of areas in our model: the user area and the parking area. The former is where services are provided, while the latter serves as the preparation zone for backup IVs. IVs in the parking area are dispatched to service areas, where existing vehicle resources cannot meet user demand, and they return to the parking area after delivering the service. Multiple users share sensing resources, and our model allocates the costs among them. To ensure strategy-proofness, we introduce the concepts of no additional cost and allocation stability. We propose a strategy-proof cost-sharing mechanism for dynamic adaptability service. The proposed mechanism achieves no positive transfers, voluntary participation, individual rationality, consumer sovereignty, budget balance, no additional costs, and allocation stability. Moreover, the proposed mechanism’s approximation performance is analyzed. We further use comprehensive simulations to verify the effectiveness and efficiency of the proposed mechanism.","PeriodicalId":13423,"journal":{"name":"IEEE Transactions on Network and Service Management","volume":"23 ","pages":"1942-1959"},"PeriodicalIF":5.4,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145982313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Network Slicing in MEC-Based RANs With Nonlinear Cost Rate Functions 基于mec的具有非线性代价率函数的局域网网络切片
IF 5.4 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-19 DOI: 10.1109/TNSM.2025.3646478
Jiahe Xu;Jing Fu;Bige Yang;Zengfu Wang;Jingjin Wu;Xinyu Wang;Moshe Zukerman
This paper addresses network slicing in a large-scale Multi-Access Edge Computing (MEC)-enabled Radio Access Network (RAN) comprising heterogeneous edge nodes with varying computing and storage resource capacities. These resources are dynamically allocated to slice requests and released when the service of a slice request is completed. Our objective is to optimize the resource allocation for each admitted arriving slice request, considering its demands for computing and storage resources, to maximize the long-run average Earning Before Interest and Taxes (EBIT) of the MEC slicing system. We formulate the optimization problem as a Restless Multi-Armed Bandit (RMAB)-based resource allocation problem with a nonlinear cost rate function. To solve this, we introduce a new policy called Prioritizing-the-Future-Approximated earning per request (PFA) where for each admitted slice request, we always prioritize the allocation of the resource combination that gives the highest achievable earning, considering the future effects of this allocation. PFA is designed to be scalable and applicable to large-scale networks. We numerically demonstrate the superior performance of PFA in maximizing long-run average EBIT through simulations, comparing it with two baseline policies, at various cases of parameter values. Moreover, our findings offer insights for network operators in resource allocation policy selection.
本文讨论了大规模多访问边缘计算(MEC)无线接入网(RAN)中的网络切片,该网络由具有不同计算和存储资源容量的异构边缘节点组成。这些资源被动态地分配给片请求,并在片请求的服务完成时释放。我们的目标是优化每个被允许到达的切片请求的资源分配,考虑其对计算和存储资源的需求,以最大化MEC切片系统的长期平均息税前收益(EBIT)。我们将优化问题表述为一个具有非线性成本率函数的基于不动多臂强盗(RMAB)的资源分配问题。为了解决这个问题,我们引入了一种新的策略,称为优先考虑每个请求的未来近似收益(PFA),其中对于每个被接受的切片请求,我们总是优先考虑可实现最高收益的资源组合的分配,同时考虑到这种分配的未来影响。PFA具有可扩展性,适用于大规模网络。我们通过模拟在数值上证明了PFA在最大化长期平均息税前利润方面的优越性能,并将其与两种基线策略在各种参数值情况下进行了比较。此外,我们的研究结果为网络运营商的资源配置策略选择提供了见解。
{"title":"Network Slicing in MEC-Based RANs With Nonlinear Cost Rate Functions","authors":"Jiahe Xu;Jing Fu;Bige Yang;Zengfu Wang;Jingjin Wu;Xinyu Wang;Moshe Zukerman","doi":"10.1109/TNSM.2025.3646478","DOIUrl":"https://doi.org/10.1109/TNSM.2025.3646478","url":null,"abstract":"This paper addresses network slicing in a large-scale Multi-Access Edge Computing (MEC)-enabled Radio Access Network (RAN) comprising heterogeneous edge nodes with varying computing and storage resource capacities. These resources are dynamically allocated to slice requests and released when the service of a slice request is completed. Our objective is to optimize the resource allocation for each admitted arriving slice request, considering its demands for computing and storage resources, to maximize the long-run average Earning Before Interest and Taxes (EBIT) of the MEC slicing system. We formulate the optimization problem as a Restless Multi-Armed Bandit (RMAB)-based resource allocation problem with a nonlinear cost rate function. To solve this, we introduce a new policy called Prioritizing-the-Future-Approximated earning per request (PFA) where for each admitted slice request, we always prioritize the allocation of the resource combination that gives the highest achievable earning, considering the future effects of this allocation. PFA is designed to be scalable and applicable to large-scale networks. We numerically demonstrate the superior performance of PFA in maximizing long-run average EBIT through simulations, comparing it with two baseline policies, at various cases of parameter values. Moreover, our findings offer insights for network operators in resource allocation policy selection.","PeriodicalId":13423,"journal":{"name":"IEEE Transactions on Network and Service Management","volume":"23 ","pages":"1989-2005"},"PeriodicalIF":5.4,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146026417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online Traffic Camouflage Against Network Analyzers via Deep Reinforcement Learning 通过深度强化学习对网络分析器进行在线流量伪装
IF 5.4 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-19 DOI: 10.1109/TNSM.2025.3646259
Wenhao Li;Jie Chen;Zhaoxuan Li;Shuai Wang;Huamin Jin;Xiao-Yu Zhang
Traffic analysis plays a pivotal role in network management. However, despite the prevalence of encryption, attackers are still able to deduce privacy elements such as user behavior and OS identification through advanced learning-based methods that exploit side-channel features. Existing defense strategies, which manipulate feature distribution to evade traffic analyzers, are often hampered by the need for impractical decoder deployment across all routes in symmetric framework methods. Moreover, reversing feature distribution modifications to real-time traffic, especially through dummy packet crafting or padding, is a complex task. In response to these challenges, we propose Veil, a novel and practical defender designed to protect live connections against encrypted network traffic analyzers. Leveraging an asymmetric deployment structure, Veil is capable of reconstructing live streams at the packet-block level, thereby allowing for seamless deployment on any connection node while enforcing transmission constraints. By employing a traffic-customized DQN framework, Veil not only reverses statistical feature perturbations back to the traffic space but also directs the distribution towards a target class. Extensive experiments conducted on real-world datasets validate the efficacy of Veil in efficiently evading analyzers in both targeted and untargeted modes, outperforming existing defense mechanisms. Notably, Veil addresses the key issues of impractical decoder deployment and complex real-time traffic manipulation, offering a more viable solution for network traffic privacy protection. The source code is publicly available at https://github.com/SecTeamPolaris/Veil, facilitating further research and application in the field of network security.
流量分析在网络管理中起着举足轻重的作用。然而,尽管加密盛行,攻击者仍然能够通过利用侧信道特性的高级基于学习的方法推断出隐私元素,例如用户行为和操作系统识别。现有的防御策略,通过操纵特征分布来逃避流量分析,经常受到需要在对称框架方法中跨所有路由部署不切实际的解码器的阻碍。此外,逆转实时流量的特征分布修改,特别是通过虚拟数据包制作或填充,是一项复杂的任务。为了应对这些挑战,我们提出了Veil,这是一种新颖实用的防御器,旨在保护实时连接免受加密网络流量分析器的攻击。利用非对称部署结构,Veil能够在包块级别重构实时流,从而允许在任何连接节点上无缝部署,同时强制传输约束。通过采用流量定制的DQN框架,Veil不仅将统计特征扰动逆转回流量空间,而且还将分布指向目标类。在真实世界数据集上进行的大量实验验证了Veil在靶向和非靶向模式下有效逃避分析器的功效,优于现有的防御机制。值得注意的是,Veil解决了解码器部署不切实际和复杂的实时流量操纵的关键问题,为网络流量隐私保护提供了更可行的解决方案。源代码可在https://github.com/SecTeamPolaris/Veil上公开获取,以促进在网络安全领域的进一步研究和应用。
{"title":"Online Traffic Camouflage Against Network Analyzers via Deep Reinforcement Learning","authors":"Wenhao Li;Jie Chen;Zhaoxuan Li;Shuai Wang;Huamin Jin;Xiao-Yu Zhang","doi":"10.1109/TNSM.2025.3646259","DOIUrl":"https://doi.org/10.1109/TNSM.2025.3646259","url":null,"abstract":"Traffic analysis plays a pivotal role in network management. However, despite the prevalence of encryption, attackers are still able to deduce privacy elements such as user behavior and OS identification through advanced learning-based methods that exploit side-channel features. Existing defense strategies, which manipulate feature distribution to evade traffic analyzers, are often hampered by the need for impractical decoder deployment across all routes in symmetric framework methods. Moreover, reversing feature distribution modifications to real-time traffic, especially through dummy packet crafting or padding, is a complex task. In response to these challenges, we propose <monospace>Veil</monospace>, a novel and practical defender designed to protect live connections against encrypted network traffic analyzers. Leveraging an asymmetric deployment structure, <monospace>Veil</monospace> is capable of reconstructing live streams at the packet-block level, thereby allowing for seamless deployment on any connection node while enforcing transmission constraints. By employing a traffic-customized DQN framework, <monospace>Veil</monospace> not only reverses statistical feature perturbations back to the traffic space but also directs the distribution towards a target class. Extensive experiments conducted on real-world datasets validate the efficacy of <monospace>Veil</monospace> in efficiently evading analyzers in both targeted and untargeted modes, outperforming existing defense mechanisms. Notably, <monospace>Veil</monospace> addresses the key issues of impractical decoder deployment and complex real-time traffic manipulation, offering a more viable solution for network traffic privacy protection. The source code is publicly available at <uri>https://github.com/SecTeamPolaris/Veil</uri>, facilitating further research and application in the field of network security.","PeriodicalId":13423,"journal":{"name":"IEEE Transactions on Network and Service Management","volume":"23 ","pages":"1927-1941"},"PeriodicalIF":5.4,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145982182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GAN4RM: A CWGAN-Based Framework for Radio Maps Generation in Real Cellular Networks GAN4RM:真实蜂窝网络中基于cwgan的无线地图生成框架
IF 5.4 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-18 DOI: 10.1109/TNSM.2025.3645305
Lei Zhang;Wanting Su;Qin Ni;Jiawangnan Lu;Bin Chen
With the evolution of mobile networks towards Artificial Intelligence as a Service (AIaaS), generative radio maps not only need to reflect the signal strength distribution in specific areas, but also possess the capability of proactive prediction. However, due to the rapid updates in urban infrastructure and the network iterations, crafting radio maps in complex urban environments represents a substantial challenge. In this paper, a multi-output framework for generating radio maps in real multi-building scenarios is proposed, based on Reference Signal Received Power (RSRP) and Reference Signal Received Quality (RSRQ) extracted from actual urban and suburban Measurement Reports (MRs). Specifically, An image encoding method integrating environmental features and base station system information is designed, while considering the sector antenna characteristics in actual communication environments. Then, a multi-output Conditional Wasserstein Generative Adversarial Network (CWGAN) is constructed for image conversion, and the radio maps are generated by learning the mapping from environmental & system information to RSRP & RSRQ radio maps, on the basis of image encoding that incorporates the physical laws of radio propagation. By calculating the priority of communication link gains at receiving points, it provides generative networks with reliable theoretical basis and conditional information, for serving cells and first neighboring cells. Experimental results show that the root mean square errors (RMSE) of the proposed method for RSRP / RSRQ of serving and neighboring cells are 1.7821 / 2.2251 and 0.8108 / 1.5121, which demonstrates the proposed method outperforms the baseline results. Simultaneously radio maps generation endows the cellular network with a certain “prophetic” capability, significantly enhancing the live service experience.
随着移动网络向人工智能即服务(AIaaS)的发展,生成式无线地图不仅需要反映特定区域的信号强度分布,还需要具备主动预测的能力。然而,由于城市基础设施的快速更新和网络的迭代,在复杂的城市环境中制作无线电地图是一项重大挑战。本文提出了一种基于参考信号接收功率(RSRP)和参考信号接收质量(RSRQ)的多输出框架,用于在实际多建筑场景下生成无线电地图。具体而言,在考虑实际通信环境扇形天线特性的同时,设计了一种综合环境特征和基站系统信息的图像编码方法。然后,构建多输出条件Wasserstein生成对抗网络(CWGAN)进行图像转换,在结合无线电传播物理规律的图像编码基础上,通过学习环境和系统信息到RSRP和RSRQ无线电地图的映射,生成无线电地图。通过计算接收点通信链路增益的优先级,为服务小区和第一相邻小区的生成网络提供可靠的理论依据和条件信息。实验结果表明,所提方法对服务单元和相邻单元的RSRP / RSRQ的均方根误差(RMSE)分别为1.7821 / 2.2251和0.8108 / 1.5121,表明所提方法优于基线结果。同时,无线地图的生成赋予了蜂窝网络一定的“预见性”能力,显著提升了现场服务体验。
{"title":"GAN4RM: A CWGAN-Based Framework for Radio Maps Generation in Real Cellular Networks","authors":"Lei Zhang;Wanting Su;Qin Ni;Jiawangnan Lu;Bin Chen","doi":"10.1109/TNSM.2025.3645305","DOIUrl":"https://doi.org/10.1109/TNSM.2025.3645305","url":null,"abstract":"With the evolution of mobile networks towards Artificial Intelligence as a Service (AIaaS), generative radio maps not only need to reflect the signal strength distribution in specific areas, but also possess the capability of proactive prediction. However, due to the rapid updates in urban infrastructure and the network iterations, crafting radio maps in complex urban environments represents a substantial challenge. In this paper, a multi-output framework for generating radio maps in real multi-building scenarios is proposed, based on Reference Signal Received Power (RSRP) and Reference Signal Received Quality (RSRQ) extracted from actual urban and suburban Measurement Reports (MRs). Specifically, An image encoding method integrating environmental features and base station system information is designed, while considering the sector antenna characteristics in actual communication environments. Then, a multi-output Conditional Wasserstein Generative Adversarial Network (CWGAN) is constructed for image conversion, and the radio maps are generated by learning the mapping from environmental & system information to RSRP & RSRQ radio maps, on the basis of image encoding that incorporates the physical laws of radio propagation. By calculating the priority of communication link gains at receiving points, it provides generative networks with reliable theoretical basis and conditional information, for serving cells and first neighboring cells. Experimental results show that the root mean square errors (RMSE) of the proposed method for RSRP / RSRQ of serving and neighboring cells are 1.7821 / 2.2251 and 0.8108 / 1.5121, which demonstrates the proposed method outperforms the baseline results. Simultaneously radio maps generation endows the cellular network with a certain “prophetic” capability, significantly enhancing the live service experience.","PeriodicalId":13423,"journal":{"name":"IEEE Transactions on Network and Service Management","volume":"23 ","pages":"1329-1341"},"PeriodicalIF":5.4,"publicationDate":"2025-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Segment Routing Header (SRH)-Aware Traffic Engineering in Hybrid IP/SRv6 Networks With Deep Reinforcement Learning 基于深度强化学习的混合IP/SRv6网络中SRH感知流量工程
IF 5.4 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-18 DOI: 10.1109/TNSM.2025.3645463
Shuyi Liu;Yuang Chen;Zhengze Li;Fangyu Zhang;Hancheng Lu;Xiaobo Guo;Lizhe Liu
Segment Routing over IPv6 (SRv6) gives operators explicit path control and alleviates network congestion, making it a compelling technique for traffic engineering (TE). Yet two practical hurdles slow adoption. First, a one-shot upgrade of every traditional device is prohibitively expensive, so operators must prioritize which devices to upgrade. Second, the Segment Routing Header (SRH) increases packet size; if TE algorithms ignore this overhead, they will underestimate link load and may cause congestion in practice. We address both challenges with DRL-TE, an algorithm that couples deep reinforcement learning (DRL) with a lightweight local search (LS) step to minimize the network’s maximum link utilization (MLU). DRL-TE first identifies the smallest set of critical devices whose upgrade yields the largest drop in MLU, enabling hybrid IP/SRv6 networks to approach optimal performance with minimal investment. It then computes SRH-aware routes, and the DRL agent, augmented by a fast LS refinement, rapidly reduces MLU even under traffic variation. Experiments on an 11-node hardware testbed and three larger simulated topologies show that upgrading about 30% of devices allows DRL-TE to match fully upgraded networks and reduce MLU by up to 34% compared with existing algorithms. DRL-TE also maintains high performance under link failures and traffic variations, offering a cost-effective and robust path toward incremental SRv6 deployment.
IPv6分段路由(SRv6)为运营商提供了明确的路径控制,减轻了网络拥塞,使其成为交通工程(TE)的一项引人注目的技术。然而,有两个实际障碍阻碍了采用。首先,对每个传统设备进行一次性升级是非常昂贵的,因此运营商必须优先考虑升级哪些设备。第二,段路由头(SRH)增加包的大小;如果TE算法忽略这个开销,它们将低估链路负载,并可能在实践中导致拥塞。我们通过DRL- te解决了这两个挑战,DRL- te是一种将深度强化学习(DRL)与轻量级本地搜索(LS)步骤相结合的算法,以最小化网络的最大链路利用率(MLU)。DRL-TE首先识别出最小的关键设备集,这些设备的升级产生最大的MLU下降,使混合IP/SRv6网络能够以最小的投资接近最佳性能。然后计算srh感知路由,DRL代理通过快速LS细化增强,即使在流量变化的情况下也能快速减少MLU。在11节点硬件测试平台和三个更大的模拟拓扑上进行的实验表明,升级约30%的设备使DRL-TE能够匹配完全升级的网络,与现有算法相比,MLU最多可减少34%。DRL-TE还在链路故障和流量变化下保持高性能,为增量SRv6部署提供了经济有效且稳健的途径。
{"title":"Segment Routing Header (SRH)-Aware Traffic Engineering in Hybrid IP/SRv6 Networks With Deep Reinforcement Learning","authors":"Shuyi Liu;Yuang Chen;Zhengze Li;Fangyu Zhang;Hancheng Lu;Xiaobo Guo;Lizhe Liu","doi":"10.1109/TNSM.2025.3645463","DOIUrl":"https://doi.org/10.1109/TNSM.2025.3645463","url":null,"abstract":"Segment Routing over IPv6 (SRv6) gives operators explicit path control and alleviates network congestion, making it a compelling technique for traffic engineering (TE). Yet two practical hurdles slow adoption. First, a one-shot upgrade of every traditional device is prohibitively expensive, so operators must prioritize which devices to upgrade. Second, the Segment Routing Header (SRH) increases packet size; if TE algorithms ignore this overhead, they will underestimate link load and may cause congestion in practice. We address both challenges with DRL-TE, an algorithm that couples deep reinforcement learning (DRL) with a lightweight local search (LS) step to minimize the network’s maximum link utilization (MLU). DRL-TE first identifies the smallest set of critical devices whose upgrade yields the largest drop in MLU, enabling hybrid IP/SRv6 networks to approach optimal performance with minimal investment. It then computes SRH-aware routes, and the DRL agent, augmented by a fast LS refinement, rapidly reduces MLU even under traffic variation. Experiments on an 11-node hardware testbed and three larger simulated topologies show that upgrading about 30% of devices allows DRL-TE to match fully upgraded networks and reduce MLU by up to 34% compared with existing algorithms. DRL-TE also maintains high performance under link failures and traffic variations, offering a cost-effective and robust path toward incremental SRv6 deployment.","PeriodicalId":13423,"journal":{"name":"IEEE Transactions on Network and Service Management","volume":"23 ","pages":"1260-1275"},"PeriodicalIF":5.4,"publicationDate":"2025-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Network and Service Management
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1