基于深度强化学习的不确定性感知加权公平路由器排队

2021 IEEE 4th International Conference on Electronics and Communication Engineering (ICECE) Pub Date : 2021-12-17 DOI:10.1109/ICECE54449.2021.9674580

Pengyue Wang, Zhaoyu Jiang, Meiyu Qi, Longfei Dai, Huiying Xu

{"title":"基于深度强化学习的不确定性感知加权公平路由器排队","authors":"Pengyue Wang, Zhaoyu Jiang, Meiyu Qi, Longfei Dai, Huiying Xu","doi":"10.1109/ICECE54449.2021.9674580","DOIUrl":null,"url":null,"abstract":"In current computer communication networks, the increasing packet loss and delay caused by the increasing traffic becomes the bottleneck for the desired Quality of Service (QoS). Weighted Fair Queueing can be used to provide differentiated services according to the Service Level Agreement (SLA) associated with each packet. However, due to inaccurate measurements of queue usage, drop rate and delay in real routers, and the intrinsic property of a real network system that there will always be some unpredictable traffic patterns, current methods for WFQ updating can be improved and extended further. In this work, an uncertainty-aware soft actor-critic agent is introduced. First, the learned weights updating strategy is a maximum entropy policy, which is robust under estimation and model error. Second, the technique of model uncertainty estimation is adopted into the agent so that it is capable of detecting novel states that are unseen during the training period, which facilitates a strategy switching framework. The proposed algorithm shows the potential of using reinforcement learning for WFQ weights updating and is compatible with existing techniques by monitoring the model uncertainty, which makes a more robust and stable system. The benefits of applying the proposed algorithm is validated through the simulation studies, showing a promising direction for further exploration.","PeriodicalId":166178,"journal":{"name":"2021 IEEE 4th International Conference on Electronics and Communication Engineering (ICECE)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Uncertainty-aware Weighted Fair Queueing for Routers Based on Deep Reinforcement Learning\",\"authors\":\"Pengyue Wang, Zhaoyu Jiang, Meiyu Qi, Longfei Dai, Huiying Xu\",\"doi\":\"10.1109/ICECE54449.2021.9674580\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In current computer communication networks, the increasing packet loss and delay caused by the increasing traffic becomes the bottleneck for the desired Quality of Service (QoS). Weighted Fair Queueing can be used to provide differentiated services according to the Service Level Agreement (SLA) associated with each packet. However, due to inaccurate measurements of queue usage, drop rate and delay in real routers, and the intrinsic property of a real network system that there will always be some unpredictable traffic patterns, current methods for WFQ updating can be improved and extended further. In this work, an uncertainty-aware soft actor-critic agent is introduced. First, the learned weights updating strategy is a maximum entropy policy, which is robust under estimation and model error. Second, the technique of model uncertainty estimation is adopted into the agent so that it is capable of detecting novel states that are unseen during the training period, which facilitates a strategy switching framework. The proposed algorithm shows the potential of using reinforcement learning for WFQ weights updating and is compatible with existing techniques by monitoring the model uncertainty, which makes a more robust and stable system. The benefits of applying the proposed algorithm is validated through the simulation studies, showing a promising direction for further exploration.\",\"PeriodicalId\":166178,\"journal\":{\"name\":\"2021 IEEE 4th International Conference on Electronics and Communication Engineering (ICECE)\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 4th International Conference on Electronics and Communication Engineering (ICECE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICECE54449.2021.9674580\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 4th International Conference on Electronics and Communication Engineering (ICECE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECE54449.2021.9674580","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

在当前的计算机通信网络中，由于业务量的增加而导致的丢包和时延的增加成为实现理想的服务质量(QoS)的瓶颈。加权公平排队可以根据每个报文所关联的SLA (Service Level Agreement)提供差异化的服务。然而，由于对真实路由器的队列使用率、丢包率和时延的测量并不准确，而且真实网络系统的固有特性是总会存在一些不可预测的流量模式，因此现有的WFQ更新方法还可以进一步改进和扩展。本文介绍了一种具有不确定性感知的软行为-评论代理。首先，学习到的权重更新策略是一种最大熵策略，在估计和模型误差下具有鲁棒性。其次，将模型不确定性估计技术引入到智能体中，使其能够检测到在训练期间未见过的新状态，从而便于策略切换框架;该算法显示了将强化学习用于WFQ权值更新的潜力，并通过监测模型的不确定性与现有技术相兼容，使系统更加鲁棒和稳定。通过仿真研究验证了该算法的优越性，为进一步探索提供了良好的方向。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Uncertainty-aware Weighted Fair Queueing for Routers Based on Deep Reinforcement Learning

In current computer communication networks, the increasing packet loss and delay caused by the increasing traffic becomes the bottleneck for the desired Quality of Service (QoS). Weighted Fair Queueing can be used to provide differentiated services according to the Service Level Agreement (SLA) associated with each packet. However, due to inaccurate measurements of queue usage, drop rate and delay in real routers, and the intrinsic property of a real network system that there will always be some unpredictable traffic patterns, current methods for WFQ updating can be improved and extended further. In this work, an uncertainty-aware soft actor-critic agent is introduced. First, the learned weights updating strategy is a maximum entropy policy, which is robust under estimation and model error. Second, the technique of model uncertainty estimation is adopted into the agent so that it is capable of detecting novel states that are unseen during the training period, which facilitates a strategy switching framework. The proposed algorithm shows the potential of using reinforcement learning for WFQ weights updating and is compatible with existing techniques by monitoring the model uncertainty, which makes a more robust and stable system. The benefits of applying the proposed algorithm is validated through the simulation studies, showing a promising direction for further exploration.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 IEEE 4th International Conference on Electronics and Communication Engineering (ICECE)

自引率

0.00%

发文量