基于深度强化学习的不确定性感知加权公平路由器排队

Pengyue Wang, Zhaoyu Jiang, Meiyu Qi, Longfei Dai, Huiying Xu
{"title":"基于深度强化学习的不确定性感知加权公平路由器排队","authors":"Pengyue Wang, Zhaoyu Jiang, Meiyu Qi, Longfei Dai, Huiying Xu","doi":"10.1109/ICECE54449.2021.9674580","DOIUrl":null,"url":null,"abstract":"In current computer communication networks, the increasing packet loss and delay caused by the increasing traffic becomes the bottleneck for the desired Quality of Service (QoS). Weighted Fair Queueing can be used to provide differentiated services according to the Service Level Agreement (SLA) associated with each packet. However, due to inaccurate measurements of queue usage, drop rate and delay in real routers, and the intrinsic property of a real network system that there will always be some unpredictable traffic patterns, current methods for WFQ updating can be improved and extended further. In this work, an uncertainty-aware soft actor-critic agent is introduced. First, the learned weights updating strategy is a maximum entropy policy, which is robust under estimation and model error. Second, the technique of model uncertainty estimation is adopted into the agent so that it is capable of detecting novel states that are unseen during the training period, which facilitates a strategy switching framework. The proposed algorithm shows the potential of using reinforcement learning for WFQ weights updating and is compatible with existing techniques by monitoring the model uncertainty, which makes a more robust and stable system. The benefits of applying the proposed algorithm is validated through the simulation studies, showing a promising direction for further exploration.","PeriodicalId":166178,"journal":{"name":"2021 IEEE 4th International Conference on Electronics and Communication Engineering (ICECE)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Uncertainty-aware Weighted Fair Queueing for Routers Based on Deep Reinforcement Learning\",\"authors\":\"Pengyue Wang, Zhaoyu Jiang, Meiyu Qi, Longfei Dai, Huiying Xu\",\"doi\":\"10.1109/ICECE54449.2021.9674580\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In current computer communication networks, the increasing packet loss and delay caused by the increasing traffic becomes the bottleneck for the desired Quality of Service (QoS). Weighted Fair Queueing can be used to provide differentiated services according to the Service Level Agreement (SLA) associated with each packet. However, due to inaccurate measurements of queue usage, drop rate and delay in real routers, and the intrinsic property of a real network system that there will always be some unpredictable traffic patterns, current methods for WFQ updating can be improved and extended further. In this work, an uncertainty-aware soft actor-critic agent is introduced. First, the learned weights updating strategy is a maximum entropy policy, which is robust under estimation and model error. Second, the technique of model uncertainty estimation is adopted into the agent so that it is capable of detecting novel states that are unseen during the training period, which facilitates a strategy switching framework. The proposed algorithm shows the potential of using reinforcement learning for WFQ weights updating and is compatible with existing techniques by monitoring the model uncertainty, which makes a more robust and stable system. The benefits of applying the proposed algorithm is validated through the simulation studies, showing a promising direction for further exploration.\",\"PeriodicalId\":166178,\"journal\":{\"name\":\"2021 IEEE 4th International Conference on Electronics and Communication Engineering (ICECE)\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 4th International Conference on Electronics and Communication Engineering (ICECE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICECE54449.2021.9674580\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 4th International Conference on Electronics and Communication Engineering (ICECE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECE54449.2021.9674580","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

在当前的计算机通信网络中,由于业务量的增加而导致的丢包和时延的增加成为实现理想的服务质量(QoS)的瓶颈。加权公平排队可以根据每个报文所关联的SLA (Service Level Agreement)提供差异化的服务。然而,由于对真实路由器的队列使用率、丢包率和时延的测量并不准确,而且真实网络系统的固有特性是总会存在一些不可预测的流量模式,因此现有的WFQ更新方法还可以进一步改进和扩展。本文介绍了一种具有不确定性感知的软行为-评论代理。首先,学习到的权重更新策略是一种最大熵策略,在估计和模型误差下具有鲁棒性。其次,将模型不确定性估计技术引入到智能体中,使其能够检测到在训练期间未见过的新状态,从而便于策略切换框架;该算法显示了将强化学习用于WFQ权值更新的潜力,并通过监测模型的不确定性与现有技术相兼容,使系统更加鲁棒和稳定。通过仿真研究验证了该算法的优越性,为进一步探索提供了良好的方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Uncertainty-aware Weighted Fair Queueing for Routers Based on Deep Reinforcement Learning
In current computer communication networks, the increasing packet loss and delay caused by the increasing traffic becomes the bottleneck for the desired Quality of Service (QoS). Weighted Fair Queueing can be used to provide differentiated services according to the Service Level Agreement (SLA) associated with each packet. However, due to inaccurate measurements of queue usage, drop rate and delay in real routers, and the intrinsic property of a real network system that there will always be some unpredictable traffic patterns, current methods for WFQ updating can be improved and extended further. In this work, an uncertainty-aware soft actor-critic agent is introduced. First, the learned weights updating strategy is a maximum entropy policy, which is robust under estimation and model error. Second, the technique of model uncertainty estimation is adopted into the agent so that it is capable of detecting novel states that are unseen during the training period, which facilitates a strategy switching framework. The proposed algorithm shows the potential of using reinforcement learning for WFQ weights updating and is compatible with existing techniques by monitoring the model uncertainty, which makes a more robust and stable system. The benefits of applying the proposed algorithm is validated through the simulation studies, showing a promising direction for further exploration.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Design of Emergency Rescue Command Platform Based on Satellite Mobile Communication System Multi-Dimensional Spectrum Data Denoising Based on Tensor Theory Predicting COVID-19 Severe Patients and Evaluation Method of 3 Stages Severe Level by Machine Learning A Novel Stacking Framework Based On Hybrid of Gradient Boosting-Adaptive Boosting-Multilayer Perceptron for Crash Injury Severity Prediction and Analysis Key Techniques on Unified Identity Authentication in OpenMBEE Integration
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1