具有公平约束和武器长期不可得性的组合沉睡强盗

2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA) Pub Date : 2020-11-05 DOI:10.1109/ICECA49313.2020.9297371

Vivek Kuchibhotla, P. Harshitha, Divitha Elugoti

{"title":"具有公平约束和武器长期不可得性的组合沉睡强盗","authors":"Vivek Kuchibhotla, P. Harshitha, Divitha Elugoti","doi":"10.1109/ICECA49313.2020.9297371","DOIUrl":null,"url":null,"abstract":"In this paper, the situation of long term non-availability of arms in combinatorial sleeping bandits problem is analyzed. The multi-arm sleeping bandit’s model along with fairness constraint is very widely used to model real world examples like a network switch. One common occurrence in such a scenario is long term non-availability. In such cases the queue length (in the Queuing techniques) grows rapidly causing system instability. The algorithm proposed in this paper deals with this problem and still maintain the regret bounds along with the queue fairness constraints. A better way of estimating the fairness that takes into account the long term non-availability of arms is also proposed. Extension of the UCB algorithm is used to deal with the exploration versus exploitation dilemma. Mathematical proofs for arriving at the regret bounds and feasibility optimality is given in the end.","PeriodicalId":297285,"journal":{"name":"2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Combinatorial Sleeping Bandits with Fairness Constraints and Long-Term Non-Availability of Arms\",\"authors\":\"Vivek Kuchibhotla, P. Harshitha, Divitha Elugoti\",\"doi\":\"10.1109/ICECA49313.2020.9297371\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, the situation of long term non-availability of arms in combinatorial sleeping bandits problem is analyzed. The multi-arm sleeping bandit’s model along with fairness constraint is very widely used to model real world examples like a network switch. One common occurrence in such a scenario is long term non-availability. In such cases the queue length (in the Queuing techniques) grows rapidly causing system instability. The algorithm proposed in this paper deals with this problem and still maintain the regret bounds along with the queue fairness constraints. A better way of estimating the fairness that takes into account the long term non-availability of arms is also proposed. Extension of the UCB algorithm is used to deal with the exploration versus exploitation dilemma. Mathematical proofs for arriving at the regret bounds and feasibility optimality is given in the end.\",\"PeriodicalId\":297285,\"journal\":{\"name\":\"2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICECA49313.2020.9297371\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECA49313.2020.9297371","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

本文分析了组合睡匪问题中武器长期不可用的情况。带有公平性约束的多臂睡眠强盗模型被广泛地应用于网络交换机等现实例子的建模。在这种情况下，一个常见的情况是长期不可用。在这种情况下，队列长度(在排队技术中)增长迅速，导致系统不稳定。本文提出的算法处理了这一问题，并且在队列公平性约束下仍然保持遗憾边界。还提出了一种更好的估计公平性的方法，该方法考虑了武器的长期不可获得性。对UCB算法进行了扩展，解决了探索与利用的两难问题。最后给出了遗憾界和可行性最优性的数学证明。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Combinatorial Sleeping Bandits with Fairness Constraints and Long-Term Non-Availability of Arms

In this paper, the situation of long term non-availability of arms in combinatorial sleeping bandits problem is analyzed. The multi-arm sleeping bandit’s model along with fairness constraint is very widely used to model real world examples like a network switch. One common occurrence in such a scenario is long term non-availability. In such cases the queue length (in the Queuing techniques) grows rapidly causing system instability. The algorithm proposed in this paper deals with this problem and still maintain the regret bounds along with the queue fairness constraints. A better way of estimating the fairness that takes into account the long term non-availability of arms is also proposed. Extension of the UCB algorithm is used to deal with the exploration versus exploitation dilemma. Mathematical proofs for arriving at the regret bounds and feasibility optimality is given in the end.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA)

自引率

0.00%

发文量