{"title":"具有公平约束和武器长期不可得性的组合沉睡强盗","authors":"Vivek Kuchibhotla, P. Harshitha, Divitha Elugoti","doi":"10.1109/ICECA49313.2020.9297371","DOIUrl":null,"url":null,"abstract":"In this paper, the situation of long term non-availability of arms in combinatorial sleeping bandits problem is analyzed. The multi-arm sleeping bandit’s model along with fairness constraint is very widely used to model real world examples like a network switch. One common occurrence in such a scenario is long term non-availability. In such cases the queue length (in the Queuing techniques) grows rapidly causing system instability. The algorithm proposed in this paper deals with this problem and still maintain the regret bounds along with the queue fairness constraints. A better way of estimating the fairness that takes into account the long term non-availability of arms is also proposed. Extension of the UCB algorithm is used to deal with the exploration versus exploitation dilemma. Mathematical proofs for arriving at the regret bounds and feasibility optimality is given in the end.","PeriodicalId":297285,"journal":{"name":"2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Combinatorial Sleeping Bandits with Fairness Constraints and Long-Term Non-Availability of Arms\",\"authors\":\"Vivek Kuchibhotla, P. Harshitha, Divitha Elugoti\",\"doi\":\"10.1109/ICECA49313.2020.9297371\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, the situation of long term non-availability of arms in combinatorial sleeping bandits problem is analyzed. The multi-arm sleeping bandit’s model along with fairness constraint is very widely used to model real world examples like a network switch. One common occurrence in such a scenario is long term non-availability. In such cases the queue length (in the Queuing techniques) grows rapidly causing system instability. The algorithm proposed in this paper deals with this problem and still maintain the regret bounds along with the queue fairness constraints. A better way of estimating the fairness that takes into account the long term non-availability of arms is also proposed. Extension of the UCB algorithm is used to deal with the exploration versus exploitation dilemma. Mathematical proofs for arriving at the regret bounds and feasibility optimality is given in the end.\",\"PeriodicalId\":297285,\"journal\":{\"name\":\"2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICECA49313.2020.9297371\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECA49313.2020.9297371","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Combinatorial Sleeping Bandits with Fairness Constraints and Long-Term Non-Availability of Arms
In this paper, the situation of long term non-availability of arms in combinatorial sleeping bandits problem is analyzed. The multi-arm sleeping bandit’s model along with fairness constraint is very widely used to model real world examples like a network switch. One common occurrence in such a scenario is long term non-availability. In such cases the queue length (in the Queuing techniques) grows rapidly causing system instability. The algorithm proposed in this paper deals with this problem and still maintain the regret bounds along with the queue fairness constraints. A better way of estimating the fairness that takes into account the long term non-availability of arms is also proposed. Extension of the UCB algorithm is used to deal with the exploration versus exploitation dilemma. Mathematical proofs for arriving at the regret bounds and feasibility optimality is given in the end.