Overlapping Multi-Bandit Best Arm Identification

2019 IEEE International Symposium on Information Theory (ISIT) Pub Date : 2019-07-01 DOI:10.1109/ISIT.2019.8849327

J. Scarlett, Ilija Bogunovic, V. Cevher

引用次数: 8

Abstract

In the multi-armed bandit literature, the multibandit best-arm identification problem consists of determining each best arm in a number of disjoint groups of arms, with as few total arm pulls as possible. In this paper, we introduce a variant of the multi-bandit problem with overlapping groups, and present two algorithms for this problem based on successive elimination and lower/upper confidence bounds (LUCB). We bound the number of total arm pulls required for high-probability best-arm identification in every group, and we complement these bounds with a near-matching algorithm-independent lower bound.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

重叠多强盗最佳武器识别

在多臂强盗文献中，多臂强盗最佳臂识别问题包括在许多不相交的臂组中确定每个最佳臂，并且总臂拉力尽可能少。本文引入了一种具有重叠群的多盗匪问题的变体，并给出了两种基于逐次消去和上下置信区间(LUCB)的算法。我们限定了每组中高概率最佳手臂识别所需的总手臂拉拔次数，并用一个接近匹配的与算法无关的下界来补充这些边界。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2019 IEEE International Symposium on Information Theory (ISIT)

自引率

0.00%

发文量

期刊最新文献

Gambling and Rényi Divergence Irregular Product Coded Computation for High-Dimensional Matrix Multiplication Error Exponents in Distributed Hypothesis Testing of Correlations Pareto Optimal Schemes in Coded Caching Constrained de Bruijn Codes and their Applications