Online learning in decentralized multi-user spectrum access with synchronized explorations

MILCOM 2012 - 2012 IEEE Military Communications Conference Pub Date : 2012-10-01 DOI:10.1109/MILCOM.2012.6415693

Cem Tekin, M. Liu

引用次数: 19

Abstract

In this paper we consider decentralized multi-user online learning of unused spectrum bands as an opportunistic spectrum access (OSA) problem. There is a set of M secondary users exploiting the spectrum opportunities in K channels. We develop a distributed algorithm for the secondary users that will learn the optimal allocation with logarithmic regret. Thus, our algorithm achieves the fastest convergence rate to the optimal allocation. In a more general framework, our algorithm gives an order optimal solution to the decentralized multi-player multi-armed bandit problem with general reward functions.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

同步探索的分散多用户频谱访问中的在线学习

本文将未使用频段的分散多用户在线学习作为机会频谱接入(OSA)问题来考虑。有一组M个二级用户利用K个频道的频谱机会。我们为次要用户开发了一种分布式算法，该算法将学习具有对数遗憾的最优分配。因此，我们的算法对最优分配的收敛速度最快。在更一般的框架下，我们的算法给出了具有一般奖励函数的分散多人多手强盗问题的有序最优解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

MILCOM 2012 - 2012 IEEE Military Communications Conference

自引率

0.00%

发文量

期刊最新文献

An Open Standard for Ka-band Interoperable Satellite Antennas An approach to data correlation using JC3IEDM model The U.s. Army and Network-centric Warfare a Thematic Analysis of the Literature Technology diffusion and military users: Perceptions that predict adoption Cooperative Multi-tree Sleep Scheduling for Surveillance in Wireless Sensor Networks