Concentration inequalities for the empirical distribution of discrete distributions: beyond the method of types

IF 1.6 4区数学 Q2 MATHEMATICS, APPLIED Information and Inference-A Journal of the Ima Pub Date : 2020-12-16 DOI:10.1093/imaiai/iaz025

Jay Mardia, Jiantao Jiao, Ervin Tánczos, R. Nowak, T. Weissman

引用次数: 29

Abstract

We study concentration inequalities for the Kullback–Leibler (KL) divergence between the empirical distribution and the true distribution. Applying a recursion technique, we improve over the method of types bound uniformly in all regimes of sample size n and alphabet size k, and the improvement becomes more significant when k is large. We discuss the applications of our results in obtaining tighter concentration inequalities for L1 deviations of the empirical distribution from the true distribution, and the difference between concentration around the expectation or zero. We also obtain asymptotically tight bounds on the variance of the KL divergence between the empirical and true distribution, and demonstrate their quantitatively different behaviors between small and large sample sizes compared to the alphabet size.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

离散分布的经验分布的集中不等式:超越类型的方法

我们研究了经验分布和真实分布之间的Kullback-Leibler (KL)散度的集中不等式。应用递归技术，在样本大小为n、字母大小为k的所有区域中，对类型一致定界的方法进行了改进，当k较大时，改进更为显著。我们讨论了我们的结果在经验分布与真实分布的L1偏差以及期望值周围或零之间的浓度差的更严格的浓度不等式中的应用。我们还获得了经验分布和真实分布之间KL散度方差的渐近紧界，并证明了与字母表大小相比，它们在小样本容量和大样本容量之间的定量差异行为。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊