MDL model selection using the ML plug-in code

International Symposium on Information Theory and its Applications. International Symposium on Information Theory and its Applications Pub Date : 2005-01-01 DOI:10.1109/ISIT.2005.1523439

S. D. Rooij, P. Grünwald

引用次数: 0

Abstract

We analyse the behaviour of the ML plug-in code, also known as the Rissanen-Dawid prequential ML code, relative to single parameter exponential families M. If the data are i.i.d. according to an (essentially) arbitrary P, then the redundancy grows at 1/2c log n. We find that, in contrast to other important universal codes such as the 2-part MDL, Shtarkov and Bayesian codes where c = 1, here c equals the ratio between the variance of P and the variance of the element of M that is closest to P in KL-divergence. We show how this behaviour can impair model selection performance in a simple setting in which we select between the Poisson and geometric models

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

使用ML插件代码进行MDL模型选择

我们分析了ML插件代码的行为，也称为rissanen - david先验ML代码，相对于单参数指数族M.如果数据是根据(本质上)任意P的i.id，那么冗余度以1/2c log n增长。我们发现，与其他重要的通用代码相比，如2部分MDL, Shtarkov和贝叶斯代码，其中c = 1，这里c等于P的方差与k -散度中最接近P的M元素的方差之比。我们在一个简单的设置中展示了这种行为如何损害模型选择性能，其中我们在泊松模型和几何模型之间进行选择

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

International Symposium on Information Theory and its Applications. International Symposium on Information Theory and its Applications

自引率

0.00%

发文量

期刊最新文献

Rank Preserving Code-based Signature Buddhism and the Religious Other Statistical Inference and Exact Saddle Point Approximations Topological structures on DMC spaces A computer-aided investigation on the fundamental limits of caching