Why BDeu? Regular Bayesian network structure learning with discrete and continuous variables

IF 5.4 2区数学 Q1 STATISTICS & PROBABILITY Wiley Interdisciplinary Reviews-Computational Statistics Pub Date : 2021-03-01 DOI:10.1002/wics.1554

J. Suzuki

引用次数: 2

Abstract

We consider the problem of Bayesian network structure learning (BNSL) from data. In particular, we focus on the score‐based approach rather than the constraint‐based approach and address what score we should use for the purpose. The Bayesian Dirichlet equivalent uniform (BDeu) has been mainly used within the community of BNs (not outside of it). We know that for any model selection and any data, the fitter the data to a model, the more complex the model, and vice versa. However, recently, it was proven that BDeu violates regularity, which means that it does not balance the two factors, although it works satisfactorily (consistently) when the sample size is infinitely large. In addition, we claim that the merit of using the regular scores over the BDeu is that tighter bounds of pruning rules are available when we consider efficient BNSL. Finally, using experiments, we compare the performances of the procedures to examine the claim. (This paper is for review and gives a unified viewpoint from the recent progress on the topic.)

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

为什么选择BDU？具有离散和连续变量的正则贝叶斯网络结构学习

研究了基于数据的贝叶斯网络结构学习问题。特别是，我们专注于基于分数的方法，而不是基于约束的方法，并解决我们应该使用什么分数来达到目的。贝叶斯狄利克雷等效均匀性(BDeu)主要在bn社区内使用(而不是在其外)。我们知道，对于任何模型选择和任何数据，数据越适合模型，模型就越复杂，反之亦然。然而，最近，人们证明BDeu违反了规律性，这意味着它不能平衡这两个因素，尽管它在样本量无限大时工作得令人满意(一致)。此外，我们声称使用正则分数优于BDeu的优点是，当我们考虑有效的BNSL时，可以使用更严格的修剪规则边界。最后，通过实验，我们比较了程序的性能来检验索赔。(本文仅供回顾，并从该主题的最新进展中给出一个统一的观点。)

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Wiley Interdisciplinary Reviews-Computational Statistics STATISTICS & PROBABILITY-

CiteScore

6.20

自引率

0.00%

发文量