Scalable Learning of Bayesian Network Classifiers

J. Mach. Learn. Res. Pub Date : 2016-01-01 DOI:10.5555/2946645.2946689

Ana M. Martínez, Geoffrey I. Webb, Shenglei Chen, Nayyar Zaidi

引用次数: 57

Abstract

Ever increasing data quantity makes ever more urgent the need for highly scalable learners that have good classification performance. Therefore, an out-of-core learner with excellent time and space complexity, along with high expressivity (that is, capacity to learn very complex multivariate probability distributions) is extremely desirable. This paper presents such a learner. We propose an extension to the k-dependence Bayesian classifier (KDB) that discriminatively selects a sub-model of a full KDB classifier. It requires only one additional pass through the training data, making it a three-pass learner. Our extensive experimental evaluation on 16 large data sets reveals that this out-of-core algorithm achieves competitive classification performance, and substantially better training and classification time than state-of-the-art in-core learners such as random forest and linear and non-linear logistic regression.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

贝叶斯网络分类器的可扩展学习

随着数据量的不断增加，对具有良好分类性能的高可扩展性学习器的需求日益迫切。因此，一个具有出色的时间和空间复杂性，以及高表达能力(即学习非常复杂的多元概率分布的能力)的外核学习器是非常可取的。本文提出了这样一个学习器。我们提出了k依赖贝叶斯分类器(KDB)的扩展，该扩展可以判别地选择完整KDB分类器的子模型。它只需要额外通过一次训练数据，使其成为一个三次学习。我们对16个大型数据集进行了广泛的实验评估，结果表明，这种核心外算法实现了具有竞争力的分类性能，并且比最先进的核心内学习器(如随机森林和线性和非线性逻辑回归)的训练和分类时间要短得多。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

J. Mach. Learn. Res.

自引率

0.00%

发文量

期刊最新文献

Scalable Computation of Causal Bounds A Unified Framework for Factorizing Distributional Value Functions for Multi-Agent Reinforcement Learning Adaptive False Discovery Rate Control with Privacy Guarantee Fairlearn: Assessing and Improving Fairness of AI Systems Generalization Bounds for Adversarial Contrastive Learning