Why Stacked Models Perform Effective Collective Classification

2008 Eighth IEEE International Conference on Data Mining Pub Date : 2008-12-15 DOI:10.1109/ICDM.2008.126

A. Fast, David D. Jensen

引用次数: 33

Abstract

Collective classification techniques jointly infer all class labels of a relational data set, using the inferences about one class label to influence inferences about related class labels. Kou and Cohen recently introduced an efficient relational model based on stacking that, despite its simplicity, has equivalent accuracy to more sophisticated joint inference approaches. Using experiments on both real and synthetic data, we show that the primary cause for the performance of the stacked model is the reduction in bias from learning the stacked model on inferred labels rather than true labels. The reduction in variance due to conditional inference also contributes to the effect but it is not as strong. In addition, we show that the performance of the joint inference and stacked learners can be attributed to an implicit weighting of local and relational features at learning time.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

为什么堆叠模型能有效地进行集体分类

集体分类技术联合推断一个关系数据集的所有类标签，利用一个类标签的推断来影响相关类标签的推断。Kou和Cohen最近介绍了一种基于堆叠的高效关系模型，尽管它很简单，但与更复杂的联合推理方法具有相同的准确性。通过对真实和合成数据的实验，我们表明堆叠模型性能的主要原因是通过在推断标签而不是真实标签上学习堆叠模型来减少偏差。由于条件推断而导致的方差减少也有助于这种效果，但它没有那么强。此外，我们还证明了联合推理和堆叠学习器的性能可以归因于学习时局部特征和关系特征的隐式加权。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2008 Eighth IEEE International Conference on Data Mining

自引率

0.00%

发文量