A Bayesian analysis of co-training algorithm with insufficient views

2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA) Pub Date : 2012-07-02 DOI:10.1109/ISSPA.2012.6310456

Luca Didaci, F. Roli

引用次数: 2

Abstract

The co-training algorithm can be applied if a dataset admits a representation into two different feature sets (two views). However, its optimality is proved only under the conditions a) sufficiency of each view, and b) conditional independence given the class. We address the case where condition a) doesn't hold, as often happens in concrete applications. In such cases the co-training is unable to converge to the optimal Bayesian classifier, because samples added in the training set are not distributed according to the class-conditional distributions, even if their assigned label is correct. These results help to better understand the behavior of the co-training algorithm when the classes are only `statistically' separable.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

视图不足情况下协同训练算法的贝叶斯分析

如果一个数据集允许两个不同的特征集(两个视图)表示，则可以应用协同训练算法。然而，它的最优性仅在a)每个视图的充分性和b)给定类的条件独立性的条件下被证明。我们处理条件a)不成立的情况，这在具体应用中经常发生。在这种情况下，协同训练无法收敛到最优贝叶斯分类器，因为添加到训练集中的样本不按照类条件分布分布，即使它们分配的标签是正确的。这些结果有助于更好地理解当类仅在“统计”上可分离时，协同训练算法的行为。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)

自引率

0.00%

发文量

期刊最新文献

Online mvbf adaptation under diffuse noise environments with mimo based noise pre-filtering Hierarchical scheme for Arabic text recognition Precoder selection and rank adaptation in MIMO-OFDM Head detection using Kinect camera and its application to fall detection Wavelength and code division multiplexing toward diffuse optical imaging