{"title":"A Bayesian analysis of co-training algorithm with insufficient views","authors":"Luca Didaci, F. Roli","doi":"10.1109/ISSPA.2012.6310456","DOIUrl":null,"url":null,"abstract":"The co-training algorithm can be applied if a dataset admits a representation into two different feature sets (two views). However, its optimality is proved only under the conditions a) sufficiency of each view, and b) conditional independence given the class. We address the case where condition a) doesn't hold, as often happens in concrete applications. In such cases the co-training is unable to converge to the optimal Bayesian classifier, because samples added in the training set are not distributed according to the class-conditional distributions, even if their assigned label is correct. These results help to better understand the behavior of the co-training algorithm when the classes are only `statistically' separable.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"72 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSPA.2012.6310456","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The co-training algorithm can be applied if a dataset admits a representation into two different feature sets (two views). However, its optimality is proved only under the conditions a) sufficiency of each view, and b) conditional independence given the class. We address the case where condition a) doesn't hold, as often happens in concrete applications. In such cases the co-training is unable to converge to the optimal Bayesian classifier, because samples added in the training set are not distributed according to the class-conditional distributions, even if their assigned label is correct. These results help to better understand the behavior of the co-training algorithm when the classes are only `statistically' separable.