{"title":"Difference based query strategies in active learning","authors":"Dávid Papp, G. Szücs, Zsolt Knoll","doi":"10.1109/SISY47553.2019.9111587","DOIUrl":null,"url":null,"abstract":"In this paper two active learning methods are proposed in the machine learning literature, both of them based on difference calculation idea. One of the new methods is difference sampling query strategy. This strategy calculates a novel difference list and the elements of this list are then able to influence the uncertainty measure of the appropriate unlabelled instance. By taking the ratio of these measures a new informativeness metric is defined, and the aim of the difference sampling strategy is to minimize this ratio. Besides that, expected difference change query strategy was developed using a new metric, the global difference metric for each step. This strategy combines expected model change and uncertainty sampling strategies by taking the expectation of the difference of uncertainty values. The aim of this combined strategy is to query the instance that will most likely result the greatest change in global difference of the next step. The experimental results on image dataset show that both of the difference based sampling query strategies surpass the competitive methods in literature.","PeriodicalId":256922,"journal":{"name":"2019 IEEE 17th International Symposium on Intelligent Systems and Informatics (SISY)","volume":"215 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 17th International Symposium on Intelligent Systems and Informatics (SISY)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SISY47553.2019.9111587","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In this paper two active learning methods are proposed in the machine learning literature, both of them based on difference calculation idea. One of the new methods is difference sampling query strategy. This strategy calculates a novel difference list and the elements of this list are then able to influence the uncertainty measure of the appropriate unlabelled instance. By taking the ratio of these measures a new informativeness metric is defined, and the aim of the difference sampling strategy is to minimize this ratio. Besides that, expected difference change query strategy was developed using a new metric, the global difference metric for each step. This strategy combines expected model change and uncertainty sampling strategies by taking the expectation of the difference of uncertainty values. The aim of this combined strategy is to query the instance that will most likely result the greatest change in global difference of the next step. The experimental results on image dataset show that both of the difference based sampling query strategies surpass the competitive methods in literature.