{"title":"ALBS: An Active Learning Framework Based on Syncretic Sample Selection Strategy","authors":"Longfei Pan, Xiaojun Wang","doi":"10.1145/3318299.3318362","DOIUrl":null,"url":null,"abstract":"Machine learning has achieved outstanding performance in many fields, but its success heavily relies on the large number of annotated training samples. However, for many professional fields, data annotation is not only tedious and time consuming, but also demanding specialty-oriented knowledge and skills, which are not easily accessible. To significantly reduce the cost of annotation, we propose a novel active learning framework called ALBS. ALBS uses the syncretic strategy which incorporates \"most discriminative\" and \"most representative\" to seek \"worthy\" samples from unlabeled dataset and update the model incrementally to enhance the performance continuously. We have evaluated our method on two different audio datasets, demonstrating that the syncretic strategy can makes the promotion of model model's performance more robust and faster than the other strategies, and subsampling the historical labeled dataset can reduce unnecessary computing costs and storage space.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Machine Learning and Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3318299.3318362","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Machine learning has achieved outstanding performance in many fields, but its success heavily relies on the large number of annotated training samples. However, for many professional fields, data annotation is not only tedious and time consuming, but also demanding specialty-oriented knowledge and skills, which are not easily accessible. To significantly reduce the cost of annotation, we propose a novel active learning framework called ALBS. ALBS uses the syncretic strategy which incorporates "most discriminative" and "most representative" to seek "worthy" samples from unlabeled dataset and update the model incrementally to enhance the performance continuously. We have evaluated our method on two different audio datasets, demonstrating that the syncretic strategy can makes the promotion of model model's performance more robust and faster than the other strategies, and subsampling the historical labeled dataset can reduce unnecessary computing costs and storage space.