Eliciting and Learning with Soft Labels from Every Annotator

Proceedings of the ... AAAI Conference on Human Computation and Crowdsourcing Pub Date : 2022-07-02 DOI:10.48550/arXiv.2207.00810

K. M. Collins, Umang Bhatt, Adrian Weller

{"title":"Eliciting and Learning with Soft Labels from Every Annotator","authors":"K. M. Collins, Umang Bhatt, Adrian Weller","doi":"10.48550/arXiv.2207.00810","DOIUrl":null,"url":null,"abstract":"The labels used to train machine learning (ML) models are of paramount importance. Typically for ML classification tasks, datasets contain hard labels, yet learning using soft labels has been shown to yield benefits for model generalization, robustness, and calibration. Earlier work found success in forming soft labels from multiple annotators' hard labels; however, this approach may not converge to the best labels and necessitates many annotators, which can be expensive and inefficient. We focus on efficiently eliciting soft labels from individual annotators. We collect and release a dataset of soft labels (which we call CIFAR-10S) over the CIFAR-10 test set via a crowdsourcing study (N=248). We demonstrate that learning with our labels achieves comparable model performance to prior approaches while requiring far fewer annotators -- albeit with significant temporal costs per elicitation. Our elicitation methodology therefore shows nuanced promise in enabling practitioners to enjoy the benefits of improved model performance and reliability with fewer annotators, and serves as a guide for future dataset curators on the benefits of leveraging richer information, such as categorical uncertainty, from individual annotators.","PeriodicalId":87339,"journal":{"name":"Proceedings of the ... AAAI Conference on Human Computation and Crowdsourcing","volume":"13 1","pages":"40-52"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... AAAI Conference on Human Computation and Crowdsourcing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2207.00810","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 21

Abstract

The labels used to train machine learning (ML) models are of paramount importance. Typically for ML classification tasks, datasets contain hard labels, yet learning using soft labels has been shown to yield benefits for model generalization, robustness, and calibration. Earlier work found success in forming soft labels from multiple annotators' hard labels; however, this approach may not converge to the best labels and necessitates many annotators, which can be expensive and inefficient. We focus on efficiently eliciting soft labels from individual annotators. We collect and release a dataset of soft labels (which we call CIFAR-10S) over the CIFAR-10 test set via a crowdsourcing study (N=248). We demonstrate that learning with our labels achieves comparable model performance to prior approaches while requiring far fewer annotators -- albeit with significant temporal costs per elicitation. Our elicitation methodology therefore shows nuanced promise in enabling practitioners to enjoy the benefits of improved model performance and reliability with fewer annotators, and serves as a guide for future dataset curators on the benefits of leveraging richer information, such as categorical uncertainty, from individual annotators.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用每个注释者的软标签引出和学习

用于训练机器学习(ML)模型的标签至关重要。通常对于ML分类任务，数据集包含硬标签，但使用软标签学习已被证明对模型泛化、鲁棒性和校准有好处。早期的研究发现，从多个注释者的硬标签中形成软标签是成功的;然而，这种方法可能不会收敛到最好的标签，并且需要许多注释器，这可能是昂贵和低效的。我们专注于从单个注释器中有效地提取软标签。我们通过众包研究(N=248)在CIFAR-10测试集上收集并发布了软标签数据集(我们称之为CIFAR-10S)。我们证明，使用我们的标签学习可以达到与之前的方法相当的模型性能，同时需要更少的注释器——尽管每次引出的时间成本很大。因此，我们的启发方法显示出细微的承诺，使从业者能够使用更少的注释者享受改进的模型性能和可靠性的好处，并作为未来数据集管理员利用更丰富信息(如分类不确定性)的好处的指南，来自单个注释者。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the ... AAAI Conference on Human Computation and Crowdsourcing

自引率

0.00%

发文量