Training Classifiers with Natural Language Explanations

Proceedings of the conference. Association for Computational Linguistics. Meeting Pub Date : 2018-05-10 DOI:10.18653/v1/P18-1175

Braden Hancock, P. Varma, Stephanie Wang, Martin Bringmann, Percy Liang, C. Ré

引用次数: 140

Abstract

Training accurate classifiers requires many labels, but each label provides only limited information (one bit for binary classification). In this work, we propose BabbleLabble, a framework for training classifiers in which an annotator provides a natural language explanation for each labeling decision. A semantic parser converts these explanations into programmatic labeling functions that generate noisy labels for an arbitrary amount of unlabeled data, which is used to train a classifier. On three relation extraction tasks, we find that users are able to train classifiers with comparable F1 scores from 5-100 faster by providing explanations instead of just labels. Furthermore, given the inherent imperfection of labeling functions, we find that a simple rule-based semantic parser suffices.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用自然语言解释训练分类器

训练准确的分类器需要许多标签，但是每个标签只提供有限的信息(对于二值分类来说是一个比特)。在这项工作中，我们提出了BabbleLabble，这是一个用于训练分类器的框架，其中注释器为每个标记决策提供自然语言解释。语义解析器将这些解释转换为可编程标记函数，该函数为任意数量的未标记数据生成嘈杂的标签，用于训练分类器。在三个关系提取任务中，我们发现用户能够通过提供解释而不仅仅是标签来更快地训练具有5-100分可比较F1分数的分类器。此外，考虑到标记功能的固有缺陷，我们发现一个简单的基于规则的语义解析器就足够了。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the conference. Association for Computational Linguistics. Meeting

自引率

0.00%

发文量