Azimuth: Systematic Error Analysis for Text Classification

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing Pub Date : 2022-12-16 DOI:10.48550/arXiv.2212.08216

Gabrielle Gauthier Melançon, Orlando Marquez Ayala, Lindsay D. Brin, Chris Tyler, Frederic Branchaud-Charron, Joseph Marinier, Karine Grande, Dieu-Thu Le

{"title":"Azimuth: Systematic Error Analysis for Text Classification","authors":"Gabrielle Gauthier Melançon, Orlando Marquez Ayala, Lindsay D. Brin, Chris Tyler, Frederic Branchaud-Charron, Joseph Marinier, Karine Grande, Dieu-Thu Le","doi":"10.48550/arXiv.2212.08216","DOIUrl":null,"url":null,"abstract":"We present Azimuth, an open-source and easy-to-use tool to perform error analysis for text classification. Compared to other stages of the ML development cycle, such as model training and hyper-parameter tuning, the process and tooling for the error analysis stage are less mature. However, this stage is critical for the development of reliable and trustworthy AI systems. To make error analysis more systematic, we propose an approach comprising dataset analysis and model quality assessment, which Azimuth facilitates. We aim to help AI practitioners discover and address areas where the model does not generalize by leveraging and integrating a range of ML techniques, such as saliency maps, similarity, uncertainty, and behavioral analyses, all in one tool. Our code and documentation are available at github.com/servicenow/azimuth.","PeriodicalId":74540,"journal":{"name":"Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing","volume":"71 1","pages":"298-310"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2212.08216","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

We present Azimuth, an open-source and easy-to-use tool to perform error analysis for text classification. Compared to other stages of the ML development cycle, such as model training and hyper-parameter tuning, the process and tooling for the error analysis stage are less mature. However, this stage is critical for the development of reliable and trustworthy AI systems. To make error analysis more systematic, we propose an approach comprising dataset analysis and model quality assessment, which Azimuth facilitates. We aim to help AI practitioners discover and address areas where the model does not generalize by leveraging and integrating a range of ML techniques, such as saliency maps, similarity, uncertainty, and behavioral analyses, all in one tool. Our code and documentation are available at github.com/servicenow/azimuth.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

方位角:文本分类系统误差分析

我们介绍了Azimuth，一个开源和易于使用的工具，用于执行文本分类的错误分析。与机器学习开发周期的其他阶段(如模型训练和超参数调优)相比，错误分析阶段的过程和工具不太成熟。然而，这一阶段对于开发可靠和值得信赖的人工智能系统至关重要。为了使误差分析更加系统化，我们提出了一种包含数据集分析和模型质量评估的方法。我们的目标是通过利用和集成一系列ML技术，如显著性地图、相似性、不确定性和行为分析，帮助人工智能从业者发现和解决模型不能泛化的领域。我们的代码和文档可在github.com/servicenow/azimuth上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing

自引率

0.00%

发文量