Interpreting convolutional networks trained on textual data

International Conference on Pattern Recognition Applications and Methods Pub Date : 2020-10-20 DOI:10.5220/0010205901960203

Reza Marzban, C. Crick

{"title":"Interpreting convolutional networks trained on textual data","authors":"Reza Marzban, C. Crick","doi":"10.5220/0010205901960203","DOIUrl":null,"url":null,"abstract":"There have been many advances in the artificial intelligence field due to the emergence of deep learning. In almost all sub-fields, artificial neural networks have reached or exceeded human-level performance. However, most of the models are not interpretable. As a result, it is hard to trust their decisions, especially in life and death scenarios. In recent years, there has been a movement toward creating explainable artificial intelligence, but most work to date has concentrated on image processing models, as it is easier for humans to perceive visual patterns. There has been little work in other fields like natural language processing. In this paper, we train a convolutional model on textual data and analyze the global logic of the model by studying its filter values. In the end, we find the most important words in our corpus to our models logic and remove the rest (95%). New models trained on just the 5% most important words can achieve the same performance as the original model while reducing training time by more than half. Approaches such as this will help us to understand NLP models, explain their decisions according to their word choices, and improve them by finding blind spots and biases.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Pattern Recognition Applications and Methods","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5220/0010205901960203","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

There have been many advances in the artificial intelligence field due to the emergence of deep learning. In almost all sub-fields, artificial neural networks have reached or exceeded human-level performance. However, most of the models are not interpretable. As a result, it is hard to trust their decisions, especially in life and death scenarios. In recent years, there has been a movement toward creating explainable artificial intelligence, but most work to date has concentrated on image processing models, as it is easier for humans to perceive visual patterns. There has been little work in other fields like natural language processing. In this paper, we train a convolutional model on textual data and analyze the global logic of the model by studying its filter values. In the end, we find the most important words in our corpus to our models logic and remove the rest (95%). New models trained on just the 5% most important words can achieve the same performance as the original model while reducing training time by more than half. Approaches such as this will help us to understand NLP models, explain their decisions according to their word choices, and improve them by finding blind spots and biases.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

解释基于文本数据训练的卷积网络

由于深度学习的出现，人工智能领域取得了许多进展。在几乎所有的子领域，人工神经网络已经达到或超过了人类的水平。然而，大多数模型是不可解释的。因此，很难相信他们的决定，尤其是在生死攸关的情况下。近年来，出现了一种创造可解释的人工智能的趋势，但迄今为止，大多数工作都集中在图像处理模型上，因为人类更容易感知视觉模式。自然语言处理等其他领域的研究很少。本文在文本数据上训练卷积模型，并通过研究模型的过滤值来分析模型的全局逻辑。最后，我们找到语料库中对模型逻辑最重要的单词，并删除其余的(95%)。只对5%最重要的单词进行训练的新模型可以达到与原始模型相同的性能，同时将训练时间减少一半以上。诸如此类的方法将帮助我们理解NLP模型，根据它们的词语选择来解释它们的决策，并通过发现盲点和偏见来改进它们。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

International Conference on Pattern Recognition Applications and Methods

自引率

0.00%

发文量