Better Hit the Nail on the Head than Beat around the Bush: Removing Protected Attributes with a Single Projection

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing Pub Date : 2022-12-08 DOI:10.48550/arXiv.2212.04273

P. Haghighatkhah, Antske Fokkens, Pia Sommerauer, B. Speckmann, Kevin Verbeek

{"title":"Better Hit the Nail on the Head than Beat around the Bush: Removing Protected Attributes with a Single Projection","authors":"P. Haghighatkhah, Antske Fokkens, Pia Sommerauer, B. Speckmann, Kevin Verbeek","doi":"10.48550/arXiv.2212.04273","DOIUrl":null,"url":null,"abstract":"Bias elimination and recent probing studies attempt to remove specific information from embedding spaces. Here it is important to remove as much of the target information as possible, while preserving any other information present. INLP is a popular recent method which removes specific information through iterative nullspace projections.Multiple iterations, however, increase the risk that information other than the target is negatively affected.We introduce two methods that find a single targeted projection: Mean Projection (MP, more efficient) and Tukey Median Projection (TMP, with theoretical guarantees). Our comparison between MP and INLP shows that (1) one MP projection removes linear separability based on the target and (2) MP has less impact on the overall space.Further analysis shows that applying random projections after MP leads to the same overall effects on the embedding space as the multiple projections of INLP. Applying one targeted (MP) projection hence is methodologically cleaner than applying multiple (INLP) projections that introduce random effects.","PeriodicalId":74540,"journal":{"name":"Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing","volume":"53 1","pages":"8395-8416"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2212.04273","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Bias elimination and recent probing studies attempt to remove specific information from embedding spaces. Here it is important to remove as much of the target information as possible, while preserving any other information present. INLP is a popular recent method which removes specific information through iterative nullspace projections.Multiple iterations, however, increase the risk that information other than the target is negatively affected.We introduce two methods that find a single targeted projection: Mean Projection (MP, more efficient) and Tukey Median Projection (TMP, with theoretical guarantees). Our comparison between MP and INLP shows that (1) one MP projection removes linear separability based on the target and (2) MP has less impact on the overall space.Further analysis shows that applying random projections after MP leads to the same overall effects on the embedding space as the multiple projections of INLP. Applying one targeted (MP) projection hence is methodologically cleaner than applying multiple (INLP) projections that introduce random effects.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

切中要害，不要拐弯抹角:用一次投射移除受保护的属性

偏见消除和最近的探索研究试图从嵌入空间中去除特定信息。在这里，重要的是要尽可能多地删除目标信息，同时保留现有的任何其他信息。INLP是最近流行的一种通过迭代零空间投影去除特定信息的方法。然而，多次迭代会增加目标以外的信息受到负面影响的风险。我们介绍了寻找单个目标投影的两种方法:均值投影(MP，更有效)和Tukey中值投影(TMP，有理论保证)。我们对MP和INLP的比较表明:(1)一个MP投影消除了基于目标的线性可分性;(2)MP对整体空间的影响较小。进一步分析表明，在MP后应用随机投影对嵌入空间的总体影响与INLP的多次投影相同。因此，应用一个目标(MP)投影比应用引入随机效应的多个(INLP)投影在方法上更干净。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing

自引率

0.00%

发文量