EdgeInfer: Robust Truth Inference under Data Poisoning Attack

Farnaz Tahmasebian, Li Xiong, Mani Sotoodeh, V. Sunderam
{"title":"EdgeInfer: Robust Truth Inference under Data Poisoning Attack","authors":"Farnaz Tahmasebian, Li Xiong, Mani Sotoodeh, V. Sunderam","doi":"10.1109/SMDS49396.2020.00013","DOIUrl":null,"url":null,"abstract":"As crowdsourcing is becoming more widely used for annotating data from a large group of users, attackers have strong incentives to manipulate the system. Deriving the true answer of tasks in crowdsourcing systems based on user-provided data is susceptible to data poisoning attacks, whereby malicious users may intentionally or strategically report incorrect information to mislead the system into inferring the wrong truth for a set of tasks. Recent work has proposed several attacks on the crowdsourcing systems and showed that existing truth inference methods may be vulnerable to such attacks. In this paper, we propose solutions to enhance the robustness of existing truth inference methods. Our solutions base on 1) detecting and augmenting the answers for the boundary tasks in which users could not reach a strong consensus and hence are subjective to potential manipulation, and 2) enhancing inference method with a stronger prior. We empirically evaluate these defense mechanisms by designing attack scenarios that aim to decrease the accuracy of the system. Experiments show that our method is effective and significantly improves the robustness of the system under attack.","PeriodicalId":385149,"journal":{"name":"2020 IEEE International Conference on Smart Data Services (SMDS)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Conference on Smart Data Services (SMDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SMDS49396.2020.00013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

As crowdsourcing is becoming more widely used for annotating data from a large group of users, attackers have strong incentives to manipulate the system. Deriving the true answer of tasks in crowdsourcing systems based on user-provided data is susceptible to data poisoning attacks, whereby malicious users may intentionally or strategically report incorrect information to mislead the system into inferring the wrong truth for a set of tasks. Recent work has proposed several attacks on the crowdsourcing systems and showed that existing truth inference methods may be vulnerable to such attacks. In this paper, we propose solutions to enhance the robustness of existing truth inference methods. Our solutions base on 1) detecting and augmenting the answers for the boundary tasks in which users could not reach a strong consensus and hence are subjective to potential manipulation, and 2) enhancing inference method with a stronger prior. We empirically evaluate these defense mechanisms by designing attack scenarios that aim to decrease the accuracy of the system. Experiments show that our method is effective and significantly improves the robustness of the system under attack.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
EdgeInfer:数据中毒攻击下的鲁棒真值推断
随着众包越来越广泛地用于注释来自一大群用户的数据,攻击者有强烈的动机操纵系统。在众包系统中,基于用户提供的数据得出任务的真实答案容易受到数据中毒攻击,恶意用户可能有意或有策略地报告不正确的信息,误导系统对一组任务推断出错误的真相。最近的研究提出了几种针对众包系统的攻击,并表明现有的真相推理方法可能容易受到这种攻击。在本文中,我们提出了增强现有真值推理方法的鲁棒性的解决方案。我们的解决方案基于1)检测和增强用户无法达成强烈共识的边界任务的答案,因此对潜在的操纵是主观的;2)用更强的先验增强推理方法。我们通过设计旨在降低系统准确性的攻击场景来经验地评估这些防御机制。实验表明,该方法是有效的,显著提高了系统在攻击下的鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
S3QLRDF: Property Table Partitioning Scheme for Distributed SPARQL Querying of large-scale RDF data BC-Sketch: A Simple Reversible Sketch for Detecting Network Anomalies 2020 IEEE International Conference on Smart Data Services (SMDS) SMDS 2020 Scalable and Hybrid Ensemble-Based Causality Discovery Stargazer: A Deep Learning Approach for Estimating the Performance of Edge- Based Clustering Applications
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1