Segmentation Based Backdoor Attack Detection

2020 International Conference on Machine Learning and Cybernetics (ICMLC) Pub Date : 2020-12-02 DOI:10.1109/ICMLC51923.2020.9469037

Natasha Kees, Yaxuan Wang, Yiling Jiang, Fang Lue, P. Chan

{"title":"Segmentation Based Backdoor Attack Detection","authors":"Natasha Kees, Yaxuan Wang, Yiling Jiang, Fang Lue, P. Chan","doi":"10.1109/ICMLC51923.2020.9469037","DOIUrl":null,"url":null,"abstract":"Backdoor attacks have become a serious security concern because of the rising popularity of unverified third party machine learning resources such as datasets, pretrained models, and processors. Pre-trained models and shared datasets have become popular due to the high training requirement of deep learning. This raises a serious security concern since the shared models and datasets may be modified intentionally in order to reduce system efficacy. A backdoor attack is difficult to detect since the embedded adversarial decision rule will only be triggered by a pre-chosen pattern, and the contaminated model behaves normally on benign samples. This paper devises a backdoor attack detection method to identify whether a sample is attacked for image-related applications. The information consistence provided by an image without each segment is considered. The absence of the segment containing a trigger strongly affects the consistence since the trigger dominates the decision. Our proposed method is evaluated empirically to confirm the effectiveness in various settings. As there is no restrictive assumption on the trigger of backdoor attacks, we expect our proposed model is generalizable and can defend against a wider range of modern attacks.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLC51923.2020.9469037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Backdoor attacks have become a serious security concern because of the rising popularity of unverified third party machine learning resources such as datasets, pretrained models, and processors. Pre-trained models and shared datasets have become popular due to the high training requirement of deep learning. This raises a serious security concern since the shared models and datasets may be modified intentionally in order to reduce system efficacy. A backdoor attack is difficult to detect since the embedded adversarial decision rule will only be triggered by a pre-chosen pattern, and the contaminated model behaves normally on benign samples. This paper devises a backdoor attack detection method to identify whether a sample is attacked for image-related applications. The information consistence provided by an image without each segment is considered. The absence of the segment containing a trigger strongly affects the consistence since the trigger dominates the decision. Our proposed method is evaluated empirically to confirm the effectiveness in various settings. As there is no restrictive assumption on the trigger of backdoor attacks, we expect our proposed model is generalizable and can defend against a wider range of modern attacks.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于分段的后门攻击检测

由于未经验证的第三方机器学习资源(如数据集、预训练模型和处理器)越来越受欢迎，后门攻击已经成为一个严重的安全问题。由于深度学习的高训练要求，预训练模型和共享数据集已成为流行。这引起了严重的安全问题，因为共享模型和数据集可能会被有意地修改，以降低系统效率。由于嵌入的对抗决策规则只会由预先选择的模式触发，并且受污染的模型在良性样本上表现正常，因此后门攻击很难被检测到。本文针对图像相关应用，设计了一种后门攻击检测方法来识别样本是否受到攻击。不考虑每个片段的图像所提供的信息一致性。缺少包含触发器的片段会严重影响一致性，因为触发器主导决策。我们提出的方法进行了经验评估，以确认在各种设置的有效性。由于对后门攻击的触发没有限制性假设，我们希望我们提出的模型具有通用性，并且可以抵御更大范围的现代攻击。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2020 International Conference on Machine Learning and Cybernetics (ICMLC)

自引率

0.00%

发文量

期刊最新文献

Behavioral Decision Makings: Reconciling Behavioral Economics and Decision Systems Operating System Classification: A Minimalist Approach Research on Hotspot Mining Method of Twitter News Report Based on LDA and Sentiment Analysis Conservative Generalisation for Small Data Analytics –An Extended Lattice Machine Approach ICMLC 2020 Cover Page