Natural Scene Statistics for Detecting Adversarial Examples in Deep Neural Networks

2020 IEEE 22nd International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2020-09-21 DOI:10.1109/MMSP48831.2020.9287056

Anouar Kherchouche, Sid Ahmed Fezza, W. Hamidouche, O. Déforges

{"title":"Natural Scene Statistics for Detecting Adversarial Examples in Deep Neural Networks","authors":"Anouar Kherchouche, Sid Ahmed Fezza, W. Hamidouche, O. Déforges","doi":"10.1109/MMSP48831.2020.9287056","DOIUrl":null,"url":null,"abstract":"The deep neural networks (DNNs) have been adopted in a wide spectrum of applications. However, it has been demonstrated that their are vulnerable to adversarial examples (AEs): carefully-crafted perturbations added to a clean input image. These AEs fool the DNNs which classify them incorrectly. Therefore, it is imperative to develop a detection method of AEs allowing the defense of DNNs. In this paper, we propose to characterize the adversarial perturbations through the use of natural scene statistics. We demonstrate that these statistical properties are altered by the presence of adversarial perturbations. Based on this finding, we design a classifier that exploits these scene statistics to determine if an input is adversarial or not. The proposed method has been evaluated against four prominent adversarial attacks and on three standards datasets. The experimental results have shown that the proposed detection method achieves a high detection accuracy, even against strong attacks, while providing a low false positive rate.","PeriodicalId":188283,"journal":{"name":"2020 IEEE 22nd International Workshop on Multimedia Signal Processing (MMSP)","volume":"259 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 22nd International Workshop on Multimedia Signal Processing (MMSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MMSP48831.2020.9287056","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

The deep neural networks (DNNs) have been adopted in a wide spectrum of applications. However, it has been demonstrated that their are vulnerable to adversarial examples (AEs): carefully-crafted perturbations added to a clean input image. These AEs fool the DNNs which classify them incorrectly. Therefore, it is imperative to develop a detection method of AEs allowing the defense of DNNs. In this paper, we propose to characterize the adversarial perturbations through the use of natural scene statistics. We demonstrate that these statistical properties are altered by the presence of adversarial perturbations. Based on this finding, we design a classifier that exploits these scene statistics to determine if an input is adversarial or not. The proposed method has been evaluated against four prominent adversarial attacks and on three standards datasets. The experimental results have shown that the proposed detection method achieves a high detection accuracy, even against strong attacks, while providing a low false positive rate.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于自然场景统计的深度神经网络对抗样本检测

深度神经网络(dnn)已被广泛应用。然而，已经证明它们很容易受到对抗性示例(AEs)的影响:在干净的输入图像中添加精心制作的扰动。这些ae欺骗了对它们进行错误分类的dnn。因此，开发一种能够防御深层神经网络的ae检测方法势在必行。在本文中，我们建议通过使用自然场景统计来表征对抗性摄动。我们证明，这些统计性质被对抗性扰动的存在所改变。基于这一发现，我们设计了一个分类器，利用这些场景统计来确定输入是否是对抗性的。提出的方法已经针对四种突出的对抗性攻击和三个标准数据集进行了评估。实验结果表明，该检测方法在面对强攻击的情况下也具有较高的检测精度，同时具有较低的误报率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2020 IEEE 22nd International Workshop on Multimedia Signal Processing (MMSP)

自引率

0.00%

发文量