FTG: Score-based black-box watermarking by fragile trigger generation for deep model integrity verification

Journal of Information and Intelligence Pub Date : 2024-01-01 DOI:10.1016/j.jiixd.2023.10.006

Heng Yin , Zhaoxia Yin , Zhenzhe Gao , Hang Su , Xinpeng Zhang , Bin Luo

{"title":"FTG: Score-based black-box watermarking by fragile trigger generation for deep model integrity verification","authors":"Heng Yin , Zhaoxia Yin , Zhenzhe Gao , Hang Su , Xinpeng Zhang , Bin Luo","doi":"10.1016/j.jiixd.2023.10.006","DOIUrl":null,"url":null,"abstract":"<div><p>Deep neural networks (DNNs) are widely used in real-world applications, thanks to their exceptional performance in image recognition. However, their vulnerability to attacks, such as Trojan and data poison, can compromise the integrity and stability of DNN applications. Therefore, it is crucial to verify the integrity of DNN models to ensure their security. Previous research on model watermarking for integrity detection has encountered the issue of overexposure of model parameters during embedding and extraction of the watermark. To address this problem, we propose a novel score-based black-box DNN fragile watermarking framework called fragile trigger generation (FTG). The FTG framework only requires the prediction probability distribution of the final output of the classifier during the watermarking process. It generates different fragile samples as the trigger, based on the classification prediction probability of the target classifier and a specified prediction probability mask to watermark it. Different prediction probability masks can promote the generation of fragile samples in corresponding distribution types. The whole watermarking process does not affect the performance of the target classifier. When verifying the watermarking information, the FTG only needs to compare the prediction results of the model on the samples with the previous label. As a result, the required model parameter information is reduced, and the FTG only needs a few samples to detect slight modifications in the model. Experimental results demonstrate the effectiveness of our proposed method and show its superiority over related work. The FTG framework provides a robust solution for verifying the integrity of DNN models, and its effectiveness in detecting slight modifications makes it a valuable tool for ensuring the security and stability of DNN applications.</p></div>","PeriodicalId":100790,"journal":{"name":"Journal of Information and Intelligence","volume":"2 1","pages":"Pages 28-41"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949715923000641/pdfft?md5=60f402130fb47c84b855a467ea72516c&pid=1-s2.0-S2949715923000641-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Information and Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949715923000641","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Deep neural networks (DNNs) are widely used in real-world applications, thanks to their exceptional performance in image recognition. However, their vulnerability to attacks, such as Trojan and data poison, can compromise the integrity and stability of DNN applications. Therefore, it is crucial to verify the integrity of DNN models to ensure their security. Previous research on model watermarking for integrity detection has encountered the issue of overexposure of model parameters during embedding and extraction of the watermark. To address this problem, we propose a novel score-based black-box DNN fragile watermarking framework called fragile trigger generation (FTG). The FTG framework only requires the prediction probability distribution of the final output of the classifier during the watermarking process. It generates different fragile samples as the trigger, based on the classification prediction probability of the target classifier and a specified prediction probability mask to watermark it. Different prediction probability masks can promote the generation of fragile samples in corresponding distribution types. The whole watermarking process does not affect the performance of the target classifier. When verifying the watermarking information, the FTG only needs to compare the prediction results of the model on the samples with the previous label. As a result, the required model parameter information is reduced, and the FTG only needs a few samples to detect slight modifications in the model. Experimental results demonstrate the effectiveness of our proposed method and show its superiority over related work. The FTG framework provides a robust solution for verifying the integrity of DNN models, and its effectiveness in detecting slight modifications makes it a valuable tool for ensuring the security and stability of DNN applications.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

FTG：通过脆弱触发器生成基于分数的黑盒水印，用于深度模型完整性验证

深度神经网络（DNN）因其在图像识别方面的卓越性能而被广泛应用于现实世界。然而，它们易受木马和数据中毒等攻击的影响，会损害 DNN 应用程序的完整性和稳定性。因此，验证 DNN 模型的完整性以确保其安全性至关重要。以往针对完整性检测的模型水印研究遇到了水印嵌入和提取过程中模型参数过度暴露的问题。为解决这一问题，我们提出了一种新颖的基于分数的黑盒 DNN 脆弱水印框架，称为脆弱触发生成（FTG）。FTG 框架在水印处理过程中只需要分类器最终输出的预测概率分布。它根据目标分类器的分类预测概率和指定的预测概率掩码，生成不同的易损样本作为触发器，对其进行水印处理。不同的预测概率掩码可促进生成相应分布类型的易损样本。整个水印过程不会影响目标分类器的性能。在验证水印信息时，FTG 只需比较模型对样本的预测结果与之前的标签。因此，所需的模型参数信息减少了，FTG 只需要几个样本就能检测到模型的细微变化。实验结果证明了我们提出的方法的有效性，并显示出其优于相关工作。FTG 框架为验证 DNN 模型的完整性提供了一个稳健的解决方案，它在检测轻微修改方面的有效性使其成为确保 DNN 应用安全性和稳定性的重要工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Information and Intelligence

自引率

0.00%

发文量

期刊最新文献

Editorial Board Editorial Board Boosting brain-computer interface performance through cognitive training: A brain-centric approach Hand-aware graph convolution network for skeleton-based sign language recognition Composite fixed-length ordered features with index-of-max transformation for high-performing and secure palmprint template protection