Protecting ownership rights of ML models using watermarking in the light of adversarial attacks

AI and ethics Pub Date : 2024-02-23 DOI:10.1007/s43681-023-00412-3

Katarzyna Kapusta, Lucas Mattioli, Boussad Addad, Mohammed Lansari

引用次数: 0

Abstract

In this paper, we present and analyze two novel—and seemingly distant—research trends in Machine Learning: ML watermarking and adversarial patches. First, we show how ML watermarking uses specially crafted inputs to provide a proof of model ownership. Second, we demonstrate how an attacker can craft adversarial samples in order to trigger an abnormal behavior in a model and thus perform an ambiguity attack on ML watermarking. Finally, we describe three countermeasures that could be applied in order to prevent ambiguity attacks. We illustrate our works using the example of a binary classification model for welding inspection.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

在对抗性攻击中使用水印保护 ML 模型的所有权

在本文中，我们介绍并分析了机器学习领域的两个新颖且看似遥远的研究趋势：ML 水印和对抗补丁。首先，我们展示了 ML 水印如何使用特制输入来提供模型所有权证明。其次，我们展示了攻击者如何制作对抗样本，以触发模型中的异常行为，从而对 ML 水印进行模糊攻击。最后，我们介绍了可用于防止模糊攻击的三种对策。我们以用于焊接检测的二进制分类模型为例说明我们的工作。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

AI and ethics

自引率

0.00%

发文量