Content-Aware Differential Privacy with Conditional Invertible Neural Networks

DeCaF/FAIR@MICCAI Pub Date : 2022-07-29 DOI:10.48550/arXiv.2207.14625

Malte Tölle, U. Köthe, F. André, B. Meder, S. Engelhardt

{"title":"Content-Aware Differential Privacy with Conditional Invertible Neural Networks","authors":"Malte Tölle, U. Köthe, F. André, B. Meder, S. Engelhardt","doi":"10.48550/arXiv.2207.14625","DOIUrl":null,"url":null,"abstract":"Differential privacy (DP) has arisen as the gold standard in protecting an individual's privacy in datasets by adding calibrated noise to each data sample. While the application to categorical data is straightforward, its usability in the context of images has been limited. Contrary to categorical data the meaning of an image is inherent in the spatial correlation of neighboring pixels making the simple application of noise infeasible. Invertible Neural Networks (INN) have shown excellent generative performance while still providing the ability to quantify the exact likelihood. Their principle is based on transforming a complicated distribution into a simple one e.g. an image into a spherical Gaussian. We hypothesize that adding noise to the latent space of an INN can enable differentially private image modification. Manipulation of the latent space leads to a modified image while preserving important details. Further, by conditioning the INN on meta-data provided with the dataset we aim at leaving dimensions important for downstream tasks like classification untouched while altering other parts that potentially contain identifying information. We term our method content-aware differential privacy (CADP). We conduct experiments on publicly available benchmarking datasets as well as dedicated medical ones. In addition, we show the generalizability of our method to categorical data. The source code is publicly available at https://github.com/Cardio-AI/CADP.","PeriodicalId":347091,"journal":{"name":"DeCaF/FAIR@MICCAI","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"DeCaF/FAIR@MICCAI","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2207.14625","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Differential privacy (DP) has arisen as the gold standard in protecting an individual's privacy in datasets by adding calibrated noise to each data sample. While the application to categorical data is straightforward, its usability in the context of images has been limited. Contrary to categorical data the meaning of an image is inherent in the spatial correlation of neighboring pixels making the simple application of noise infeasible. Invertible Neural Networks (INN) have shown excellent generative performance while still providing the ability to quantify the exact likelihood. Their principle is based on transforming a complicated distribution into a simple one e.g. an image into a spherical Gaussian. We hypothesize that adding noise to the latent space of an INN can enable differentially private image modification. Manipulation of the latent space leads to a modified image while preserving important details. Further, by conditioning the INN on meta-data provided with the dataset we aim at leaving dimensions important for downstream tasks like classification untouched while altering other parts that potentially contain identifying information. We term our method content-aware differential privacy (CADP). We conduct experiments on publicly available benchmarking datasets as well as dedicated medical ones. In addition, we show the generalizability of our method to categorical data. The source code is publicly available at https://github.com/Cardio-AI/CADP.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于条件可逆神经网络的内容感知差分隐私

差分隐私(DP)已成为保护数据集中个人隐私的黄金标准，它通过在每个数据样本中添加校准噪声来保护个人隐私。虽然分类数据的应用很简单，但它在图像上下文中的可用性受到限制。与分类数据相反，图像的意义固有于相邻像素的空间相关性，使得简单的噪声应用不可行。可逆神经网络(INN)在提供精确似然量化能力的同时，表现出了优异的生成性能。它们的原理是基于将复杂的分布转换成简单的分布，例如将图像转换成球形高斯分布。我们假设在隐空间中加入噪声可以实现差分私有图像修改。对潜在空间的处理可以在保留重要细节的同时修改图像。此外，通过调整与数据集一起提供的元数据上的INN，我们的目标是保留对下游任务(如分类)重要的维度，同时改变可能包含识别信息的其他部分。我们将这种方法称为内容感知差分隐私(content-aware differential privacy, CADP)。我们在公开可用的基准数据集以及专用的医疗数据集上进行实验。此外，我们还展示了我们的方法对分类数据的泛化性。源代码可在https://github.com/Cardio-AI/CADP上公开获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

DeCaF/FAIR@MICCAI

自引率

0.00%

发文量