Privacy and Fairness Analysis in the Post-Processed Differential Privacy Framework

IF 8 1区计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS IEEE Transactions on Information Forensics and Security Pub Date : 2025-01-10 DOI:10.1109/TIFS.2025.3528222

Ying Zhao;Kai Zhang;Longxiang Gao;Jinjun Chen

{"title":"Privacy and Fairness Analysis in the Post-Processed Differential Privacy Framework","authors":"Ying Zhao;Kai Zhang;Longxiang Gao;Jinjun Chen","doi":"10.1109/TIFS.2025.3528222","DOIUrl":null,"url":null,"abstract":"The post-processed Differential Privacy (DP) framework has been routinely adopted to preserve privacy while maintaining important invariant characteristics of datasets in data-release applications such as census data. Typical invariant characteristics include non-negative counts and total population. Subspace DP has been proposed to preserve total population while guaranteeing DP for sub-populations. Non-negativity post-processing has been identified to inherently incur fairness issues. In this work, we study privacy and unfairness (i.e., accuracy disparity) concerns in the post-processed DP framework. On one hand, we propose the post-processed DP framework with both non-negativity and accurate total population as constraints would inadvertently violate privacy guarantee desired by it. Instead, we propose the post-processed subspace DP framework to accurately define privacy guarantees against adversaries. On the other hand, we identify unfairness level is dependent on privacy budget, count sizes as well as their imbalance level via empirical analysis. Particularly concerning is severe unfairness in the setting of strict privacy budgets. We further trace unfairness back to uniform privacy budget setting over different population subgroups. To address this, we propose a varying privacy budget setting method and develop optimization approaches using ternary search and golden ratio search to identify optimal privacy budget ranges that minimize unfairness while maintaining privacy guarantees. Our extensive theoretical and empirical analysis demonstrates the effectiveness of our approaches in addressing severe unfairness issues across different privacy settings and several canonical privacy mechanisms. Using datasets of Australian Census data, Adult dataset, and delinquent children by county and household head education level, we validate both our privacy analysis framework and fairness optimization methods, showing significant reduction in accuracy disparities while maintaining strong privacy guarantees.","PeriodicalId":13492,"journal":{"name":"IEEE Transactions on Information Forensics and Security","volume":"20 ","pages":"2412-2423"},"PeriodicalIF":8.0000,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Information Forensics and Security","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10836853/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

The post-processed Differential Privacy (DP) framework has been routinely adopted to preserve privacy while maintaining important invariant characteristics of datasets in data-release applications such as census data. Typical invariant characteristics include non-negative counts and total population. Subspace DP has been proposed to preserve total population while guaranteeing DP for sub-populations. Non-negativity post-processing has been identified to inherently incur fairness issues. In this work, we study privacy and unfairness (i.e., accuracy disparity) concerns in the post-processed DP framework. On one hand, we propose the post-processed DP framework with both non-negativity and accurate total population as constraints would inadvertently violate privacy guarantee desired by it. Instead, we propose the post-processed subspace DP framework to accurately define privacy guarantees against adversaries. On the other hand, we identify unfairness level is dependent on privacy budget, count sizes as well as their imbalance level via empirical analysis. Particularly concerning is severe unfairness in the setting of strict privacy budgets. We further trace unfairness back to uniform privacy budget setting over different population subgroups. To address this, we propose a varying privacy budget setting method and develop optimization approaches using ternary search and golden ratio search to identify optimal privacy budget ranges that minimize unfairness while maintaining privacy guarantees. Our extensive theoretical and empirical analysis demonstrates the effectiveness of our approaches in addressing severe unfairness issues across different privacy settings and several canonical privacy mechanisms. Using datasets of Australian Census data, Adult dataset, and delinquent children by county and household head education level, we validate both our privacy analysis framework and fairness optimization methods, showing significant reduction in accuracy disparities while maintaining strong privacy guarantees.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

后处理差分隐私框架下的隐私与公平分析

在数据发布应用（如人口普查数据）中，通常采用后处理差分隐私（DP）框架来保护隐私，同时保持数据集的重要不变特征。典型的不变量特征包括非负计数和总人口。在保证子种群的DP的同时，提出了子空间DP。非消极性后处理已被确定为固有的公平性问题。在这项工作中，我们研究了后处理DP框架中的隐私和不公平（即准确性差异）问题。一方面，我们提出了具有非负性和准确总人口的后处理DP框架，因为约束会无意中违反其期望的隐私保证。相反，我们提出了后处理子空间DP框架来准确定义针对对手的隐私保证。另一方面，我们通过实证分析确定了不公平程度取决于隐私预算、计数大小及其不平衡程度。尤其令人担忧的是，在制定严格的隐私预算方面存在严重的不公平。我们进一步将不公平追溯到不同人口子组的统一隐私预算设置。为了解决这个问题，我们提出了一种不同的隐私预算设置方法，并开发了使用三元搜索和黄金比例搜索的优化方法，以确定在保持隐私保证的同时最小化不公平的最佳隐私预算范围。我们广泛的理论和实证分析证明了我们的方法在解决不同隐私设置和几种规范隐私机制中的严重不公平问题方面的有效性。使用澳大利亚人口普查数据集、成人数据集以及按县和户主教育程度划分的犯罪儿童数据集，我们验证了我们的隐私分析框架和公平优化方法，在保持强大隐私保障的同时显着减少了准确性差异。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Information Forensics and Security 工程技术-工程：电子与电气

CiteScore

14.40

自引率

7.40%

发文量

234

审稿时长

6.5 months

期刊介绍： The IEEE Transactions on Information Forensics and Security covers the sciences, technologies, and applications relating to information forensics, information security, biometrics, surveillance and systems applications that incorporate these features