Non-interactive (t, n)-Incidence Counting from Differentially Private Indicator Vectors

Proceedings of the 3rd ACM on International Workshop on Security And Privacy Analytics Pub Date : 2017-03-24 DOI:10.1145/3041008.3041017

Mohammad Alaggan, M. Cunche, M. Minier

{"title":"Non-interactive (t, n)-Incidence Counting from Differentially Private Indicator Vectors","authors":"Mohammad Alaggan, M. Cunche, M. Minier","doi":"10.1145/3041008.3041017","DOIUrl":null,"url":null,"abstract":"We present a novel non-interactive (t,n)-incidence count estimation for indicator vectors ensuring Differential Privacy. Given one or two differentially private indicator vectors, estimating the distinct count of elements in each and their intersection cardinality (equivalently, their inner product) have been studied in the literature, along with other extensions for estimating the cardinality set intersection in case the elements are hashed prior to insertion. The core contribution behind all these studies was to address the problem of estimating the Hamming weight (the number of bits set to one) of a bit vector from its differentially private version, and in the case of inner product and set intersection, estimating the number of positions which are jointly set to one in both bit vectors. We develop the most general case of estimating the number of positions which are set to one in exactly t out of n bit vectors (this quantity is denoted the (t,n)-incidence count), given access only to the differentially private version of those bit vectors. This means that if each bit vector belongs to a different owner, each can locally sanitize their bit vector prior to sharing it, hence the non-interactive nature of our algorithm. Our main contribution is a novel algorithm that simultaneously estimates the (t,n)-incidence counts for all t'{0,...,n}. We provide upper and lower bounds to the estimation error. Our lower bound is achieved by generalizing the limit of two-party differential privacy into $n$-party differential privacy, which is a contribution of independent interest. In particular we prove a lower bound on the additive error that must be incurred by any n-wise inner product of $n$ mutually differentially-private bit vectors. Our results are very general and are not limited to differentially private bit vectors. They should apply to a large class of sanitization mechanism of bit vectors which depend on flipping the bits with a constant probability. Some potential applications for our technique include physical mobility analytics, call-detail-record analysis, and similarity metrics computation.","PeriodicalId":137012,"journal":{"name":"Proceedings of the 3rd ACM on International Workshop on Security And Privacy Analytics","volume":"79 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 3rd ACM on International Workshop on Security And Privacy Analytics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3041008.3041017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

We present a novel non-interactive (t,n)-incidence count estimation for indicator vectors ensuring Differential Privacy. Given one or two differentially private indicator vectors, estimating the distinct count of elements in each and their intersection cardinality (equivalently, their inner product) have been studied in the literature, along with other extensions for estimating the cardinality set intersection in case the elements are hashed prior to insertion. The core contribution behind all these studies was to address the problem of estimating the Hamming weight (the number of bits set to one) of a bit vector from its differentially private version, and in the case of inner product and set intersection, estimating the number of positions which are jointly set to one in both bit vectors. We develop the most general case of estimating the number of positions which are set to one in exactly t out of n bit vectors (this quantity is denoted the (t,n)-incidence count), given access only to the differentially private version of those bit vectors. This means that if each bit vector belongs to a different owner, each can locally sanitize their bit vector prior to sharing it, hence the non-interactive nature of our algorithm. Our main contribution is a novel algorithm that simultaneously estimates the (t,n)-incidence counts for all t'{0,...,n}. We provide upper and lower bounds to the estimation error. Our lower bound is achieved by generalizing the limit of two-party differential privacy into $n$-party differential privacy, which is a contribution of independent interest. In particular we prove a lower bound on the additive error that must be incurred by any n-wise inner product of $n$ mutually differentially-private bit vectors. Our results are very general and are not limited to differentially private bit vectors. They should apply to a large class of sanitization mechanism of bit vectors which depend on flipping the bits with a constant probability. Some potential applications for our technique include physical mobility analytics, call-detail-record analysis, and similarity metrics computation.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

非交互的(t, n)-从差分私有指标向量的发生率计数

我们提出了一种新的非交互式(t,n)事件计数估计，以确保差分隐私。给定一个或两个差分私有指示向量，在文献中已经研究了每个指示向量中元素的不同计数及其相交基数(即它们的内积)，以及在元素在插入之前被散列的情况下估计基数集交集的其他扩展。所有这些研究背后的核心贡献是解决了从其差分私有版本估计位向量的汉明权值(设置为1的比特数)的问题，以及在内积和集合相交的情况下，估计两个位向量中共同设置为1的位置的数量。我们开发了最一般的情况来估计被设置为1的位置的数量，正好在n个位向量中的t中(这个数量表示为(t,n)-发生率计数)，给定只能访问这些位向量的差分私有版本。这意味着，如果每个位向量属于不同的所有者，每个人都可以在共享之前对其位向量进行局部消毒，因此我们的算法具有非交互式的性质。我们的主要贡献是一种新的算法，可以同时估计所有t'{0，…，n}的(t,n)-发生率计数。给出了估计误差的上界和下界。我们的下界是通过将两方差分隐私的极限推广到$n$方差分隐私来实现的，这是对独立兴趣的贡献。特别地，我们证明了加性误差的下界，该误差是由任意n个互微分私有位向量的n向内积引起的。我们的结果是非常普遍的，并不局限于微分私有位向量。它们应该适用于一类依赖于以恒定概率翻转比特的位向量的处理机制。我们的技术的一些潜在应用包括身体移动分析、呼叫详细记录分析和相似度量计算。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 3rd ACM on International Workshop on Security And Privacy Analytics

自引率

0.00%

发文量