{"title":"Yet another privacy metric for publishing micro-data","authors":"Keith B. Frikken, Yihua Zhang","doi":"10.1145/1456403.1456423","DOIUrl":null,"url":null,"abstract":"Recently many schemes, including <i>k</i>-anonymity [8], <i>l</i>-diversity [6] and <i>t</i>-closeness [5] have been introduced for preserving individual privacy when publishing database tables. Furthermore <i>k</i>-anonymity and <i>l</i>-diversity have been shown to have weaknesses. In this paper, we show that <i>t</i>-closeness also has limitations, more specifically we argue that: i) choosing the correct value for <i>t</i> is difficult, ii) <i>t</i>-closeness does not allow some values of sensitive attributes to be more sensitive than other values, and iii) to prevent certain types of privacy leaks <i>t</i> must be set to such a small value that it produces low-quality published data. In this paper we propose a new privacy metric,(α<sub><i>i</i></sub>, β<sub><i>i</i></sub>)-closeness, that mitigates these problems. We also show how to calculate an optimal release table (in the full domain model) that satisfies (α<sub><i>i</i></sub>, β<sub><i>i</i></sub>)-closeness and we present experimental results that show that the data quality provided by 9α<sub><i>i</i></sub>, β;<sub><i>i</i></sub>),-closeness is higher than <i>t</i>-closeness, <i>k</i>-anonymity, and <i>l</i>-diversity while achieving the same privacy goals.","PeriodicalId":74537,"journal":{"name":"Proceedings of the ACM Workshop on Privacy in the Electronic Society. ACM Workshop on Privacy in the Electronic Society","volume":"94 1","pages":"117-122"},"PeriodicalIF":0.0000,"publicationDate":"2008-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ACM Workshop on Privacy in the Electronic Society. ACM Workshop on Privacy in the Electronic Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1456403.1456423","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 16
Abstract
Recently many schemes, including k-anonymity [8], l-diversity [6] and t-closeness [5] have been introduced for preserving individual privacy when publishing database tables. Furthermore k-anonymity and l-diversity have been shown to have weaknesses. In this paper, we show that t-closeness also has limitations, more specifically we argue that: i) choosing the correct value for t is difficult, ii) t-closeness does not allow some values of sensitive attributes to be more sensitive than other values, and iii) to prevent certain types of privacy leaks t must be set to such a small value that it produces low-quality published data. In this paper we propose a new privacy metric,(αi, βi)-closeness, that mitigates these problems. We also show how to calculate an optimal release table (in the full domain model) that satisfies (αi, βi)-closeness and we present experimental results that show that the data quality provided by 9αi, β;i),-closeness is higher than t-closeness, k-anonymity, and l-diversity while achieving the same privacy goals.