{"title":"TopDown-KACA:一种高效的k-匿名局部编码算法","authors":"Juan Yu, Jianmin Han, Jianmin Chen, Zanzhu Xia","doi":"10.1109/GRC.2009.5255024","DOIUrl":null,"url":null,"abstract":"K-anonymity is an effective model for protecting privacy while publishing data. KACA algorithm is a typical generalization algorithm for k-anonymity, which can generate small information loss, but its efficiency is low, especially when dataset is large. Another generalization algorithm, topDown, has high efficiency but generates heavy information loss. In this paper, we propose an efficient generalization algorithm for k-anonymity, called topDown-KACA, which combines the topDown algorithm with the KACA algorithm. The idea of topDown-KACA algorithm is to partition the whole dataset into some subsets by topDown algorithm at first, and then k-anonymize these subsets by KACA algorithm respectively. Experiments show that the proposed algorithm is more efficient than KACA algorithm with similar information loss, and generates less information loss than topDown algorithm with similar execution time.","PeriodicalId":388774,"journal":{"name":"2009 IEEE International Conference on Granular Computing","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"TopDown-KACA: An efficient local-recoding algorithm for k-anonymity\",\"authors\":\"Juan Yu, Jianmin Han, Jianmin Chen, Zanzhu Xia\",\"doi\":\"10.1109/GRC.2009.5255024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"K-anonymity is an effective model for protecting privacy while publishing data. KACA algorithm is a typical generalization algorithm for k-anonymity, which can generate small information loss, but its efficiency is low, especially when dataset is large. Another generalization algorithm, topDown, has high efficiency but generates heavy information loss. In this paper, we propose an efficient generalization algorithm for k-anonymity, called topDown-KACA, which combines the topDown algorithm with the KACA algorithm. The idea of topDown-KACA algorithm is to partition the whole dataset into some subsets by topDown algorithm at first, and then k-anonymize these subsets by KACA algorithm respectively. Experiments show that the proposed algorithm is more efficient than KACA algorithm with similar information loss, and generates less information loss than topDown algorithm with similar execution time.\",\"PeriodicalId\":388774,\"journal\":{\"name\":\"2009 IEEE International Conference on Granular Computing\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-09-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 IEEE International Conference on Granular Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/GRC.2009.5255024\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE International Conference on Granular Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GRC.2009.5255024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
摘要
k -匿名是一种有效的数据发布隐私保护模式。KACA算法是一种典型的k-匿名泛化算法,其产生的信息损失较小,但效率较低,特别是在数据集较大的情况下。另一种泛化算法topDown效率高,但信息损失大。本文将topDown算法与KACA算法相结合,提出了一种高效的k-匿名泛化算法topDown-KACA。topDown-KACA算法的思想是首先通过topDown算法将整个数据集划分为若干子集,然后分别通过KACA算法对这些子集进行k匿名化。实验表明,该算法在信息损失相似的情况下比KACA算法效率更高,在执行时间相似的情况下比topDown算法产生的信息损失更小。
TopDown-KACA: An efficient local-recoding algorithm for k-anonymity
K-anonymity is an effective model for protecting privacy while publishing data. KACA algorithm is a typical generalization algorithm for k-anonymity, which can generate small information loss, but its efficiency is low, especially when dataset is large. Another generalization algorithm, topDown, has high efficiency but generates heavy information loss. In this paper, we propose an efficient generalization algorithm for k-anonymity, called topDown-KACA, which combines the topDown algorithm with the KACA algorithm. The idea of topDown-KACA algorithm is to partition the whole dataset into some subsets by topDown algorithm at first, and then k-anonymize these subsets by KACA algorithm respectively. Experiments show that the proposed algorithm is more efficient than KACA algorithm with similar information loss, and generates less information loss than topDown algorithm with similar execution time.