Anonymizing collections of tree-structured data

2016 IEEE 32nd International Conference on Data Engineering (ICDE) Pub Date : 2016-05-16 DOI:10.1109/ICDE.2016.7498404

Olga Gkountouna, Manolis Terrovitis

引用次数: 17

Abstract

Collections of real-world data usually have implicit or explicit structural relations. For example, databases link records through foreign keys, and XML documents express associations between different values through syntax. Privacy preservation, until now, has focused either on data with a very simple structure, e.g. relational tables, or on data with very complex structure e.g. social network graphs, but has ignored intermediate cases, which are the most frequent in practice. In this work, we focus on tree structured data. The paper defines k(m;n)-anonymity, which provides protection against identity disclosure and proposes a greedy anonymization heuristic that is able to sanitize large datasets. The algorithm and the quality of the anonymization are evaluated experimentally.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

对树状结构数据集合进行匿名化

实际数据的集合通常具有隐式或显式的结构关系。例如，数据库通过外键链接记录，XML文档通过语法表达不同值之间的关联。到目前为止，隐私保护要么集中在结构非常简单的数据上，比如关系表，要么集中在结构非常复杂的数据上，比如社交网络图，但是忽略了在实践中最常见的中间情况。在这项工作中，我们专注于树结构数据。本文定义了k(m;n)-匿名，它提供了防止身份泄露的保护，并提出了一种能够对大型数据集进行清理的贪婪匿名启发式算法。通过实验对算法和匿名化质量进行了评价。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2016 IEEE 32nd International Conference on Data Engineering (ICDE)

自引率

0.00%

发文量

期刊最新文献

Data profiling SEED: A system for entity exploration and debugging in large-scale knowledge graphs TemProRA: Top-k temporal-probabilistic results analysis Durable graph pattern queries on historical graphs SCouT: Scalable coupled matrix-tensor factorization - algorithm and discoveries