基于灵敏度的多维混合泛化匿名化

2018 Thirteenth International Conference on Digital Information Management (ICDIM) Pub Date : 2018-09-01 DOI:10.1109/ICDIM.2018.8847000

Esther Gachanga, Michael W. Kimwele, L. Nderu

{"title":"基于灵敏度的多维混合泛化匿名化","authors":"Esther Gachanga, Michael W. Kimwele, L. Nderu","doi":"10.1109/ICDIM.2018.8847000","DOIUrl":null,"url":null,"abstract":"Sensitive information about individuals must not be revealed when sharing data, but a data set must remain useful for research and analysis when published. Anonymization methods have been considered as a possible solution for protecting the privacy of individuals. This is achieved by transforming data in a way that guarantees a certain degree of protection from re-identification threats. In the process, it is important to ensure that the quality of data is preserved. K-anonymity is the most commonly used approach for the anonymization of published datasets. However, the approach causes a decline in data utility. The key challenge for data publishers is how to anonymize data without causing a significant decline in data utility. The paper addresses this challenge by proposing a multidimensional mixed generalization. We conduct experiments with mixed generalization. Our results show that mixed generalization preserves the quality of data for classification.","PeriodicalId":120884,"journal":{"name":"2018 Thirteenth International Conference on Digital Information Management (ICDIM)","volume":"17 8","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Sensitivity Based Anonymization with Multi-dimensional Mixed Generalization\",\"authors\":\"Esther Gachanga, Michael W. Kimwele, L. Nderu\",\"doi\":\"10.1109/ICDIM.2018.8847000\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sensitive information about individuals must not be revealed when sharing data, but a data set must remain useful for research and analysis when published. Anonymization methods have been considered as a possible solution for protecting the privacy of individuals. This is achieved by transforming data in a way that guarantees a certain degree of protection from re-identification threats. In the process, it is important to ensure that the quality of data is preserved. K-anonymity is the most commonly used approach for the anonymization of published datasets. However, the approach causes a decline in data utility. The key challenge for data publishers is how to anonymize data without causing a significant decline in data utility. The paper addresses this challenge by proposing a multidimensional mixed generalization. We conduct experiments with mixed generalization. Our results show that mixed generalization preserves the quality of data for classification.\",\"PeriodicalId\":120884,\"journal\":{\"name\":\"2018 Thirteenth International Conference on Digital Information Management (ICDIM)\",\"volume\":\"17 8\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 Thirteenth International Conference on Digital Information Management (ICDIM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDIM.2018.8847000\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Thirteenth International Conference on Digital Information Management (ICDIM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDIM.2018.8847000","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

在共享数据时不能泄露个人的敏感信息，但数据集在发布时必须对研究和分析有用。匿名化方法被认为是保护个人隐私的一种可能的解决方案。这是通过以某种方式转换数据来实现的，这种方式保证了一定程度的保护，以防止再次识别威胁。在这个过程中，确保数据的质量是很重要的。k -匿名是发表数据集匿名化最常用的方法。然而，这种方法会导致数据效用的下降。数据发布者面临的主要挑战是如何在不导致数据效用显著下降的情况下对数据进行匿名化。本文通过提出一个多维混合泛化来解决这一挑战。我们进行混合泛化实验。我们的研究结果表明，混合泛化保留了分类数据的质量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Sensitivity Based Anonymization with Multi-dimensional Mixed Generalization

Sensitive information about individuals must not be revealed when sharing data, but a data set must remain useful for research and analysis when published. Anonymization methods have been considered as a possible solution for protecting the privacy of individuals. This is achieved by transforming data in a way that guarantees a certain degree of protection from re-identification threats. In the process, it is important to ensure that the quality of data is preserved. K-anonymity is the most commonly used approach for the anonymization of published datasets. However, the approach causes a decline in data utility. The key challenge for data publishers is how to anonymize data without causing a significant decline in data utility. The paper addresses this challenge by proposing a multidimensional mixed generalization. We conduct experiments with mixed generalization. Our results show that mixed generalization preserves the quality of data for classification.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 Thirteenth International Conference on Digital Information Management (ICDIM)

自引率

0.00%

发文量

期刊最新文献

Attention Based Neural Architecture for Rumor Detection with Author Context Awareness Urdu Text Classification: A comparative study using machine learning techniques The Effect of Different Type of Information on Trust in Facebook Page Towards scalable standards for web content usability Ontology Coverage Tool and Document Browser for Learning Material Exploration