基于差分隐私模型的政府数据发布研究

Chunhui Piao, Yajuan Shi, Yunzuo Zhang, Xuehong Jiang
{"title":"基于差分隐私模型的政府数据发布研究","authors":"Chunhui Piao, Yajuan Shi, Yunzuo Zhang, Xuehong Jiang","doi":"10.1109/ICEBE.2017.21","DOIUrl":null,"url":null,"abstract":"With the enforcement of the policies of opening and sharing information resources, protection of citizens' privacy has become a key issue concerned by the government and public. This paper discusses the risk of citizens' privacy disclosure related to government data publishing, and analyzes the main privacy-preserving methods for data publishing. Aiming at the problem that most of the existing privacy protection models for data publishing cannot resist the attacks based on the growing background knowledge, a differential privacy framework for publishing governmental statistical data is established. Based on the framework, a data publishing algorithm using MaxDiff histogram is proposed. Applying differential method, Laplace noises are added to the original dataset, which prevents citizens' privacy from disclosure even if attackers get strong background knowledge. According to the maximum frequency difference, the adjacent data bins are grouped, then the differential privacy histogram with minimum average error can be constructed. Through theoretical analysis and experimental comparison, it is demonstrated that the proposed data publishing algorithm can not only be used to effectively protect citizens' privacy, but also reduce the query sensitivity and improve the utility of the data published.","PeriodicalId":347774,"journal":{"name":"2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Research on Government Data Publishing Based on Differential Privacy Model\",\"authors\":\"Chunhui Piao, Yajuan Shi, Yunzuo Zhang, Xuehong Jiang\",\"doi\":\"10.1109/ICEBE.2017.21\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the enforcement of the policies of opening and sharing information resources, protection of citizens' privacy has become a key issue concerned by the government and public. This paper discusses the risk of citizens' privacy disclosure related to government data publishing, and analyzes the main privacy-preserving methods for data publishing. Aiming at the problem that most of the existing privacy protection models for data publishing cannot resist the attacks based on the growing background knowledge, a differential privacy framework for publishing governmental statistical data is established. Based on the framework, a data publishing algorithm using MaxDiff histogram is proposed. Applying differential method, Laplace noises are added to the original dataset, which prevents citizens' privacy from disclosure even if attackers get strong background knowledge. According to the maximum frequency difference, the adjacent data bins are grouped, then the differential privacy histogram with minimum average error can be constructed. Through theoretical analysis and experimental comparison, it is demonstrated that the proposed data publishing algorithm can not only be used to effectively protect citizens' privacy, but also reduce the query sensitivity and improve the utility of the data published.\",\"PeriodicalId\":347774,\"journal\":{\"name\":\"2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICEBE.2017.21\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEBE.2017.21","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

随着信息资源开放和共享政策的实施,公民隐私保护已成为政府和公众关注的重点问题。本文讨论了政府数据公开中公民隐私泄露的风险,分析了政府数据公开中主要的隐私保护方法。针对现有的数据发布隐私保护模型大多无法抵抗基于背景知识不断增长的攻击的问题,建立了政府统计数据发布的差分隐私保护框架。在此基础上,提出了一种基于MaxDiff直方图的数据发布算法。采用差分方法,在原始数据集中加入拉普拉斯噪声,即使攻击者获得较强的背景知识,也不会泄露公民的隐私。根据最大频率差对相邻数据箱进行分组,构造平均误差最小的差分隐私直方图。通过理论分析和实验对比,表明所提出的数据发布算法不仅可以有效地保护公民隐私,还可以降低查询灵敏度,提高发布数据的实用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Research on Government Data Publishing Based on Differential Privacy Model
With the enforcement of the policies of opening and sharing information resources, protection of citizens' privacy has become a key issue concerned by the government and public. This paper discusses the risk of citizens' privacy disclosure related to government data publishing, and analyzes the main privacy-preserving methods for data publishing. Aiming at the problem that most of the existing privacy protection models for data publishing cannot resist the attacks based on the growing background knowledge, a differential privacy framework for publishing governmental statistical data is established. Based on the framework, a data publishing algorithm using MaxDiff histogram is proposed. Applying differential method, Laplace noises are added to the original dataset, which prevents citizens' privacy from disclosure even if attackers get strong background knowledge. According to the maximum frequency difference, the adjacent data bins are grouped, then the differential privacy histogram with minimum average error can be constructed. Through theoretical analysis and experimental comparison, it is demonstrated that the proposed data publishing algorithm can not only be used to effectively protect citizens' privacy, but also reduce the query sensitivity and improve the utility of the data published.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
The Integrated System Optimization Based on the Boiler Combustion and Denitration with Denitration Operating Cost Consideration Chinese Questions Classification in the Law Domain Dust Removal with Boundary and Spatial Constraint for Videos Captured in Car Indexing for Large Scale Data Querying Based on Spark SQL Finding K-Most Influential Users in Social Networks for Information Diffusion Based on Network Structure and Different User Behavioral Patterns
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1