最大化数据效用,同时通过数据库碎片保护隐私

IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Expert Systems with Applications Pub Date : 2025-05-10 Epub Date: 2025-02-15 DOI:10.1016/j.eswa.2025.126873
Ali Amiri
{"title":"最大化数据效用,同时通过数据库碎片保护隐私","authors":"Ali Amiri","doi":"10.1016/j.eswa.2025.126873","DOIUrl":null,"url":null,"abstract":"<div><div>Efficiently managing databases that balance data privacy with utility is a critical challenge in today’s data-driven landscape. This study addresses the problem of database fragmentation, which involves dividing a database into smaller fragments, each containing a subset of attributes. The primary objective is to strike a balance between safeguarding the confidentiality of sensitive attribute sets and optimizing the database’s utility. Sensitive attribute sets include combinations of attributes that could disclose private information or identify individuals, such as personal quasi-identifiers, necessitating their separation into distinct fragments to reduce the risk of sensitive data exposure. Conversely, utility attribute sets consist of attributes that enhance data usability and query efficiency. Maximizing utility requires grouping attributes from the same utility set into as few fragments as possible. To effectively solve this complex NP-hard problem, A column generation-based solution leveraging a set partitioning formulation is presented. Experimental evaluations on real and synthetic datasets validate the efficiency of the proposed approach, demonstrating its superiority over the state-of-the-art commercial solver, CPLEX.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"273 ","pages":"Article 126873"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Maximizing data utility while preserving privacy through database fragmentation\",\"authors\":\"Ali Amiri\",\"doi\":\"10.1016/j.eswa.2025.126873\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Efficiently managing databases that balance data privacy with utility is a critical challenge in today’s data-driven landscape. This study addresses the problem of database fragmentation, which involves dividing a database into smaller fragments, each containing a subset of attributes. The primary objective is to strike a balance between safeguarding the confidentiality of sensitive attribute sets and optimizing the database’s utility. Sensitive attribute sets include combinations of attributes that could disclose private information or identify individuals, such as personal quasi-identifiers, necessitating their separation into distinct fragments to reduce the risk of sensitive data exposure. Conversely, utility attribute sets consist of attributes that enhance data usability and query efficiency. Maximizing utility requires grouping attributes from the same utility set into as few fragments as possible. To effectively solve this complex NP-hard problem, A column generation-based solution leveraging a set partitioning formulation is presented. Experimental evaluations on real and synthetic datasets validate the efficiency of the proposed approach, demonstrating its superiority over the state-of-the-art commercial solver, CPLEX.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"273 \",\"pages\":\"Article 126873\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-05-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425004956\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/2/15 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425004956","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/15 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

在当今数据驱动的环境中,有效地管理平衡数据隐私和实用性的数据库是一项关键挑战。本研究解决了数据库碎片的问题,这涉及到将数据库划分为更小的片段,每个片段包含一个属性子集。主要目标是在保护敏感属性集的机密性和优化数据库的实用性之间取得平衡。敏感属性集包括可能泄露私人信息或识别个人的属性组合,例如个人准标识符,因此需要将其分离为不同的片段,以降低敏感数据暴露的风险。相反,实用工具属性集由增强数据可用性和查询效率的属性组成。最大化效用需要将来自同一效用集的属性分组到尽可能少的片段中。为了有效地解决这个复杂的np困难问题,提出了一种基于列生成的解决方案,利用集合划分公式。在真实和合成数据集上的实验评估验证了所提出方法的有效性,证明了其优于最先进的商业求解器CPLEX。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Maximizing data utility while preserving privacy through database fragmentation
Efficiently managing databases that balance data privacy with utility is a critical challenge in today’s data-driven landscape. This study addresses the problem of database fragmentation, which involves dividing a database into smaller fragments, each containing a subset of attributes. The primary objective is to strike a balance between safeguarding the confidentiality of sensitive attribute sets and optimizing the database’s utility. Sensitive attribute sets include combinations of attributes that could disclose private information or identify individuals, such as personal quasi-identifiers, necessitating their separation into distinct fragments to reduce the risk of sensitive data exposure. Conversely, utility attribute sets consist of attributes that enhance data usability and query efficiency. Maximizing utility requires grouping attributes from the same utility set into as few fragments as possible. To effectively solve this complex NP-hard problem, A column generation-based solution leveraging a set partitioning formulation is presented. Experimental evaluations on real and synthetic datasets validate the efficiency of the proposed approach, demonstrating its superiority over the state-of-the-art commercial solver, CPLEX.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Expert Systems with Applications
Expert Systems with Applications 工程技术-工程:电子与电气
CiteScore
13.80
自引率
10.60%
发文量
2045
审稿时长
8.7 months
期刊介绍: Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.
期刊最新文献
Iterative conceptual query expansion for biomedical information retrieval MS-Bi-PRM: A dynamic-ready bidirectional probabilistic roadmap algorithm with multi-strategy sampling for high-efficiency robotic manipulator path planning Multi-hop semantic association-based adversarial attack and defense method for natural language processing systems Dice-GAN: Generative adversarial network with diversity injection and consistency enhancement ARMS: Attention-driven representation reconstruction with multi-signal pseudo-label selection for semi-supervised text classification
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1