On masking topical intent in keyword search

2014 IEEE 30th International Conference on Data Engineering Pub Date : 2014-05-19 DOI:10.1109/ICDE.2014.6816656

Peng Wang, C. Ravishankar

引用次数: 12

Abstract

Text-based search queries reveal user intent to the search engine, compromising privacy. Topical Intent Obfuscation (TIO) is a promising new approach to preserving user privacy. TIO masks topical intent by mixing real user queries with dummy queries matching various different topics. Dummy queries are generated using a Dummy Query Generation Algorithm (DGA). We demonstrate various shortcomings in current TIO schemes, and show how to correct them. Current schemes assume that DGA details are unknown to the adversary. We argue that this is a flawed assumption, and show how DGA details can be used to construct efficient attacks on TIO schemes, using an iterative DGA as an example. Our extensive experiments on real data sets show that our attacks can flag up to 80% of dummy queries. We also propose HDGA, a new DGA that we prove to be immune to the attacks based on DGA semantics that we describe.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

关键词搜索中主题意图的掩蔽

基于文本的搜索查询向搜索引擎揭示了用户的意图，损害了隐私。局部意图混淆(TIO)是一种很有前途的保护用户隐私的新方法。TIO通过混合真实用户查询和匹配各种不同主题的虚拟查询来掩盖主题意图。虚拟查询是使用虚拟查询生成算法(DGA)生成的。我们展示了当前TIO方案的各种缺点，并展示了如何纠正它们。目前的方案假设对手不知道DGA的细节。我们认为这是一个有缺陷的假设，并展示了如何使用DGA细节来构建对TIO方案的有效攻击，并以迭代DGA为例。我们在真实数据集上的大量实验表明，我们的攻击可以标记高达80%的虚拟查询。我们还提出了HDGA，一种新的DGA，我们证明了它对基于我们描述的DGA语义的攻击免疫。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2014 IEEE 30th International Conference on Data Engineering

自引率

0.00%

发文量

期刊最新文献

Managing uncertainty in spatial and spatio-temporal data Locality-sensitive operators for parallel main-memory database clusters KnowLife: A knowledge graph for health and life sciences We can learn your #hashtags: Connecting tweets to explicit topics A demonstration of MNTG - A web-based road network traffic generator