Privacy-aware document retrieval with two-level inverted indexing

IF 1.7 3区 计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Information Retrieval Journal Pub Date : 2023-11-17 DOI:10.1007/s10791-023-09428-z
Yifan Qiao, Shiyu Ji, Changhai Wang, Jinjin Shao, Tao Yang
{"title":"Privacy-aware document retrieval with two-level inverted indexing","authors":"Yifan Qiao, Shiyu Ji, Changhai Wang, Jinjin Shao, Tao Yang","doi":"10.1007/s10791-023-09428-z","DOIUrl":null,"url":null,"abstract":"<p>Previous work on privacy-aware ranking has addressed the minimization of information leakage when scoring top <i>k</i> documents, and has not studied on how to retrieve these top documents and their features for ranking. This paper proposes a privacy-aware document retrieval scheme with a two-level inverted index structure. In this scheme, posting records are grouped with bucket tags and runtime query processing produces query-specific tags in order to gather encoded features of matched documents with a privacy protection during index traversal. To thwart leakage-abuse attacks, our design minimizes the chance that a server processes unauthorized queries or identifies document sharing across posting lists through index inspection or across-query association. This paper presents the evaluation and analytic results of the proposed scheme to demonstrate the tradeoffs in its design considerations for privacy, efficiency, and relevance.</p>","PeriodicalId":54352,"journal":{"name":"Information Retrieval Journal","volume":"576 ","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2023-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Retrieval Journal","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s10791-023-09428-z","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Previous work on privacy-aware ranking has addressed the minimization of information leakage when scoring top k documents, and has not studied on how to retrieve these top documents and their features for ranking. This paper proposes a privacy-aware document retrieval scheme with a two-level inverted index structure. In this scheme, posting records are grouped with bucket tags and runtime query processing produces query-specific tags in order to gather encoded features of matched documents with a privacy protection during index traversal. To thwart leakage-abuse attacks, our design minimizes the chance that a server processes unauthorized queries or identifies document sharing across posting lists through index inspection or across-query association. This paper presents the evaluation and analytic results of the proposed scheme to demonstrate the tradeoffs in its design considerations for privacy, efficiency, and relevance.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
具有两级倒排索引的隐私感知文档检索
之前关于隐私感知排序的工作主要是在对前k个文档进行评分时最小化信息泄漏,而没有研究如何检索这些顶级文档及其特征进行排序。提出了一种具有两级倒排索引结构的感知隐私的文档检索方案。在此方案中,张贴记录与桶标记分组,运行时查询处理生成特定于查询的标记,以便在索引遍历期间收集具有隐私保护的匹配文档的编码特征。为了阻止泄漏滥用攻击,我们的设计最大限度地减少了服务器处理未经授权的查询或通过索引检查或跨查询关联识别跨发布列表的文档共享的机会。本文给出了所提出方案的评估和分析结果,以证明其在设计考虑隐私,效率和相关性方面的权衡。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Information Retrieval Journal
Information Retrieval Journal 工程技术-计算机:信息系统
CiteScore
6.20
自引率
0.00%
发文量
17
审稿时长
13.5 months
期刊介绍: The journal provides an international forum for the publication of theory, algorithms, analysis and experiments across the broad area of information retrieval. Topics of interest include search, indexing, analysis, and evaluation for applications such as the web, social and streaming media, recommender systems, and text archives. This includes research on human factors in search, bridging artificial intelligence and information retrieval, and domain-specific search applications.
期刊最新文献
Searching rooms with top-k passenger flows using indoor trajectories An innovative approach for PCO morphology segmentation using a novel MOT-SF technique A graph residual generation network for node classification based on multi-information aggregation Similarity-based ranking of videos from fixed-size one-dimensional video signature The accessibility of digital technologies for people with visual impairment and blindness: a scoping review
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1