Information Retrieval Journal最新文献

英文中文

An in-depth analysis of passage-level label transfer for contextual document ranking 深入分析用于上下文文档排序的段落级标签转移

IF 2.5 3区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Retrieval Journal

Pub Date : 2023-12-08 DOI: 10.1007/s10791-023-09430-5

Koustav Rudra, Zeon Trevor Fernando, Avishek Anand

Pre-trained contextual language models such as BERT, GPT, and XLnet work quite well for document retrieval tasks. Such models are fine-tuned based on the query-document/query-passage level relevance labels to capture the ranking signals. However, the documents are longer than the passages and such document ranking models suffer from the token limitation (512) of BERT. Researchers proposed ranking strategies that either truncate the documents beyond the token limit or chunk the documents into units that can fit into the BERT. In the later case, the relevance labels are either directly transferred from the original query-document pair or learned through some external model. In this paper, we conduct a detailed study of the design decisions about splitting and label transfer on retrieval effectiveness and efficiency. We find that direct transfer of relevance labels from documents to passages introduces label noise that strongly affects retrieval effectiveness for large training datasets. We also find that query processing times are adversely affected by fine-grained splitting schemes. As a remedy, we propose a careful passage level labelling scheme using weak supervision that delivers improved performance (3–14% in terms of nDCG score) over most of the recently proposed models for ad-hoc retrieval while maintaining manageable computational complexity on four diverse document retrieval datasets.

预先训练的上下文语言模型（如 BERT、GPT 和 XLnet）在文档检索任务中效果相当不错。这些模型根据查询-文档/查询-段落级别的相关性标签进行微调，以捕捉排序信号。然而，文档比段落长，这类文档排序模型受到 BERT 标记限制（512）的影响。研究人员提出了一些排序策略，要么将超过标记限制的文档截断，要么将文档分块，使其适合 BERT。在后一种情况下，相关性标签要么直接从原始查询-文档对中转移，要么通过一些外部模型学习。在本文中，我们详细研究了拆分和标签转移的设计决策对检索效果和效率的影响。我们发现，将相关性标签从文档直接转移到段落会引入标签噪声，从而严重影响大型训练数据集的检索效果。我们还发现，细粒度分割方案会对查询处理时间产生不利影响。作为一种补救措施，我们提出了一种使用弱监督的谨慎的段落级标签方案，与最近提出的大多数临时检索模型相比，该方案提高了性能（在 nDCG 分数方面提高了 3-14%），同时在四个不同的文档检索数据集上保持了可管理的计算复杂度。

{"title":"An in-depth analysis of passage-level label transfer for contextual document ranking","authors":"Koustav Rudra, Zeon Trevor Fernando, Avishek Anand","doi":"10.1007/s10791-023-09430-5","DOIUrl":"https://doi.org/10.1007/s10791-023-09430-5","url":null,"abstract":"<p>Pre-trained contextual language models such as BERT, GPT, and XLnet work quite well for document retrieval tasks. Such models are fine-tuned based on the query-document/query-passage level relevance labels to capture the ranking signals. However, the documents are longer than the passages and such document ranking models suffer from the token limitation (512) of BERT. Researchers proposed ranking strategies that either truncate the documents beyond the token limit or chunk the documents into units that can fit into the BERT. In the later case, the relevance labels are either directly transferred from the original query-document pair or learned through some external model. In this paper, we conduct a detailed study of the design decisions about splitting and label transfer on retrieval effectiveness and efficiency. We find that direct transfer of relevance labels from documents to passages introduces <i>label noise</i> that strongly affects retrieval effectiveness for large training datasets. We also find that query processing times are adversely affected by fine-grained splitting schemes. As a remedy, we propose a careful passage level labelling scheme using weak supervision that delivers improved performance (3–14% in terms of nDCG score) over most of the recently proposed models for ad-hoc retrieval while maintaining manageable computational complexity on four diverse document retrieval datasets.</p>","PeriodicalId":54352,"journal":{"name":"Information Retrieval Journal","volume":"64 1","pages":""},"PeriodicalIF":2.5,"publicationDate":"2023-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138556906","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Privacy-aware document retrieval with two-level inverted indexing 具有两级倒排索引的隐私感知文档检索

IF 2.5 3区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Retrieval Journal

Pub Date : 2023-11-17 DOI: 10.1007/s10791-023-09428-z

Yifan Qiao, Shiyu Ji, Changhai Wang, Jinjin Shao, Tao Yang

Previous work on privacy-aware ranking has addressed the minimization of information leakage when scoring top k documents, and has not studied on how to retrieve these top documents and their features for ranking. This paper proposes a privacy-aware document retrieval scheme with a two-level inverted index structure. In this scheme, posting records are grouped with bucket tags and runtime query processing produces query-specific tags in order to gather encoded features of matched documents with a privacy protection during index traversal. To thwart leakage-abuse attacks, our design minimizes the chance that a server processes unauthorized queries or identifies document sharing across posting lists through index inspection or across-query association. This paper presents the evaluation and analytic results of the proposed scheme to demonstrate the tradeoffs in its design considerations for privacy, efficiency, and relevance.

之前关于隐私感知排序的工作主要是在对前k个文档进行评分时最小化信息泄漏，而没有研究如何检索这些顶级文档及其特征进行排序。提出了一种具有两级倒排索引结构的感知隐私的文档检索方案。在此方案中，张贴记录与桶标记分组，运行时查询处理生成特定于查询的标记，以便在索引遍历期间收集具有隐私保护的匹配文档的编码特征。为了阻止泄漏滥用攻击，我们的设计最大限度地减少了服务器处理未经授权的查询或通过索引检查或跨查询关联识别跨发布列表的文档共享的机会。本文给出了所提出方案的评估和分析结果，以证明其在设计考虑隐私，效率和相关性方面的权衡。

引用次数: 0

Heterogeneous graph attention networks for passage retrieval 文章检索的异构图注意网络

IF 2.5 3区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Retrieval Journal

Pub Date : 2023-11-16 DOI: 10.1007/s10791-023-09424-3

Lucas Albarede, Philippe Mulhem, Lorraine Goeuriot, Sylvain Marié, Claude Le Pape-Gardeux, Trinidad Chardin-Segui

This paper presents an exploration of the usage of Heterogeneous Graph Attention Networks, or HGATs, for the task of Passage Retrieval. More precisely, we study how these models perform to alleviate the problem of passage contextualization, that is incorporating information about the context of a passage (its containing document, neighbouring passages, etc.) in its relevance estimation. We first propose several configurations to compute contextualized passage representations, including a document graph representation composed of contextualizing signals and judiciously modified HGAT architectures. We then present how we integrate these configurations in a neural passage ranking model. We evaluate our approach on a Passage Retrieval task on patent documents: CLEF-IP2013, as these documents possess several different contextualizing signals fully exploited in our models. Our results show that some HGAT architecture modifications allow for a better context representation leading to improved performances and stability.

本文提出了使用异构图注意网络(HGATs)来完成文章检索任务的探索。更准确地说，我们研究了这些模型如何缓解段落语境化的问题，即在相关性估计中纳入关于段落上下文的信息(其包含的文档，邻近的段落等)。我们首先提出了几种计算上下文化通道表示的配置，包括由上下文化信号和明智修改的HGAT架构组成的文档图表示。然后，我们介绍了如何将这些配置整合到神经通道排序模型中。我们在专利文档的段落检索任务中评估了我们的方法:CLEF-IP2013，因为这些文档具有几种不同的上下文化信号，这些信号在我们的模型中得到了充分利用。我们的结果表明，一些HGAT架构修改允许更好的上下文表示，从而提高了性能和稳定性。

引用次数: 0

Constructing and meta-evaluating state-aware evaluation metrics for interactive search systems 构建和元评估交互式搜索系统的状态感知评估指标

3区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Retrieval Journal

Pub Date : 2023-10-31 DOI: 10.1007/s10791-023-09426-1

Marco Markwald, Jiqun Liu, Ran Yu

Abstract Evaluation metrics such as precision, recall and normalized discounted cumulative gain have been widely applied in ad hoc retrieval experiments. They have facilitated the assessment of system performance in various topics over the past decade. However, the effectiveness of such metrics in capturing users’ in-situ search experience, especially in complex search tasks that trigger interactive search sessions, is limited. To address this challenge, it is necessary to adaptively adjust the evaluation strategies of search systems to better respond to users’ changing information needs and evaluation criteria. In this work, we adopt a taxonomy of search task states that a user goes through in different scenarios and moments of search sessions, and perform a meta-evaluation of existing metrics to better understand their effectiveness in measuring user satisfaction. We then built models for predicting task states behind queries based on in-session signals. Furthermore, we constructed and meta-evaluated new state-aware evaluation metrics. Our analysis and experimental evaluation are performed on two datasets collected from a field study and a laboratory study, respectively. Results demonstrate that the effectiveness of individual evaluation metrics varies across task states. Meanwhile, task states can be detected from in-session signals. Our new state-aware evaluation metrics could better reflect in-situ user satisfaction than an extensive list of the widely used measures we analyzed in this work in certain states. Findings of our research can inspire the design and meta-evaluation of user-centered adaptive evaluation metrics, and also shed light on the development of state-aware interactive search systems.

摘要精密度、查全率和归一化折现累积增益等评价指标在自组织检索实验中得到了广泛应用。在过去十年中，它们促进了对不同主题的系统性能的评估。然而，这些指标在捕捉用户的原位搜索体验方面的有效性是有限的，特别是在触发交互式搜索会话的复杂搜索任务中。为了应对这一挑战，有必要自适应地调整搜索系统的评价策略，以更好地响应用户不断变化的信息需求和评价标准。在这项工作中，我们采用了用户在搜索会话的不同场景和时刻经历的搜索任务状态的分类法，并对现有指标进行了元评估，以更好地了解它们在衡量用户满意度方面的有效性。然后，我们建立了基于会话内信号的模型来预测查询背后的任务状态。此外，我们构建并元评估了新的状态感知评估指标。我们的分析和实验评估分别对从实地研究和实验室研究中收集的两个数据集进行。结果表明，单个评估指标的有效性因任务状态而异。同时，可以从会话信号中检测任务状态。我们新的状态感知评估指标可以更好地反映现场用户满意度，而不是我们在某些状态下分析的广泛使用的测量方法的广泛列表。我们的研究结果可以启发以用户为中心的自适应评价指标的设计和元评价，也可以为状态感知交互式搜索系统的开发提供启示。

{"title":"Constructing and meta-evaluating state-aware evaluation metrics for interactive search systems","authors":"Marco Markwald, Jiqun Liu, Ran Yu","doi":"10.1007/s10791-023-09426-1","DOIUrl":"https://doi.org/10.1007/s10791-023-09426-1","url":null,"abstract":"Abstract Evaluation metrics such as precision, recall and normalized discounted cumulative gain have been widely applied in ad hoc retrieval experiments. They have facilitated the assessment of system performance in various topics over the past decade. However, the effectiveness of such metrics in capturing users’ in-situ search experience, especially in complex search tasks that trigger interactive search sessions, is limited. To address this challenge, it is necessary to adaptively adjust the evaluation strategies of search systems to better respond to users’ changing information needs and evaluation criteria. In this work, we adopt a taxonomy of search task states that a user goes through in different scenarios and moments of search sessions, and perform a meta-evaluation of existing metrics to better understand their effectiveness in measuring user satisfaction. We then built models for predicting task states behind queries based on in-session signals. Furthermore, we constructed and meta-evaluated new state-aware evaluation metrics. Our analysis and experimental evaluation are performed on two datasets collected from a field study and a laboratory study, respectively. Results demonstrate that the effectiveness of individual evaluation metrics varies across task states. Meanwhile, task states can be detected from in-session signals. Our new state-aware evaluation metrics could better reflect in-situ user satisfaction than an extensive list of the widely used measures we analyzed in this work in certain states. Findings of our research can inspire the design and meta-evaluation of user-centered adaptive evaluation metrics, and also shed light on the development of state-aware interactive search systems.","PeriodicalId":54352,"journal":{"name":"Information Retrieval Journal","volume":"2013 24","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135813980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

DeepQFM: a deep learning based query facets mining method DeepQFM:基于深度学习的查询面挖掘方法

3区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Retrieval Journal

Pub Date : 2023-10-30 DOI: 10.1007/s10791-023-09427-0

Zhirui Deng, Zhicheng Dou, Ji-Rong Wen

引用次数: 0

Learning heterogeneous subgraph representations for team discovery 学习异构子图表示用于团队发现

3区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Retrieval Journal

Pub Date : 2023-10-09 DOI: 10.1007/s10791-023-09421-6

Radin Hamidi Rad, Hoang Nguyen, Feras Al-Obeidat, Ebrahim Bagheri, Mehdi Kargar, Divesh Srivastava, Jaroslaw Szlichta, Fattane Zarrinkalam

引用次数: 0

Investigating better context representations for generative question answering 为生成式问答研究更好的上下文表示

3区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Retrieval Journal

Pub Date : 2023-10-02 DOI: 10.1007/s10791-023-09420-7

Sumam Francis, Marie-Francine Moens

引用次数: 0

Multimodal video retrieval with CLIP: a user study 多模式视频检索与剪辑:一个用户研究

3区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Retrieval Journal

Pub Date : 2023-09-29 DOI: 10.1007/s10791-023-09425-2

Tayfun Alpay, Sven Magg, Philipp Broze, Daniel Speck

Abstract Recent machine learning advances demonstrate the effectiveness of zero-shot models trained on large amounts of data collected from the internet. Among these, CLIP (Contrastive Language-Image Pre-training) has been introduced as a multimodal model with high accuracy on a number of different tasks and domains. However, the unconstrained nature of the model begs the question whether it can be deployed in open-domain real-word applications effectively in front of non-technical users. In this paper, we evaluate whether CLIP can be used for multimodal video retrieval in a real-world environment. For this purpose, we implemented impa , an efficient shot-based retrieval system powered by CLIP. We additionally implemented advanced query functionality in a unified graphical user interface to facilitate an intuitive and efficient usage of CLIP for video retrieval tasks. Finally, we empirically evaluated our retrieval system by performing a user study with video editing professionals and journalists working in the TV news media industry. After having the participants solve open-domain video retrieval tasks, we collected data via questionnaires, interviews, and UI interaction logs. Our evaluation focused on the perceived intuitiveness of retrieval using natural language, retrieval accuracy, and how users interacted with the system’s UI. We found that our advanced features yield higher task accuracy, user ratings, and more efficient queries. Overall, our results show the importance of designing intuitive and efficient user interfaces to be able to deploy large models such as CLIP effectively in real-world scenarios.

最近的机器学习进展证明了零射击模型在从互联网收集的大量数据上训练的有效性。其中，CLIP(对比语言-图像预训练)作为一种多模态模型被引入，在许多不同的任务和领域上具有很高的准确性。然而，该模型不受约束的特性引出了一个问题，即它是否能够有效地部署在面向非技术用户的开放域实际应用程序中。在本文中，我们评估CLIP是否可以用于现实世界环境中的多模态视频检索。为此，我们实现了impa，这是一个由CLIP驱动的高效的基于镜头的检索系统。我们还在统一的图形用户界面中实现了高级查询功能，以方便直观和有效地使用CLIP进行视频检索任务。最后，我们通过对电视新闻媒体行业的视频编辑专业人员和记者进行用户研究，对我们的检索系统进行了实证评估。在参与者完成开放域视频检索任务后，我们通过问卷调查、访谈和UI交互日志收集数据。我们的评估集中在使用自然语言检索的感知直观性、检索准确性以及用户如何与系统UI交互。我们发现我们的高级特性产生了更高的任务准确性、用户评分和更高效的查询。总的来说，我们的结果显示了设计直观和高效的用户界面的重要性，以便能够在现实场景中有效地部署大型模型，如CLIP。

{"title":"Multimodal video retrieval with CLIP: a user study","authors":"Tayfun Alpay, Sven Magg, Philipp Broze, Daniel Speck","doi":"10.1007/s10791-023-09425-2","DOIUrl":"https://doi.org/10.1007/s10791-023-09425-2","url":null,"abstract":"Abstract Recent machine learning advances demonstrate the effectiveness of zero-shot models trained on large amounts of data collected from the internet. Among these, CLIP (Contrastive Language-Image Pre-training) has been introduced as a multimodal model with high accuracy on a number of different tasks and domains. However, the unconstrained nature of the model begs the question whether it can be deployed in open-domain real-word applications effectively in front of non-technical users. In this paper, we evaluate whether CLIP can be used for multimodal video retrieval in a real-world environment. For this purpose, we implemented impa , an efficient shot-based retrieval system powered by CLIP. We additionally implemented advanced query functionality in a unified graphical user interface to facilitate an intuitive and efficient usage of CLIP for video retrieval tasks. Finally, we empirically evaluated our retrieval system by performing a user study with video editing professionals and journalists working in the TV news media industry. After having the participants solve open-domain video retrieval tasks, we collected data via questionnaires, interviews, and UI interaction logs. Our evaluation focused on the perceived intuitiveness of retrieval using natural language, retrieval accuracy, and how users interacted with the system’s UI. We found that our advanced features yield higher task accuracy, user ratings, and more efficient queries. Overall, our results show the importance of designing intuitive and efficient user interfaces to be able to deploy large models such as CLIP effectively in real-world scenarios.","PeriodicalId":54352,"journal":{"name":"Information Retrieval Journal","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135194990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

MuMUR: Multilingual Multimodal Universal Retrieval MuMUR:多语言多模态通用检索

3区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Retrieval Journal

Pub Date : 2023-09-25 DOI: 10.1007/s10791-023-09422-5

Avinash Madasu, Estelle Aflalo, Gabriela Ben Melech Stan, Shachar Rosenman, Shao-Yen Tseng, Gedas Bertasius, Vasudev Lal

引用次数: 0

Temporal information retrieval using bitwise operators 使用位运算符的时态信息检索

3区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Retrieval Journal

Pub Date : 2023-09-23 DOI: 10.1007/s10791-023-09423-4

Prasanna Koirala, Ramazan Aygun, Tathagata Mukherjee, Haeyong Chung

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Information Retrieval Journal

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀