首页 > 最新文献

European Conference on Information Retrieval最新文献

英文 中文
DESIRE-ME: Domain-Enhanced Supervised Information Retrieval Using Mixture-of-Experts DESIRE-ME:使用专家混合物进行领域增强型监督信息检索
Pub Date : 2024-03-20 DOI: 10.1007/978-3-031-56060-6_8
Pranav Kasela, Gabriella Pasi, Raffaele Perego, N. Tonellotto
{"title":"DESIRE-ME: Domain-Enhanced Supervised Information Retrieval Using Mixture-of-Experts","authors":"Pranav Kasela, Gabriella Pasi, Raffaele Perego, N. Tonellotto","doi":"10.1007/978-3-031-56060-6_8","DOIUrl":"https://doi.org/10.1007/978-3-031-56060-6_8","url":null,"abstract":"","PeriodicalId":126309,"journal":{"name":"European Conference on Information Retrieval","volume":" 38","pages":"111-125"},"PeriodicalIF":0.0,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140388453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving the Robustness of Dense Retrievers Against Typos via Multi-Positive Contrastive Learning 通过多正向对比学习提高密集检索器对错别字的稳健性
Pub Date : 2024-03-16 DOI: 10.1007/978-3-031-56063-7_21
Georgios Sidiropoulos, E. Kanoulas
{"title":"Improving the Robustness of Dense Retrievers Against Typos via Multi-Positive Contrastive Learning","authors":"Georgios Sidiropoulos, E. Kanoulas","doi":"10.1007/978-3-031-56063-7_21","DOIUrl":"https://doi.org/10.1007/978-3-031-56063-7_21","url":null,"abstract":"","PeriodicalId":126309,"journal":{"name":"European Conference on Information Retrieval","volume":" 44","pages":"297-305"},"PeriodicalIF":0.0,"publicationDate":"2024-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140391260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Measuring Bias in a Ranked List Using Term-Based Representations 使用基于术语的表示法测量排名列表中的偏差
Pub Date : 2024-03-09 DOI: 10.1007/978-3-031-56069-9_1
Amin Abolghasemi, Leif Azzopardi, Arian Askari, X. −832, M. D. Rijke, Suzan Verberne
{"title":"Measuring Bias in a Ranked List Using Term-Based Representations","authors":"Amin Abolghasemi, Leif Azzopardi, Arian Askari, X. −832, M. D. Rijke, Suzan Verberne","doi":"10.1007/978-3-031-56069-9_1","DOIUrl":"https://doi.org/10.1007/978-3-031-56069-9_1","url":null,"abstract":"","PeriodicalId":126309,"journal":{"name":"European Conference on Information Retrieval","volume":"14 6","pages":"3-19"},"PeriodicalIF":0.0,"publicationDate":"2024-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140396335","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Second Look on BASS - Boosting Abstractive Summarization with Unified Semantic Graphs - A Replication Study 再看 BASS--利用统一语义图促进抽象总结--复制研究
Pub Date : 2024-03-05 DOI: 10.1007/978-3-031-56066-8_11
Osman Alperen Koras, Jörg Schlötterer, Christin Seifert
{"title":"A Second Look on BASS - Boosting Abstractive Summarization with Unified Semantic Graphs - A Replication Study","authors":"Osman Alperen Koras, Jörg Schlötterer, Christin Seifert","doi":"10.1007/978-3-031-56066-8_11","DOIUrl":"https://doi.org/10.1007/978-3-031-56066-8_11","url":null,"abstract":"","PeriodicalId":126309,"journal":{"name":"European Conference on Information Retrieval","volume":"343 8","pages":"99-114"},"PeriodicalIF":0.0,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140397756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control 利用概率扩展控制进行多模态学习稀疏检索
Pub Date : 2024-02-27 DOI: 10.48550/arXiv.2402.17535
Thong Nguyen, Mariya Hendriksen, Andrew Yates, M. D. Rijke
Learned sparse retrieval (LSR) is a family of neural methods that encode queries and documents into sparse lexical vectors that can be indexed and retrieved efficiently with an inverted index. We explore the application of LSR to the multi-modal domain, with a focus on text-image retrieval. While LSR has seen success in text retrieval, its application in multimodal retrieval remains underexplored. Current approaches like LexLIP and STAIR require complex multi-step training on massive datasets. Our proposed approach efficiently transforms dense vectors from a frozen dense model into sparse lexical vectors. We address issues of high dimension co-activation and semantic deviation through a new training algorithm, using Bernoulli random variables to control query expansion. Experiments with two dense models (BLIP, ALBEF) and two datasets (MSCOCO, Flickr30k) show that our proposed algorithm effectively reduces co-activation and semantic deviation. Our best-performing sparsified model outperforms state-of-the-art text-image LSR models with a shorter training time and lower GPU memory requirements. Our approach offers an effective solution for training LSR retrieval models in multimodal settings. Our code and model checkpoints are available at github.com/thongnt99/lsr-multimodal
学习稀疏检索(LSR)是一系列神经方法,可将查询和文档编码为稀疏词性向量,并通过倒排索引进行高效检索。我们探讨了 LSR 在多模态领域的应用,重点是文本-图像检索。虽然 LSR 在文本检索中取得了成功,但其在多模态检索中的应用仍未得到充分探索。LexLIP 和 STAIR 等当前方法需要在海量数据集上进行复杂的多步骤训练。我们提出的方法能有效地将稠密向量从冻结的稠密模型转换为稀疏词向量。我们通过一种新的训练算法,使用伯努利随机变量来控制查询扩展,从而解决了高维共激活和语义偏差的问题。对两个密集模型(BLIP、ALBEF)和两个数据集(MSCOCO、Flickr30k)的实验表明,我们提出的算法能有效减少共激活和语义偏差。我们性能最佳的稀疏化模型优于最先进的文本-图像 LSR 模型,而且训练时间更短,对 GPU 内存的要求更低。我们的方法为在多模态环境中训练 LSR 检索模型提供了有效的解决方案。我们的代码和模型检查点见 github.com/thongnt99/lsr-multimodal。
{"title":"Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control","authors":"Thong Nguyen, Mariya Hendriksen, Andrew Yates, M. D. Rijke","doi":"10.48550/arXiv.2402.17535","DOIUrl":"https://doi.org/10.48550/arXiv.2402.17535","url":null,"abstract":"Learned sparse retrieval (LSR) is a family of neural methods that encode queries and documents into sparse lexical vectors that can be indexed and retrieved efficiently with an inverted index. We explore the application of LSR to the multi-modal domain, with a focus on text-image retrieval. While LSR has seen success in text retrieval, its application in multimodal retrieval remains underexplored. Current approaches like LexLIP and STAIR require complex multi-step training on massive datasets. Our proposed approach efficiently transforms dense vectors from a frozen dense model into sparse lexical vectors. We address issues of high dimension co-activation and semantic deviation through a new training algorithm, using Bernoulli random variables to control query expansion. Experiments with two dense models (BLIP, ALBEF) and two datasets (MSCOCO, Flickr30k) show that our proposed algorithm effectively reduces co-activation and semantic deviation. Our best-performing sparsified model outperforms state-of-the-art text-image LSR models with a shorter training time and lower GPU memory requirements. Our approach offers an effective solution for training LSR retrieval models in multimodal settings. Our code and model checkpoints are available at github.com/thongnt99/lsr-multimodal","PeriodicalId":126309,"journal":{"name":"European Conference on Information Retrieval","volume":"41 4","pages":"448-464"},"PeriodicalIF":0.0,"publicationDate":"2024-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140425562","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
SoftQE: Learned Representations of Queries Expanded by LLMs SoftQE:通过 LLM 扩展的查询学习表示法
Pub Date : 2024-02-20 DOI: 10.48550/arXiv.2402.12663
Varad Pimpalkhute, John Heyer, Xusen Yin, Sameer Gupta
We investigate the integration of Large Language Models (LLMs) into query encoders to improve dense retrieval without increasing latency and cost, by circumventing the dependency on LLMs at inference time. SoftQE incorporates knowledge from LLMs by mapping embeddings of input queries to those of the LLM-expanded queries. While improvements over various strong baselines on in-domain MS-MARCO metrics are marginal, SoftQE improves performance by 2.83 absolute percentage points on average on five out-of-domain BEIR tasks.
我们研究了将大型语言模型(LLM)集成到查询编码器中的问题,通过在推理时避免对 LLM 的依赖,在不增加延迟和成本的情况下改进密集检索。SoftQE 通过将输入查询的嵌入映射到 LLM 扩展查询的嵌入,将 LLM 的知识融入其中。与各种强基线相比,SoftQE 在域内 MS-MARCO 指标上的改进微乎其微,但在五项域外 BEIR 任务上,其性能平均提高了 2.83 个绝对百分点。
{"title":"SoftQE: Learned Representations of Queries Expanded by LLMs","authors":"Varad Pimpalkhute, John Heyer, Xusen Yin, Sameer Gupta","doi":"10.48550/arXiv.2402.12663","DOIUrl":"https://doi.org/10.48550/arXiv.2402.12663","url":null,"abstract":"We investigate the integration of Large Language Models (LLMs) into query encoders to improve dense retrieval without increasing latency and cost, by circumventing the dependency on LLMs at inference time. SoftQE incorporates knowledge from LLMs by mapping embeddings of input queries to those of the LLM-expanded queries. While improvements over various strong baselines on in-domain MS-MARCO metrics are marginal, SoftQE improves performance by 2.83 absolute percentage points on average on five out-of-domain BEIR tasks.","PeriodicalId":126309,"journal":{"name":"European Conference on Information Retrieval","volume":"1024 ","pages":"68-77"},"PeriodicalIF":0.0,"publicationDate":"2024-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140445758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Can we predict QPP? An approach based on multivariate outliers 我们能预测 QPP 吗?基于多元离群值的方法
Pub Date : 2024-02-07 DOI: 10.48550/arXiv.2402.16875
Adrian-Gabriel Chifu, S'ebastien D'ejean, Moncef Garouani, Josiane Mothe, Di'ego Ortiz, Md. Zia Ullah
Query performance prediction (QPP) aims to forecast the effectiveness of a search engine across a range of queries and documents. While state-of-the-art predictors offer a certain level of precision, their accuracy is not flawless. Prior research has recognized the challenges inherent in QPP but often lacks a thorough qualitative analysis. In this paper, we delve into QPP by examining the factors that influence the predictability of query performance accuracy. We propose the working hypothesis that while some queries are readily predictable, others present significant challenges. By focusing on outliers, we aim to identify the queries that are particularly challenging to predict. To this end, we employ multivariate outlier detection method. Our results demonstrate the effectiveness of this approach in identifying queries on which QPP do not perform well, yielding less reliable predictions. Moreover, we provide evidence that excluding these hard-to-predict queries from the analysis significantly enhances the overall accuracy of QPP.
查询性能预测(QPP)旨在预测搜索引擎在一系列查询和文档中的有效性。虽然最先进的预测器具有一定的精确度,但其准确性并非完美无瑕。先前的研究已经认识到 QPP 所固有的挑战,但往往缺乏全面的定性分析。在本文中,我们通过研究影响查询性能准确性可预测性的因素来深入探讨 QPP。我们提出的工作假设是,有些查询是容易预测的,而有些查询则会带来重大挑战。通过关注异常值,我们旨在确定哪些查询特别难以预测。为此,我们采用了多元离群值检测方法。我们的结果表明,这种方法在识别 QPP 表现不佳、预测可靠性较低的查询方面非常有效。此外,我们还提供证据表明,将这些难以预测的查询排除在分析之外可显著提高 QPP 的整体准确性。
{"title":"Can we predict QPP? An approach based on multivariate outliers","authors":"Adrian-Gabriel Chifu, S'ebastien D'ejean, Moncef Garouani, Josiane Mothe, Di'ego Ortiz, Md. Zia Ullah","doi":"10.48550/arXiv.2402.16875","DOIUrl":"https://doi.org/10.48550/arXiv.2402.16875","url":null,"abstract":"Query performance prediction (QPP) aims to forecast the effectiveness of a search engine across a range of queries and documents. While state-of-the-art predictors offer a certain level of precision, their accuracy is not flawless. Prior research has recognized the challenges inherent in QPP but often lacks a thorough qualitative analysis. In this paper, we delve into QPP by examining the factors that influence the predictability of query performance accuracy. We propose the working hypothesis that while some queries are readily predictable, others present significant challenges. By focusing on outliers, we aim to identify the queries that are particularly challenging to predict. To this end, we employ multivariate outlier detection method. Our results demonstrate the effectiveness of this approach in identifying queries on which QPP do not perform well, yielding less reliable predictions. Moreover, we provide evidence that excluding these hard-to-predict queries from the analysis significantly enhances the overall accuracy of QPP.","PeriodicalId":126309,"journal":{"name":"European Conference on Information Retrieval","volume":"78 ","pages":"458-467"},"PeriodicalIF":0.0,"publicationDate":"2024-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140460779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FakeClaim: A Multiple Platform-driven Dataset for Identification of Fake News on 2023 Israel-Hamas War FakeClaim:用于识别 2023 年以色列-哈马斯战争假新闻的多平台驱动数据集
Pub Date : 2024-01-29 DOI: 10.48550/arXiv.2401.16625
Gautam Kishore Shahi, Amit Kumar Jaiswal, Thomas Mandl
We contribute the first publicly available dataset of factual claims from different platforms and fake YouTube videos on the 2023 Israel-Hamas war for automatic fake YouTube video classification. The FakeClaim data is collected from 60 fact-checking organizations in 30 languages and enriched with metadata from the fact-checking organizations curated by trained journalists specialized in fact-checking. Further, we classify fake videos within the subset of YouTube videos using textual information and user comments. We used a pre-trained model to classify each video with different feature combinations. Our best-performing fine-tuned language model, Universal Sentence Encoder (USE), achieves a Macro F1 of 87%, which shows that the trained model can be helpful for debunking fake videos using the comments from the user discussion. The dataset is available on Githubfootnote{https://github.com/Gautamshahi/FakeClaim}
我们提供了第一个公开可用的数据集,其中包含来自不同平台的事实陈述和关于 2023 年以色列-哈马斯战争的虚假 YouTube 视频,用于对虚假 YouTube 视频进行自动分类。FakeClaim 数据来自 60 个事实核查组织,使用 30 种语言,并由经过培训的事实核查专业记者从事实核查组织中收集元数据加以充实。此外,我们还利用文本信息和用户评论对 YouTube 视频子集中的虚假视频进行分类。我们使用预先训练好的模型,以不同的特征组合对每个视频进行分类。我们表现最好的微调语言模型--通用句子编码器(USE)--达到了 87% 的 Macro F1,这表明训练有素的模型有助于利用用户讨论中的评论来揭穿虚假视频。该数据集可在 Githubfootnote{https://github.com/Gautamshahi/FakeClaim} 上获取。
{"title":"FakeClaim: A Multiple Platform-driven Dataset for Identification of Fake News on 2023 Israel-Hamas War","authors":"Gautam Kishore Shahi, Amit Kumar Jaiswal, Thomas Mandl","doi":"10.48550/arXiv.2401.16625","DOIUrl":"https://doi.org/10.48550/arXiv.2401.16625","url":null,"abstract":"We contribute the first publicly available dataset of factual claims from different platforms and fake YouTube videos on the 2023 Israel-Hamas war for automatic fake YouTube video classification. The FakeClaim data is collected from 60 fact-checking organizations in 30 languages and enriched with metadata from the fact-checking organizations curated by trained journalists specialized in fact-checking. Further, we classify fake videos within the subset of YouTube videos using textual information and user comments. We used a pre-trained model to classify each video with different feature combinations. Our best-performing fine-tuned language model, Universal Sentence Encoder (USE), achieves a Macro F1 of 87%, which shows that the trained model can be helpful for debunking fake videos using the comments from the user discussion. The dataset is available on Githubfootnote{https://github.com/Gautamshahi/FakeClaim}","PeriodicalId":126309,"journal":{"name":"European Conference on Information Retrieval","volume":"70 5","pages":"66-74"},"PeriodicalIF":0.0,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140485828","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Cost-Sensitive Meta-Learning Strategy for Fair Provider Exposure in Recommendation 在推荐中公平曝光提供商的成本敏感元学习策略
Pub Date : 2024-01-24 DOI: 10.48550/arXiv.2401.13566
Ludovico Boratto, Giulia Cerniglia, M. Marras, Alessandra Perniciano, Barbara Pes
When devising recommendation services, it is important to account for the interests of all content providers, encompassing not only newcomers but also minority demographic groups. In various instances, certain provider groups find themselves underrepresented in the item catalog, a situation that can influence recommendation results. Hence, platform owners often seek to regulate the exposure of these provider groups in the recommended lists. In this paper, we propose a novel cost-sensitive approach designed to guarantee these target exposure levels in pairwise recommendation models. This approach quantifies, and consequently mitigate, the discrepancies between the volume of recommendations allocated to groups and their contribution in the item catalog, under the principle of equity. Our results show that this approach, while aligning groups exposure with their assigned levels, does not compromise to the original recommendation utility. Source code and pre-processed data can be retrieved at https://github.com/alessandraperniciano/meta-learning-strategy-fair-provider-exposure.
在设计推荐服务时,必须考虑到所有内容提供者的利益,不仅包括新用户,也包括少数人口群体。在各种情况下,某些提供商群体发现自己在项目目录中的代表性不足,这种情况会影响推荐结果。因此,平台所有者通常会设法调节这些提供商群体在推荐列表中的曝光率。在本文中,我们提出了一种新颖的成本敏感型方法,旨在保证成对推荐模型中的目标曝光水平。在公平原则下,该方法量化了分配给群体的推荐量与他们在项目目录中的贡献之间的差异,并因此减轻了这种差异。我们的研究结果表明,这种方法在使各组的曝光率与其分配的等级保持一致的同时,并不会损害原有的推荐效用。源代码和预处理数据可在 https://github.com/alessandraperniciano/meta-learning-strategy-fair-provider-exposure 上检索。
{"title":"A Cost-Sensitive Meta-Learning Strategy for Fair Provider Exposure in Recommendation","authors":"Ludovico Boratto, Giulia Cerniglia, M. Marras, Alessandra Perniciano, Barbara Pes","doi":"10.48550/arXiv.2401.13566","DOIUrl":"https://doi.org/10.48550/arXiv.2401.13566","url":null,"abstract":"When devising recommendation services, it is important to account for the interests of all content providers, encompassing not only newcomers but also minority demographic groups. In various instances, certain provider groups find themselves underrepresented in the item catalog, a situation that can influence recommendation results. Hence, platform owners often seek to regulate the exposure of these provider groups in the recommended lists. In this paper, we propose a novel cost-sensitive approach designed to guarantee these target exposure levels in pairwise recommendation models. This approach quantifies, and consequently mitigate, the discrepancies between the volume of recommendations allocated to groups and their contribution in the item catalog, under the principle of equity. Our results show that this approach, while aligning groups exposure with their assigned levels, does not compromise to the original recommendation utility. Source code and pre-processed data can be retrieved at https://github.com/alessandraperniciano/meta-learning-strategy-fair-provider-exposure.","PeriodicalId":126309,"journal":{"name":"European Conference on Information Retrieval","volume":"1 1","pages":"440-448"},"PeriodicalIF":0.0,"publicationDate":"2024-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140497673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating the Usefulness of Clarifying Questions and Answers for Conversational Search 估算说明性问答对对话搜索的有用性
Pub Date : 2024-01-21 DOI: 10.48550/arXiv.2401.11463
Ivan Sekuli'c, Weronika Lajewska, K. Balog, Fabio Crestani
While the body of research directed towards constructing and generating clarifying questions in mixed-initiative conversational search systems is vast, research aimed at processing and comprehending users' answers to such questions is scarce. To this end, we present a simple yet effective method for processing answers to clarifying questions, moving away from previous work that simply appends answers to the original query and thus potentially degrades retrieval performance. Specifically, we propose a classifier for assessing usefulness of the prompted clarifying question and an answer given by the user. Useful questions or answers are further appended to the conversation history and passed to a transformer-based query rewriting module. Results demonstrate significant improvements over strong non-mixed-initiative baselines. Furthermore, the proposed approach mitigates the performance drops when non useful questions and answers are utilized.
在混合式会话搜索系统中,针对构建和生成澄清问题的研究成果非常丰富,但针对处理和理解用户对此类问题的回答的研究却很少。为此,我们提出了一种简单而有效的方法来处理澄清问题的答案,而不再像以前的研究那样简单地将答案附加到原始查询中,从而降低检索性能。具体来说,我们提出了一种分类器,用于评估提示的澄清问题和用户给出的答案是否有用。有用的问题或答案会被进一步添加到对话历史记录中,并传递给基于转换器的查询重写模块。结果表明,与强大的非混合诱导基线相比,该方法有了明显的改进。此外,当使用非有用的问题和答案时,所提出的方法还能缓解性能下降的问题。
{"title":"Estimating the Usefulness of Clarifying Questions and Answers for Conversational Search","authors":"Ivan Sekuli'c, Weronika Lajewska, K. Balog, Fabio Crestani","doi":"10.48550/arXiv.2401.11463","DOIUrl":"https://doi.org/10.48550/arXiv.2401.11463","url":null,"abstract":"While the body of research directed towards constructing and generating clarifying questions in mixed-initiative conversational search systems is vast, research aimed at processing and comprehending users' answers to such questions is scarce. To this end, we present a simple yet effective method for processing answers to clarifying questions, moving away from previous work that simply appends answers to the original query and thus potentially degrades retrieval performance. Specifically, we propose a classifier for assessing usefulness of the prompted clarifying question and an answer given by the user. Useful questions or answers are further appended to the conversation history and passed to a transformer-based query rewriting module. Results demonstrate significant improvements over strong non-mixed-initiative baselines. Furthermore, the proposed approach mitigates the performance drops when non useful questions and answers are utilized.","PeriodicalId":126309,"journal":{"name":"European Conference on Information Retrieval","volume":"92 3","pages":"384-392"},"PeriodicalIF":0.0,"publicationDate":"2024-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140501229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
European Conference on Information Retrieval
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1