首页 > 最新文献

European Conference on Information Retrieval最新文献

英文 中文
A Deep Learning Approach for Selective Relevance Feedback 选择性相关性反馈的深度学习方法
Pub Date : 2024-01-20 DOI: 10.48550/arXiv.2401.11198
S. Datta, Debasis Ganguly, Sean MacAvaney, Derek Greene
Pseudo-relevance feedback (PRF) can enhance average retrieval effectiveness over a sufficiently large number of queries. However, PRF often introduces a drift into the original information need, thus hurting the retrieval effectiveness of several queries. While a selective application of PRF can potentially alleviate this issue, previous approaches have largely relied on unsupervised or feature-based learning to determine whether a query should be expanded. In contrast, we revisit the problem of selective PRF from a deep learning perspective, presenting a model that is entirely data-driven and trained in an end-to-end manner. The proposed model leverages a transformer-based bi-encoder architecture. Additionally, to further improve retrieval effectiveness with this selective PRF approach, we make use of the model's confidence estimates to combine the information from the original and expanded queries. In our experiments, we apply this selective feedback on a number of different combinations of ranking and feedback models, and show that our proposed approach consistently improves retrieval effectiveness for both sparse and dense ranking models, with the feedback models being either sparse, dense or generative.
伪相关性反馈(PRF)可以提高足够多查询的平均检索效率。然而,伪相关性反馈往往会使原始信息需求发生偏移,从而损害若干查询的检索效果。虽然有选择地应用 PRF 有可能缓解这一问题,但以前的方法主要依赖于无监督或基于特征的学习来确定是否应该扩展查询。相比之下,我们从深度学习的角度重新审视了选择性 PRF 问题,提出了一种完全由数据驱动并以端到端方式进行训练的模型。所提出的模型利用了基于变换器的双编码器架构。此外,为了进一步提高这种选择性 PRF 方法的检索效率,我们还利用模型的置信度估计来结合来自原始查询和扩展查询的信息。在实验中,我们将这种选择性反馈应用于排名和反馈模型的多种不同组合,结果表明,我们提出的方法可以持续提高稀疏和密集排名模型的检索效率,反馈模型可以是稀疏模型、密集模型或生成模型。
{"title":"A Deep Learning Approach for Selective Relevance Feedback","authors":"S. Datta, Debasis Ganguly, Sean MacAvaney, Derek Greene","doi":"10.48550/arXiv.2401.11198","DOIUrl":"https://doi.org/10.48550/arXiv.2401.11198","url":null,"abstract":"Pseudo-relevance feedback (PRF) can enhance average retrieval effectiveness over a sufficiently large number of queries. However, PRF often introduces a drift into the original information need, thus hurting the retrieval effectiveness of several queries. While a selective application of PRF can potentially alleviate this issue, previous approaches have largely relied on unsupervised or feature-based learning to determine whether a query should be expanded. In contrast, we revisit the problem of selective PRF from a deep learning perspective, presenting a model that is entirely data-driven and trained in an end-to-end manner. The proposed model leverages a transformer-based bi-encoder architecture. Additionally, to further improve retrieval effectiveness with this selective PRF approach, we make use of the model's confidence estimates to combine the information from the original and expanded queries. In our experiments, we apply this selective feedback on a number of different combinations of ranking and feedback models, and show that our proposed approach consistently improves retrieval effectiveness for both sparse and dense ranking models, with the feedback models being either sparse, dense or generative.","PeriodicalId":126309,"journal":{"name":"European Conference on Information Retrieval","volume":"56 1","pages":"189-204"},"PeriodicalIF":0.0,"publicationDate":"2024-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140502180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ranking Heterogeneous Search Result Pages using the Interactive Probability Ranking Principle 利用交互式概率排序原则对异构搜索结果页面进行排序
Pub Date : 2024-01-16 DOI: 10.1007/978-3-031-56060-6_7
Kanaad Pathak, Leif Azzopardi, Martin Halvey
{"title":"Ranking Heterogeneous Search Result Pages using the Interactive Probability Ranking Principle","authors":"Kanaad Pathak, Leif Azzopardi, Martin Halvey","doi":"10.1007/978-3-031-56060-6_7","DOIUrl":"https://doi.org/10.1007/978-3-031-56060-6_7","url":null,"abstract":"","PeriodicalId":126309,"journal":{"name":"European Conference on Information Retrieval","volume":"14 10","pages":"96-110"},"PeriodicalIF":0.0,"publicationDate":"2024-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140506104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Reproducibility Study of Goldilocks: Just-Right Tuning of BERT for TAR 金发姑娘的可重复性研究:为 TAR 对 BERT 进行恰到好处的调整
Pub Date : 2024-01-16 DOI: 10.48550/arXiv.2401.08104
Xinyu Mao, B. Koopman, G. Zuccon
Screening documents is a tedious and time-consuming aspect of high-recall retrieval tasks, such as compiling a systematic literature review, where the goal is to identify all relevant documents for a topic. To help streamline this process, many Technology-Assisted Review (TAR) methods leverage active learning techniques to reduce the number of documents requiring review. BERT-based models have shown high effectiveness in text classification, leading to interest in their potential use in TAR workflows. In this paper, we investigate recent work that examined the impact of further pre-training epochs on the effectiveness and efficiency of a BERT-based active learning pipeline. We first report that we could replicate the original experiments on two specific TAR datasets, confirming some of the findings: importantly, that further pre-training is critical to high effectiveness, but requires attention in terms of selecting the correct training epoch. We then investigate the generalisability of the pipeline on a different TAR task, that of medical systematic reviews. In this context, we show that there is no need for further pre-training if a domain-specific BERT backbone is used within the active learning pipeline. This finding provides practical implications for using the studied active learning pipeline within domain-specific TAR tasks.
筛选文档是高检索任务(如编纂系统性文献综述)的一个繁琐而耗时的环节,其目标是识别某一主题的所有相关文档。为了帮助简化这一过程,许多技术辅助审查(TAR)方法利用主动学习技术来减少需要审查的文档数量。基于 BERT 的模型在文本分类中表现出了很高的效率,从而引起了人们对其在 TAR 工作流中的潜在应用的兴趣。在本文中,我们研究了最近的一项工作,该工作考察了进一步预训练历时对基于 BERT 的主动学习管道的有效性和效率的影响。我们首先报告说,我们可以在两个特定的 TAR 数据集上重复原来的实验,从而证实了一些发现:重要的是,进一步的预训练对高效率至关重要,但需要注意选择正确的训练历元。然后,我们在不同的 TAR 任务(即医学系统综述)上研究了该管道的通用性。在这种情况下,我们发现,如果在主动学习管道中使用特定领域的 BERT 骨干,就不需要进一步的预训练。这一发现为在特定领域的 TAR 任务中使用所研究的主动学习管道提供了实际意义。
{"title":"A Reproducibility Study of Goldilocks: Just-Right Tuning of BERT for TAR","authors":"Xinyu Mao, B. Koopman, G. Zuccon","doi":"10.48550/arXiv.2401.08104","DOIUrl":"https://doi.org/10.48550/arXiv.2401.08104","url":null,"abstract":"Screening documents is a tedious and time-consuming aspect of high-recall retrieval tasks, such as compiling a systematic literature review, where the goal is to identify all relevant documents for a topic. To help streamline this process, many Technology-Assisted Review (TAR) methods leverage active learning techniques to reduce the number of documents requiring review. BERT-based models have shown high effectiveness in text classification, leading to interest in their potential use in TAR workflows. In this paper, we investigate recent work that examined the impact of further pre-training epochs on the effectiveness and efficiency of a BERT-based active learning pipeline. We first report that we could replicate the original experiments on two specific TAR datasets, confirming some of the findings: importantly, that further pre-training is critical to high effectiveness, but requires attention in terms of selecting the correct training epoch. We then investigate the generalisability of the pipeline on a different TAR task, that of medical systematic reviews. In this context, we show that there is no need for further pre-training if a domain-specific BERT backbone is used within the active learning pipeline. This finding provides practical implications for using the studied active learning pipeline within domain-specific TAR tasks.","PeriodicalId":126309,"journal":{"name":"European Conference on Information Retrieval","volume":"78 1","pages":"132-146"},"PeriodicalIF":0.0,"publicationDate":"2024-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140505786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Revealing the Hidden Impact of Top-N Metrics on Optimization in Recommender Systems 揭示 Top-N 指标对推荐系统优化的隐性影响
Pub Date : 2024-01-16 DOI: 10.48550/arXiv.2401.08444
Lukas Wegmeth, Tobias Vente, Lennart Purucker
The hyperparameters of recommender systems for top-n predictions are typically optimized to enhance the predictive performance of algorithms. Thereby, the optimization algorithm, e.g., grid search or random search, searches for the best hyperparameter configuration according to an optimization-target metric, like nDCG or Precision. In contrast, the optimized algorithm, internally optimizes a different loss function during training, like squared error or cross-entropy. To tackle this discrepancy, recent work focused on generating loss functions better suited for recommender systems. Yet, when evaluating an algorithm using a top-n metric during optimization, another discrepancy between the optimization-target metric and the training loss has so far been ignored. During optimization, the top-n items are selected for computing a top-n metric; ignoring that the top-n items are selected from the recommendations of a model trained with an entirely different loss function. Item recommendations suitable for optimization-target metrics could be outside the top-n recommended items; hiddenly impacting the optimization performance. Therefore, we were motivated to analyze whether the top-n items are optimal for optimization-target top-n metrics. In pursuit of an answer, we exhaustively evaluate the predictive performance of 250 selection strategies besides selecting the top-n. We extensively evaluate each selection strategy over twelve implicit feedback and eight explicit feedback data sets with eleven recommender systems algorithms. Our results show that there exist selection strategies other than top-n that increase predictive performance for various algorithms and recommendation domains. However, the performance of the top ~43% of selection strategies is not significantly different. We discuss the impact of our findings on optimization and re-ranking in recommender systems and feasible solutions.
顶级预测推荐系统的超参数一般都经过优化,以提高算法的预测性能。因此,优化算法(如网格搜索或随机搜索)会根据优化目标指标(如 nDCG 或精度)搜索最佳超参数配置。相比之下,优化算法在训练过程中会对不同的损失函数(如平方误差或交叉熵)进行内部优化。为了解决这一差异,最近的工作重点是生成更适合推荐系统的损失函数。然而,在优化过程中使用前 N 项指标对算法进行评估时,优化目标指标与训练损失之间的另一个差异至今仍被忽视。在优化过程中,前 n 个项目是为了计算前 n 个指标而选择的;而忽略了前 n 个项目是从用完全不同的损失函数训练的模型的推荐中选择的。适合优化目标指标的项目推荐可能在前 n 个推荐项目之外,从而对优化性能产生隐性影响。因此,我们有动力分析前 n 个项目是否是优化目标前 n 个指标的最优项目。为了找到答案,我们对 250 种选择策略的预测性能进行了详尽评估。我们使用 11 种推荐系统算法,对 12 个隐式反馈数据集和 8 个显式反馈数据集的每种选择策略进行了广泛评估。我们的结果表明,除了 top-n 之外,还有其他选择策略可以提高各种算法和推荐领域的预测性能。然而,前约 43% 的选择策略的性能并无显著差异。我们讨论了我们的发现对推荐系统优化和重新排序的影响以及可行的解决方案。
{"title":"Revealing the Hidden Impact of Top-N Metrics on Optimization in Recommender Systems","authors":"Lukas Wegmeth, Tobias Vente, Lennart Purucker","doi":"10.48550/arXiv.2401.08444","DOIUrl":"https://doi.org/10.48550/arXiv.2401.08444","url":null,"abstract":"The hyperparameters of recommender systems for top-n predictions are typically optimized to enhance the predictive performance of algorithms. Thereby, the optimization algorithm, e.g., grid search or random search, searches for the best hyperparameter configuration according to an optimization-target metric, like nDCG or Precision. In contrast, the optimized algorithm, internally optimizes a different loss function during training, like squared error or cross-entropy. To tackle this discrepancy, recent work focused on generating loss functions better suited for recommender systems. Yet, when evaluating an algorithm using a top-n metric during optimization, another discrepancy between the optimization-target metric and the training loss has so far been ignored. During optimization, the top-n items are selected for computing a top-n metric; ignoring that the top-n items are selected from the recommendations of a model trained with an entirely different loss function. Item recommendations suitable for optimization-target metrics could be outside the top-n recommended items; hiddenly impacting the optimization performance. Therefore, we were motivated to analyze whether the top-n items are optimal for optimization-target top-n metrics. In pursuit of an answer, we exhaustively evaluate the predictive performance of 250 selection strategies besides selecting the top-n. We extensively evaluate each selection strategy over twelve implicit feedback and eight explicit feedback data sets with eleven recommender systems algorithms. Our results show that there exist selection strategies other than top-n that increase predictive performance for various algorithms and recommendation domains. However, the performance of the top ~43% of selection strategies is not significantly different. We discuss the impact of our findings on optimization and re-ranking in recommender systems and feasible solutions.","PeriodicalId":126309,"journal":{"name":"European Conference on Information Retrieval","volume":"51 5","pages":"140-156"},"PeriodicalIF":0.0,"publicationDate":"2024-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140506371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Multi-Stage Prompting Approach for Language Agnostic MCQ Generation using GPT 使用 GPT 生成与语言无关的 MCQ 的新型多阶段提示方法
Pub Date : 2024-01-13 DOI: 10.48550/arXiv.2401.07098
S. Maity, Aniket Deroy, Sudeshna Sarkar
We introduce a multi-stage prompting approach (MSP) for the generation of multiple choice questions (MCQs), harnessing the capabilities of GPT models such as text-davinci-003 and GPT-4, renowned for their excellence across various NLP tasks. Our approach incorporates the innovative concept of chain-of-thought prompting, a progressive technique in which the GPT model is provided with a series of interconnected cues to guide the MCQ generation process. Automated evaluations consistently demonstrate the superiority of our proposed MSP method over the traditional single-stage prompting (SSP) baseline, resulting in the production of high-quality distractors. Furthermore, the one-shot MSP technique enhances automatic evaluation results, contributing to improved distractor generation in multiple languages, including English, German, Bengali, and Hindi. In human evaluations, questions generated using our approach exhibit superior levels of grammaticality, answerability, and difficulty, highlighting its efficacy in various languages.
我们介绍了一种用于生成多选题(MCQ)的多阶段提示方法(MSP),它利用了文本-davinci-003 和 GPT-4 等 GPT 模型的能力,这些模型因其在各种 NLP 任务中的卓越表现而闻名。我们的方法采用了思维链提示的创新概念,这是一种渐进技术,为 GPT 模型提供了一系列相互关联的提示,以指导 MCQ 生成过程。自动评估结果一致表明,我们提出的 MSP 方法优于传统的单阶段提示 (SSP),能生成高质量的干扰项。此外,一次性 MSP 技术还增强了自动评估结果,有助于改进多种语言(包括英语、德语、孟加拉语和印地语)的干扰项生成。在人工评估中,使用我们的方法生成的问题在语法、可回答性和难度方面都表现出了卓越的水平,突出了它在各种语言中的功效。
{"title":"A Novel Multi-Stage Prompting Approach for Language Agnostic MCQ Generation using GPT","authors":"S. Maity, Aniket Deroy, Sudeshna Sarkar","doi":"10.48550/arXiv.2401.07098","DOIUrl":"https://doi.org/10.48550/arXiv.2401.07098","url":null,"abstract":"We introduce a multi-stage prompting approach (MSP) for the generation of multiple choice questions (MCQs), harnessing the capabilities of GPT models such as text-davinci-003 and GPT-4, renowned for their excellence across various NLP tasks. Our approach incorporates the innovative concept of chain-of-thought prompting, a progressive technique in which the GPT model is provided with a series of interconnected cues to guide the MCQ generation process. Automated evaluations consistently demonstrate the superiority of our proposed MSP method over the traditional single-stage prompting (SSP) baseline, resulting in the production of high-quality distractors. Furthermore, the one-shot MSP technique enhances automatic evaluation results, contributing to improved distractor generation in multiple languages, including English, German, Bengali, and Hindi. In human evaluations, questions generated using our approach exhibit superior levels of grammaticality, answerability, and difficulty, highlighting its efficacy in various languages.","PeriodicalId":126309,"journal":{"name":"European Conference on Information Retrieval","volume":"2 4","pages":"268-277"},"PeriodicalIF":0.0,"publicationDate":"2024-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140509123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CrisisKAN: Knowledge-infused and Explainable Multimodal Attention Network for Crisis Event Classification CrisisKAN:用于危机事件分类的注入知识且可解释的多模态注意力网络
Pub Date : 2024-01-11 DOI: 10.48550/arXiv.2401.06194
Shubham Gupta, Nandini Saini, Suman Kundu, Debasis Das
Pervasive use of social media has become the emerging source for real-time information (like images, text, or both) to identify various events. Despite the rapid growth of image and text-based event classification, the state-of-the-art (SOTA) models find it challenging to bridge the semantic gap between features of image and text modalities due to inconsistent encoding. Also, the black-box nature of models fails to explain the model's outcomes for building trust in high-stakes situations such as disasters, pandemic. Additionally, the word limit imposed on social media posts can potentially introduce bias towards specific events. To address these issues, we proposed CrisisKAN, a novel Knowledge-infused and Explainable Multimodal Attention Network that entails images and texts in conjunction with external knowledge from Wikipedia to classify crisis events. To enrich the context-specific understanding of textual information, we integrated Wikipedia knowledge using proposed wiki extraction algorithm. Along with this, a guided cross-attention module is implemented to fill the semantic gap in integrating visual and textual data. In order to ensure reliability, we employ a model-specific approach called Gradient-weighted Class Activation Mapping (Grad-CAM) that provides a robust explanation of the predictions of the proposed model. The comprehensive experiments conducted on the CrisisMMD dataset yield in-depth analysis across various crisis-specific tasks and settings. As a result, CrisisKAN outperforms existing SOTA methodologies and provides a novel view in the domain of explainable multimodal event classification.
社交媒体的广泛使用已成为识别各种事件的实时信息(如图像、文本或两者)的新兴来源。尽管基于图像和文本的事件分类迅速发展,但由于编码不一致,最先进的(SOTA)模型在弥合图像和文本模式特征之间的语义鸿沟方面面临挑战。此外,模型的黑箱性质也无法解释模型在灾难、大流行等高风险情况下建立信任的结果。此外,社交媒体帖子的字数限制可能会对特定事件产生偏见。为了解决这些问题,我们提出了 CrisisKAN,这是一种新颖的知识注入和可解释的多模态注意力网络,它将图像和文本与维基百科的外部知识相结合,对危机事件进行分类。为了丰富对文本信息的特定语境理解,我们利用提出的维基提取算法整合了维基百科知识。与此同时,我们还实施了一个引导式交叉关注模块,以填补整合视觉和文本数据时的语义空白。为了确保可靠性,我们采用了一种称为 "梯度加权类激活映射"(Gradient-weighted Class Activation Mapping,Grad-CAM)的特定模型方法,该方法可对所提模型的预测结果进行稳健的解释。在 CrisisMMD 数据集上进行的综合实验对各种危机特定任务和设置进行了深入分析。因此,CrisisKAN 优于现有的 SOTA 方法,并在可解释多模态事件分类领域提供了一种新的视角。
{"title":"CrisisKAN: Knowledge-infused and Explainable Multimodal Attention Network for Crisis Event Classification","authors":"Shubham Gupta, Nandini Saini, Suman Kundu, Debasis Das","doi":"10.48550/arXiv.2401.06194","DOIUrl":"https://doi.org/10.48550/arXiv.2401.06194","url":null,"abstract":"Pervasive use of social media has become the emerging source for real-time information (like images, text, or both) to identify various events. Despite the rapid growth of image and text-based event classification, the state-of-the-art (SOTA) models find it challenging to bridge the semantic gap between features of image and text modalities due to inconsistent encoding. Also, the black-box nature of models fails to explain the model's outcomes for building trust in high-stakes situations such as disasters, pandemic. Additionally, the word limit imposed on social media posts can potentially introduce bias towards specific events. To address these issues, we proposed CrisisKAN, a novel Knowledge-infused and Explainable Multimodal Attention Network that entails images and texts in conjunction with external knowledge from Wikipedia to classify crisis events. To enrich the context-specific understanding of textual information, we integrated Wikipedia knowledge using proposed wiki extraction algorithm. Along with this, a guided cross-attention module is implemented to fill the semantic gap in integrating visual and textual data. In order to ensure reliability, we employ a model-specific approach called Gradient-weighted Class Activation Mapping (Grad-CAM) that provides a robust explanation of the predictions of the proposed model. The comprehensive experiments conducted on the CrisisMMD dataset yield in-depth analysis across various crisis-specific tasks and settings. As a result, CrisisKAN outperforms existing SOTA methodologies and provides a novel view in the domain of explainable multimodal event classification.","PeriodicalId":126309,"journal":{"name":"European Conference on Information Retrieval","volume":"6 2","pages":"18-33"},"PeriodicalIF":0.0,"publicationDate":"2024-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140509922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An EcoSage Assistant: Towards Building A Multimodal Plant Care Dialogue Assistant 生态贤者助手:打造多模式植物护理对话助手
Pub Date : 2024-01-10 DOI: 10.48550/arXiv.2401.06807
Mohit Tomar, Abhisek Tiwari, Tulika Saha, Prince Jha, Sriparna Saha
In recent times, there has been an increasing awareness about imminent environmental challenges, resulting in people showing a stronger dedication to taking care of the environment and nurturing green life. The current $19.6 billion indoor gardening industry, reflective of this growing sentiment, not only signifies a monetary value but also speaks of a profound human desire to reconnect with the natural world. However, several recent surveys cast a revealing light on the fate of plants within our care, with more than half succumbing primarily due to the silent menace of improper care. Thus, the need for accessible expertise capable of assisting and guiding individuals through the intricacies of plant care has become paramount more than ever. In this work, we make the very first attempt at building a plant care assistant, which aims to assist people with plant(-ing) concerns through conversations. We propose a plant care conversational dataset named Plantational, which contains around 1K dialogues between users and plant care experts. Our end-to-end proposed approach is two-fold : (i) We first benchmark the dataset with the help of various large language models (LLMs) and visual language model (VLM) by studying the impact of instruction tuning (zero-shot and few-shot prompting) and fine-tuning techniques on this task; (ii) finally, we build EcoSage, a multi-modal plant care assisting dialogue generation framework, incorporating an adapter-based modality infusion using a gated mechanism. We performed an extensive examination (both automated and manual evaluation) of the performance exhibited by various LLMs and VLM in the generation of the domain-specific dialogue responses to underscore the respective strengths and weaknesses of these diverse models.
近来,人们越来越意识到迫在眉睫的环境挑战,从而表现出更强烈的爱护环境和培育绿色生活的意愿。目前,价值 196 亿美元的室内园艺产业反映了这种日益增长的情感,它不仅象征着金钱价值,也表达了人类重新与自然世界建立联系的深切愿望。然而,最近的几项调查揭示了我们所照料的植物的命运,一半以上的植物主要由于照料不当而无声无息地死去。因此,我们比以往任何时候都更需要能够帮助和指导人们了解植物养护的复杂性的专业知识。在这项工作中,我们首次尝试建立一个植物护理助手,旨在通过对话帮助人们解决植物护理问题。我们提出了一个名为 Plantational 的植物护理对话数据集,其中包含用户与植物护理专家之间的约 1K 条对话。我们提出的端到端方法包括两个方面:(i) 我们首先借助各种大型语言模型(LLM)和视觉语言模型(VLM)对数据集进行基准测试,研究指令调整(零镜头和少镜头提示)和微调技术对这项任务的影响;(ii) 最后,我们建立了一个多模式植物护理辅助对话生成框架 EcoSage,该框架利用门控机制整合了基于适配器的模态注入。我们对各种 LLM 和 VLM 在生成特定领域对话回复时的表现进行了广泛检查(包括自动和人工评估),以强调这些不同模型各自的优缺点。
{"title":"An EcoSage Assistant: Towards Building A Multimodal Plant Care Dialogue Assistant","authors":"Mohit Tomar, Abhisek Tiwari, Tulika Saha, Prince Jha, Sriparna Saha","doi":"10.48550/arXiv.2401.06807","DOIUrl":"https://doi.org/10.48550/arXiv.2401.06807","url":null,"abstract":"In recent times, there has been an increasing awareness about imminent environmental challenges, resulting in people showing a stronger dedication to taking care of the environment and nurturing green life. The current $19.6 billion indoor gardening industry, reflective of this growing sentiment, not only signifies a monetary value but also speaks of a profound human desire to reconnect with the natural world. However, several recent surveys cast a revealing light on the fate of plants within our care, with more than half succumbing primarily due to the silent menace of improper care. Thus, the need for accessible expertise capable of assisting and guiding individuals through the intricacies of plant care has become paramount more than ever. In this work, we make the very first attempt at building a plant care assistant, which aims to assist people with plant(-ing) concerns through conversations. We propose a plant care conversational dataset named Plantational, which contains around 1K dialogues between users and plant care experts. Our end-to-end proposed approach is two-fold : (i) We first benchmark the dataset with the help of various large language models (LLMs) and visual language model (VLM) by studying the impact of instruction tuning (zero-shot and few-shot prompting) and fine-tuning techniques on this task; (ii) finally, we build EcoSage, a multi-modal plant care assisting dialogue generation framework, incorporating an adapter-based modality infusion using a gated mechanism. We performed an extensive examination (both automated and manual evaluation) of the performance exhibited by various LLMs and VLM in the generation of the domain-specific dialogue responses to underscore the respective strengths and weaknesses of these diverse models.","PeriodicalId":126309,"journal":{"name":"European Conference on Information Retrieval","volume":"47 15","pages":"318-332"},"PeriodicalIF":0.0,"publicationDate":"2024-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140511182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Living Lab Evaluation for Life and Social Sciences Search Platforms - LiLAS at CLEF 2021 生活实验室评估生命和社会科学搜索平台- LiLAS在CLEF 2021
Pub Date : 2023-10-05 DOI: 10.1007/978-3-030-72240-1_77
Philipp Schaer, Johann Schaible, Leyla Jael García Castro
{"title":"Living Lab Evaluation for Life and Social Sciences Search Platforms - LiLAS at CLEF 2021","authors":"Philipp Schaer, Johann Schaible, Leyla Jael García Castro","doi":"10.1007/978-3-030-72240-1_77","DOIUrl":"https://doi.org/10.1007/978-3-030-72240-1_77","url":null,"abstract":"","PeriodicalId":126309,"journal":{"name":"European Conference on Information Retrieval","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128775726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Theoretical Analysis on the Efficiency of Interleaved Comparisons 交错比较效率的理论分析
Pub Date : 2023-05-31 DOI: 10.1007/978-3-031-28244-7_29
Kojiro Iizuka, Hajime Morita, Makoto P. Kato
{"title":"Theoretical Analysis on the Efficiency of Interleaved Comparisons","authors":"Kojiro Iizuka, Hajime Morita, Makoto P. Kato","doi":"10.1007/978-3-031-28244-7_29","DOIUrl":"https://doi.org/10.1007/978-3-031-28244-7_29","url":null,"abstract":"","PeriodicalId":126309,"journal":{"name":"European Conference on Information Retrieval","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125559096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Privacy-Preserving Fair Item Ranking 隐私保护公平项目排名
Pub Date : 2023-03-06 DOI: 10.48550/arXiv.2303.02916
Jiajun Sun, Sikha Pentyala, Martine De Cock, G. Farnadi
Users worldwide access massive amounts of curated data in the form of rankings on a daily basis. The societal impact of this ease of access has been studied and work has been done to propose and enforce various notions of fairness in rankings. Current computational methods for fair item ranking rely on disclosing user data to a centralized server, which gives rise to privacy concerns for the users. This work is the first to advance research at the conjunction of producer (item) fairness and consumer (user) privacy in rankings by exploring the incorporation of privacy-preserving techniques; specifically, differential privacy and secure multi-party computation. Our work extends the equity of amortized attention ranking mechanism to be privacy-preserving, and we evaluate its effects with respect to privacy, fairness, and ranking quality. Our results using real-world datasets show that we are able to effectively preserve the privacy of users and mitigate unfairness of items without making additional sacrifices to the quality of rankings in comparison to the ranking mechanism in the clear.
世界各地的用户每天都会以排名的形式访问大量的精心策划的数据。人们已经研究了这种便利的社会影响,并提出并实施了各种公平排名的概念。目前公平项目排名的计算方法依赖于将用户数据披露给一个集中的服务器,这给用户带来了隐私问题。这项工作是第一个通过探索隐私保护技术的结合来推进生产者(项目)公平和消费者(用户)隐私在排名中的结合研究;具体来说,差分隐私和安全多方计算。我们的工作将平摊注意力排序机制的公平性扩展到隐私保护,并从隐私、公平性和排序质量方面评估其效果。我们使用真实世界数据集的结果表明,我们能够有效地保护用户的隐私,减轻项目的不公平,而不会对排名的质量做出额外的牺牲,与清晰的排名机制相比。
{"title":"Privacy-Preserving Fair Item Ranking","authors":"Jiajun Sun, Sikha Pentyala, Martine De Cock, G. Farnadi","doi":"10.48550/arXiv.2303.02916","DOIUrl":"https://doi.org/10.48550/arXiv.2303.02916","url":null,"abstract":"Users worldwide access massive amounts of curated data in the form of rankings on a daily basis. The societal impact of this ease of access has been studied and work has been done to propose and enforce various notions of fairness in rankings. Current computational methods for fair item ranking rely on disclosing user data to a centralized server, which gives rise to privacy concerns for the users. This work is the first to advance research at the conjunction of producer (item) fairness and consumer (user) privacy in rankings by exploring the incorporation of privacy-preserving techniques; specifically, differential privacy and secure multi-party computation. Our work extends the equity of amortized attention ranking mechanism to be privacy-preserving, and we evaluate its effects with respect to privacy, fairness, and ranking quality. Our results using real-world datasets show that we are able to effectively preserve the privacy of users and mitigate unfairness of items without making additional sacrifices to the quality of rankings in comparison to the ranking mechanism in the clear.","PeriodicalId":126309,"journal":{"name":"European Conference on Information Retrieval","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124232358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
European Conference on Information Retrieval
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1