arXiv - CS - Information Retrieval最新文献_第2页

Designing Interfaces for Multimodal Vector Search Applications 为多模式矢量搜索应用设计界面

arXiv - CS - Information Retrieval

Pub Date : 2024-09-18 DOI: arxiv-2409.11629

Owen Pendrigh Elliott, Tom Hamer, Jesse Clark

Multimodal vector search offers a new paradigm for information retrieval byexposing numerous pieces of functionality which are not possible in traditionallexical search engines. While multimodal vector search can be treated as a dropin replacement for these traditional systems, the experience can besignificantly enhanced by leveraging the unique capabilities of multimodalsearch. Central to any information retrieval system is a user who expresses aninformation need, traditional user interfaces with a single search bar allowusers to interact with lexical search systems effectively however are notnecessarily optimal for multimodal vector search. In this paper we explorenovel capabilities of multimodal vector search applications utilising CLIPmodels and present implementations and design patterns which better allow usersto express their information needs and effectively interact with these systemsin an information retrieval context.

多模态矢量搜索为信息检索提供了一种新的范式，它展示了许多传统传统搜索引擎无法实现的功能。虽然多模态矢量搜索可以被视为这些传统系统的直接替代品，但利用多模态搜索的独特功能可以显著增强用户体验。任何信息检索系统的核心都是表达信息需求的用户，传统的用户界面只有一个搜索栏，允许用户与词法搜索系统进行有效交互，但对于多模态矢量搜索来说，这并不一定是最佳的。在本文中，我们探讨了利用 CLIP 模型的多模态矢量搜索应用程序的新功能，并介绍了实现方法和设计模式，这些方法和模式能让用户更好地表达他们的信息需求，并在信息检索环境中与这些系统进行有效交互。

引用次数: 0

GenCRF: Generative Clustering and Reformulation Framework for Enhanced Intent-Driven Information Retrieval GenCRF：用于增强型意图驱动信息检索的生成聚类和重构框架

arXiv - CS - Information Retrieval

Pub Date : 2024-09-17 DOI: arxiv-2409.10909

Wonduk Seo, Haojie Zhang, Yueyang Zhang, Changhao Zhang, Songyao Duan, Lixin Su, Daiting Shi, Jiashu Zhao, Dawei Yin

Query reformulation is a well-known problem in Information Retrieval (IR)aimed at enhancing single search successful completion rate by automaticallymodifying user's input query. Recent methods leverage Large Language Models(LLMs) to improve query reformulation, but often generate limited and redundantexpansions, potentially constraining their effectiveness in capturing diverseintents. In this paper, we propose GenCRF: a Generative Clustering andReformulation Framework to capture diverse intentions adaptively based onmultiple differentiated, well-generated queries in the retrieval phase for thefirst time. GenCRF leverages LLMs to generate variable queries from the initialquery using customized prompts, then clusters them into groups to distinctlyrepresent diverse intents. Furthermore, the framework explores to combinediverse intents query with innovative weighted aggregation strategies tooptimize retrieval performance and crucially integrates a novel QueryEvaluation Rewarding Model (QERM) to refine the process through feedback loops.Empirical experiments on the BEIR benchmark demonstrate that GenCRF achievesstate-of-the-art performance, surpassing previous query reformulation SOTAs byup to 12% on nDCG@10. These techniques can be adapted to various LLMs,significantly boosting retriever performance and advancing the field ofInformation Retrieval.

查询重拟是信息检索（IR）领域的一个著名问题，旨在通过自动修改用户的输入查询来提高单次搜索的成功完成率。最近的方法利用大语言模型（LLMs）来改进查询重构，但通常会生成有限的冗余扩展，这可能会限制其捕捉不同意图的有效性。在本文中，我们首次提出了 GenCRF：一种生成聚类和重构框架，可在检索阶段根据多个有区别的、生成良好的查询自适应地捕捉多样化意图。GenCRF 利用 LLM 从使用定制提示的初始查询中生成可变查询，然后将它们聚类成组，以鲜明地代表不同的意图。此外，该框架还利用创新的加权聚合策略来组合不同的意图查询，以优化检索性能，并集成了新颖的查询评估奖励模型（QERM），通过反馈循环来完善这一过程。在 BEIR 基准上进行的实证实验证明，GenCRF 实现了最先进的性能，在 nDCG@10 上超越了之前的查询重构 SOTAs 多达 12%。这些技术可适用于各种 LLM，大大提高了检索器的性能，推动了信息检索领域的发展。

{"title":"GenCRF: Generative Clustering and Reformulation Framework for Enhanced Intent-Driven Information Retrieval","authors":"Wonduk Seo, Haojie Zhang, Yueyang Zhang, Changhao Zhang, Songyao Duan, Lixin Su, Daiting Shi, Jiashu Zhao, Dawei Yin","doi":"arxiv-2409.10909","DOIUrl":"https://doi.org/arxiv-2409.10909","url":null,"abstract":"Query reformulation is a well-known problem in Information Retrieval (IR)\u0000aimed at enhancing single search successful completion rate by automatically\u0000modifying user's input query. Recent methods leverage Large Language Models\u0000(LLMs) to improve query reformulation, but often generate limited and redundant\u0000expansions, potentially constraining their effectiveness in capturing diverse\u0000intents. In this paper, we propose GenCRF: a Generative Clustering and\u0000Reformulation Framework to capture diverse intentions adaptively based on\u0000multiple differentiated, well-generated queries in the retrieval phase for the\u0000first time. GenCRF leverages LLMs to generate variable queries from the initial\u0000query using customized prompts, then clusters them into groups to distinctly\u0000represent diverse intents. Furthermore, the framework explores to combine\u0000diverse intents query with innovative weighted aggregation strategies to\u0000optimize retrieval performance and crucially integrates a novel Query\u0000Evaluation Rewarding Model (QERM) to refine the process through feedback loops.\u0000Empirical experiments on the BEIR benchmark demonstrate that GenCRF achieves\u0000state-of-the-art performance, surpassing previous query reformulation SOTAs by\u0000up to 12% on nDCG@10. These techniques can be adapted to various LLMs,\u0000significantly boosting retriever performance and advancing the field of\u0000Information Retrieval.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"16 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142255199","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Best-of-Both Approach to Improve Match Predictions and Reciprocal Recommendations for Job Search 改善求职匹配预测和互惠推荐的最佳方法

arXiv - CS - Information Retrieval

Pub Date : 2024-09-17 DOI: arxiv-2409.10992

Shuhei Goda, Yudai Hayashi, Yuta Saito

Matching users with mutual preferences is a critical aspect of servicesdriven by reciprocal recommendations, such as job search. To producerecommendations in such scenarios, one can predict match probabilities andconstruct rankings based on these predictions. However, this direct matchprediction approach often underperforms due to the extreme sparsity of matchlabels. Therefore, most existing methods predict preferences separately foreach direction (e.g., job seeker to employer and employer to job seeker) andthen aggregate the predictions to generate overall matching scores and producerecommendations. However, this typical approach often leads to practicalissues, such as biased error propagation between the two models. This paperintroduces and demonstrates a novel and practical solution to improvereciprocal recommendations in production by leveraging textit{pseudo-matchscores}. Specifically, our approach generates dense and more directly relevantpseudo-match scores by combining the true match labels, which are accurate butsparse, with relatively inaccurate but dense match predictions. We then train ameta-model to output the final match predictions by minimizing the predictionloss against the pseudo-match scores. Our method can be seen as atextbf{best-of-both (BoB) approach}, as it combines the high-level ideas ofboth direct match prediction and the two separate models approach. It alsoallows for user-specific weights to construct textit{personalized}pseudo-match scores, achieving even better matching performance throughappropriate tuning of the weights. Offline experiments on real-world job searchdata demonstrate the superior performance of our BoB method, particularly withpersonalized pseudo-match scores, compared to existing approaches in terms offinding potential matches.

匹配具有共同偏好的用户是求职等以互惠推荐为驱动的服务的一个重要方面。为了在这种情况下生成推荐，我们可以预测匹配概率，并根据这些预测构建排名。然而，由于匹配标签极其稀少，这种直接的匹配预测方法往往表现不佳。因此，现有的大多数方法都是分别预测每个方向（例如求职者对雇主和雇主对求职者）的偏好，然后汇总预测结果，生成总体匹配分数并提出建议。然而，这种典型的方法往往会导致一些实际问题，比如两个模型之间有偏差的误差传播。本文介绍并演示了一种新颖实用的解决方案，即利用文本{伪匹配分数}来改进生产中的互惠推荐。具体来说，我们的方法通过将准确但稀疏的真实匹配标签与相对不准确但密集的匹配预测相结合，生成密集且更直接相关的伪匹配分数。然后，我们训练一个模型，通过最小化与伪匹配分数的预测损失来输出最终的匹配预测结果。我们的方法可以看作是一种文本方法，因为它结合了直接匹配预测和两个独立模型方法的高层次思想。它还允许使用用户特定的权重来构建 "文本{个性化}伪匹配分数"，从而通过对权重的适当调整获得更好的匹配性能。在真实世界的求职数据上进行的离线实验证明，与现有方法相比，我们的BoB方法（尤其是使用个性化伪匹配分数的方法）在寻找潜在匹配者方面具有更优越的性能。

{"title":"A Best-of-Both Approach to Improve Match Predictions and Reciprocal Recommendations for Job Search","authors":"Shuhei Goda, Yudai Hayashi, Yuta Saito","doi":"arxiv-2409.10992","DOIUrl":"https://doi.org/arxiv-2409.10992","url":null,"abstract":"Matching users with mutual preferences is a critical aspect of services\u0000driven by reciprocal recommendations, such as job search. To produce\u0000recommendations in such scenarios, one can predict match probabilities and\u0000construct rankings based on these predictions. However, this direct match\u0000prediction approach often underperforms due to the extreme sparsity of match\u0000labels. Therefore, most existing methods predict preferences separately for\u0000each direction (e.g., job seeker to employer and employer to job seeker) and\u0000then aggregate the predictions to generate overall matching scores and produce\u0000recommendations. However, this typical approach often leads to practical\u0000issues, such as biased error propagation between the two models. This paper\u0000introduces and demonstrates a novel and practical solution to improve\u0000reciprocal recommendations in production by leveraging textit{pseudo-match\u0000scores}. Specifically, our approach generates dense and more directly relevant\u0000pseudo-match scores by combining the true match labels, which are accurate but\u0000sparse, with relatively inaccurate but dense match predictions. We then train a\u0000meta-model to output the final match predictions by minimizing the prediction\u0000loss against the pseudo-match scores. Our method can be seen as a\u0000textbf{best-of-both (BoB) approach}, as it combines the high-level ideas of\u0000both direct match prediction and the two separate models approach. It also\u0000allows for user-specific weights to construct textit{personalized}\u0000pseudo-match scores, achieving even better matching performance through\u0000appropriate tuning of the weights. Offline experiments on real-world job search\u0000data demonstrate the superior performance of our BoB method, particularly with\u0000personalized pseudo-match scores, compared to existing approaches in terms of\u0000finding potential matches.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"67 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142255314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Attention-Seeker: Dynamic Self-Attention Scoring for Unsupervised Keyphrase Extraction Attention-Seeker：无监督关键词提取的动态自我注意力评分

arXiv - CS - Information Retrieval

Pub Date : 2024-09-17 DOI: arxiv-2409.10907

Erwin D. López Z., Cheng Tang, Atsushi Shimada

This paper proposes Attention-Seeker, an unsupervised keyphrase extractionmethod that leverages self-attention maps from a Large Language Model toestimate the importance of candidate phrases. Our approach identifies specificcomponents - such as layers, heads, and attention vectors - where the modelpays significant attention to the key topics of the text. The attention weightsprovided by these components are then used to score the candidate phrases.Unlike previous models that require manual tuning of parameters (e.g.,selection of heads, prompts, hyperparameters), Attention-Seeker dynamicallyadapts to the input text without any manual adjustments, enhancing itspractical applicability. We evaluate Attention-Seeker on four publiclyavailable datasets: Inspec, SemEval2010, SemEval2017, and Krapivin. Our resultsdemonstrate that, even without parameter tuning, Attention-Seeker outperformsmost baseline models, achieving state-of-the-art performance on three out offour datasets, particularly excelling in extracting keyphrases from longdocuments.

本文提出的 Attention-Seeker 是一种无监督关键词提取方法，它利用大语言模型的自我注意力图来估计候选短语的重要性。我们的方法可以识别特定的组件，如层、头和注意力向量，在这些组件中，模型对文本的关键主题给予了极大的关注。与以往需要手动调整参数（如选择头部、提示、超参数）的模型不同，Attention-Seeker 可动态适应输入文本，无需任何手动调整，从而增强了其实用性。我们在四个公开数据集上对 Attention-Seeker 进行了评估：Inspec、SemEval2010、SemEval2017 和 Krapivin。我们的结果表明，即使不调整参数，Attention-Seeker 的表现也优于大多数基线模型，在四个数据集中的三个数据集上取得了最先进的性能，尤其是在从长文档中提取关键词方面表现出色。

引用次数: 0

Challenging Fairness: A Comprehensive Exploration of Bias in LLM-Based Recommendations 挑战公平：全面探讨基于法律硕士的建议中的偏见

arXiv - CS - Information Retrieval

Pub Date : 2024-09-17 DOI: arxiv-2409.10825

Shahnewaz Karim Sakib, Anindya Bijoy Das

Large Language Model (LLM)-based recommendation systems provide morecomprehensive recommendations than traditional systems by deeply analyzingcontent and user behavior. However, these systems often exhibit biases,favoring mainstream content while marginalizing non-traditional options due toskewed training data. This study investigates the intricate relationshipbetween bias and LLM-based recommendation systems, with a focus on music, song,and book recommendations across diverse demographic and cultural groups.Through a comprehensive analysis conducted over different LLM-models, thispaper evaluates the impact of bias on recommendation outcomes. Our findingsreveal that bias is so deeply ingrained within these systems that even asimpler intervention like prompt engineering can significantly reduce bias,underscoring the pervasive nature of the issue. Moreover, factors likeintersecting identities and contextual information, such as socioeconomicstatus, further amplify these biases, demonstrating the complexity and depth ofthe challenges faced in creating fair recommendations across different groups.

基于大型语言模型（LLM）的推荐系统通过深入分析内容和用户行为，提供比传统系统更全面的推荐。然而，由于训练数据有偏差，这些系统往往会表现出偏差，偏向于主流内容，而将非传统选项边缘化。本研究调查了偏见与基于 LLM 的推荐系统之间错综复杂的关系，重点关注不同人口和文化群体的音乐、歌曲和书籍推荐。我们的研究结果表明，偏见在这些系统中根深蒂固，即使是提示工程这样简单的干预措施也能显著减少偏见，这凸显了问题的普遍性。此外，交叉身份和背景信息（如社会经济地位）等因素进一步扩大了这些偏见，这表明在不同群体中创建公平推荐所面临的挑战的复杂性和深度。

{"title":"Challenging Fairness: A Comprehensive Exploration of Bias in LLM-Based Recommendations","authors":"Shahnewaz Karim Sakib, Anindya Bijoy Das","doi":"arxiv-2409.10825","DOIUrl":"https://doi.org/arxiv-2409.10825","url":null,"abstract":"Large Language Model (LLM)-based recommendation systems provide more\u0000comprehensive recommendations than traditional systems by deeply analyzing\u0000content and user behavior. However, these systems often exhibit biases,\u0000favoring mainstream content while marginalizing non-traditional options due to\u0000skewed training data. This study investigates the intricate relationship\u0000between bias and LLM-based recommendation systems, with a focus on music, song,\u0000and book recommendations across diverse demographic and cultural groups.\u0000Through a comprehensive analysis conducted over different LLM-models, this\u0000paper evaluates the impact of bias on recommendation outcomes. Our findings\u0000reveal that bias is so deeply ingrained within these systems that even a\u0000simpler intervention like prompt engineering can significantly reduce bias,\u0000underscoring the pervasive nature of the issue. Moreover, factors like\u0000intersecting identities and contextual information, such as socioeconomic\u0000status, further amplify these biases, demonstrating the complexity and depth of\u0000the challenges faced in creating fair recommendations across different groups.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142255200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Towards Fair RAG: On the Impact of Fair Ranking in Retrieval-Augmented Generation 走向公平的 RAG：论检索增强生成中公平排序的影响

arXiv - CS - Information Retrieval

Pub Date : 2024-09-17 DOI: arxiv-2409.11598

To Eun Kim, Fernando Diaz

Many language models now enhance their responses with retrieval capabilities,leading to the widespread adoption of retrieval-augmented generation (RAG)systems. However, despite retrieval being a core component of RAG, much of theresearch in this area overlooks the extensive body of work on fair ranking,neglecting the importance of considering all stakeholders involved. This paperpresents the first systematic evaluation of RAG systems integrated with fairrankings. We focus specifically on measuring the fair exposure of each relevantitem across the rankings utilized by RAG systems (i.e., item-side fairness),aiming to promote equitable growth for relevant item providers. To gain a deepunderstanding of the relationship between item-fairness, ranking quality, andgeneration quality in the context of RAG, we analyze nine different RAG systemsthat incorporate fair rankings across seven distinct datasets. Our findingsindicate that RAG systems with fair rankings can maintain a high level ofgeneration quality and, in many cases, even outperform traditional RAG systems,despite the general trend of a tradeoff between ensuring fairness andmaintaining system-effectiveness. We believe our insights lay the groundworkfor responsible and equitable RAG systems and open new avenues for futureresearch. We publicly release our codebase and dataset athttps://github.com/kimdanny/Fair-RAG.

现在，许多语言模型都通过检索功能来增强其响应能力，这导致了检索增强生成系统（RAG）的广泛采用。然而，尽管检索是 RAG 的核心组成部分，该领域的许多研究却忽视了公平排名方面的大量工作，忽略了考虑所有利益相关者的重要性。本文首次系统地评估了与公平排名相结合的 RAG 系统。我们特别关注测量每个相关项目在 RAG 系统使用的排名中的公平曝光率（即项目方公平性），旨在促进相关项目提供者的公平增长。为了深入理解 RAG 中项目公平性、排名质量和生成质量之间的关系，我们分析了七个不同数据集中九种不同的 RAG 系统，这些系统都包含公平排名。我们的研究结果表明，具有公平排名的 RAG 系统可以保持较高的生成质量，在很多情况下甚至优于传统的 RAG 系统，尽管在确保公平性和保持系统有效性之间存在权衡的普遍趋势。我们相信，我们的见解为负责任和公平的 RAG 系统奠定了基础，并为未来的研究开辟了新途径。我们公开发布了我们的代码库和数据集，网址是：https://github.com/kimdanny/Fair-RAG。

{"title":"Towards Fair RAG: On the Impact of Fair Ranking in Retrieval-Augmented Generation","authors":"To Eun Kim, Fernando Diaz","doi":"arxiv-2409.11598","DOIUrl":"https://doi.org/arxiv-2409.11598","url":null,"abstract":"Many language models now enhance their responses with retrieval capabilities,\u0000leading to the widespread adoption of retrieval-augmented generation (RAG)\u0000systems. However, despite retrieval being a core component of RAG, much of the\u0000research in this area overlooks the extensive body of work on fair ranking,\u0000neglecting the importance of considering all stakeholders involved. This paper\u0000presents the first systematic evaluation of RAG systems integrated with fair\u0000rankings. We focus specifically on measuring the fair exposure of each relevant\u0000item across the rankings utilized by RAG systems (i.e., item-side fairness),\u0000aiming to promote equitable growth for relevant item providers. To gain a deep\u0000understanding of the relationship between item-fairness, ranking quality, and\u0000generation quality in the context of RAG, we analyze nine different RAG systems\u0000that incorporate fair rankings across seven distinct datasets. Our findings\u0000indicate that RAG systems with fair rankings can maintain a high level of\u0000generation quality and, in many cases, even outperform traditional RAG systems,\u0000despite the general trend of a tradeoff between ensuring fairness and\u0000maintaining system-effectiveness. We believe our insights lay the groundwork\u0000for responsible and equitable RAG systems and open new avenues for future\u0000research. We publicly release our codebase and dataset at\u0000https://github.com/kimdanny/Fair-RAG.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"27 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142255196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-modal Generative Models in Recommendation System 推荐系统中的多模式生成模型

arXiv - CS - Information Retrieval

Pub Date : 2024-09-17 DOI: arxiv-2409.10993

Arnau Ramisa, Rene Vidal, Yashar Deldjoo, Zhankui He, Julian McAuley, Anton Korikov, Scott Sanner, Mahesh Sathiamoorthy, Atoosa Kasrizadeh, Silvia Milano, Francesco Ricci

Many recommendation systems limit user inputs to text strings or behaviorsignals such as clicks and purchases, and system outputs to a list of productssorted by relevance. With the advent of generative AI, users have come toexpect richer levels of interactions. In visual search, for example, a user mayprovide a picture of their desired product along with a natural languagemodification of the content of the picture (e.g., a dress like the one shown inthe picture but in red color). Moreover, users may want to better understandthe recommendations they receive by visualizing how the product fits their usecase, e.g., with a representation of how a garment might look on them, or how afurniture item might look in their room. Such advanced levels of interactionrequire recommendation systems that are able to discover both shared andcomplementary information about the product across modalities, and visualizethe product in a realistic and informative way. However, existing systems oftentreat multiple modalities independently: text search is usually done bycomparing the user query to product titles and descriptions, while visualsearch is typically done by comparing an image provided by the customer toproduct images. We argue that future recommendation systems will benefit from amulti-modal understanding of the products that leverages the rich informationretailers have about both customers and products to come up with the bestrecommendations. In this chapter we review recommendation systems that usemultiple data modalities simultaneously.

许多推荐系统将用户输入限制为文本字符串或行为信号（如点击和购买），系统输出为按相关性排序的产品列表。随着生成式人工智能的出现，用户开始期待更丰富的交互。例如，在视觉搜索中，用户可能会提供一张所需的产品图片，并用自然语言对图片内容进行修改（例如，提供一件与图片中相似的红色连衣裙）。此外，用户可能希望通过可视化方式更好地理解所收到的推荐，例如，展示服装穿在身上的效果，或家具摆放在房间里的效果。这种高级别的交互要求推荐系统能够发现跨模式的产品共享信息和互补信息，并以逼真和信息丰富的方式将产品可视化。然而，现有的系统往往将多种模式分开处理：文本搜索通常是通过将用户查询与产品标题和描述进行比较来完成的，而视觉搜索通常是通过将客户提供的图片与产品图片进行比较来完成的。我们认为，未来的推荐系统将受益于对产品的多模式理解，这种理解可以利用零售商所拥有的关于顾客和产品的丰富信息，从而提出最佳推荐。在本章中，我们将回顾同时使用多种数据模式的推荐系统。

{"title":"Multi-modal Generative Models in Recommendation System","authors":"Arnau Ramisa, Rene Vidal, Yashar Deldjoo, Zhankui He, Julian McAuley, Anton Korikov, Scott Sanner, Mahesh Sathiamoorthy, Atoosa Kasrizadeh, Silvia Milano, Francesco Ricci","doi":"arxiv-2409.10993","DOIUrl":"https://doi.org/arxiv-2409.10993","url":null,"abstract":"Many recommendation systems limit user inputs to text strings or behavior\u0000signals such as clicks and purchases, and system outputs to a list of products\u0000sorted by relevance. With the advent of generative AI, users have come to\u0000expect richer levels of interactions. In visual search, for example, a user may\u0000provide a picture of their desired product along with a natural language\u0000modification of the content of the picture (e.g., a dress like the one shown in\u0000the picture but in red color). Moreover, users may want to better understand\u0000the recommendations they receive by visualizing how the product fits their use\u0000case, e.g., with a representation of how a garment might look on them, or how a\u0000furniture item might look in their room. Such advanced levels of interaction\u0000require recommendation systems that are able to discover both shared and\u0000complementary information about the product across modalities, and visualize\u0000the product in a realistic and informative way. However, existing systems often\u0000treat multiple modalities independently: text search is usually done by\u0000comparing the user query to product titles and descriptions, while visual\u0000search is typically done by comparing an image provided by the customer to\u0000product images. We argue that future recommendation systems will benefit from a\u0000multi-modal understanding of the products that leverages the rich information\u0000retailers have about both customers and products to come up with the best\u0000recommendations. In this chapter we review recommendation systems that use\u0000multiple data modalities simultaneously.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"11 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142255198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models Promptriever：经过指令训练的检索器可以像语言模型一样接受提示

arXiv - CS - Information Retrieval

Pub Date : 2024-09-17 DOI: arxiv-2409.11136

Orion Weller, Benjamin Van Durme, Dawn Lawrie, Ashwin Paranjape, Yuhao Zhang, Jack Hessel

Instruction-tuned language models (LM) are able to respond to imperativecommands, providing a more natural user interface compared to their basecounterparts. In this work, we present Promptriever, the first retrieval modelable to be prompted like an LM. To train Promptriever, we curate and release anew instance-level instruction training set from MS MARCO, spanning nearly 500kinstances. Promptriever not only achieves strong performance on standardretrieval tasks, but also follows instructions. We observe: (1) large gains(reaching SoTA) on following detailed relevance instructions (+14.3 p-MRR /+3.1 nDCG on FollowIR), (2) significantly increased robustness to lexicalchoices/phrasing in the query+instruction (+12.9 Robustness@10 on InstructIR),and (3) the ability to perform hyperparameter search via prompting to reliablyimprove retrieval performance (+1.4 average increase on BEIR). Promptrieverdemonstrates that retrieval models can be controlled with prompts on aper-query basis, setting the stage for future work aligning LM promptingtechniques with information retrieval.

经过指令调整的语言模型（LM）能够响应指令性命令，从而提供比基本模型更自然的用户界面。在这项工作中，我们提出了 Promptriever，它是第一个可以像 LM 一样进行提示的检索模型。为了训练 Promptriever，我们从 MS MARCO 收集并发布了一个新的实例级指令训练集，涵盖近 500 个实例。Promptriever 不仅在标准检索任务中表现出色，而且还能按照指令进行检索。我们观察到：(1) 在遵循详细的相关性指令方面取得了巨大进步（达到 SoTA）（FollowIR 上+14.3 p-MRR /+3.1 nDCG），(2) 对查询+指令中的词性选择/措辞的鲁棒性显著提高（InstructIR 上+12.9 Robustness@10），(3) 通过提示执行超参数搜索的能力可靠地提高了检索性能（BEIR 上+1.4 平均增长）。Promptriever 演示了检索模型可以在每次查询的基础上通过提示进行控制，为今后将 LM 提示技术与信息检索相结合的工作奠定了基础。

{"title":"Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models","authors":"Orion Weller, Benjamin Van Durme, Dawn Lawrie, Ashwin Paranjape, Yuhao Zhang, Jack Hessel","doi":"arxiv-2409.11136","DOIUrl":"https://doi.org/arxiv-2409.11136","url":null,"abstract":"Instruction-tuned language models (LM) are able to respond to imperative\u0000commands, providing a more natural user interface compared to their base\u0000counterparts. In this work, we present Promptriever, the first retrieval model\u0000able to be prompted like an LM. To train Promptriever, we curate and release a\u0000new instance-level instruction training set from MS MARCO, spanning nearly 500k\u0000instances. Promptriever not only achieves strong performance on standard\u0000retrieval tasks, but also follows instructions. We observe: (1) large gains\u0000(reaching SoTA) on following detailed relevance instructions (+14.3 p-MRR /\u0000+3.1 nDCG on FollowIR), (2) significantly increased robustness to lexical\u0000choices/phrasing in the query+instruction (+12.9 Robustness@10 on InstructIR),\u0000and (3) the ability to perform hyperparameter search via prompting to reliably\u0000improve retrieval performance (+1.4 average increase on BEIR). Promptriever\u0000demonstrates that retrieval models can be controlled with prompts on a\u0000per-query basis, setting the stage for future work aligning LM prompting\u0000techniques with information retrieval.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142269090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Framework for Ranking Content Providers Using Prompt Engineering and Self-Attention Network 利用提示工程和自我关注网络对内容提供商进行排名的框架

arXiv - CS - Information Retrieval

Pub Date : 2024-09-17 DOI: arxiv-2409.11511

Gosuddin Kamaruddin Siddiqi, Deven Santhosh Shah, Radhika Bansal, Askar Kamalov

This paper addresses the problem of ranking Content Providers for ContentRecommendation System. Content Providers are the sources of news and othertypes of content, such as lifestyle, travel, gardening. We propose a frameworkthat leverages explicit user feedback, such as clicks and reactions, andcontent-based features, such as writing style and frequency of publishing, torank Content Providers for a given topic. We also use language models toengineer prompts that help us create a ground truth dataset for the previousunsupervised ranking problem. Using this ground truth, we expand with aself-attention based network to train on Learning to Rank ListWise task. Weevaluate our framework using online experiments and show that it can improvethe quality, credibility, and diversity of the content recommended to users.

本文探讨了内容推荐系统中的内容提供商排名问题。内容提供商是新闻和其他类型内容的来源，如生活方式、旅游、园艺等。我们提出了一个框架，利用明确的用户反馈（如点击和反应）和基于内容的特征（如写作风格和发布频率），对给定主题的内容提供商进行排名。我们还利用语言模型设计提示，帮助我们为之前的无监督排名问题创建一个基本事实数据集。利用这一基本事实，我们扩展了基于自我关注的网络，以训练 "学会明智排名"（Learning to Rank ListWise）任务。我们通过在线实验对我们的框架进行了评估，结果表明它可以提高向用户推荐内容的质量、可信度和多样性。

引用次数: 0

Beyond Relevance: Improving User Engagement by Personalization for Short-Video Search 超越相关性：通过个性化短视频搜索提高用户参与度

arXiv - CS - Information Retrieval

Pub Date : 2024-09-17 DOI: arxiv-2409.11281

Wentian Bao, Hu Liu, Kai Zheng, Chao Zhang, Shunyu Zhang, Enyun Yu, Wenwu Ou, Yang Song

Personalized search has been extensively studied in various applications,including web search, e-commerce, social networks, etc. With the soaringpopularity of short-video platforms, exemplified by TikTok and Kuaishou, thequestion arises: can personalization elevate the realm of short-video search,and if so, which techniques hold the key? In this work, we introduce $text{PR}^2$, a novel and comprehensive solutionfor personalizing short-video search, where $text{PR}^2$ stands for thePersonalized Retrieval and Ranking augmented search system. Specifically,$text{PR}^2$ leverages query-relevant collaborative filtering and personalizeddense retrieval to extract relevant and individually tailored content from alarge-scale video corpus. Furthermore, it utilizes the QIN (Query-Dominate UserInterest Network) ranking model, to effectively harness user long-termpreferences and real-time behaviors, and efficiently learn from user variousimplicit feedback through a multi-task learning framework. By deploying the$text{PR}^2$ in production system, we have achieved the most remarkable userengagement improvements in recent years: a 10.2% increase in CTR@10, a notable20% surge in video watch time, and a 1.6% uplift of search DAU. We believe thepractical insights presented in this work are valuable especially for buildingand improving personalized search systems for the short video platforms.

个性化搜索已在网络搜索、电子商务、社交网络等各种应用中得到广泛研究。随着以嘀嗒和快手为代表的短视频平台的迅速普及，人们不禁要问：个性化能否提升短视频搜索的境界？在这项工作中，我们介绍了$text{PR}^2$--一种新颖而全面的短视频搜索个性化解决方案，其中$text{PR}^2$代表个性化检索和排名增强搜索系统。具体来说，$text{PR}^2$ 利用查询相关协同过滤和个性化密集检索，从大规模视频语料库中提取相关的、个性化的内容。此外，它还利用 QIN（Query-Dominate UserInterest Network，查询主导用户兴趣网络）排名模型，有效利用用户的长期偏好和实时行为，并通过多任务学习框架从用户的各种隐性反馈中高效地学习。通过在生产系统中部署$text{PR}^2$，我们实现了近年来最显著的用户管理改进：CTR@10提高了10.2%，视频观看时间显著增加了20%，搜索DAU提高了1.6%。我们相信，这项工作中提出的实用见解对于构建和改进短视频平台的个性化搜索系统尤其有价值。

{"title":"Beyond Relevance: Improving User Engagement by Personalization for Short-Video Search","authors":"Wentian Bao, Hu Liu, Kai Zheng, Chao Zhang, Shunyu Zhang, Enyun Yu, Wenwu Ou, Yang Song","doi":"arxiv-2409.11281","DOIUrl":"https://doi.org/arxiv-2409.11281","url":null,"abstract":"Personalized search has been extensively studied in various applications,\u0000including web search, e-commerce, social networks, etc. With the soaring\u0000popularity of short-video platforms, exemplified by TikTok and Kuaishou, the\u0000question arises: can personalization elevate the realm of short-video search,\u0000and if so, which techniques hold the key? In this work, we introduce $text{PR}^2$, a novel and comprehensive solution\u0000for personalizing short-video search, where $text{PR}^2$ stands for the\u0000Personalized Retrieval and Ranking augmented search system. Specifically,\u0000$text{PR}^2$ leverages query-relevant collaborative filtering and personalized\u0000dense retrieval to extract relevant and individually tailored content from a\u0000large-scale video corpus. Furthermore, it utilizes the QIN (Query-Dominate User\u0000Interest Network) ranking model, to effectively harness user long-term\u0000preferences and real-time behaviors, and efficiently learn from user various\u0000implicit feedback through a multi-task learning framework. By deploying the\u0000$text{PR}^2$ in production system, we have achieved the most remarkable user\u0000engagement improvements in recent years: a 10.2% increase in CTR@10, a notable\u000020% surge in video watch time, and a 1.6% uplift of search DAU. We believe the\u0000practical insights presented in this work are valuable especially for building\u0000and improving personalized search systems for the short video platforms.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142255195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0