首页 > 最新文献

arXiv - CS - Information Retrieval最新文献

英文 中文
E-commerce Webpage Recommendation Scheme Base on Semantic Mining and Neural Networks 基于语义挖掘和神经网络的电子商务网页推荐方案
Pub Date : 2024-09-11 DOI: arxiv-2409.07033
Wenchao Zhao, Xiaoyi Liu, Ruilin Xu, Lingxi Xiao, Muqing Li
In e-commerce websites, web mining web page recommendation technology hasbeen widely used. However, recommendation solutions often cannot meet theactual application needs of online shopping users. To address this problem,this paper proposes an e-commerce web page recommendation solution thatcombines semantic web mining and BP neural networks. First, the web logs ofuser searches are processed, and 5 features are extracted: content priority,time consumption priority, online shopping users' explicit/implicit feedback onthe website, recommendation semantics and input deviation amount. Then, thesefeatures are used as input features of the BP neural network to classify andidentify the priority of the final output web page. Finally, the web pages aresorted according to priority and recommended to users. This project uses booksales webpages as samples for experiments. The results show that this solutioncan quickly and accurately identify the webpages required by users.
在电子商务网站中,网络挖掘网页推荐技术得到了广泛应用。然而,推荐解决方案往往不能满足在线购物用户的实际应用需求。针对这一问题,本文提出了一种结合语义网络挖掘和 BP 神经网络的电子商务网页推荐解决方案。首先,处理用户搜索的网络日志,提取 5 个特征:内容优先级、时间消费优先级、网购用户对网站的显性/隐性反馈、推荐语义和输入偏差量。然后,将这些特征作为 BP 神经网络的输入特征,对最终输出的网页进行分类和优先级识别。最后,根据优先级对网页进行排序并推荐给用户。本项目使用图书销售网页作为实验样本。结果表明,该解决方案能够快速、准确地识别用户所需的网页。
{"title":"E-commerce Webpage Recommendation Scheme Base on Semantic Mining and Neural Networks","authors":"Wenchao Zhao, Xiaoyi Liu, Ruilin Xu, Lingxi Xiao, Muqing Li","doi":"arxiv-2409.07033","DOIUrl":"https://doi.org/arxiv-2409.07033","url":null,"abstract":"In e-commerce websites, web mining web page recommendation technology has\u0000been widely used. However, recommendation solutions often cannot meet the\u0000actual application needs of online shopping users. To address this problem,\u0000this paper proposes an e-commerce web page recommendation solution that\u0000combines semantic web mining and BP neural networks. First, the web logs of\u0000user searches are processed, and 5 features are extracted: content priority,\u0000time consumption priority, online shopping users' explicit/implicit feedback on\u0000the website, recommendation semantics and input deviation amount. Then, these\u0000features are used as input features of the BP neural network to classify and\u0000identify the priority of the final output web page. Finally, the web pages are\u0000sorted according to priority and recommended to users. This project uses book\u0000sales webpages as samples for experiments. The results show that this solution\u0000can quickly and accurately identify the webpages required by users.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"55 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142205322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging User-Generated Reviews for Recommender Systems with Dynamic Headers 利用动态标题为推荐系统提供用户评论
Pub Date : 2024-09-11 DOI: arxiv-2409.07627
Shanu Vashishtha, Abhay Kumar, Lalitesh Morishetti, Kaushiki Nag, Kannan Achan
E-commerce platforms have a vast catalog of items to cater to theircustomers' shopping interests. Most of these platforms assist their customersin the shopping process by offering optimized recommendation carousels,designed to help customers quickly locate their desired items. Many models havebeen proposed in academic literature to generate and enhance the ranking andrecall set of items in these carousels. Conventionally, the accompanyingcarousel title text (header) of these carousels remains static. In mostinstances, a generic text such as "Items similar to your current viewing" isutilized. Fixed variations such as the inclusion of specific attributes "Otheritems from a similar seller" or "Items from a similar brand" in addition to"frequently bought together" or "considered together" are observed as well.This work proposes a novel approach to customize the header generation processof these carousels. Our work leverages user-generated reviews that lay focus onspecific attributes (aspects) of an item that were favorably perceived by usersduring their interaction with the given item. We extract these aspects fromreviews and train a graph neural network-based model under the framework of aconditional ranking task. We refer to our innovative methodology as DynamicText Snippets (DTS) which generates multiple header texts for an anchor itemand its recall set. Our approach demonstrates the potential of utilizinguser-generated reviews and presents a unique paradigm for exploringincreasingly context-aware recommendation systems.
电子商务平台拥有庞大的商品目录,可以满足客户的购物兴趣。这些平台大多通过提供优化的推荐转盘来帮助客户完成购物流程,从而帮助客户快速找到所需商品。学术文献中提出了许多模型来生成和增强这些旋转木马中商品的排名和检索集。传统上,这些旋转传送带的标题文本(页眉)保持不变。在大多数情况下,使用的是通用文本,如 "与您当前浏览的项目相似"。除了 "经常一起购买 "或 "一起考虑 "之外,我们还观察到一些固定的变化,例如包含特定属性 "来自类似卖家的其他商品 "或 "来自类似品牌的商品"。我们的工作利用了用户生成的评论,这些评论关注的是用户在与特定商品的互动过程中对该商品的特定属性(方面)的好感。我们从评论中提取这些方面,并在条件排名任务框架下训练基于图神经网络的模型。我们将我们的创新方法称为动态文本片段(DTS),它能为锚点项目及其召回集生成多个标题文本。我们的方法展示了利用用户生成的评论的潜力,并为探索日益增强的上下文感知推荐系统提供了一个独特的范例。
{"title":"Leveraging User-Generated Reviews for Recommender Systems with Dynamic Headers","authors":"Shanu Vashishtha, Abhay Kumar, Lalitesh Morishetti, Kaushiki Nag, Kannan Achan","doi":"arxiv-2409.07627","DOIUrl":"https://doi.org/arxiv-2409.07627","url":null,"abstract":"E-commerce platforms have a vast catalog of items to cater to their\u0000customers' shopping interests. Most of these platforms assist their customers\u0000in the shopping process by offering optimized recommendation carousels,\u0000designed to help customers quickly locate their desired items. Many models have\u0000been proposed in academic literature to generate and enhance the ranking and\u0000recall set of items in these carousels. Conventionally, the accompanying\u0000carousel title text (header) of these carousels remains static. In most\u0000instances, a generic text such as \"Items similar to your current viewing\" is\u0000utilized. Fixed variations such as the inclusion of specific attributes \"Other\u0000items from a similar seller\" or \"Items from a similar brand\" in addition to\u0000\"frequently bought together\" or \"considered together\" are observed as well.\u0000This work proposes a novel approach to customize the header generation process\u0000of these carousels. Our work leverages user-generated reviews that lay focus on\u0000specific attributes (aspects) of an item that were favorably perceived by users\u0000during their interaction with the given item. We extract these aspects from\u0000reviews and train a graph neural network-based model under the framework of a\u0000conditional ranking task. We refer to our innovative methodology as Dynamic\u0000Text Snippets (DTS) which generates multiple header texts for an anchor item\u0000and its recall set. Our approach demonstrates the potential of utilizing\u0000user-generated reviews and presents a unique paradigm for exploring\u0000increasingly context-aware recommendation systems.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"15 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142205296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dot Product is All You Need: Bridging the Gap Between Item Recommendation and Link Prediction 点产品就是你所需要的一切:缩小项目推荐与链接预测之间的差距
Pub Date : 2024-09-11 DOI: arxiv-2409.07433
Daniele Malitesta, Alberto Carlo Maria Mancino, Pasquale Minervini, Tommaso Di Noia
Item recommendation (the task of predicting if a user may interact with newitems from the catalogue in a recommendation system) and link prediction (thetask of identifying missing links in a knowledge graph) have long been regardedas distinct problems. In this work, we show that the item recommendationproblem can be seen as an instance of the link prediction problem, whereentities in the graph represent users and items, and the task consists ofpredicting missing instances of the relation type <>. In apreliminary attempt to demonstrate the assumption, we decide to test threepopular factorisation-based link prediction models on the item recommendationtask, showing that their predictive accuracy is competitive with tenstate-of-the-art recommendation models. The purpose is to show how the formermay be seamlessly and effectively applied to the recommendation task withoutany specific modification to their architectures. Finally, while beginning tounveil the key reasons behind the recommendation performance of the selectedlink prediction models, we explore different settings for their hyper-parametervalues, paving the way for future directions.
长期以来,项目推荐(预测用户是否可能与推荐系统目录中的新项目进行交互的任务)和链接预测(识别知识图谱中缺失的链接的任务)一直被视为不同的问题。在这项工作中,我们证明了物品推荐问题可以看作是链接预测问题的一个实例,图中的实体代表用户和物品,任务包括预测关系类型 > 的缺失实例。为了初步证明这一假设,我们决定在物品推荐任务中测试三种流行的基于因子化的链接预测模型,结果表明它们的预测准确率与十种最先进的推荐模型相比具有竞争力。这样做的目的是展示如何在不对其架构进行任何特定修改的情况下,将前者无缝、有效地应用到推荐任务中。最后,在开始揭示所选链接预测模型的推荐性能背后的关键原因的同时,我们探索了它们的超参数值的不同设置,为未来的发展方向铺平了道路。
{"title":"Dot Product is All You Need: Bridging the Gap Between Item Recommendation and Link Prediction","authors":"Daniele Malitesta, Alberto Carlo Maria Mancino, Pasquale Minervini, Tommaso Di Noia","doi":"arxiv-2409.07433","DOIUrl":"https://doi.org/arxiv-2409.07433","url":null,"abstract":"Item recommendation (the task of predicting if a user may interact with new\u0000items from the catalogue in a recommendation system) and link prediction (the\u0000task of identifying missing links in a knowledge graph) have long been regarded\u0000as distinct problems. In this work, we show that the item recommendation\u0000problem can be seen as an instance of the link prediction problem, where\u0000entities in the graph represent users and items, and the task consists of\u0000predicting missing instances of the relation type <<interactsWith>>. In a\u0000preliminary attempt to demonstrate the assumption, we decide to test three\u0000popular factorisation-based link prediction models on the item recommendation\u0000task, showing that their predictive accuracy is competitive with ten\u0000state-of-the-art recommendation models. The purpose is to show how the former\u0000may be seamlessly and effectively applied to the recommendation task without\u0000any specific modification to their architectures. Finally, while beginning to\u0000unveil the key reasons behind the recommendation performance of the selected\u0000link prediction models, we explore different settings for their hyper-parameter\u0000values, paving the way for future directions.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"12 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142205320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multilingual Prompts in LLM-Based Recommenders: Performance Across Languages 基于 LLM 的推荐器中的多语言提示:跨语言性能
Pub Date : 2024-09-11 DOI: arxiv-2409.07604
Makbule Gulcin Ozsoy
Large language models (LLMs) are increasingly used in natural languageprocessing tasks. Recommender systems traditionally use methods such ascollaborative filtering and matrix factorization, as well as advancedtechniques like deep learning and reinforcement learning. Although languagemodels have been applied in recommendation, the recent trend have focused onleveraging the generative capabilities of LLMs for more personalizedsuggestions. While current research focuses on English due to its resourcerichness, this work explores the impact of non-English prompts onrecommendation performance. Using OpenP5, a platform for developing andevaluating LLM-based recommendations, we expanded its English prompt templatesto include Spanish and Turkish. Evaluation on three real-world datasets, namelyML1M, LastFM, and Amazon-Beauty, showed that usage of non-English promptsgenerally reduce performance, especially in less-resourced languages likeTurkish. We also retrained an LLM-based recommender model with multilingualprompts to analyze performance variations. Retraining with multilingual promptsresulted in more balanced performance across languages, but slightly reducedEnglish performance. This work highlights the need for diverse language supportin LLM-based recommenders and suggests future research on creating evaluationdatasets, using newer models and additional languages.
大型语言模型(LLM)越来越多地用于自然语言处理任务。推荐系统传统上使用协同过滤和矩阵因式分解等方法,以及深度学习和强化学习等先进技术。虽然语言模型已被应用于推荐中,但最近的趋势侧重于利用 LLM 的生成能力来提供更个性化的建议。由于英语资源丰富,目前的研究主要集中在英语上,而本研究则探索了非英语提示对推荐性能的影响。OpenP5 是一个用于开发和评估基于 LLM 的推荐的平台,我们使用 OpenP5 将其英语提示模板扩展到西班牙语和土耳其语。在ML1M、LastFM 和 Amazon-Beauty 这三个真实数据集上进行的评估表明,使用非英语提示通常会降低推荐性能,尤其是像土耳其语这样资源较少的语言。我们还使用多语言提示重新训练了一个基于 LLM 的推荐模型,以分析性能变化。使用多语言提示重新训练的结果是,不同语言的性能更加均衡,但英语性能略有下降。这项工作强调了在基于 LLM 的推荐器中提供多种语言支持的必要性,并建议今后在创建评估数据集、使用更新的模型和更多语言方面开展研究。
{"title":"Multilingual Prompts in LLM-Based Recommenders: Performance Across Languages","authors":"Makbule Gulcin Ozsoy","doi":"arxiv-2409.07604","DOIUrl":"https://doi.org/arxiv-2409.07604","url":null,"abstract":"Large language models (LLMs) are increasingly used in natural language\u0000processing tasks. Recommender systems traditionally use methods such as\u0000collaborative filtering and matrix factorization, as well as advanced\u0000techniques like deep learning and reinforcement learning. Although language\u0000models have been applied in recommendation, the recent trend have focused on\u0000leveraging the generative capabilities of LLMs for more personalized\u0000suggestions. While current research focuses on English due to its resource\u0000richness, this work explores the impact of non-English prompts on\u0000recommendation performance. Using OpenP5, a platform for developing and\u0000evaluating LLM-based recommendations, we expanded its English prompt templates\u0000to include Spanish and Turkish. Evaluation on three real-world datasets, namely\u0000ML1M, LastFM, and Amazon-Beauty, showed that usage of non-English prompts\u0000generally reduce performance, especially in less-resourced languages like\u0000Turkish. We also retrained an LLM-based recommender model with multilingual\u0000prompts to analyze performance variations. Retraining with multilingual prompts\u0000resulted in more balanced performance across languages, but slightly reduced\u0000English performance. This work highlights the need for diverse language support\u0000in LLM-based recommenders and suggests future research on creating evaluation\u0000datasets, using newer models and additional languages.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"4 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142205297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
STORE: Streamlining Semantic Tokenization and Generative Recommendation with A Single LLM STORE:利用单一 LLM 精简语义标记化和生成式推荐
Pub Date : 2024-09-11 DOI: arxiv-2409.07276
Qijiong Liu, Jieming Zhu, Lu Fan, Zhou Zhao, Xiao-Ming Wu
Traditional recommendation models often rely on unique item identifiers (IDs)to distinguish between items, which can hinder their ability to effectivelyleverage item content information and generalize to long-tail or cold-startitems. Recently, semantic tokenization has been proposed as a promisingsolution that aims to tokenize each item's semantic representation into asequence of discrete tokens. In this way, it preserves the item's semanticswithin these tokens and ensures that semantically similar items are representedby similar tokens. These semantic tokens have become fundamental in traininggenerative recommendation models. However, existing generative recommendationmethods typically involve multiple sub-models for embedding, quantization, andrecommendation, leading to an overly complex system. In this paper, we proposeto streamline the semantic tokenization and generative recommendation processwith a unified framework, dubbed STORE, which leverages a single large languagemodel (LLM) for both tasks. Specifically, we formulate semantic tokenization asa text-to-token task and generative recommendation as a token-to-token task,supplemented by a token-to-text reconstruction task and a text-to-tokenauxiliary task. All these tasks are framed in a generative manner and trainedusing a single LLM backbone. Extensive experiments have been conducted tovalidate the effectiveness of our STORE framework across various recommendationtasks and datasets. We will release the source code and configurations forreproducible research.
传统的推荐模型通常依赖于唯一的项目标识符(ID)来区分项目,这可能会妨碍它们有效利用项目内容信息和概括长尾项目或冷启动项目的能力。最近,有人提出了语义标记化这一有前途的解决方案,其目的是将每个条目的语义表示标记化为一系列离散的标记。这样,它就能在这些标记中保留项目的语义,并确保语义相似的项目由相似的标记来表示。这些语义标记已成为训练生成式推荐模型的基础。然而,现有的生成式推荐方法通常涉及嵌入、量化和推荐等多个子模型,导致系统过于复杂。在本文中,我们建议使用一个统一的框架来简化语义标记化和生成式推荐过程,该框架被称为 STORE,它利用单一的大型语言模型(LLM)来完成这两项任务。具体来说,我们将语义标记化视为文本到标记的任务,将生成式推荐视为标记到标记的任务,并辅以标记到文本的重构任务和文本到标记的辅助任务。所有这些任务都是以生成方式构建的,并使用单个 LLM 骨干进行训练。我们进行了广泛的实验,以验证 STORE 框架在各种推荐任务和数据集上的有效性。我们将发布源代码和配置,以便进行可重复的研究。
{"title":"STORE: Streamlining Semantic Tokenization and Generative Recommendation with A Single LLM","authors":"Qijiong Liu, Jieming Zhu, Lu Fan, Zhou Zhao, Xiao-Ming Wu","doi":"arxiv-2409.07276","DOIUrl":"https://doi.org/arxiv-2409.07276","url":null,"abstract":"Traditional recommendation models often rely on unique item identifiers (IDs)\u0000to distinguish between items, which can hinder their ability to effectively\u0000leverage item content information and generalize to long-tail or cold-start\u0000items. Recently, semantic tokenization has been proposed as a promising\u0000solution that aims to tokenize each item's semantic representation into a\u0000sequence of discrete tokens. In this way, it preserves the item's semantics\u0000within these tokens and ensures that semantically similar items are represented\u0000by similar tokens. These semantic tokens have become fundamental in training\u0000generative recommendation models. However, existing generative recommendation\u0000methods typically involve multiple sub-models for embedding, quantization, and\u0000recommendation, leading to an overly complex system. In this paper, we propose\u0000to streamline the semantic tokenization and generative recommendation process\u0000with a unified framework, dubbed STORE, which leverages a single large language\u0000model (LLM) for both tasks. Specifically, we formulate semantic tokenization as\u0000a text-to-token task and generative recommendation as a token-to-token task,\u0000supplemented by a token-to-text reconstruction task and a text-to-token\u0000auxiliary task. All these tasks are framed in a generative manner and trained\u0000using a single LLM backbone. Extensive experiments have been conducted to\u0000validate the effectiveness of our STORE framework across various recommendation\u0000tasks and datasets. We will release the source code and configurations for\u0000reproducible research.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142205318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Diff-VPS: Video Polyp Segmentation via a Multi-task Diffusion Network with Adversarial Temporal Reasoning Diff-VPS:通过多任务扩散网络与对抗性时态推理进行视频息肉分割
Pub Date : 2024-09-11 DOI: arxiv-2409.07238
Yingling Lu, Yijun Yang, Zhaohu Xing, Qiong Wang, Lei Zhu
Diffusion Probabilistic Models have recently attracted significant attentionin the community of computer vision due to their outstanding performance.However, while a substantial amount of diffusion-based research has focused ongenerative tasks, no work introduces diffusion models to advance the results ofpolyp segmentation in videos, which is frequently challenged by polyps' highcamouflage and redundant temporal cues.In this paper, we present a noveldiffusion-based network for video polyp segmentation task, dubbed as Diff-VPS.We incorporate multi-task supervision into diffusion models to promote thediscrimination of diffusion models on pixel-by-pixel segmentation. Thisintegrates the contextual high-level information achieved by the jointclassification and detection tasks. To explore the temporal dependency,Temporal Reasoning Module (TRM) is devised via reasoning and reconstructing thetarget frame from the previous frames. We further equip TRM with a generativeadversarial self-supervised strategy to produce more realistic frames and thuscapture better dynamic cues. Extensive experiments are conducted on SUN-SEG,and the results indicate that our proposed Diff-VPS significantly achievesstate-of-the-art performance. Code is available athttps://github.com/lydia-yllu/Diff-VPS.
然而,尽管大量基于扩散的研究都集中在生成任务上,却没有任何研究引入扩散模型来推进视频中息肉分割的结果,而息肉的高遮蔽性和冗余时间线索经常给视频息肉分割带来挑战。我们在扩散模型中加入了多任务监督,以促进扩散模型在逐像素分割上的辨别能力。这整合了联合分类和检测任务所实现的上下文高级信息。为了探索时间依赖性,我们设计了时间推理模块(Temporal Reasoning Module,TRM),通过推理和重建前一帧的目标帧。我们进一步为 TRM 配备了生成式对抗自监督策略,以生成更逼真的帧,从而捕捉到更好的动态线索。我们在 SUN-SEG 上进行了广泛的实验,结果表明我们提出的 Diff-VPS 显著达到了最先进的性能。代码见:https://github.com/lydia-yllu/Diff-VPS。
{"title":"Diff-VPS: Video Polyp Segmentation via a Multi-task Diffusion Network with Adversarial Temporal Reasoning","authors":"Yingling Lu, Yijun Yang, Zhaohu Xing, Qiong Wang, Lei Zhu","doi":"arxiv-2409.07238","DOIUrl":"https://doi.org/arxiv-2409.07238","url":null,"abstract":"Diffusion Probabilistic Models have recently attracted significant attention\u0000in the community of computer vision due to their outstanding performance.\u0000However, while a substantial amount of diffusion-based research has focused on\u0000generative tasks, no work introduces diffusion models to advance the results of\u0000polyp segmentation in videos, which is frequently challenged by polyps' high\u0000camouflage and redundant temporal cues.In this paper, we present a novel\u0000diffusion-based network for video polyp segmentation task, dubbed as Diff-VPS.\u0000We incorporate multi-task supervision into diffusion models to promote the\u0000discrimination of diffusion models on pixel-by-pixel segmentation. This\u0000integrates the contextual high-level information achieved by the joint\u0000classification and detection tasks. To explore the temporal dependency,\u0000Temporal Reasoning Module (TRM) is devised via reasoning and reconstructing the\u0000target frame from the previous frames. We further equip TRM with a generative\u0000adversarial self-supervised strategy to produce more realistic frames and thus\u0000capture better dynamic cues. Extensive experiments are conducted on SUN-SEG,\u0000and the results indicate that our proposed Diff-VPS significantly achieves\u0000state-of-the-art performance. Code is available at\u0000https://github.com/lydia-yllu/Diff-VPS.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"67 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142205321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Negative Sampling in Recommendation: A Survey and Future Directions 建议书中的负抽样:调查与未来方向
Pub Date : 2024-09-11 DOI: arxiv-2409.07237
Haokai Ma, Ruobing Xie, Lei Meng, Fuli Feng, Xiaoyu Du, Xingwu Sun, Zhanhui Kang, Xiangxu Meng
Recommender systems aim to capture users' personalized preferences from thecast amount of user behaviors, making them pivotal in the era of informationexplosion. However, the presence of the dynamic preference, the "informationcocoons", and the inherent feedback loops in recommendation make users interactwith a limited number of items. Conventional recommendation algorithmstypically focus on the positive historical behaviors, while neglecting theessential role of negative feedback in user interest understanding. As apromising but easy-to-ignored area, negative sampling is proficients inrevealing the genuine negative aspect inherent in user behaviors, emerging asan inescapable procedure in recommendation. In this survey, we first discussthe role of negative sampling in recommendation and thoroughly analyzechallenges that consistently impede its progress. Then, we conduct an extensiveliterature review on the existing negative sampling strategies inrecommendation and classify them into five categories with their discrepanttechniques. Finally, we detail the insights of the tailored negative samplingstrategies in diverse recommendation scenarios and outline an overview of theprospective research directions toward which the community may engage andbenefit.
推荐系统旨在从大量的用户行为中捕捉用户的个性化偏好,因此在信息爆炸的时代具有举足轻重的地位。然而,由于动态偏好、"信息茧 "以及推荐中固有的反馈回路的存在,用户只能与有限的项目进行交互。传统的推荐算法通常只关注积极的历史行为,而忽视了消极反馈在用户兴趣理解中的重要作用。负面采样是一个令人兴奋但又容易被忽视的领域,它能有效揭示用户行为中固有的真正负面因素,是推荐中不可避免的程序。在本研究中,我们首先讨论了负抽样在推荐中的作用,并深入分析了一直阻碍其发展的挑战。然后,我们对推荐中现有的负面取样策略进行了广泛的文献综述,并将其分为五类,其中包含了各自不同的技术。最后,我们详细介绍了量身定制的负面取样策略在不同推荐方案中的应用,并概述了社区可能参与并从中受益的前瞻性研究方向。
{"title":"Negative Sampling in Recommendation: A Survey and Future Directions","authors":"Haokai Ma, Ruobing Xie, Lei Meng, Fuli Feng, Xiaoyu Du, Xingwu Sun, Zhanhui Kang, Xiangxu Meng","doi":"arxiv-2409.07237","DOIUrl":"https://doi.org/arxiv-2409.07237","url":null,"abstract":"Recommender systems aim to capture users' personalized preferences from the\u0000cast amount of user behaviors, making them pivotal in the era of information\u0000explosion. However, the presence of the dynamic preference, the \"information\u0000cocoons\", and the inherent feedback loops in recommendation make users interact\u0000with a limited number of items. Conventional recommendation algorithms\u0000typically focus on the positive historical behaviors, while neglecting the\u0000essential role of negative feedback in user interest understanding. As a\u0000promising but easy-to-ignored area, negative sampling is proficients in\u0000revealing the genuine negative aspect inherent in user behaviors, emerging as\u0000an inescapable procedure in recommendation. In this survey, we first discuss\u0000the role of negative sampling in recommendation and thoroughly analyze\u0000challenges that consistently impede its progress. Then, we conduct an extensive\u0000literature review on the existing negative sampling strategies in\u0000recommendation and classify them into five categories with their discrepant\u0000techniques. Finally, we detail the insights of the tailored negative sampling\u0000strategies in diverse recommendation scenarios and outline an overview of the\u0000prospective research directions toward which the community may engage and\u0000benefit.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"16 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142205319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hierarchical Reinforcement Learning for Temporal Abstraction of Listwise Recommendation 用于时空抽象的列表式推荐的分层强化学习
Pub Date : 2024-09-11 DOI: arxiv-2409.07416
Luo Ji, Gao Liu, Mingyang Yin, Hongxia Yang, Jingren Zhou
Modern listwise recommendation systems need to consider both long-term userperceptions and short-term interest shifts. Reinforcement learning can beapplied on recommendation to study such a problem but is also subject to largesearch space, sparse user feedback and long interactive latency. Motivated byrecent progress in hierarchical reinforcement learning, we propose a novelframework called mccHRL to provide different levels of temporal abstraction onlistwise recommendation. Within the hierarchical framework, the high-levelagent studies the evolution of user perception, while the low-level agentproduces the item selection policy by modeling the process as a sequentialdecision-making problem. We argue that such framework has a well-defineddecomposition of the outra-session context and the intra-session context, whichare encoded by the high-level and low-level agents, respectively. To verifythis argument, we implement both a simulator-based environment and anindustrial dataset-based experiment. Results observe significant performanceimprovement by our method, compared with several well-known baselines. Data andcodes have been made public.
现代列表式推荐系统需要同时考虑用户的长期看法和短期兴趣转移。强化学习可以应用于推荐来研究这样的问题,但它也受到搜索空间大、用户反馈稀少和交互延迟长的限制。受分层强化学习最新进展的启发,我们提出了一个名为 mccHRL 的新框架,为列表式推荐提供不同层次的时间抽象。在分层框架内,高层代理研究用户感知的演化,而低层代理则通过将用户感知的演化过程建模为一个顺序决策问题来生成项目选择策略。我们认为,这种框架对会话外上下文和会话内上下文有明确的分解,分别由高层代理和低层代理编码。为了验证这一论点,我们实施了基于模拟器的环境和基于工业数据集的实验。结果表明,与几种著名的基线方法相比,我们的方法在性能上有明显提高。数据和代码已公开。
{"title":"Hierarchical Reinforcement Learning for Temporal Abstraction of Listwise Recommendation","authors":"Luo Ji, Gao Liu, Mingyang Yin, Hongxia Yang, Jingren Zhou","doi":"arxiv-2409.07416","DOIUrl":"https://doi.org/arxiv-2409.07416","url":null,"abstract":"Modern listwise recommendation systems need to consider both long-term user\u0000perceptions and short-term interest shifts. Reinforcement learning can be\u0000applied on recommendation to study such a problem but is also subject to large\u0000search space, sparse user feedback and long interactive latency. Motivated by\u0000recent progress in hierarchical reinforcement learning, we propose a novel\u0000framework called mccHRL to provide different levels of temporal abstraction on\u0000listwise recommendation. Within the hierarchical framework, the high-level\u0000agent studies the evolution of user perception, while the low-level agent\u0000produces the item selection policy by modeling the process as a sequential\u0000decision-making problem. We argue that such framework has a well-defined\u0000decomposition of the outra-session context and the intra-session context, which\u0000are encoded by the high-level and low-level agents, respectively. To verify\u0000this argument, we implement both a simulator-based environment and an\u0000industrial dataset-based experiment. Results observe significant performance\u0000improvement by our method, compared with several well-known baselines. Data and\u0000codes have been made public.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"33 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142205316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Sequential Music Recommendation with Negative Feedback-informed Contrastive Learning 利用负反馈信息对比学习改进序列音乐推荐
Pub Date : 2024-09-11 DOI: arxiv-2409.07367
Pavan Seshadri, Shahrzad Shashaani, Peter Knees
Modern music streaming services are heavily based on recommendation enginesto serve content to users. Sequential recommendation -- continuously providingnew items within a single session in a contextually coherent manner -- has beenan emerging topic in current literature. User feedback -- a positive ornegative response to the item presented -- is used to drive contentrecommendations by learning user preferences. We extend this idea tosession-based recommendation to provide context-coherent music recommendationsby modelling negative user feedback, i.e., skips, in the loss function. Wepropose a sequence-aware contrastive sub-task to structure item embeddings insession-based music recommendation, such that true next-positive items(ignoring skipped items) are structured closer in the session embedding space,while skipped tracks are structured farther away from all items in the session.This directly affects item rankings using a K-nearest-neighbors search fornext-item recommendations, while also promoting the rank of the true next item.Experiments incorporating this task into SoTA methods for sequential itemrecommendation show consistent performance gains in terms of next-item hitrate, item ranking, and skip down-ranking on three music recommendationdatasets, strongly benefiting from the increasing presence of user feedback.
现代音乐流媒体服务在很大程度上是基于向用户提供内容的推荐引擎。顺序推荐--在一个会话中以上下文连贯的方式持续提供新项目--一直是当前文献中的一个新兴话题。用户反馈--对所提供项目的积极或消极反应--被用来通过学习用户偏好来驱动内容推荐。我们将这一想法扩展到基于会话的推荐,通过在损失函数中模拟用户的负面反馈(即跳过)来提供上下文一致的音乐推荐。我们提出了一个序列感知对比子任务,用于在基于会话的音乐推荐中构建项目嵌入,这样真正的下一个积极项目(忽略跳过的项目)在会话嵌入空间中的结构更接近,而跳过的曲目在会话中的结构则远离所有项目。实验表明,在三个音乐推荐数据集上,下一项目命中率、项目排名和跳过降级方面的性能都得到了持续提升,用户反馈的不断增加使其受益匪浅。
{"title":"Enhancing Sequential Music Recommendation with Negative Feedback-informed Contrastive Learning","authors":"Pavan Seshadri, Shahrzad Shashaani, Peter Knees","doi":"arxiv-2409.07367","DOIUrl":"https://doi.org/arxiv-2409.07367","url":null,"abstract":"Modern music streaming services are heavily based on recommendation engines\u0000to serve content to users. Sequential recommendation -- continuously providing\u0000new items within a single session in a contextually coherent manner -- has been\u0000an emerging topic in current literature. User feedback -- a positive or\u0000negative response to the item presented -- is used to drive content\u0000recommendations by learning user preferences. We extend this idea to\u0000session-based recommendation to provide context-coherent music recommendations\u0000by modelling negative user feedback, i.e., skips, in the loss function. We\u0000propose a sequence-aware contrastive sub-task to structure item embeddings in\u0000session-based music recommendation, such that true next-positive items\u0000(ignoring skipped items) are structured closer in the session embedding space,\u0000while skipped tracks are structured farther away from all items in the session.\u0000This directly affects item rankings using a K-nearest-neighbors search for\u0000next-item recommendations, while also promoting the rank of the true next item.\u0000Experiments incorporating this task into SoTA methods for sequential item\u0000recommendation show consistent performance gains in terms of next-item hit\u0000rate, item ranking, and skip down-ranking on three music recommendation\u0000datasets, strongly benefiting from the increasing presence of user feedback.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"21 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142205317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adversarial Attacks to Multi-Modal Models 多模态模型的对抗性攻击
Pub Date : 2024-09-10 DOI: arxiv-2409.06793
Zhihao Dou, Xin Hu, Haibo Yang, Zhuqing Liu, Minghong Fang
Multi-modal models have gained significant attention due to their powerfulcapabilities. These models effectively align embeddings across diverse datamodalities, showcasing superior performance in downstream tasks compared totheir unimodal counterparts. Recent study showed that the attacker canmanipulate an image or audio file by altering it in such a way that itsembedding matches that of an attacker-chosen targeted input, thereby deceivingdownstream models. However, this method often underperforms due to inherentdisparities in data from different modalities. In this paper, we introduceCrossFire, an innovative approach to attack multi-modal models. CrossFirebegins by transforming the targeted input chosen by the attacker into a formatthat matches the modality of the original image or audio file. We thenformulate our attack as an optimization problem, aiming to minimize the angulardeviation between the embeddings of the transformed input and the modifiedimage or audio file. Solving this problem determines the perturbations to beadded to the original media. Our extensive experiments on six real-worldbenchmark datasets reveal that CrossFire can significantly manipulatedownstream tasks, surpassing existing attacks. Additionally, we evaluate sixdefensive strategies against CrossFire, finding that current defenses areinsufficient to counteract our CrossFire.
多模态模型因其强大的功能而备受关注。与单模态模型相比,这些模型能有效地调整不同数据模态的嵌入,在下游任务中表现出卓越的性能。最近的研究表明,攻击者可以通过改变图像或音频文件,使其嵌入与攻击者选择的目标输入相匹配,从而欺骗下游模型。然而,由于来自不同模态的数据存在固有差异,这种方法往往效果不佳。本文介绍了一种攻击多模态模型的创新方法--CrossFire。CrossFire 首先将攻击者选择的目标输入转换成与原始图像或音频文件的模态相匹配的格式。然后,我们将攻击表述为一个优化问题,旨在最小化转换后的输入与修改后的图像或音频文件的嵌入之间的角度偏差。解决这个问题就能确定要添加到原始媒体中的扰动。我们在六个真实世界基准数据集上进行了广泛的实验,结果表明 CrossFire 能够显著操纵下游任务,超越了现有的攻击。此外,我们还评估了针对 CrossFire 的六种防御策略,发现当前的防御措施不足以对抗我们的 CrossFire。
{"title":"Adversarial Attacks to Multi-Modal Models","authors":"Zhihao Dou, Xin Hu, Haibo Yang, Zhuqing Liu, Minghong Fang","doi":"arxiv-2409.06793","DOIUrl":"https://doi.org/arxiv-2409.06793","url":null,"abstract":"Multi-modal models have gained significant attention due to their powerful\u0000capabilities. These models effectively align embeddings across diverse data\u0000modalities, showcasing superior performance in downstream tasks compared to\u0000their unimodal counterparts. Recent study showed that the attacker can\u0000manipulate an image or audio file by altering it in such a way that its\u0000embedding matches that of an attacker-chosen targeted input, thereby deceiving\u0000downstream models. However, this method often underperforms due to inherent\u0000disparities in data from different modalities. In this paper, we introduce\u0000CrossFire, an innovative approach to attack multi-modal models. CrossFire\u0000begins by transforming the targeted input chosen by the attacker into a format\u0000that matches the modality of the original image or audio file. We then\u0000formulate our attack as an optimization problem, aiming to minimize the angular\u0000deviation between the embeddings of the transformed input and the modified\u0000image or audio file. Solving this problem determines the perturbations to be\u0000added to the original media. Our extensive experiments on six real-world\u0000benchmark datasets reveal that CrossFire can significantly manipulate\u0000downstream tasks, surpassing existing attacks. Additionally, we evaluate six\u0000defensive strategies against CrossFire, finding that current defenses are\u0000insufficient to counteract our CrossFire.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"44 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142205323","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - CS - Information Retrieval
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1