Proceedings of the 16th ACM Conference on Recommender Systems最新文献

英文中文

CARS: Workshop on Context-Aware Recommender Systems 2022 汽车:情景感知推荐系统研讨会2022

Proceedings of the 16th ACM Conference on Recommender Systems

Pub Date : 2022-09-18 DOI: 10.1145/3523227.3547421

G. Adomavicius, Konstantin Bauman, B. Mobasher, Francesco Ricci, Alexander Tuzhilin, Moshe Unger

Contextual information has been widely recognized as an important modeling dimension in social sciences and in computing. In particular, the role of context has been recognized in enhancing recommendation results and retrieval performance. While a substantial amount of existing research has focused on context-aware recommender systems (CARS), many interesting problems remain under-explored. The CARS 2022 workshop provides a venue for presenting and discussing: the important features of the next generation of CARS; and application domains that may require the use of novel types of contextual information and cope with their dynamic properties in group recommendations and in online environments.

上下文信息已被广泛认为是社会科学和计算领域一个重要的建模维度。特别是，上下文在提高推荐结果和检索性能方面的作用已经得到认可。虽然大量的现有研究集中在上下文感知推荐系统(CARS)上，但许多有趣的问题仍未得到充分探索。CARS 2022研讨会提供了一个展示和讨论的场所:下一代CARS的重要特征;以及可能需要使用新型上下文信息并在组推荐和在线环境中处理其动态属性的应用程序领域。

引用次数: 1

The Effect of Feedback Granularity on Recommender Systems Performance 反馈粒度对推荐系统性能的影响

Proceedings of the 16th ACM Conference on Recommender Systems

Pub Date : 2022-09-18 DOI: 10.1145/3523227.3551479

Ladislav Peška, Stepán Balcar

The main source of knowledge utilized in recommender systems (RS) is users’ feedback. While the usage of implicit feedback (i.e. user’s behavior statistics) is gaining in prominence, the explicit feedback (i.e. user’s ratings) remain an important data source. This is true especially for domains, where evaluation of an object does not require an extensive usage and users are well motivated to do so (e.g., video-on-demand services or library archives). So far, numerous rating schemes for explicit feedback have been proposed, ranging both in granularity and presentation style. There are several works studying the effect of rating’s scale and presentation on user’s rating behavior, e.g. willingness to provide feedback or various biases in rating behavior. Nonetheless, the effect of ratings granularity on RS performance remain largely under-researched. In this paper, we studied the combined effect of ratings granularity and supposed probability of feedback existence on various performance statistics of recommender systems. Results indicate that decreasing feedback granularity may lead to changes in RS’s performance w.r.t. nDCG for some recommending algorithms. Nonetheless, in most cases the effect of feedback granularity is surpassed by even a small decrease in feedback’s quantity. Therefore, our results corroborate the policy of many major real-world applications, i.e. preference of simpler rating schemes with the higher chance of feedback reception instead of finer-grained rating scenarios.

在推荐系统中使用的主要知识来源是用户的反馈。虽然隐式反馈(即用户行为统计)的使用越来越突出，但显式反馈(即用户评级)仍然是一个重要的数据源。这对于那些对象的评估不需要广泛使用并且用户很有动力这样做的领域来说尤其如此(例如，视频点播服务或图书馆档案)。到目前为止，已经提出了许多显式反馈的评级方案，包括粒度和表示风格。已有一些研究研究了评分的尺度和呈现方式对用户评分行为的影响，如提供反馈的意愿或评分行为中的各种偏见。尽管如此，评级粒度对RS性能的影响在很大程度上仍未得到充分研究。本文研究了评分粒度和反馈存在假设概率对推荐系统各种性能统计的综合影响。结果表明，对于某些推荐算法，减少反馈粒度可能会导致RS的性能发生变化。尽管如此，在大多数情况下，反馈粒度的影响甚至会被反馈数量的小幅减少所超越。因此，我们的结果证实了许多主要现实世界应用的策略，即优先选择具有更高反馈接收机会的更简单的评级方案，而不是更细粒度的评级方案。

{"title":"The Effect of Feedback Granularity on Recommender Systems Performance","authors":"Ladislav Peška, Stepán Balcar","doi":"10.1145/3523227.3551479","DOIUrl":"https://doi.org/10.1145/3523227.3551479","url":null,"abstract":"The main source of knowledge utilized in recommender systems (RS) is users’ feedback. While the usage of implicit feedback (i.e. user’s behavior statistics) is gaining in prominence, the explicit feedback (i.e. user’s ratings) remain an important data source. This is true especially for domains, where evaluation of an object does not require an extensive usage and users are well motivated to do so (e.g., video-on-demand services or library archives). So far, numerous rating schemes for explicit feedback have been proposed, ranging both in granularity and presentation style. There are several works studying the effect of rating’s scale and presentation on user’s rating behavior, e.g. willingness to provide feedback or various biases in rating behavior. Nonetheless, the effect of ratings granularity on RS performance remain largely under-researched. In this paper, we studied the combined effect of ratings granularity and supposed probability of feedback existence on various performance statistics of recommender systems. Results indicate that decreasing feedback granularity may lead to changes in RS’s performance w.r.t. nDCG for some recommending algorithms. Nonetheless, in most cases the effect of feedback granularity is surpassed by even a small decrease in feedback’s quantity. Therefore, our results corroborate the policy of many major real-world applications, i.e. preference of simpler rating schemes with the higher chance of feedback reception instead of finer-grained rating scenarios.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":"139 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126898195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Off-Policy Actor-critic for Recommender Systems 推荐系统的非政策行为者批评家

Proceedings of the 16th ACM Conference on Recommender Systems

Pub Date : 2022-09-18 DOI: 10.1145/3523227.3546758

Minmin Chen, Can Xu, Vince Gatto, Devanshu Jain, Aviral Kumar, Ed H. Chi

Industrial recommendation platforms are increasingly concerned with how to make recommendations that cause users to enjoy their long term experience on the platform. Reinforcement learning emerged naturally as an appealing approach for its promise in 1) combating feedback loop effect resulted from myopic system behaviors; and 2) sequential planning to optimize long term outcome. Scaling RL algorithms to production recommender systems serving billions of users and contents, however remain challenging. Sample inefficiency and instability of online RL hinder its widespread adoption in production. Offline RL enables usage of off-policy data and batch learning. It on the other hand faces significant challenges in learning due to the distribution shift. A REINFORCE agent [3] was successfully tested for YouTube recommendation, significantly outperforming a sophisticated supervised learning production system. Off-policy correction was employed to learn from logged data. The algorithm partially mitigates the distribution shift by employing a one-step importance weighting. We resort to the off-policy actor critic algorithms to addresses the distribution shift to a better extent. Here we share the key designs in setting up an off-policy actor-critic agent for production recommender systems. It extends [3] with a critic network that estimates the value of any state-action pairs under the target learned policy through temporal difference learning. We demonstrate in offline and live experiments that the new framework out-performs baseline and improves long term user experience. An interesting discovery along our investigation is that recommendation agents that employ a softmax policy parameterization, can end up being too pessimistic about out-of-distribution (OOD) actions. Finding the right balance between pessimism and optimism on OOD actions is critical to the success of offline RL for recommender systems.

行业推荐平台越来越关注如何进行推荐，让用户在平台上享受长期的体验。强化学习作为一种吸引人的方法自然出现，因为它在以下方面有希望:1)对抗由短视系统行为引起的反馈循环效应;2)顺序规划以优化长期结果。然而，将强化学习算法扩展到为数十亿用户和内容提供服务的生产推荐系统仍然具有挑战性。在线RL的样品效率低、稳定性差，阻碍了其在生产中的广泛应用。离线强化学习允许使用非策略数据和批量学习。另一方面，由于分布的变化，它在学习方面面临着重大挑战。一个强化代理[3]被成功地用于YouTube推荐测试，显著优于一个复杂的监督学习生产系统。采用非策略校正从日志数据中学习。该算法通过采用一步重要度加权，部分缓解了分布偏移。我们采用非政策行为者批评家算法来更好地解决分布转移问题。在这里，我们分享了为制作推荐系统建立非政策参与者-评论家代理的关键设计。它将[3]扩展为一个批判网络，该网络通过时间差异学习来估计目标学习策略下任何状态-动作对的值。我们在离线和实时实验中证明，新框架优于基线，并改善了长期用户体验。在我们的调查中有一个有趣的发现，使用softmax策略参数化的推荐代理最终可能对超出分布(OOD)的行为过于悲观。在对OOD行为的悲观和乐观之间找到适当的平衡对于推荐系统的离线强化学习的成功至关重要。

{"title":"Off-Policy Actor-critic for Recommender Systems","authors":"Minmin Chen, Can Xu, Vince Gatto, Devanshu Jain, Aviral Kumar, Ed H. Chi","doi":"10.1145/3523227.3546758","DOIUrl":"https://doi.org/10.1145/3523227.3546758","url":null,"abstract":"Industrial recommendation platforms are increasingly concerned with how to make recommendations that cause users to enjoy their long term experience on the platform. Reinforcement learning emerged naturally as an appealing approach for its promise in 1) combating feedback loop effect resulted from myopic system behaviors; and 2) sequential planning to optimize long term outcome. Scaling RL algorithms to production recommender systems serving billions of users and contents, however remain challenging. Sample inefficiency and instability of online RL hinder its widespread adoption in production. Offline RL enables usage of off-policy data and batch learning. It on the other hand faces significant challenges in learning due to the distribution shift. A REINFORCE agent [3] was successfully tested for YouTube recommendation, significantly outperforming a sophisticated supervised learning production system. Off-policy correction was employed to learn from logged data. The algorithm partially mitigates the distribution shift by employing a one-step importance weighting. We resort to the off-policy actor critic algorithms to addresses the distribution shift to a better extent. Here we share the key designs in setting up an off-policy actor-critic agent for production recommender systems. It extends [3] with a critic network that estimates the value of any state-action pairs under the target learned policy through temporal difference learning. We demonstrate in offline and live experiments that the new framework out-performs baseline and improves long term user experience. An interesting discovery along our investigation is that recommendation agents that employ a softmax policy parameterization, can end up being too pessimistic about out-of-distribution (OOD) actions. Finding the right balance between pessimism and optimism on OOD actions is critical to the success of offline RL for recommender systems.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125379971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 19

RecWork: Workshop on Recommender Systems for the Future of Work RecWork:未来工作推荐系统研讨会

Proceedings of the 16th ACM Conference on Recommender Systems

Pub Date : 2022-09-18 DOI: 10.1145/3523227.3547415

J. Konstan, A. Muralidharan, Ankan Saha, Shilad Sen, Mengting Wan, Longqi Yang

As organizations increasingly digitize their business processes, the role of recommender systems in work environments is expanding. The goal of the RecWork workshop is closing the gap in recommender systems research for work environments in areas such as calendaring, productivity, community building, space planning, workforce development, and information routing. RecWork will bring together experts who will collaboratively synthesize a forward-looking research agenda for recommender systems in the workplace. The outcome will be captured through a white paper that will serve as the foundation for future RecWork workshops. These steps will help advance research in workplace recommenders and broaden the reach of the RecSys conference.

随着企业的业务流程日益数字化，推荐系统在工作环境中的作用正在扩大。RecWork研讨会的目标是缩小在日历、生产力、社区建设、空间规划、劳动力发展和信息路由等领域的工作环境推荐系统研究方面的差距。RecWork将汇集专家，他们将共同为工作场所的推荐系统合成前瞻性的研究议程。结果将通过白皮书进行记录，该白皮书将作为未来RecWork研讨会的基础。这些步骤将有助于推进工作场所推荐的研究，并扩大RecSys会议的影响范围。

引用次数: 1

Hands on Explainable Recommender Systems with Knowledge Graphs 使用知识图谱的可解释推荐系统

Proceedings of the 16th ACM Conference on Recommender Systems

Pub Date : 2022-09-18 DOI: 10.1145/3523227.3547374

Giacomo Balloccu, Ludovico Boratto, G. Fenu, M. Marras

The goal of this tutorial is to present the RecSys community with recent advances on explainable recommender systems with knowledge graphs. We will first introduce conceptual foundations, by surveying the state of the art and describing real-world examples of how knowledge graphs are being integrated into the recommendation pipeline, also for the purpose of providing explanations. This tutorial will continue with a systematic presentation of algorithmic solutions to model, integrate, train, and assess a recommender system with knowledge graphs, with particular attention to the explainability perspective. A practical part will then provide attendees with concrete implementations of recommender systems with knowledge graphs, leveraging open-source tools and public datasets; in this part, tutorial participants will be engaged in the design of explanations accompanying the recommendations and in articulating their impact. We conclude the tutorial by analyzing emerging open issues and future directions. Website: https://explainablerecsys.github.io/recsys2022/.

本教程的目的是向RecSys社区介绍使用知识图的可解释推荐系统的最新进展。我们将首先介绍概念基础，通过调查目前的技术状况并描述如何将知识图集成到推荐管道中的现实世界示例，也是为了提供解释。本教程将继续系统地介绍使用知识图建模、集成、训练和评估推荐系统的算法解决方案，并特别关注可解释性视角。然后，实践部分将向与会者提供利用开源工具和公共数据集的知识图谱的推荐系统的具体实现;在这一部分中，教程参与者将参与设计伴随建议的解释并阐明其影响。我们通过分析新出现的开放问题和未来方向来结束本教程。网站:https://explainablerecsys.github.io/recsys2022/。

引用次数: 4

You Say Factorization Machine, I Say Neural Network - It’s All in the Activation 你说分解机器，我说神经网络——都在激活中

Proceedings of the 16th ACM Conference on Recommender Systems

Pub Date : 2022-09-18 DOI: 10.1145/3523227.3551499

Chen Almagor, Yedid Hoshen

In recent years, many methods for machine learning on tabular data were introduced that use either factorization machines, neural networks or both. This created a great variety of methods making it non-obvious which method should be used in practice. We begin by extending the previously established theoretical connection between polynomial neural networks and factorization machines (FM) to recently introduced FM techniques. This allows us to propose a single neural-network-based framework that can switch between the deep learning and FM paradigms by a simple change of an activation function. We further show that an activation function exists which can adaptively learn to select the optimal paradigm. Another key element in our framework is its ability to learn high-dimensional embeddings by low-rank factorization. Our framework can handle numeric and categorical data as well as multiclass outputs. Extensive empirical experiments verify our analytical claims. Source code is available at https://github.com/ChenAlmagor/FiFa

近年来，引入了许多基于表格数据的机器学习方法，这些方法要么使用分解机器，要么使用神经网络，要么两者兼而有之。这创造了各种各样的方法，使得在实践中应该使用哪种方法变得不明显。我们首先将先前建立的多项式神经网络和因子分解机(FM)之间的理论联系扩展到最近引入的FM技术。这允许我们提出一个单一的基于神经网络的框架，它可以通过简单地改变激活函数在深度学习和FM范式之间切换。我们进一步证明了存在一个可以自适应学习选择最优范式的激活函数。我们框架中的另一个关键元素是它通过低秩分解学习高维嵌入的能力。我们的框架可以处理数字和分类数据以及多类输出。大量的实证实验证实了我们的分析结论。源代码可从https://github.com/ChenAlmagor/FiFa获得

引用次数: 1

Building and Deploying a Multi-Stage Recommender System with Merlin 用Merlin构建和部署一个多级推荐系统

Proceedings of the 16th ACM Conference on Recommender Systems

Pub Date : 2022-09-18 DOI: 10.1145/3523227.3551468

Karl Higley, Even Oldridge, Ronay Ak, Sara Rabhi, G. Moreira

Newcomers to recommender systems often face challenges related to their lack of understanding of how these systems operate in real life. In most online content related to this topic, the focus is on models and algorithms that score items based on the user’s preferences. However, the recommender model alone does not comprise everything needed for serving optimized recommender systems that meet the company’s business objectives. An industry-standard recommender system involves a number of steps, including data preprocessing, defining and training recommender models, as well as filtering and business logic for serving. In this work, we propose the four-stage recommender system, an industry-wide design pattern we have identified for production recommender systems. The four-stage pipeline includes an item retrieval step that prepares a small subset of relevant items for scoring. The filtering stage then cleans up the subset of items based on business logic such as removing out-of-stock or previously seen items. As for the ranking component, it uses a recommender model to score each item in the presented list based on the preferences of the user. In the final step, the scores are re-ordered to provide a final recommendation list aligned with other business needs or constraints such as diversity. In particular, the presented demo demonstrates how easy it is to build and deploy a four-stage recommender system pipeline using the NVIDIA Merlin open-source framework.

刚接触推荐系统的人经常面临挑战，因为他们不了解这些系统在现实生活中的运作方式。在大多数与此主题相关的在线内容中，重点是基于用户偏好对项目进行评分的模型和算法。然而，推荐模型本身并不能包含满足公司业务目标的优化推荐系统所需要的一切。行业标准的推荐系统涉及许多步骤，包括数据预处理、定义和训练推荐模型，以及用于服务的过滤和业务逻辑。在这项工作中，我们提出了四阶段推荐系统，这是我们为生产推荐系统确定的全行业设计模式。这个四阶段的管道包括一个项目检索步骤，该步骤为评分准备一小部分相关项目。然后，过滤阶段根据业务逻辑清理项目子集，例如删除缺货或以前见过的项目。对于排名组件，它使用推荐模型根据用户的偏好对呈现列表中的每个项目进行评分。在最后一步中，对分数进行重新排序，以提供与其他业务需求或限制(如多样性)一致的最终推荐列表。特别地，演示演示了使用NVIDIA Merlin开源框架构建和部署一个四阶段推荐系统管道是多么容易。

{"title":"Building and Deploying a Multi-Stage Recommender System with Merlin","authors":"Karl Higley, Even Oldridge, Ronay Ak, Sara Rabhi, G. Moreira","doi":"10.1145/3523227.3551468","DOIUrl":"https://doi.org/10.1145/3523227.3551468","url":null,"abstract":"Newcomers to recommender systems often face challenges related to their lack of understanding of how these systems operate in real life. In most online content related to this topic, the focus is on models and algorithms that score items based on the user’s preferences. However, the recommender model alone does not comprise everything needed for serving optimized recommender systems that meet the company’s business objectives. An industry-standard recommender system involves a number of steps, including data preprocessing, defining and training recommender models, as well as filtering and business logic for serving. In this work, we propose the four-stage recommender system, an industry-wide design pattern we have identified for production recommender systems. The four-stage pipeline includes an item retrieval step that prepares a small subset of relevant items for scoring. The filtering stage then cleans up the subset of items based on business logic such as removing out-of-stock or previously seen items. As for the ranking component, it uses a recommender model to score each item in the presented list based on the preferences of the user. In the final step, the scores are re-ordered to provide a final recommendation list aligned with other business needs or constraints such as diversity. In particular, the presented demo demonstrates how easy it is to build and deploy a four-stage recommender system pipeline using the NVIDIA Merlin open-source framework.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115279616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Estimating Long-term Effects from Experimental Data 从实验数据估计长期影响

Proceedings of the 16th ACM Conference on Recommender Systems

Pub Date : 2022-09-18 DOI: 10.1145/3523227.3547398

Ziyang Tang, Yiheng Duan, Steven H. Zhu, Stephanie S. Zhang, Lihong Li

A/B testing is a powerful tool for a company to make informed decisions about their services and products. A limitation of A/B tests is that they do not easily extend to measure post-experiment (long-term) differences. In this talk, we study a different approach inspired by recent advances in off-policy evaluation in reinforcement learning (RL). The basic RL approach assumes customer behavior follows a stationary Markovian process, and estimates the average engagement metric when the process reaches the steady state. However, in realistic scenarios, the stationary assumption is often violated due to weekly variations and seasonality effects. To tackle this challenge, we propose a variation by relaxing the stationary assumption. We empirically tested both stationary and nonstationary approaches in a synthetic dataset and an online store dataset.

A/B测试是公司对其服务和产品做出明智决策的强大工具。A/B测试的一个限制是，它们不容易扩展到测量实验后(长期)差异。在这次演讲中，我们研究了一种不同的方法，这种方法受到了强化学习(RL)中非政策评估的最新进展的启发。基本的强化学习方法假设客户行为遵循一个平稳的马尔可夫过程，并在该过程达到稳定状态时估计平均粘性指标。然而，在现实情况下，由于每周变化和季节性影响，通常违反平稳假设。为了解决这一挑战，我们通过放松平稳假设提出了一种变化。我们在合成数据集和在线商店数据集中对平稳和非平稳方法进行了实证测试。

引用次数: 1

Global and Personalized Graphs for Heterogeneous Sequential Recommendation by Learning Behavior Transitions and User Intentions 基于学习行为转换和用户意图的异构顺序推荐的全局和个性化图

Proceedings of the 16th ACM Conference on Recommender Systems

Pub Date : 2022-09-18 DOI: 10.1145/3523227.3546761

Weixin Chen, Mingkai He, Yongxin Ni, Weike Pan, L. Chen, Zhong Ming

Heterogeneous sequential recommendation (HSR) is a very important recommendation problem, which aims to predict a user’s next interacted item under a target behavior type (e.g., purchase in e-commerce sites) based on his/her historical interactions with different behaviors. Though existing sequential methods have achieved advanced performance by considering the varied impacts of interactions with sequential information, a large body of them still have two major shortcomings. Firstly, they usually model different behaviors separately without considering the correlations between them. The transitions from item to item under diverse behaviors indicate some users’ potential behavior manner. Secondly, though the behavior information contains a user’s fine-grained interests, the insufficient consideration of the local context information limits them from well understanding user intentions. Utilizing the adjacent interactions to better understand a user’s behavior could improve the certainty of prediction. To address these two issues, we propose a novel solution utilizing global and personalized graphs for HSR (GPG4HSR) to learn behavior transitions and user intentions. Specifically, our GPG4HSR consists of two graphs, i.e., a global graph to capture the transitions between different behaviors, and a personalized graph to model items with behaviors by further considering the distinct user intentions of the adjacent contextually relevant nodes. Extensive experiments on four public datasets with the state-of-the-art baselines demonstrate the effectiveness and general applicability of our method GPG4HSR.

异构顺序推荐(HSR)是一个非常重要的推荐问题，它旨在根据用户与不同行为的历史交互，预测用户在目标行为类型下(如在电子商务网站购买)的下一个交互项目。虽然现有的序列方法通过考虑与序列信息交互的各种影响而取得了先进的性能，但大部分序列方法仍然存在两个主要缺点。首先，它们通常分别对不同的行为进行建模，而不考虑它们之间的相关性。在不同行为下从一个项目到另一个项目的转换表明了用户潜在的行为方式。其次，虽然行为信息包含了用户细粒度的兴趣，但对局部上下文信息的考虑不足，限制了他们对用户意图的理解。利用相邻的交互来更好地理解用户的行为可以提高预测的确定性。为了解决这两个问题，我们提出了一种新的解决方案，利用高铁的全局和个性化图形(GPG4HSR)来学习行为转变和用户意图。具体来说，我们的GPG4HSR由两个图组成，即一个全局图用于捕获不同行为之间的转换，一个个性化图用于通过进一步考虑相邻上下文相关节点的不同用户意图来对具有行为的项目进行建模。在四个公共数据集上进行的大量实验证明了我们的方法GPG4HSR的有效性和普遍适用性。

{"title":"Global and Personalized Graphs for Heterogeneous Sequential Recommendation by Learning Behavior Transitions and User Intentions","authors":"Weixin Chen, Mingkai He, Yongxin Ni, Weike Pan, L. Chen, Zhong Ming","doi":"10.1145/3523227.3546761","DOIUrl":"https://doi.org/10.1145/3523227.3546761","url":null,"abstract":"Heterogeneous sequential recommendation (HSR) is a very important recommendation problem, which aims to predict a user’s next interacted item under a target behavior type (e.g., purchase in e-commerce sites) based on his/her historical interactions with different behaviors. Though existing sequential methods have achieved advanced performance by considering the varied impacts of interactions with sequential information, a large body of them still have two major shortcomings. Firstly, they usually model different behaviors separately without considering the correlations between them. The transitions from item to item under diverse behaviors indicate some users’ potential behavior manner. Secondly, though the behavior information contains a user’s fine-grained interests, the insufficient consideration of the local context information limits them from well understanding user intentions. Utilizing the adjacent interactions to better understand a user’s behavior could improve the certainty of prediction. To address these two issues, we propose a novel solution utilizing global and personalized graphs for HSR (GPG4HSR) to learn behavior transitions and user intentions. Specifically, our GPG4HSR consists of two graphs, i.e., a global graph to capture the transitions between different behaviors, and a personalized graph to model items with behaviors by further considering the distinct user intentions of the adjacent contextually relevant nodes. Extensive experiments on four public datasets with the state-of-the-art baselines demonstrate the effectiveness and general applicability of our method GPG4HSR.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121157522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Hands-on Reinforcement Learning for Recommender Systems - From Bandits to SlateQ to Offline RL with Ray RLlib 推荐系统的动手强化学习-从Bandits到SlateQ到离线RL与Ray RLlib

Proceedings of the 16th ACM Conference on Recommender Systems

Pub Date : 2022-09-18 DOI: 10.1145/3523227.3547370

Christy D. Bergman, Kourosh Hakhamaneshi

Reinforcement learning (RL) is gaining traction as a complementary approach to supervised learning for RecSys due to its ability to solve sequential decision-making processes for delayed rewards. Recent advances in offline reinforcement learning, off-policy evaluation, and more scalable, performant system design with the ability to run code in parallel, have made RL more tractable for the RecSys real time use cases. This tutorial introduces RLlib [9], a comprehensive open-source Python RL framework built for production workloads. RLlib is built on top of open-source Ray [8], an easy-to-use, distributed computing framework for Python that can handle complex, heterogeneous applications. Ray and RLlib run on compute clusters on any cloud without vendor lock. Using Colab notebooks, you will leave this tutorial with a complete, working example of parallelized Python RL code using RLlib for RecSys on a github repo.

由于强化学习(RL)能够解决延迟奖励的顺序决策过程，因此它作为监督学习的补充方法在RecSys中获得了越来越多的关注。最近在离线强化学习、离线策略评估、更可扩展、性能更好的系统设计以及并行运行代码的能力方面取得的进展，使得强化学习在RecSys实时用例中更容易处理。本教程介绍了RLlib[9]，一个为生产工作负载构建的全面的开源Python RL框架。RLlib建立在开源的Ray[8]之上，Ray是一个易于使用的Python分布式计算框架，可以处理复杂的异构应用程序。Ray和RLlib可以在没有供应商锁定的任何云上的计算集群上运行。使用Colab笔记本，您将在本教程中留下一个完整的，使用RLlib for RecSys在github repo上并行化Python RL代码的工作示例。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 16th ACM Conference on Recommender Systems

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀