首页 > 最新文献

Proceedings of the 16th ACM Conference on Recommender Systems最新文献

英文 中文
Self-Supervised Bot Play for Transcript-Free Conversational Recommendation with Rationales 基于基本原理的无转录会话推荐的自我监督Bot游戏
Pub Date : 2022-09-18 DOI: 10.1145/3523227.3546783
Shuyang Li, Bodhisattwa Prasad Majumder, Julian McAuley
Conversational recommender systems offer a way for users to engage in multi-turn conversations to find items they enjoy. For users to trust an agent and give effective feedback, the recommender system must be able to explain its suggestions and rationales. We develop a two-part framework for training multi-turn conversational recommenders that provide recommendation rationales that users can effectively interact with to receive better recommendations. First, we train a recommender system to jointly suggest items and explain its reasoning via subjective rationales. We then fine-tune this model to incorporate iterative user feedback via self-supervised bot-play. Experiments on three real-world datasets demonstrate that our system can be applied to different recommendation models across diverse domains to achieve state-of-the-art performance in multi-turn recommendation. Human studies show that systems trained with our framework provide more useful, helpful, and knowledgeable suggestions in warm- and cold-start settings. Our framework allows us to use only product reviews during training, avoiding the need for expensive dialog transcript datasets that limit the applicability of previous conversational recommender agents.
会话推荐系统为用户提供了一种参与多回合对话的方式,以找到他们喜欢的物品。为了让用户信任代理并给出有效的反馈,推荐系统必须能够解释它的建议和理由。我们开发了一个由两部分组成的框架来训练多回合会话推荐器,该框架提供了用户可以有效交互以获得更好推荐的推荐原理。首先,我们训练一个推荐系统来联合推荐项目,并通过主观理由解释其推理。然后我们微调这个模型,通过自我监督的机器人游戏来整合迭代用户反馈。在三个真实数据集上的实验表明,我们的系统可以应用于不同领域的不同推荐模型,以达到最先进的多回合推荐性能。人类研究表明,用我们的框架训练的系统在热启动和冷启动设置中提供更有用、更有帮助和更有知识的建议。我们的框架允许我们在训练期间只使用产品评论,避免了对昂贵的对话记录数据集的需求,这些数据集限制了以前会话推荐代理的适用性。
{"title":"Self-Supervised Bot Play for Transcript-Free Conversational Recommendation with Rationales","authors":"Shuyang Li, Bodhisattwa Prasad Majumder, Julian McAuley","doi":"10.1145/3523227.3546783","DOIUrl":"https://doi.org/10.1145/3523227.3546783","url":null,"abstract":"Conversational recommender systems offer a way for users to engage in multi-turn conversations to find items they enjoy. For users to trust an agent and give effective feedback, the recommender system must be able to explain its suggestions and rationales. We develop a two-part framework for training multi-turn conversational recommenders that provide recommendation rationales that users can effectively interact with to receive better recommendations. First, we train a recommender system to jointly suggest items and explain its reasoning via subjective rationales. We then fine-tune this model to incorporate iterative user feedback via self-supervised bot-play. Experiments on three real-world datasets demonstrate that our system can be applied to different recommendation models across diverse domains to achieve state-of-the-art performance in multi-turn recommendation. Human studies show that systems trained with our framework provide more useful, helpful, and knowledgeable suggestions in warm- and cold-start settings. Our framework allows us to use only product reviews during training, avoiding the need for expensive dialog transcript datasets that limit the applicability of previous conversational recommender agents.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116414532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Reducing Cross-Topic Political Homogenization in Content-Based News Recommendation 减少基于内容的新闻推荐中的跨话题政治同质化
Pub Date : 2022-09-18 DOI: 10.1145/3523227.3546782
K. Shivaram, Ping Liu, Matthew Shapiro, M. Bilgic, A. Culotta
Content-based news recommenders learn words that correlate with user engagement and recommend articles accordingly. This can be problematic for users with diverse political preferences by topic — e.g., users that prefer conservative articles on one topic but liberal articles on another. In such instances, recommenders can have a homogenizing effect by recommending articles with the same political lean on both topics, particularly if both topics share salient, politically polarized terms like “far right” or “radical left.” In this paper, we propose attention-based neural network models to reduce this homogenization effect by increasing attention on words that are topic specific while decreasing attention on polarized, topic-general terms. We find that the proposed approach results in more accurate recommendations for simulated users with such diverse preferences.
基于内容的新闻推荐器学习与用户参与度相关的单词,并相应地推荐文章。这对于根据主题有不同政治偏好的用户来说可能是有问题的——例如,用户在一个主题上喜欢保守的文章,但在另一个主题上喜欢自由的文章。在这种情况下,推荐器可以通过推荐在两个主题上具有相同政治倾向的文章来产生同质化效果,特别是如果两个主题都有明显的、政治上两极分化的术语,如“极右翼”或“激进左翼”。在本文中,我们提出了基于注意力的神经网络模型,通过增加对主题特定词的关注,同时减少对极化、主题一般术语的关注,来减少这种同质化效应。我们发现所提出的方法可以为具有不同偏好的模拟用户提供更准确的推荐。
{"title":"Reducing Cross-Topic Political Homogenization in Content-Based News Recommendation","authors":"K. Shivaram, Ping Liu, Matthew Shapiro, M. Bilgic, A. Culotta","doi":"10.1145/3523227.3546782","DOIUrl":"https://doi.org/10.1145/3523227.3546782","url":null,"abstract":"Content-based news recommenders learn words that correlate with user engagement and recommend articles accordingly. This can be problematic for users with diverse political preferences by topic — e.g., users that prefer conservative articles on one topic but liberal articles on another. In such instances, recommenders can have a homogenizing effect by recommending articles with the same political lean on both topics, particularly if both topics share salient, politically polarized terms like “far right” or “radical left.” In this paper, we propose attention-based neural network models to reduce this homogenization effect by increasing attention on words that are topic specific while decreasing attention on polarized, topic-general terms. We find that the proposed approach results in more accurate recommendations for simulated users with such diverse preferences.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128357074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiobjective Evaluation of Reinforcement Learning Based Recommender Systems 基于强化学习的推荐系统多目标评价
Pub Date : 2022-09-18 DOI: 10.1145/3523227.3551485
A. Grishanov, A. Ianina, K. Vorontsov
Movielens dataset has become a default choice for recommender systems evaluation. In this paper we analyze the best strategies of a Reinforcement Learning agent on Movielens (1M) dataset studying the balance between precision and diversity of recommendations. We found that trivial strategies are able to maximize ranking quality criteria, but useless for users of the recommendation system due to the lack of diversity in final predictions. Our proposed method stimulates the agent to explore the environment using the stochasticity of Ornstein-Uhlenbeck processes. Experiments show that optimization of the Ornstein-Uhlenbeck process drift coefficient improves the diversity of recommendations while maintaining high nDCG and HR criteria. To the best of our knowledge, the analysis of agent strategies in recommendation environments has not been studied excessively in previous works.
Movielens数据集已经成为推荐系统评估的默认选择。本文在Movielens (1M)数据集上分析了强化学习代理的最佳策略,研究了推荐的精度和多样性之间的平衡。我们发现,琐碎策略能够最大化排名质量标准,但由于最终预测缺乏多样性,对推荐系统的用户无用。我们提出的方法利用Ornstein-Uhlenbeck过程的随机性来激励智能体探索环境。实验表明,优化Ornstein-Uhlenbeck过程漂移系数可以提高推荐的多样性,同时保持较高的nDCG和HR标准。据我们所知,在以前的工作中,对推荐环境中的智能体策略的分析并没有过多的研究。
{"title":"Multiobjective Evaluation of Reinforcement Learning Based Recommender Systems","authors":"A. Grishanov, A. Ianina, K. Vorontsov","doi":"10.1145/3523227.3551485","DOIUrl":"https://doi.org/10.1145/3523227.3551485","url":null,"abstract":"Movielens dataset has become a default choice for recommender systems evaluation. In this paper we analyze the best strategies of a Reinforcement Learning agent on Movielens (1M) dataset studying the balance between precision and diversity of recommendations. We found that trivial strategies are able to maximize ranking quality criteria, but useless for users of the recommendation system due to the lack of diversity in final predictions. Our proposed method stimulates the agent to explore the environment using the stochasticity of Ornstein-Uhlenbeck processes. Experiments show that optimization of the Ornstein-Uhlenbeck process drift coefficient improves the diversity of recommendations while maintaining high nDCG and HR criteria. To the best of our knowledge, the analysis of agent strategies in recommendation environments has not been studied excessively in previous works.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132689968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Lightweight Transformer for Next-Item Product Recommendation 用于下一项产品推荐的轻量级变压器
Pub Date : 2022-09-18 DOI: 10.1145/3523227.3547491
M. J. Mei, Cole Zuber, Y. Khazaeni
We apply a transformer using sequential browse history to generate next-item product recommendations. Interpreting the learned item embeddings, we show that the model is able to implicitly learn price, popularity, style and functionality attributes without being explicitly passed these features during training. Our real-life test of this model on Wayfair’s different international stores show mixed results (but overall win). Diagnosing the cause, we identify a useful metric (average number of customers browsing each product) to ensure good model convergence. We also find limitations of using standard metrics like recall and nDCG, which do not correctly account for the positional effects of showing items on the Wayfair website, and empirically determine a more accurate discount factor.
我们使用顺序浏览历史应用一个转换器来生成下一项产品推荐。通过解释学习到的物品嵌入,我们表明该模型能够隐式地学习价格、流行度、风格和功能属性,而无需在训练期间显式地传递这些特征。我们在Wayfair不同的国际门店对这一模式进行了现实测试,结果好坏参半(但总体上是胜利的)。通过诊断原因,我们确定了一个有用的度量(浏览每个产品的平均客户数量),以确保良好的模型收敛性。我们还发现了使用召回率和nDCG等标准指标的局限性,它们不能正确地解释在Wayfair网站上展示商品的位置效应,并根据经验确定更准确的折扣系数。
{"title":"A Lightweight Transformer for Next-Item Product Recommendation","authors":"M. J. Mei, Cole Zuber, Y. Khazaeni","doi":"10.1145/3523227.3547491","DOIUrl":"https://doi.org/10.1145/3523227.3547491","url":null,"abstract":"We apply a transformer using sequential browse history to generate next-item product recommendations. Interpreting the learned item embeddings, we show that the model is able to implicitly learn price, popularity, style and functionality attributes without being explicitly passed these features during training. Our real-life test of this model on Wayfair’s different international stores show mixed results (but overall win). Diagnosing the cause, we identify a useful metric (average number of customers browsing each product) to ensure good model convergence. We also find limitations of using standard metrics like recall and nDCG, which do not correctly account for the positional effects of showing items on the Wayfair website, and empirically determine a more accurate discount factor.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133056979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Hands on Explainable Recommender Systems with Knowledge Graphs 使用知识图谱的可解释推荐系统
Pub Date : 2022-09-18 DOI: 10.1145/3523227.3547374
Giacomo Balloccu, Ludovico Boratto, G. Fenu, M. Marras
The goal of this tutorial is to present the RecSys community with recent advances on explainable recommender systems with knowledge graphs. We will first introduce conceptual foundations, by surveying the state of the art and describing real-world examples of how knowledge graphs are being integrated into the recommendation pipeline, also for the purpose of providing explanations. This tutorial will continue with a systematic presentation of algorithmic solutions to model, integrate, train, and assess a recommender system with knowledge graphs, with particular attention to the explainability perspective. A practical part will then provide attendees with concrete implementations of recommender systems with knowledge graphs, leveraging open-source tools and public datasets; in this part, tutorial participants will be engaged in the design of explanations accompanying the recommendations and in articulating their impact. We conclude the tutorial by analyzing emerging open issues and future directions. Website: https://explainablerecsys.github.io/recsys2022/.
本教程的目的是向RecSys社区介绍使用知识图的可解释推荐系统的最新进展。我们将首先介绍概念基础,通过调查目前的技术状况并描述如何将知识图集成到推荐管道中的现实世界示例,也是为了提供解释。本教程将继续系统地介绍使用知识图建模、集成、训练和评估推荐系统的算法解决方案,并特别关注可解释性视角。然后,实践部分将向与会者提供利用开源工具和公共数据集的知识图谱的推荐系统的具体实现;在这一部分中,教程参与者将参与设计伴随建议的解释并阐明其影响。我们通过分析新出现的开放问题和未来方向来结束本教程。网站:https://explainablerecsys.github.io/recsys2022/。
{"title":"Hands on Explainable Recommender Systems with Knowledge Graphs","authors":"Giacomo Balloccu, Ludovico Boratto, G. Fenu, M. Marras","doi":"10.1145/3523227.3547374","DOIUrl":"https://doi.org/10.1145/3523227.3547374","url":null,"abstract":"The goal of this tutorial is to present the RecSys community with recent advances on explainable recommender systems with knowledge graphs. We will first introduce conceptual foundations, by surveying the state of the art and describing real-world examples of how knowledge graphs are being integrated into the recommendation pipeline, also for the purpose of providing explanations. This tutorial will continue with a systematic presentation of algorithmic solutions to model, integrate, train, and assess a recommender system with knowledge graphs, with particular attention to the explainability perspective. A practical part will then provide attendees with concrete implementations of recommender systems with knowledge graphs, leveraging open-source tools and public datasets; in this part, tutorial participants will be engaged in the design of explanations accompanying the recommendations and in articulating their impact. We conclude the tutorial by analyzing emerging open issues and future directions. Website: https://explainablerecsys.github.io/recsys2022/.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114684640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
You Say Factorization Machine, I Say Neural Network - It’s All in the Activation 你说分解机器,我说神经网络——都在激活中
Pub Date : 2022-09-18 DOI: 10.1145/3523227.3551499
Chen Almagor, Yedid Hoshen
In recent years, many methods for machine learning on tabular data were introduced that use either factorization machines, neural networks or both. This created a great variety of methods making it non-obvious which method should be used in practice. We begin by extending the previously established theoretical connection between polynomial neural networks and factorization machines (FM) to recently introduced FM techniques. This allows us to propose a single neural-network-based framework that can switch between the deep learning and FM paradigms by a simple change of an activation function. We further show that an activation function exists which can adaptively learn to select the optimal paradigm. Another key element in our framework is its ability to learn high-dimensional embeddings by low-rank factorization. Our framework can handle numeric and categorical data as well as multiclass outputs. Extensive empirical experiments verify our analytical claims. Source code is available at https://github.com/ChenAlmagor/FiFa
近年来,引入了许多基于表格数据的机器学习方法,这些方法要么使用分解机器,要么使用神经网络,要么两者兼而有之。这创造了各种各样的方法,使得在实践中应该使用哪种方法变得不明显。我们首先将先前建立的多项式神经网络和因子分解机(FM)之间的理论联系扩展到最近引入的FM技术。这允许我们提出一个单一的基于神经网络的框架,它可以通过简单地改变激活函数在深度学习和FM范式之间切换。我们进一步证明了存在一个可以自适应学习选择最优范式的激活函数。我们框架中的另一个关键元素是它通过低秩分解学习高维嵌入的能力。我们的框架可以处理数字和分类数据以及多类输出。大量的实证实验证实了我们的分析结论。源代码可从https://github.com/ChenAlmagor/FiFa获得
{"title":"You Say Factorization Machine, I Say Neural Network - It’s All in the Activation","authors":"Chen Almagor, Yedid Hoshen","doi":"10.1145/3523227.3551499","DOIUrl":"https://doi.org/10.1145/3523227.3551499","url":null,"abstract":"In recent years, many methods for machine learning on tabular data were introduced that use either factorization machines, neural networks or both. This created a great variety of methods making it non-obvious which method should be used in practice. We begin by extending the previously established theoretical connection between polynomial neural networks and factorization machines (FM) to recently introduced FM techniques. This allows us to propose a single neural-network-based framework that can switch between the deep learning and FM paradigms by a simple change of an activation function. We further show that an activation function exists which can adaptively learn to select the optimal paradigm. Another key element in our framework is its ability to learn high-dimensional embeddings by low-rank factorization. Our framework can handle numeric and categorical data as well as multiclass outputs. Extensive empirical experiments verify our analytical claims. Source code is available at https://github.com/ChenAlmagor/FiFa","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115387742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Building and Deploying a Multi-Stage Recommender System with Merlin 用Merlin构建和部署一个多级推荐系统
Pub Date : 2022-09-18 DOI: 10.1145/3523227.3551468
Karl Higley, Even Oldridge, Ronay Ak, Sara Rabhi, G. Moreira
Newcomers to recommender systems often face challenges related to their lack of understanding of how these systems operate in real life. In most online content related to this topic, the focus is on models and algorithms that score items based on the user’s preferences. However, the recommender model alone does not comprise everything needed for serving optimized recommender systems that meet the company’s business objectives. An industry-standard recommender system involves a number of steps, including data preprocessing, defining and training recommender models, as well as filtering and business logic for serving. In this work, we propose the four-stage recommender system, an industry-wide design pattern we have identified for production recommender systems. The four-stage pipeline includes an item retrieval step that prepares a small subset of relevant items for scoring. The filtering stage then cleans up the subset of items based on business logic such as removing out-of-stock or previously seen items. As for the ranking component, it uses a recommender model to score each item in the presented list based on the preferences of the user. In the final step, the scores are re-ordered to provide a final recommendation list aligned with other business needs or constraints such as diversity. In particular, the presented demo demonstrates how easy it is to build and deploy a four-stage recommender system pipeline using the NVIDIA Merlin open-source framework.
刚接触推荐系统的人经常面临挑战,因为他们不了解这些系统在现实生活中的运作方式。在大多数与此主题相关的在线内容中,重点是基于用户偏好对项目进行评分的模型和算法。然而,推荐模型本身并不能包含满足公司业务目标的优化推荐系统所需要的一切。行业标准的推荐系统涉及许多步骤,包括数据预处理、定义和训练推荐模型,以及用于服务的过滤和业务逻辑。在这项工作中,我们提出了四阶段推荐系统,这是我们为生产推荐系统确定的全行业设计模式。这个四阶段的管道包括一个项目检索步骤,该步骤为评分准备一小部分相关项目。然后,过滤阶段根据业务逻辑清理项目子集,例如删除缺货或以前见过的项目。对于排名组件,它使用推荐模型根据用户的偏好对呈现列表中的每个项目进行评分。在最后一步中,对分数进行重新排序,以提供与其他业务需求或限制(如多样性)一致的最终推荐列表。特别地,演示演示了使用NVIDIA Merlin开源框架构建和部署一个四阶段推荐系统管道是多么容易。
{"title":"Building and Deploying a Multi-Stage Recommender System with Merlin","authors":"Karl Higley, Even Oldridge, Ronay Ak, Sara Rabhi, G. Moreira","doi":"10.1145/3523227.3551468","DOIUrl":"https://doi.org/10.1145/3523227.3551468","url":null,"abstract":"Newcomers to recommender systems often face challenges related to their lack of understanding of how these systems operate in real life. In most online content related to this topic, the focus is on models and algorithms that score items based on the user’s preferences. However, the recommender model alone does not comprise everything needed for serving optimized recommender systems that meet the company’s business objectives. An industry-standard recommender system involves a number of steps, including data preprocessing, defining and training recommender models, as well as filtering and business logic for serving. In this work, we propose the four-stage recommender system, an industry-wide design pattern we have identified for production recommender systems. The four-stage pipeline includes an item retrieval step that prepares a small subset of relevant items for scoring. The filtering stage then cleans up the subset of items based on business logic such as removing out-of-stock or previously seen items. As for the ranking component, it uses a recommender model to score each item in the presented list based on the preferences of the user. In the final step, the scores are re-ordered to provide a final recommendation list aligned with other business needs or constraints such as diversity. In particular, the presented demo demonstrates how easy it is to build and deploy a four-stage recommender system pipeline using the NVIDIA Merlin open-source framework.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115279616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Estimating Long-term Effects from Experimental Data 从实验数据估计长期影响
Pub Date : 2022-09-18 DOI: 10.1145/3523227.3547398
Ziyang Tang, Yiheng Duan, Steven H. Zhu, Stephanie S. Zhang, Lihong Li
A/B testing is a powerful tool for a company to make informed decisions about their services and products. A limitation of A/B tests is that they do not easily extend to measure post-experiment (long-term) differences. In this talk, we study a different approach inspired by recent advances in off-policy evaluation in reinforcement learning (RL). The basic RL approach assumes customer behavior follows a stationary Markovian process, and estimates the average engagement metric when the process reaches the steady state. However, in realistic scenarios, the stationary assumption is often violated due to weekly variations and seasonality effects. To tackle this challenge, we propose a variation by relaxing the stationary assumption. We empirically tested both stationary and nonstationary approaches in a synthetic dataset and an online store dataset.
A/B测试是公司对其服务和产品做出明智决策的强大工具。A/B测试的一个限制是,它们不容易扩展到测量实验后(长期)差异。在这次演讲中,我们研究了一种不同的方法,这种方法受到了强化学习(RL)中非政策评估的最新进展的启发。基本的强化学习方法假设客户行为遵循一个平稳的马尔可夫过程,并在该过程达到稳定状态时估计平均粘性指标。然而,在现实情况下,由于每周变化和季节性影响,通常违反平稳假设。为了解决这一挑战,我们通过放松平稳假设提出了一种变化。我们在合成数据集和在线商店数据集中对平稳和非平稳方法进行了实证测试。
{"title":"Estimating Long-term Effects from Experimental Data","authors":"Ziyang Tang, Yiheng Duan, Steven H. Zhu, Stephanie S. Zhang, Lihong Li","doi":"10.1145/3523227.3547398","DOIUrl":"https://doi.org/10.1145/3523227.3547398","url":null,"abstract":"A/B testing is a powerful tool for a company to make informed decisions about their services and products. A limitation of A/B tests is that they do not easily extend to measure post-experiment (long-term) differences. In this talk, we study a different approach inspired by recent advances in off-policy evaluation in reinforcement learning (RL). The basic RL approach assumes customer behavior follows a stationary Markovian process, and estimates the average engagement metric when the process reaches the steady state. However, in realistic scenarios, the stationary assumption is often violated due to weekly variations and seasonality effects. To tackle this challenge, we propose a variation by relaxing the stationary assumption. We empirically tested both stationary and nonstationary approaches in a synthetic dataset and an online store dataset.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122655347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Global and Personalized Graphs for Heterogeneous Sequential Recommendation by Learning Behavior Transitions and User Intentions 基于学习行为转换和用户意图的异构顺序推荐的全局和个性化图
Pub Date : 2022-09-18 DOI: 10.1145/3523227.3546761
Weixin Chen, Mingkai He, Yongxin Ni, Weike Pan, L. Chen, Zhong Ming
Heterogeneous sequential recommendation (HSR) is a very important recommendation problem, which aims to predict a user’s next interacted item under a target behavior type (e.g., purchase in e-commerce sites) based on his/her historical interactions with different behaviors. Though existing sequential methods have achieved advanced performance by considering the varied impacts of interactions with sequential information, a large body of them still have two major shortcomings. Firstly, they usually model different behaviors separately without considering the correlations between them. The transitions from item to item under diverse behaviors indicate some users’ potential behavior manner. Secondly, though the behavior information contains a user’s fine-grained interests, the insufficient consideration of the local context information limits them from well understanding user intentions. Utilizing the adjacent interactions to better understand a user’s behavior could improve the certainty of prediction. To address these two issues, we propose a novel solution utilizing global and personalized graphs for HSR (GPG4HSR) to learn behavior transitions and user intentions. Specifically, our GPG4HSR consists of two graphs, i.e., a global graph to capture the transitions between different behaviors, and a personalized graph to model items with behaviors by further considering the distinct user intentions of the adjacent contextually relevant nodes. Extensive experiments on four public datasets with the state-of-the-art baselines demonstrate the effectiveness and general applicability of our method GPG4HSR.
异构顺序推荐(HSR)是一个非常重要的推荐问题,它旨在根据用户与不同行为的历史交互,预测用户在目标行为类型下(如在电子商务网站购买)的下一个交互项目。虽然现有的序列方法通过考虑与序列信息交互的各种影响而取得了先进的性能,但大部分序列方法仍然存在两个主要缺点。首先,它们通常分别对不同的行为进行建模,而不考虑它们之间的相关性。在不同行为下从一个项目到另一个项目的转换表明了用户潜在的行为方式。其次,虽然行为信息包含了用户细粒度的兴趣,但对局部上下文信息的考虑不足,限制了他们对用户意图的理解。利用相邻的交互来更好地理解用户的行为可以提高预测的确定性。为了解决这两个问题,我们提出了一种新的解决方案,利用高铁的全局和个性化图形(GPG4HSR)来学习行为转变和用户意图。具体来说,我们的GPG4HSR由两个图组成,即一个全局图用于捕获不同行为之间的转换,一个个性化图用于通过进一步考虑相邻上下文相关节点的不同用户意图来对具有行为的项目进行建模。在四个公共数据集上进行的大量实验证明了我们的方法GPG4HSR的有效性和普遍适用性。
{"title":"Global and Personalized Graphs for Heterogeneous Sequential Recommendation by Learning Behavior Transitions and User Intentions","authors":"Weixin Chen, Mingkai He, Yongxin Ni, Weike Pan, L. Chen, Zhong Ming","doi":"10.1145/3523227.3546761","DOIUrl":"https://doi.org/10.1145/3523227.3546761","url":null,"abstract":"Heterogeneous sequential recommendation (HSR) is a very important recommendation problem, which aims to predict a user’s next interacted item under a target behavior type (e.g., purchase in e-commerce sites) based on his/her historical interactions with different behaviors. Though existing sequential methods have achieved advanced performance by considering the varied impacts of interactions with sequential information, a large body of them still have two major shortcomings. Firstly, they usually model different behaviors separately without considering the correlations between them. The transitions from item to item under diverse behaviors indicate some users’ potential behavior manner. Secondly, though the behavior information contains a user’s fine-grained interests, the insufficient consideration of the local context information limits them from well understanding user intentions. Utilizing the adjacent interactions to better understand a user’s behavior could improve the certainty of prediction. To address these two issues, we propose a novel solution utilizing global and personalized graphs for HSR (GPG4HSR) to learn behavior transitions and user intentions. Specifically, our GPG4HSR consists of two graphs, i.e., a global graph to capture the transitions between different behaviors, and a personalized graph to model items with behaviors by further considering the distinct user intentions of the adjacent contextually relevant nodes. Extensive experiments on four public datasets with the state-of-the-art baselines demonstrate the effectiveness and general applicability of our method GPG4HSR.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121157522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Hands-on Reinforcement Learning for Recommender Systems - From Bandits to SlateQ to Offline RL with Ray RLlib 推荐系统的动手强化学习-从Bandits到SlateQ到离线RL与Ray RLlib
Pub Date : 2022-09-18 DOI: 10.1145/3523227.3547370
Christy D. Bergman, Kourosh Hakhamaneshi
Reinforcement learning (RL) is gaining traction as a complementary approach to supervised learning for RecSys due to its ability to solve sequential decision-making processes for delayed rewards. Recent advances in offline reinforcement learning, off-policy evaluation, and more scalable, performant system design with the ability to run code in parallel, have made RL more tractable for the RecSys real time use cases. This tutorial introduces RLlib [9], a comprehensive open-source Python RL framework built for production workloads. RLlib is built on top of open-source Ray [8], an easy-to-use, distributed computing framework for Python that can handle complex, heterogeneous applications. Ray and RLlib run on compute clusters on any cloud without vendor lock. Using Colab notebooks, you will leave this tutorial with a complete, working example of parallelized Python RL code using RLlib for RecSys on a github repo.
由于强化学习(RL)能够解决延迟奖励的顺序决策过程,因此它作为监督学习的补充方法在RecSys中获得了越来越多的关注。最近在离线强化学习、离线策略评估、更可扩展、性能更好的系统设计以及并行运行代码的能力方面取得的进展,使得强化学习在RecSys实时用例中更容易处理。本教程介绍了RLlib[9],一个为生产工作负载构建的全面的开源Python RL框架。RLlib建立在开源的Ray[8]之上,Ray是一个易于使用的Python分布式计算框架,可以处理复杂的异构应用程序。Ray和RLlib可以在没有供应商锁定的任何云上的计算集群上运行。使用Colab笔记本,您将在本教程中留下一个完整的,使用RLlib for RecSys在github repo上并行化Python RL代码的工作示例。
{"title":"Hands-on Reinforcement Learning for Recommender Systems - From Bandits to SlateQ to Offline RL with Ray RLlib","authors":"Christy D. Bergman, Kourosh Hakhamaneshi","doi":"10.1145/3523227.3547370","DOIUrl":"https://doi.org/10.1145/3523227.3547370","url":null,"abstract":"Reinforcement learning (RL) is gaining traction as a complementary approach to supervised learning for RecSys due to its ability to solve sequential decision-making processes for delayed rewards. Recent advances in offline reinforcement learning, off-policy evaluation, and more scalable, performant system design with the ability to run code in parallel, have made RL more tractable for the RecSys real time use cases. This tutorial introduces RLlib [9], a comprehensive open-source Python RL framework built for production workloads. RLlib is built on top of open-source Ray [8], an easy-to-use, distributed computing framework for Python that can handle complex, heterogeneous applications. Ray and RLlib run on compute clusters on any cloud without vendor lock. Using Colab notebooks, you will leave this tutorial with a complete, working example of parallelized Python RL code using RLlib for RecSys on a github repo.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125151247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 16th ACM Conference on Recommender Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1