首页 > 最新文献

Proceedings of the 16th ACM Conference on Recommender Systems最新文献

英文 中文
FAccTRec 2022: The 5th Workshop on Responsible Recommendation FAccTRec 2022:第五届负责任建议研讨会
Pub Date : 2022-09-18 DOI: 10.1145/3523227.3547419
Nasim Sonboli, Toshihiro Kamishima, Amifa Raj, Luca Belli, R. Burke
The 5th Workshop on Responsible Recommendation (FAccTRec 2022) was held in conjunction with the 16th ACM Conference on Recommender Systems on September, 2022 at Seattle, USA, in a hybrid format. This workshop brought together researchers and practitioners to discuss several topics under the banner of social responsibility in recommender systems: fairness, accountability, transparency, privacy, and other ethical and social concerns. It served to advance research and discussion of these topics in the recommender systems space, and incubate ideas for future development and refinement.
第五届负责任推荐研讨会(FAccTRec 2022)与第16届ACM推荐系统会议于2022年9月在美国西雅图以混合形式举行。本次研讨会汇集了研究人员和从业人员,讨论了推荐系统中社会责任的几个主题:公平性、问责制、透明度、隐私以及其他伦理和社会问题。它有助于在推荐系统领域推进这些主题的研究和讨论,并为未来的开发和改进孕育想法。
{"title":"FAccTRec 2022: The 5th Workshop on Responsible Recommendation","authors":"Nasim Sonboli, Toshihiro Kamishima, Amifa Raj, Luca Belli, R. Burke","doi":"10.1145/3523227.3547419","DOIUrl":"https://doi.org/10.1145/3523227.3547419","url":null,"abstract":"The 5th Workshop on Responsible Recommendation (FAccTRec 2022) was held in conjunction with the 16th ACM Conference on Recommender Systems on September, 2022 at Seattle, USA, in a hybrid format. This workshop brought together researchers and practitioners to discuss several topics under the banner of social responsibility in recommender systems: fairness, accountability, transparency, privacy, and other ethical and social concerns. It served to advance research and discussion of these topics in the recommender systems space, and incubate ideas for future development and refinement.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125665847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Fast and Accurate User Cold-Start Learning Using Monte Carlo Tree Search 快速和准确的用户冷启动学习使用蒙特卡洛树搜索
Pub Date : 2022-09-18 DOI: 10.1145/3523227.3546786
Dilina Chandika Rajapakse, D. Leith
We revisit the cold-start task for new users of a recommender system whereby a new user is asked to rate a few items with the aim of discovering the user’s preferences. This is a combinatorial stochastic learning task, and so difficult in general. In this paper we propose using Monte Carlo Tree Search (MCTS) to dynamically select the sequence of items presented to a new user. We find that this new MCTS-based cold-start approach is able to consistently quickly identify the preferences of a user with significantly higher accuracy than with either a decision-tree or a state of the art bandit-based approach without incurring higher regret i.e the learning performance is fundamentally superior to that of the state of the art. This boost in recommender accuracy is achieved in a computationally lightweight fashion.
我们重新审视了推荐系统新用户的冷启动任务,即要求新用户对几个项目进行评级,目的是发现用户的偏好。这是一个组合随机学习任务,一般来说很难。在本文中,我们提出使用蒙特卡罗树搜索(MCTS)来动态选择呈现给新用户的项目序列。我们发现,这种新的基于mcts的冷启动方法能够持续快速地识别用户的偏好,其准确性明显高于决策树或最先进的基于强盗的方法,而不会产生更高的遗憾,即学习性能从根本上优于最先进的方法。这种推荐精度的提升是以计算轻量级的方式实现的。
{"title":"Fast and Accurate User Cold-Start Learning Using Monte Carlo Tree Search","authors":"Dilina Chandika Rajapakse, D. Leith","doi":"10.1145/3523227.3546786","DOIUrl":"https://doi.org/10.1145/3523227.3546786","url":null,"abstract":"We revisit the cold-start task for new users of a recommender system whereby a new user is asked to rate a few items with the aim of discovering the user’s preferences. This is a combinatorial stochastic learning task, and so difficult in general. In this paper we propose using Monte Carlo Tree Search (MCTS) to dynamically select the sequence of items presented to a new user. We find that this new MCTS-based cold-start approach is able to consistently quickly identify the preferences of a user with significantly higher accuracy than with either a decision-tree or a state of the art bandit-based approach without incurring higher regret i.e the learning performance is fundamentally superior to that of the state of the art. This boost in recommender accuracy is achieved in a computationally lightweight fashion.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127680877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Query Attribute Recommendation at Amazon Search 在亚马逊搜索中查询属性推荐
Pub Date : 2022-09-18 DOI: 10.1145/3523227.3547395
Cheng-hsin Luo, William P. Headden, Neela Avudaiappan, Haoming Jiang, Tianyu Cao, Qingyu Yin, Yifan Gao, Zheng Li, R. Goutam, Haiyang Zhang, Bing Yin
Query understanding models extract attributes from search queries, like color, product type, brand, etc. Search engines rely on these attributes for ranking, advertising, and recommendation, etc. However, product search queries are usually short, three or four words on average. This information shortage limits the search engine’s power to provide high-quality services. In this talk, we would like to share our year-long journey in solving the information shortage problem and introduce an end-to-end system for attribute recommendation at Amazon Search. We showcase how the system works and how the system contributes to the long-term user experience through offline and online experiments at Amazon Search. We hope this talk can inspire more follow-up works in understanding and improving attribute recommendations in product search.
查询理解模型从搜索查询中提取属性,如颜色、产品类型、品牌等。搜索引擎依靠这些属性进行排名、广告和推荐等。然而,产品搜索查询通常很短,平均只有三到四个字。这种信息短缺限制了搜索引擎提供高质量服务的能力。在这次演讲中,我们将分享我们一年来解决信息短缺问题的历程,并介绍亚马逊搜索的端到端属性推荐系统。我们通过亚马逊搜索的离线和在线实验,展示了该系统是如何工作的,以及该系统如何为长期用户体验做出贡献。我们希望这次演讲能够启发更多的后续工作来理解和改进产品搜索中的属性推荐。
{"title":"Query Attribute Recommendation at Amazon Search","authors":"Cheng-hsin Luo, William P. Headden, Neela Avudaiappan, Haoming Jiang, Tianyu Cao, Qingyu Yin, Yifan Gao, Zheng Li, R. Goutam, Haiyang Zhang, Bing Yin","doi":"10.1145/3523227.3547395","DOIUrl":"https://doi.org/10.1145/3523227.3547395","url":null,"abstract":"Query understanding models extract attributes from search queries, like color, product type, brand, etc. Search engines rely on these attributes for ranking, advertising, and recommendation, etc. However, product search queries are usually short, three or four words on average. This information shortage limits the search engine’s power to provide high-quality services. In this talk, we would like to share our year-long journey in solving the information shortage problem and introduce an end-to-end system for attribute recommendation at Amazon Search. We showcase how the system works and how the system contributes to the long-term user experience through offline and online experiments at Amazon Search. We hope this talk can inspire more follow-up works in understanding and improving attribute recommendations in product search.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133545880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Identifying New Podcasts with High General Appeal Using a Pure Exploration Infinitely-Armed Bandit Strategy 使用纯探索无限武装强盗策略识别具有高普遍吸引力的新播客
Pub Date : 2022-09-18 DOI: 10.1145/3523227.3546766
Maryam Aziz, J. Anderton, Kevin G. Jamieson, Alice Wang, Hugues Bouchard, J. Aslam
Podcasting is an increasingly popular medium for entertainment and discourse around the world, with tens of thousands of new podcasts released on a monthly basis. We consider the problem of identifying from these newly-released podcasts those with the largest potential audiences so they can be considered for personalized recommendation to users. We first study and then discard a supervised approach due to the inadequacy of either content or consumption features for this task, and instead propose a novel non-contextual bandit algorithm in the fixed-budget infinitely-armed pure-exploration setting. We demonstrate that our algorithm is well-suited to the best-arm identification task for a broad class of arm reservoir distributions, out-competing a large number of state-of-the-art algorithms. We then apply the algorithm to identifying podcasts with broad appeal in a simulated study, and show that it efficiently sorts podcasts into groups by increasing appeal while avoiding the popularity bias inherent in supervised approaches.
播客是一种在世界范围内日益流行的娱乐和讨论媒介,每月都会发布数以万计的新播客。我们考虑的问题是从这些新发布的播客中识别出那些拥有最大潜在受众的播客,这样就可以考虑向用户进行个性化推荐。我们首先研究并放弃了一种监督方法,因为内容或消费特征不适合该任务,而是在固定预算无限武装纯探索设置中提出了一种新的非上下文强盗算法。我们证明,我们的算法非常适合于广泛类别的臂库分布的最佳臂识别任务,胜过大量最先进的算法。然后,我们在模拟研究中应用该算法来识别具有广泛吸引力的播客,并表明它通过增加吸引力来有效地将播客分类为组,同时避免了监督方法固有的流行偏见。
{"title":"Identifying New Podcasts with High General Appeal Using a Pure Exploration Infinitely-Armed Bandit Strategy","authors":"Maryam Aziz, J. Anderton, Kevin G. Jamieson, Alice Wang, Hugues Bouchard, J. Aslam","doi":"10.1145/3523227.3546766","DOIUrl":"https://doi.org/10.1145/3523227.3546766","url":null,"abstract":"Podcasting is an increasingly popular medium for entertainment and discourse around the world, with tens of thousands of new podcasts released on a monthly basis. We consider the problem of identifying from these newly-released podcasts those with the largest potential audiences so they can be considered for personalized recommendation to users. We first study and then discard a supervised approach due to the inadequacy of either content or consumption features for this task, and instead propose a novel non-contextual bandit algorithm in the fixed-budget infinitely-armed pure-exploration setting. We demonstrate that our algorithm is well-suited to the best-arm identification task for a broad class of arm reservoir distributions, out-competing a large number of state-of-the-art algorithms. We then apply the algorithm to identifying podcasts with broad appeal in a simulated study, and show that it efficiently sorts podcasts into groups by increasing appeal while avoiding the popularity bias inherent in supervised approaches.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130517465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Recommendation Systems for Ad Creation: A View from the Trenches 广告创作的推荐系统:来自战壕的观点
Pub Date : 2022-09-18 DOI: 10.1145/3523227.3547401
Manisha Verma, Shaunak Mishra
Creative design is one of the key components of generating engaging content on the web. E-commerce websites need engaging product descriptions, social networks require user posts to have different types of content such as videos, images and hashtags, and traditional media formats such as blogs require content creators to constantly innovate their writing style, and choice of content they publish to engage with their intended audience. Designing the right content, irrespective of the industry, is a time consuming task, often requires several iterations of content selection and modification. Advertising is one such industry where content is the key to capture user interest and generate revenue. Designing engaging and attention grabbing advertisements requires extensive domain knowledge and market trend awareness. This motivates companies to hire marketing specialists to design specific advertising content, most often tasked to create text, image or video advertisements. This process is tedious and iterative which limits the amount of content that can be produced manually. In this talk, we summarize our work focused on automating ad creative design by leveraging state of the art approaches in text mining, ranking, generation, multimodal (visual-linguistic) representations, multilingual text understanding, and recommendation. We discuss how such approaches can help to reduce the time spent on designing ads, and showcase their impact on real world advertising systems and metrics.
创意设计是在网络上生成引人入胜的内容的关键组成部分之一。电子商务网站需要引人入胜的产品描述,社交网络要求用户帖子包含不同类型的内容,如视频、图像和标签,而博客等传统媒体格式要求内容创作者不断创新写作风格,并选择发布的内容,以吸引目标受众。设计正确的内容,无论在哪个行业,都是一项耗时的任务,通常需要多次的内容选择和修改。广告就是这样一个行业,内容是吸引用户兴趣和产生收入的关键。设计吸引眼球的广告需要广泛的领域知识和市场趋势意识。这促使公司聘请营销专家来设计特定的广告内容,最常见的任务是创建文本、图像或视频广告。这个过程冗长且反复,限制了手工制作的内容数量。在这次演讲中,我们总结了我们在自动化广告创意设计方面的工作,通过利用文本挖掘、排名、生成、多模态(视觉语言)表示、多语言文本理解和推荐等最先进的方法。我们将讨论这些方法如何帮助减少花在设计广告上的时间,并展示它们对现实世界广告系统和指标的影响。
{"title":"Recommendation Systems for Ad Creation: A View from the Trenches","authors":"Manisha Verma, Shaunak Mishra","doi":"10.1145/3523227.3547401","DOIUrl":"https://doi.org/10.1145/3523227.3547401","url":null,"abstract":"Creative design is one of the key components of generating engaging content on the web. E-commerce websites need engaging product descriptions, social networks require user posts to have different types of content such as videos, images and hashtags, and traditional media formats such as blogs require content creators to constantly innovate their writing style, and choice of content they publish to engage with their intended audience. Designing the right content, irrespective of the industry, is a time consuming task, often requires several iterations of content selection and modification. Advertising is one such industry where content is the key to capture user interest and generate revenue. Designing engaging and attention grabbing advertisements requires extensive domain knowledge and market trend awareness. This motivates companies to hire marketing specialists to design specific advertising content, most often tasked to create text, image or video advertisements. This process is tedious and iterative which limits the amount of content that can be produced manually. In this talk, we summarize our work focused on automating ad creative design by leveraging state of the art approaches in text mining, ranking, generation, multimodal (visual-linguistic) representations, multilingual text understanding, and recommendation. We discuss how such approaches can help to reduce the time spent on designing ads, and showcase their impact on real world advertising systems and metrics.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122270505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Challenges in Translating Research to Practice for Evaluating Fairness and Bias in Recommendation Systems 将研究成果转化为评估推荐系统公平性和偏见的实践挑战
Pub Date : 2022-09-18 DOI: 10.1145/3523227.3547403
Lex Beattie, D. Taber, H. Cramer
Calls to action to implement evaluation of fairness and bias into industry systems are increasing at a rapid rate. The research community has attempted to meet these demands by producing ethical principles and guidelines for AI, but few of these documents provide guidance on how to implement these principles in real world settings. Without readily available standardized and practice-tested approaches for evaluating fairness in recommendation systems, industry practitioners, who are often not experts, may easily run into challenges or implement metrics that are potentially poorly suited to their specific applications. When evaluating recommendations, practitioners are well aware they should evaluate their systems for unintended algorithmic harms, but the most important, and unanswered question, is how? In this talk, we will present practical challenges we encountered in addressing algorithmic responsibility in recommendation systems, which also present research opportunities for the RecSys community. This talk will focus on the steps that need to happen before bias mitigation can even begin.
呼吁采取行动,在工业系统中实施公平和偏见评估的呼声正在迅速增加。研究界试图通过制定人工智能的伦理原则和指导方针来满足这些需求,但这些文件很少提供如何在现实世界环境中实施这些原则的指导。如果没有现成的标准化和经过实践测试的方法来评估推荐系统的公平性,行业从业者(通常不是专家)可能很容易遇到挑战,或者实施可能不适合其特定应用程序的指标。在评估建议时,从业者很清楚他们应该评估他们的系统是否有意想不到的算法危害,但最重要的问题是,如何评估?在这次演讲中,我们将介绍我们在解决推荐系统中的算法责任时遇到的实际挑战,这也为RecSys社区提供了研究机会。本次演讲将重点讨论在开始减轻偏见之前需要采取的步骤。
{"title":"Challenges in Translating Research to Practice for Evaluating Fairness and Bias in Recommendation Systems","authors":"Lex Beattie, D. Taber, H. Cramer","doi":"10.1145/3523227.3547403","DOIUrl":"https://doi.org/10.1145/3523227.3547403","url":null,"abstract":"Calls to action to implement evaluation of fairness and bias into industry systems are increasing at a rapid rate. The research community has attempted to meet these demands by producing ethical principles and guidelines for AI, but few of these documents provide guidance on how to implement these principles in real world settings. Without readily available standardized and practice-tested approaches for evaluating fairness in recommendation systems, industry practitioners, who are often not experts, may easily run into challenges or implement metrics that are potentially poorly suited to their specific applications. When evaluating recommendations, practitioners are well aware they should evaluate their systems for unintended algorithmic harms, but the most important, and unanswered question, is how? In this talk, we will present practical challenges we encountered in addressing algorithmic responsibility in recommendation systems, which also present research opportunities for the RecSys community. This talk will focus on the steps that need to happen before bias mitigation can even begin.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126769575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Dual Attentional Higher Order Factorization Machines 双注意力高阶分解机
Pub Date : 2022-09-18 DOI: 10.1145/3523227.3546789
Arindam Sarkar, Dipankar Das, Vivek Sembium, Prakash Mandayam Comar
Numerous problems of practical significance such as clickthrough rate (CTR) prediction, forecasting, tagging and so on, involve complex interaction of various user, item and context features. Manual feature engineering has been used in the past to model these combinatorial features but it requires domain expertise and becomes prohibitively expensive as the number of features increases. Feedforward neural networks alleviate the need for manual feature engineering to a large extent and have shown impressive performance across multiple domains due to their ability to learn arbitrary functions. Despite multiple layers of non-linear projections, neural networks are limited in their ability to efficiently model functions with higher order interaction terms. In recent years, Factorization Machines and its variants have been proposed to explicitly capture higher order combinatorial interactions. However not all feature interactions are equally important, and in sparse data settings, without a suitable suppression mechanism, this might result into noisy terms during inference and hurt model generalization. In this work we present Dual Attentional Higher Order Factorization Machine (DA-HoFM), a unified attentional higher order factorization machine which leverages a compositional architecture to compute higher order terms with complexity linear in terms of maximum interaction degree. Equipped with sparse dual attention mechanism, DA-HoFM summarizes interaction terms at each layer, and is able to efficiently select important higher order terms. We empirically demonstrate effectiveness of our proposed models on the task of CTR prediction, where our model exhibits superior performance compared to the recent state-of-the-art models, outperforming them by up to 6.7% on the logloss metric.
点击率预测、预测、标注等许多具有现实意义的问题,涉及到各种用户、物品和上下文特征的复杂交互。过去,人工特征工程被用于对这些组合特征进行建模,但它需要领域的专业知识,并且随着特征数量的增加,它变得非常昂贵。前馈神经网络在很大程度上减轻了人工特征工程的需要,并且由于其学习任意函数的能力,在多个领域显示出令人印象深刻的性能。尽管有多层非线性投影,但神经网络在对具有高阶交互项的函数进行有效建模方面的能力有限。近年来,因数分解机及其变体已被提出用于显式捕获高阶组合相互作用。然而,并不是所有的特征交互都同样重要,在稀疏数据设置中,如果没有合适的抑制机制,这可能会导致在推理过程中产生噪声项并损害模型泛化。本文提出了双注意高阶因数分解机(Dual attention高阶因数分解机,DA-HoFM),这是一种统一的注意高阶因数分解机,它利用组合结构来计算最大相互作用程度上具有线性复杂性的高阶项。DA-HoFM采用稀疏双注意机制,对每一层的交互项进行总结,能够高效地选择重要的高阶项。我们通过经验证明了我们提出的模型在CTR预测任务上的有效性,与最近最先进的模型相比,我们的模型表现出卓越的性能,在logloss指标上优于它们高达6.7%。
{"title":"Dual Attentional Higher Order Factorization Machines","authors":"Arindam Sarkar, Dipankar Das, Vivek Sembium, Prakash Mandayam Comar","doi":"10.1145/3523227.3546789","DOIUrl":"https://doi.org/10.1145/3523227.3546789","url":null,"abstract":"Numerous problems of practical significance such as clickthrough rate (CTR) prediction, forecasting, tagging and so on, involve complex interaction of various user, item and context features. Manual feature engineering has been used in the past to model these combinatorial features but it requires domain expertise and becomes prohibitively expensive as the number of features increases. Feedforward neural networks alleviate the need for manual feature engineering to a large extent and have shown impressive performance across multiple domains due to their ability to learn arbitrary functions. Despite multiple layers of non-linear projections, neural networks are limited in their ability to efficiently model functions with higher order interaction terms. In recent years, Factorization Machines and its variants have been proposed to explicitly capture higher order combinatorial interactions. However not all feature interactions are equally important, and in sparse data settings, without a suitable suppression mechanism, this might result into noisy terms during inference and hurt model generalization. In this work we present Dual Attentional Higher Order Factorization Machine (DA-HoFM), a unified attentional higher order factorization machine which leverages a compositional architecture to compute higher order terms with complexity linear in terms of maximum interaction degree. Equipped with sparse dual attention mechanism, DA-HoFM summarizes interaction terms at each layer, and is able to efficiently select important higher order terms. We empirically demonstrate effectiveness of our proposed models on the task of CTR prediction, where our model exhibits superior performance compared to the recent state-of-the-art models, outperforming them by up to 6.7% on the logloss metric.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116768722","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
TinyKG: Memory-Efficient Training Framework for Knowledge Graph Neural Recommender Systems TinyKG:知识图神经推荐系统的记忆效率训练框架
Pub Date : 2022-09-18 DOI: 10.1145/3523227.3546760
Huiyuan Chen, Xiaoting Li, Kaixiong Zhou, Xia Hu, Chin-Chia Michael Yeh, Yan Zheng, Hao Yang
There has been an explosion of interest in designing various Knowledge Graph Neural Networks (KGNNs), which achieve state-of-the-art performance and provide great explainability for recommendation. The promising performance is mainly resulting from their capability of capturing high-order proximity messages over the knowledge graphs. However, training KGNNs at scale is challenging due to the high memory usage. In the forward pass, the automatic differentiation engines (e.g., TensorFlow/PyTorch) generally need to cache all intermediate activation maps in order to compute gradients in the backward pass, which leads to a large GPU memory footprint. Existing work solves this problem by utilizing multi-GPU distributed frameworks. Nonetheless, this poses a practical challenge when seeking to deploy KGNNs in memory-constrained environments, especially for industry-scale graphs. Here we present TinyKG, a memory-efficient GPU-based training framework for KGNNs for the tasks of recommendation. Specifically, TinyKG uses exact activations in the forward pass while storing a quantized version of activations in the GPU buffers. During the backward pass, these low-precision activations are dequantized back to full-precision tensors, in order to compute gradients. To reduce the quantization errors, TinyKG applies a simple yet effective quantization algorithm to compress the activations, which ensures unbiasedness with low variance. As such, the training memory footprint of KGNNs is largely reduced with negligible accuracy loss. To evaluate the performance of our TinyKG, we conduct comprehensive experiments on real-world datasets. We found that our TinyKG with INT2 quantization aggressively reduces the memory footprint of activation maps with 7 ×, only with 2% loss in accuracy, allowing us to deploy KGNNs on memory-constrained devices.
人们对设计各种知识图神经网络(kgnn)的兴趣激增,这些网络实现了最先进的性能,并为推荐提供了很好的可解释性。有希望的性能主要是由于它们能够捕获知识图上的高阶接近消息。然而,由于高内存使用量,大规模训练kgnn具有挑战性。在前向传递中,自动微分引擎(例如,TensorFlow/PyTorch)通常需要缓存所有中间激活映射,以便在后向传递中计算梯度,这会导致大量GPU内存占用。现有的工作通过使用多gpu分布式框架解决了这个问题。尽管如此,当寻求在内存受限的环境中部署kgnn时,这提出了一个实际的挑战,特别是对于工业规模的图。在这里,我们提出了TinyKG,一个内存高效的基于gpu的kgnn训练框架,用于推荐任务。具体来说,TinyKG在向前传递中使用精确的激活,同时在GPU缓冲区中存储量化版本的激活。在反向传递期间,这些低精度激活被去量化回全精度张量,以便计算梯度。为了减少量化误差,TinyKG采用简单有效的量化算法对激活进行压缩,保证了低方差的无偏性。因此,kgnn的训练内存占用大大减少,精度损失可以忽略不计。为了评估我们的TinyKG的性能,我们在真实的数据集上进行了全面的实验。我们发现,带有INT2量化的TinyKG大幅减少了激活图的内存占用,降低了7倍,准确性仅下降了2%,使我们能够在内存受限的设备上部署kgnn。
{"title":"TinyKG: Memory-Efficient Training Framework for Knowledge Graph Neural Recommender Systems","authors":"Huiyuan Chen, Xiaoting Li, Kaixiong Zhou, Xia Hu, Chin-Chia Michael Yeh, Yan Zheng, Hao Yang","doi":"10.1145/3523227.3546760","DOIUrl":"https://doi.org/10.1145/3523227.3546760","url":null,"abstract":"There has been an explosion of interest in designing various Knowledge Graph Neural Networks (KGNNs), which achieve state-of-the-art performance and provide great explainability for recommendation. The promising performance is mainly resulting from their capability of capturing high-order proximity messages over the knowledge graphs. However, training KGNNs at scale is challenging due to the high memory usage. In the forward pass, the automatic differentiation engines (e.g., TensorFlow/PyTorch) generally need to cache all intermediate activation maps in order to compute gradients in the backward pass, which leads to a large GPU memory footprint. Existing work solves this problem by utilizing multi-GPU distributed frameworks. Nonetheless, this poses a practical challenge when seeking to deploy KGNNs in memory-constrained environments, especially for industry-scale graphs. Here we present TinyKG, a memory-efficient GPU-based training framework for KGNNs for the tasks of recommendation. Specifically, TinyKG uses exact activations in the forward pass while storing a quantized version of activations in the GPU buffers. During the backward pass, these low-precision activations are dequantized back to full-precision tensors, in order to compute gradients. To reduce the quantization errors, TinyKG applies a simple yet effective quantization algorithm to compress the activations, which ensures unbiasedness with low variance. As such, the training memory footprint of KGNNs is largely reduced with negligible accuracy loss. To evaluate the performance of our TinyKG, we conduct comprehensive experiments on real-world datasets. We found that our TinyKG with INT2 quantization aggressively reduces the memory footprint of activation maps with 7 ×, only with 2% loss in accuracy, allowing us to deploy KGNNs on memory-constrained devices.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130327199","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Optimizing product recommendations for millions of merchants 为数百万商家优化产品推荐
Pub Date : 2022-09-18 DOI: 10.1145/3523227.3547393
Kim Falk, Chen Karako
At Shopify, we serve product recommendations to customers across millions of merchants’ online stores. It is a challenge to provide optimized recommendations to all of these independent merchants; one model might lead to an overall improvement in our metrics on aggregate, but significantly degrade recommendations for some stores. To ensure we provide high quality recommendations to all merchant segments, we develop several models that work best in different situations as determined in offline evaluation. Learning which strategy best works for a given segment also allows us to start off new stores with good recommendations, without necessarily needing to rely on an individual store amassing large amounts of traffic. In production, the system will start out with the best strategy for a given merchant, and then adjust to the current environment using multi-armed bandits. Collectively, this methodology allows us to optimize the types of recommendations served on each store.
在Shopify,我们为数百万商家的在线商店的客户提供产品推荐。向所有这些独立商家提供优化的推荐是一个挑战;一个模型可能会导致总体指标的改进,但会显著降低对某些商店的推荐。为了确保我们向所有商家提供高质量的推荐,我们开发了几个模型,这些模型在离线评估中确定的不同情况下效果最好。了解哪种策略最适合特定的细分市场,也使我们能够根据良好的推荐开设新店,而不必依赖于单个商店积累大量流量。在生产中,系统将以给定商人的最佳策略开始,然后使用多武装土匪调整到当前环境。总的来说,这种方法使我们能够优化每个商店提供的推荐类型。
{"title":"Optimizing product recommendations for millions of merchants","authors":"Kim Falk, Chen Karako","doi":"10.1145/3523227.3547393","DOIUrl":"https://doi.org/10.1145/3523227.3547393","url":null,"abstract":"At Shopify, we serve product recommendations to customers across millions of merchants’ online stores. It is a challenge to provide optimized recommendations to all of these independent merchants; one model might lead to an overall improvement in our metrics on aggregate, but significantly degrade recommendations for some stores. To ensure we provide high quality recommendations to all merchant segments, we develop several models that work best in different situations as determined in offline evaluation. Learning which strategy best works for a given segment also allows us to start off new stores with good recommendations, without necessarily needing to rely on an individual store amassing large amounts of traffic. In production, the system will start out with the best strategy for a given merchant, and then adjust to the current environment using multi-armed bandits. Collectively, this methodology allows us to optimize the types of recommendations served on each store.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129657666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Training and Deploying Multi-Stage Recommender Systems 培训和部署多阶段推荐系统
Pub Date : 2022-09-18 DOI: 10.1145/3523227.3547372
Ronay Ak, Benedikt D. Schifferer, Sara Rabhi, G. Moreira
Industrial recommender systems are made up of complex pipelines requiring multiple steps including feature engineering and preprocessing, a retrieval model for candidate generation, filtering, a feature store query, a ranking model for scoring, and an ordering stage. These pipelines need to be carefully deployed as a set, requiring coordination during their development and deployment. Data scientists, ML engineers, and researchers might focus on different stages of recommender systems, however they share a common desire to reduce the time and effort searching for and combining boilerplate code coming from different sources or writing custom code from scratch to create their own RecSys pipelines. This tutorial introduces the Merlin framework which aims to make the development and deployment of recommender systems easier, providing methods for evaluating existing approaches, developing new ideas and deploying them to production. There are many techniques, such as different model architectures (e.g. MF, DLRM, DCN, etc), negative sampling strategies, loss functions or prediction tasks (binary, multi-class, multi-task) that are commonly used in these pipelines. Merlin provides building blocks that allow RecSys practitioners to focus on the “what” question in designing their model pipeline instead of “how”. Supporting research into new ideas within the RecSys spaces is equally important and Merlin supports the addition of custom components and the extension of existing ones to address gaps. In this tutorial, participants will learn: (i) how to easily implement common recommender system techniques for comparison, (ii) how to modify components to evaluate new ideas, and (iii) deploying recommender systems, bringing new ideas to production- using an open source framework Merlin and its libraries.
工业推荐系统由复杂的管道组成,需要多个步骤,包括特征工程和预处理、用于候选生成的检索模型、过滤、特征存储查询、用于评分的排名模型和排序阶段。这些管道需要作为一个整体仔细部署,在开发和部署期间需要进行协调。数据科学家、机器学习工程师和研究人员可能会关注推荐系统的不同阶段,但他们都有一个共同的愿望,即减少搜索和组合来自不同来源的模板代码或从头编写自定义代码以创建自己的RecSys管道的时间和精力。本教程介绍了Merlin框架,该框架旨在简化推荐系统的开发和部署,提供评估现有方法、开发新想法和将其部署到生产环境的方法。有许多技术,例如不同的模型架构(例如MF, DLRM, DCN等),负采样策略,损失函数或预测任务(二进制,多类,多任务),通常在这些管道中使用。Merlin提供的构建块允许RecSys从业者在设计模型管道时关注“是什么”问题,而不是“如何”问题。支持研究RecSys空间中的新想法同样重要,Merlin支持添加定制组件和扩展现有组件以解决差距。在本教程中,参与者将学习:(i)如何轻松实现通用的推荐系统技术进行比较,(ii)如何修改组件以评估新想法,以及(iii)部署推荐系统,将新想法带入生产-使用开源框架Merlin及其库。
{"title":"Training and Deploying Multi-Stage Recommender Systems","authors":"Ronay Ak, Benedikt D. Schifferer, Sara Rabhi, G. Moreira","doi":"10.1145/3523227.3547372","DOIUrl":"https://doi.org/10.1145/3523227.3547372","url":null,"abstract":"Industrial recommender systems are made up of complex pipelines requiring multiple steps including feature engineering and preprocessing, a retrieval model for candidate generation, filtering, a feature store query, a ranking model for scoring, and an ordering stage. These pipelines need to be carefully deployed as a set, requiring coordination during their development and deployment. Data scientists, ML engineers, and researchers might focus on different stages of recommender systems, however they share a common desire to reduce the time and effort searching for and combining boilerplate code coming from different sources or writing custom code from scratch to create their own RecSys pipelines. This tutorial introduces the Merlin framework which aims to make the development and deployment of recommender systems easier, providing methods for evaluating existing approaches, developing new ideas and deploying them to production. There are many techniques, such as different model architectures (e.g. MF, DLRM, DCN, etc), negative sampling strategies, loss functions or prediction tasks (binary, multi-class, multi-task) that are commonly used in these pipelines. Merlin provides building blocks that allow RecSys practitioners to focus on the “what” question in designing their model pipeline instead of “how”. Supporting research into new ideas within the RecSys spaces is equally important and Merlin supports the addition of custom components and the extension of existing ones to address gaps. In this tutorial, participants will learn: (i) how to easily implement common recommender system techniques for comparison, (ii) how to modify components to evaluate new ideas, and (iii) deploying recommender systems, bringing new ideas to production- using an open source framework Merlin and its libraries.","PeriodicalId":443279,"journal":{"name":"Proceedings of the 16th ACM Conference on Recommender Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124962645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 16th ACM Conference on Recommender Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1