New Probabilistic Models for Recommender Systems with Rich Contextual and Content Information

Proceedings of the Ninth ACM International Conference on Web Search and Data Mining Pub Date : 2017-02-02 DOI:10.1145/3018661.3022751

Eliezer de Souza da Silva

{"title":"New Probabilistic Models for Recommender Systems with Rich Contextual and Content Information","authors":"Eliezer de Souza da Silva","doi":"10.1145/3018661.3022751","DOIUrl":null,"url":null,"abstract":"This project is focused on the design of probabilistic models for recommender systems and collaborative ltering by extending and creating new models to include rich contextual and content information (content, user social network, location, time, user intent, etc), and developing scalable approximate inference algorithms for these models. The working hypothesis is that big data analytics combined with probabilistic modelling, through automatically mining of various data sources and combining di erent latent factors explaining the user interaction with the items, can be used to better infer the user behaviour and generate improved recommendations. Fundamentally we are interested in the following questions: 1) Does additional contextual information improve the quality of recommender systems? 2) What factors (features, model, methods) are relevant in the design of personalized systems? 3) What is the relation between the social network structure, the user model and the information need of the user? How does the social context interferes with user preferences? How the evolution of the social network structure can explain changes in the user preference model? 4) Does the choice of approximate inference method have a signi cant impact on the quality of the system (quality- efficiency trade-offs)? To address some of this questions we started by proposing a model (Figure 1) based on Poisson factorization models [2], combining a social factorization model [1] and a topic based factorization [3]. The main idea is to combine content latent factor (topic, tags, etc) and trust between users (trust weight in a social graph) in a way that both sources of information have additive e ects in the observed ratings. In the case of Poisson models, this additive constraint will induce non-negative latent factors to be more sparse and avoid overfitting (in comparison the Gausian based models [2]. The main objective at this point is to compare models that incorporated both source of information (content and social networks). The next steps will include empirical validation. Concluding, we are interested in the interplay between large scale data mining and probabilistic modeling in the design of recommender systems. One initial approach we are pursuing is to model content and social network feature in a Poisson latent variable model. Our main objective in the future is the development of methods with competitive computational complexity to perform inference using het- erogeneous data in dynamical probabilistic models, as well as exploring the scalability limits of the models we propose.","PeriodicalId":20567,"journal":{"name":"Proceedings of the Ninth ACM International Conference on Web Search and Data Mining","volume":"53 1","pages":"839"},"PeriodicalIF":0.0000,"publicationDate":"2017-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Ninth ACM International Conference on Web Search and Data Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3018661.3022751","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

This project is focused on the design of probabilistic models for recommender systems and collaborative ltering by extending and creating new models to include rich contextual and content information (content, user social network, location, time, user intent, etc), and developing scalable approximate inference algorithms for these models. The working hypothesis is that big data analytics combined with probabilistic modelling, through automatically mining of various data sources and combining di erent latent factors explaining the user interaction with the items, can be used to better infer the user behaviour and generate improved recommendations. Fundamentally we are interested in the following questions: 1) Does additional contextual information improve the quality of recommender systems? 2) What factors (features, model, methods) are relevant in the design of personalized systems? 3) What is the relation between the social network structure, the user model and the information need of the user? How does the social context interferes with user preferences? How the evolution of the social network structure can explain changes in the user preference model? 4) Does the choice of approximate inference method have a signi cant impact on the quality of the system (quality- efficiency trade-offs)? To address some of this questions we started by proposing a model (Figure 1) based on Poisson factorization models [2], combining a social factorization model [1] and a topic based factorization [3]. The main idea is to combine content latent factor (topic, tags, etc) and trust between users (trust weight in a social graph) in a way that both sources of information have additive e ects in the observed ratings. In the case of Poisson models, this additive constraint will induce non-negative latent factors to be more sparse and avoid overfitting (in comparison the Gausian based models [2]. The main objective at this point is to compare models that incorporated both source of information (content and social networks). The next steps will include empirical validation. Concluding, we are interested in the interplay between large scale data mining and probabilistic modeling in the design of recommender systems. One initial approach we are pursuing is to model content and social network feature in a Poisson latent variable model. Our main objective in the future is the development of methods with competitive computational complexity to perform inference using het- erogeneous data in dynamical probabilistic models, as well as exploring the scalability limits of the models we propose.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

具有丰富上下文和内容信息的推荐系统的新概率模型

该项目专注于为推荐系统和协同过滤设计概率模型，通过扩展和创建新模型来包含丰富的上下文和内容信息(内容、用户社交网络、位置、时间、用户意图等)，并为这些模型开发可扩展的近似推理算法。工作假设是，大数据分析与概率建模相结合，通过自动挖掘各种数据源，结合解释用户与物品交互的不同潜在因素，可以更好地推断用户行为并生成改进的推荐。从根本上说，我们对以下问题感兴趣:1)额外的上下文信息是否提高了推荐系统的质量?2)哪些因素(特征、模型、方法)与个性化系统的设计相关?3)社交网络结构、用户模型与用户信息需求之间的关系是什么?社交环境是如何影响用户偏好的?社会网络结构的演变如何解释用户偏好模型的变化?4)近似推理方法的选择是否对系统的质量(质量-效率权衡)有显著影响?为了解决其中的一些问题，我们首先提出了一个基于泊松分解模型[2]的模型(图1)，结合了社会分解模型[1]和基于主题的分解[3]。其主要思想是将内容潜在因素(主题、标签等)和用户之间的信任(社交图中的信任权重)结合起来，使两种信息来源在观察到的评分中具有叠加效应。在泊松模型中，与基于高斯的模型[2]相比，这种加性约束将使非负潜因子更加稀疏，避免过拟合。这里的主要目标是比较包含两个信息源(内容和社会网络)的模型。接下来的步骤将包括实证验证。最后，我们对推荐系统设计中大规模数据挖掘和概率建模之间的相互作用感兴趣。我们所追求的一种最初的方法是在泊松潜变量模型中对内容和社会网络特征进行建模。我们未来的主要目标是开发具有竞争性计算复杂度的方法，在动态概率模型中使用异构数据进行推理，以及探索我们提出的模型的可扩展性限制。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the Ninth ACM International Conference on Web Search and Data Mining

自引率

0.00%

发文量

期刊最新文献

Beyond-Accuracy Goals, Again WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21 - 25, 2022 A Semantic Layer Querying Tool Multilingual and Multimodal Hate Speech Analysis in Twitter Designing the Cogno-Web Observatory: To Characterize the Dynamics of Online Social Cognition