ACM Transactions on Information Systems (TOIS)最新文献_第3页

Truncated Models for Probabilistic Weighted Retrieval 概率加权检索的截断模型

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-12-08 DOI: 10.1145/3476837

Jiaul H. Paik, Yash Agrawal, Sahil Rishi, Vaishal Shah

Existing probabilistic retrieval models do not restrict the domain of the random variables that they deal with. In this article, we show that the upper bound of the normalized term frequency (tf) from the relevant documents is much smaller than the upper bound of the normalized tf from the whole collection. As a result, the existing models suffer from two major problems: (i) the domain mismatch causes data modeling error, (ii) since the outliers have very large magnitude and the retrieval models follow tf hypothesis, the combination of these two factors tends to overestimate the relevance score. In an attempt to address these problems, we propose novel weighted probabilistic models based on truncated distributions. We evaluate our models on a set of large document collections. Significant performance improvement over six existing probabilistic models is demonstrated.

现有的概率检索模型没有对所处理的随机变量的域进行限制。在本文中，我们证明了相关文档的归一化项频率(tf)的上界远小于整个集合的归一化项频率(tf)的上界。因此，现有的模型存在两个主要问题:(1)领域不匹配导致数据建模误差;(2)由于异常值的幅度很大，检索模型遵循tf假设，这两个因素的结合往往会高估相关性评分。为了解决这些问题，我们提出了一种基于截断分布的加权概率模型。我们在一组大型文档集合上评估我们的模型。与现有的六种概率模型相比，证明了显著的性能改进。

引用次数: 2

Personalized and Explainable Employee Training Course Recommendations: A Bayesian Variational Approach 个性化和可解释的员工培训课程建议:贝叶斯变分方法

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-12-08 DOI: 10.1145/3490476

Chao Wang, Hengshu Zhu, Peng Wang, Chen Zhu, Xi Zhang, Enhong Chen, Hui Xiong

As a major component of strategic talent management, learning and development (L&D) aims at improving the individual and organization performances through planning tailored training for employees to increase and improve their skills and knowledge. While many companies have developed the learning management systems (LMSs) for facilitating the online training of employees, a long-standing important issue is how to achieve personalized training recommendations with the consideration of their needs for future career development. To this end, in this article, we present a focused study on the explainable personalized online course recommender system for enhancing employee training and development. Specifically, we first propose a novel end-to-end hierarchical framework, namely Demand-aware Collaborative Bayesian Variational Network (DCBVN), to jointly model both the employees’ current competencies and their career development preferences in an explainable way. In DCBVN, we first extract the latent interpretable representations of the employees’ competencies from their skill profiles with autoencoding variational inference based topic modeling. Then, we develop an effective demand recognition mechanism for learning the personal demands of career development for employees. In particular, all the above processes are integrated into a unified Bayesian inference view for obtaining both accurate and explainable recommendations. Furthermore, for handling the employees with sparse or missing skill profiles, we develop an improved version of DCBVN, called the Demand-aware Collaborative Competency Attentive Network (DCCAN) framework, by considering the connectivity among employees. In DCCAN, we first build two employee competency graphs from learning and working aspects. Then, we design a graph-attentive network and a multi-head integration mechanism to infer one’s competency information from her neighborhood employees. Finally, we can generate explainable recommendation results based on the competency representations. Extensive experimental results on real-world data clearly demonstrate the effectiveness and the interpretability of both of our frameworks, as well as their robustness on sparse and cold-start scenarios.

作为战略人才管理的重要组成部分，学习与发展(L&D)旨在通过为员工规划量身定制的培训来提高个人和组织的绩效，以增加和改善员工的技能和知识。虽然许多公司已经开发了学习管理系统(lms)来促进员工的在线培训，但一个长期存在的重要问题是如何在考虑员工未来职业发展需求的情况下实现个性化的培训建议。为此，本文重点研究了可解释的个性化在线课程推荐系统，以促进员工的培训和发展。具体而言，我们首先提出了一个新的端到端分层框架，即需求感知协同贝叶斯变分网络(DCBVN)，以一种可解释的方式共同建模员工当前胜任力和职业发展偏好。在DCBVN中，我们首先利用基于自编码变分推理的主题建模从员工的技能概况中提取潜在的可解释表征。然后，建立有效的需求识别机制，了解员工职业发展的个人需求。特别是，所有这些过程都集成到一个统一的贝叶斯推理视图中，以获得准确和可解释的建议。此外，为了处理技能特征稀疏或缺失的员工，我们通过考虑员工之间的连通性，开发了DCBVN的改进版本，称为需求感知协作能力关注网络(DCCAN)框架。在DCCAN中，我们首先从学习和工作两个方面构建了员工胜任力图。然后，我们设计了一个图关注网络和一个多头整合机制，从她的邻居员工中推断出一个人的胜任力信息。最后，我们可以基于能力表征生成可解释的推荐结果。在真实世界数据上的大量实验结果清楚地证明了我们的两个框架的有效性和可解释性，以及它们在稀疏和冷启动场景下的鲁棒性。

{"title":"Personalized and Explainable Employee Training Course Recommendations: A Bayesian Variational Approach","authors":"Chao Wang, Hengshu Zhu, Peng Wang, Chen Zhu, Xi Zhang, Enhong Chen, Hui Xiong","doi":"10.1145/3490476","DOIUrl":"https://doi.org/10.1145/3490476","url":null,"abstract":"As a major component of strategic talent management, learning and development (L&D) aims at improving the individual and organization performances through planning tailored training for employees to increase and improve their skills and knowledge. While many companies have developed the learning management systems (LMSs) for facilitating the online training of employees, a long-standing important issue is how to achieve personalized training recommendations with the consideration of their needs for future career development. To this end, in this article, we present a focused study on the explainable personalized online course recommender system for enhancing employee training and development. Specifically, we first propose a novel end-to-end hierarchical framework, namely Demand-aware Collaborative Bayesian Variational Network (DCBVN), to jointly model both the employees’ current competencies and their career development preferences in an explainable way. In DCBVN, we first extract the latent interpretable representations of the employees’ competencies from their skill profiles with autoencoding variational inference based topic modeling. Then, we develop an effective demand recognition mechanism for learning the personal demands of career development for employees. In particular, all the above processes are integrated into a unified Bayesian inference view for obtaining both accurate and explainable recommendations. Furthermore, for handling the employees with sparse or missing skill profiles, we develop an improved version of DCBVN, called the Demand-aware Collaborative Competency Attentive Network (DCCAN) framework, by considering the connectivity among employees. In DCCAN, we first build two employee competency graphs from learning and working aspects. Then, we design a graph-attentive network and a multi-head integration mechanism to infer one’s competency information from her neighborhood employees. Finally, we can generate explainable recommendation results based on the competency representations. Extensive experimental results on real-world data clearly demonstrate the effectiveness and the interpretability of both of our frameworks, as well as their robustness on sparse and cold-start scenarios.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"42 1","pages":"1 - 32"},"PeriodicalIF":0.0,"publicationDate":"2021-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81014618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

Hyperspherical Variational Co-embedding for Attributed Networks 属性网络的超球面变分共嵌入

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-12-08 DOI: 10.1145/3478284

Ji Fang, Shangsong Liang, Zaiqiao Meng, M. de Rijke

Network-based information has been widely explored and exploited in the information retrieval literature. Attributed networks, consisting of nodes, edges as well as attributes describing properties of nodes, are a basic type of network-based data, and are especially useful for many applications. Examples include user profiling in social networks and item recommendation in user-item purchase networks. Learning useful and expressive representations of entities in attributed networks can provide more effective building blocks to down-stream network-based tasks such as link prediction and attribute inference. Practically, input features of attributed networks are normalized as unit directional vectors. However, most network embedding techniques ignore the spherical nature of inputs and focus on learning representations in a Gaussian or Euclidean space, which, we hypothesize, might lead to less effective representations. To obtain more effective representations of attributed networks, we investigate the problem of mapping an attributed network with unit normalized directional features into a non-Gaussian and non-Euclidean space. Specifically, we propose a hyperspherical variational co-embedding for attributed networks (HCAN), which is based on generalized variational auto-encoders for heterogeneous data with multiple types of entities. HCAN jointly learns latent embeddings for both nodes and attributes in a unified hyperspherical space such that the affinities between nodes and attributes can be captured effectively. We argue that this is a crucial feature in many real-world applications of attributed networks. Previous Gaussian network embedding algorithms break the assumption of uninformative prior, which leads to unstable results and poor performance. In contrast, HCAN embeds nodes and attributes as von Mises-Fisher distributions, and allows one to capture the uncertainty of the inferred representations. Experimental results on eight datasets show that HCAN yields better performance in a number of applications compared with nine state-of-the-art baselines.

基于网络的信息在信息检索文献中得到了广泛的探索和利用。属性网络由节点、边以及描述节点属性的属性组成，是基于网络的数据的一种基本类型，对许多应用程序特别有用。示例包括社交网络中的用户分析和用户-物品购买网络中的物品推荐。学习属性网络中实体的有用和富有表现力的表示可以为下游基于网络的任务(如链接预测和属性推理)提供更有效的构建块。实际上，属性网络的输入特征归一化为单位方向向量。然而，大多数网络嵌入技术忽略了输入的球形性质，并专注于在高斯或欧几里得空间中学习表示，我们假设这可能会导致不太有效的表示。为了获得更有效的属性网络表示，我们研究了将具有单位归一化方向特征的属性网络映射到非高斯和非欧几里得空间的问题。具体而言，我们提出了一种基于广义变分自编码器的属性网络超球面变分共嵌入(HCAN)方法，用于具有多种类型实体的异构数据。HCAN在统一的超球面空间中共同学习节点和属性的潜在嵌入，从而有效地捕获节点和属性之间的亲和力。我们认为这是属性网络在许多实际应用中的一个关键特征。以往的高斯网络嵌入算法打破了无信息先验假设，导致结果不稳定，性能不佳。相比之下，HCAN将节点和属性嵌入为von Mises-Fisher分布，并允许捕获推断表示的不确定性。在8个数据集上的实验结果表明，与9个最先进的基线相比，HCAN在许多应用中产生了更好的性能。

{"title":"Hyperspherical Variational Co-embedding for Attributed Networks","authors":"Ji Fang, Shangsong Liang, Zaiqiao Meng, M. de Rijke","doi":"10.1145/3478284","DOIUrl":"https://doi.org/10.1145/3478284","url":null,"abstract":"Network-based information has been widely explored and exploited in the information retrieval literature. Attributed networks, consisting of nodes, edges as well as attributes describing properties of nodes, are a basic type of network-based data, and are especially useful for many applications. Examples include user profiling in social networks and item recommendation in user-item purchase networks. Learning useful and expressive representations of entities in attributed networks can provide more effective building blocks to down-stream network-based tasks such as link prediction and attribute inference. Practically, input features of attributed networks are normalized as unit directional vectors. However, most network embedding techniques ignore the spherical nature of inputs and focus on learning representations in a Gaussian or Euclidean space, which, we hypothesize, might lead to less effective representations. To obtain more effective representations of attributed networks, we investigate the problem of mapping an attributed network with unit normalized directional features into a non-Gaussian and non-Euclidean space. Specifically, we propose a hyperspherical variational co-embedding for attributed networks (HCAN), which is based on generalized variational auto-encoders for heterogeneous data with multiple types of entities. HCAN jointly learns latent embeddings for both nodes and attributes in a unified hyperspherical space such that the affinities between nodes and attributes can be captured effectively. We argue that this is a crucial feature in many real-world applications of attributed networks. Previous Gaussian network embedding algorithms break the assumption of uninformative prior, which leads to unstable results and poor performance. In contrast, HCAN embeds nodes and attributes as von Mises-Fisher distributions, and allows one to capture the uncertainty of the inferred representations. Experimental results on eight datasets show that HCAN yields better performance in a number of applications compared with nine state-of-the-art baselines.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"101 1","pages":"1 - 36"},"PeriodicalIF":0.0,"publicationDate":"2021-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88584297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

A Comparison between Term-Independence Retrieval Models for Ad Hoc Retrieval 自组织检索中词无关检索模型的比较

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-12-08 DOI: 10.1145/3483612

E. K. F. Dang, R. Luk, James Allan

In Information Retrieval, numerous retrieval models or document ranking functions have been developed in the quest for better retrieval effectiveness. Apart from some formal retrieval models formulated on a theoretical basis, various recent works have applied heuristic constraints to guide the derivation of document ranking functions. While many recent methods are shown to improve over established and successful models, comparison among these new methods under a common environment is often missing. To address this issue, we perform an extensive and up-to-date comparison of leading term-independence retrieval models implemented in our own retrieval system. Our study focuses on the following questions: (RQ1) Is there a retrieval model that consistently outperforms all other models across multiple collections; (RQ2) What are the important features of an effective document ranking function? Our retrieval experiments performed on several TREC test collections of a wide range of sizes (up to the terabyte-sized Clueweb09 Category B) enable us to answer these research questions. This work also serves as a reproducibility study for leading retrieval models. While our experiments show that no single retrieval model outperforms all others across all tested collections, some recent retrieval models, such as MATF and MVD, consistently perform better than the common baselines.

在信息检索中，为了提高检索效率，开发了许多检索模型或文档排序函数。除了一些在理论基础上制定的正式检索模型外，最近的各种工作都应用启发式约束来指导文档排序函数的推导。虽然许多最近的方法被证明是对已建立和成功的模型的改进，但在共同环境下对这些新方法的比较往往是缺失的。为了解决这个问题，我们对在我们自己的检索系统中实现的主要术语独立检索模型进行了广泛和最新的比较。我们的研究主要集中在以下问题上:(RQ1)是否存在一个检索模型在多个集合中始终优于所有其他模型;(RQ2)有效的文档排序功能的重要特征是什么?我们在几个不同大小的TREC测试集合上进行的检索实验(高达tb大小的Clueweb09 B类)使我们能够回答这些研究问题。这项工作也可作为主要检索模型的可重复性研究。虽然我们的实验表明，在所有被测试的集合中，没有一个检索模型优于所有其他模型，但是一些最近的检索模型，如MATF和MVD，始终比公共基线表现得更好。

{"title":"A Comparison between Term-Independence Retrieval Models for Ad Hoc Retrieval","authors":"E. K. F. Dang, R. Luk, James Allan","doi":"10.1145/3483612","DOIUrl":"https://doi.org/10.1145/3483612","url":null,"abstract":"In Information Retrieval, numerous retrieval models or document ranking functions have been developed in the quest for better retrieval effectiveness. Apart from some formal retrieval models formulated on a theoretical basis, various recent works have applied heuristic constraints to guide the derivation of document ranking functions. While many recent methods are shown to improve over established and successful models, comparison among these new methods under a common environment is often missing. To address this issue, we perform an extensive and up-to-date comparison of leading term-independence retrieval models implemented in our own retrieval system. Our study focuses on the following questions: (RQ1) Is there a retrieval model that consistently outperforms all other models across multiple collections; (RQ2) What are the important features of an effective document ranking function? Our retrieval experiments performed on several TREC test collections of a wide range of sizes (up to the terabyte-sized Clueweb09 Category B) enable us to answer these research questions. This work also serves as a reproducibility study for leading retrieval models. While our experiments show that no single retrieval model outperforms all others across all tested collections, some recent retrieval models, such as MATF and MVD, consistently perform better than the common baselines.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"77 1","pages":"1 - 37"},"PeriodicalIF":0.0,"publicationDate":"2021-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80788567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Grounded Task Prioritization with Context-Aware Sequential Ranking 基于上下文感知顺序排序的任务优先级

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-12-08 DOI: 10.1145/3486861

Chuxu Zhang, Julia Kiseleva, S. Jauhar, Ryen W. White

People rely on task management applications and digital assistants to capture and track their tasks, and help with executing them. The burden of organizing and scheduling time for tasks continues to reside with users of these systems, despite the high cognitive load associated with these activities. Users stand to benefit greatly from a task management system capable of prioritizing their pending tasks, thus saving them time and effort. In this article, we make three main contributions. First, we propose the problem of task prioritization, formulating it as a ranking over a user’s pending tasks given a history of previous interactions with a task management system. Second, we perform an extensive analysis on the large-scale anonymized, de-identified logs of a popular task management application, deriving a dataset of grounded, real-world tasks from which to learn and evaluate our proposed system. We also identify patterns in how people record tasks as complete, which vary consistently with the nature of the task. Third, we propose a novel contextual deep learning solution capable of performing personalized task prioritization. In a battery of tests, we show that this approach outperforms several operational baselines and other sequential ranking models from previous work. Our findings have implications for understanding the ways people prioritize and manage tasks with digital tools, and in the design of support for users of task management applications.

人们依靠任务管理应用程序和数字助理来捕获和跟踪他们的任务，并帮助他们执行任务。组织和安排任务时间的负担仍然由这些系统的用户承担，尽管与这些活动相关的认知负荷很高。用户将从任务管理系统中受益匪浅，该系统能够对待处理任务进行优先级排序，从而节省时间和精力。在本文中，我们做出了三个主要贡献。首先，我们提出了任务优先级问题，将其表述为给定用户先前与任务管理系统交互历史的待处理任务的排序。其次，我们对一个流行任务管理应用程序的大规模匿名、去识别日志进行了广泛的分析，得出了一个真实世界任务的数据集，从中学习和评估我们提出的系统。我们还确定了人们如何将任务记录为完成的模式，这些模式与任务的性质一致。第三，我们提出了一种新的上下文深度学习解决方案，能够执行个性化的任务优先级。在一系列测试中，我们表明该方法优于以前工作中的几个操作基线和其他顺序排序模型。我们的研究结果对理解人们使用数字工具对任务进行优先排序和管理的方式，以及对任务管理应用程序用户的支持设计具有启示意义。

{"title":"Grounded Task Prioritization with Context-Aware Sequential Ranking","authors":"Chuxu Zhang, Julia Kiseleva, S. Jauhar, Ryen W. White","doi":"10.1145/3486861","DOIUrl":"https://doi.org/10.1145/3486861","url":null,"abstract":"People rely on task management applications and digital assistants to capture and track their tasks, and help with executing them. The burden of organizing and scheduling time for tasks continues to reside with users of these systems, despite the high cognitive load associated with these activities. Users stand to benefit greatly from a task management system capable of prioritizing their pending tasks, thus saving them time and effort. In this article, we make three main contributions. First, we propose the problem of task prioritization, formulating it as a ranking over a user’s pending tasks given a history of previous interactions with a task management system. Second, we perform an extensive analysis on the large-scale anonymized, de-identified logs of a popular task management application, deriving a dataset of grounded, real-world tasks from which to learn and evaluate our proposed system. We also identify patterns in how people record tasks as complete, which vary consistently with the nature of the task. Third, we propose a novel contextual deep learning solution capable of performing personalized task prioritization. In a battery of tests, we show that this approach outperforms several operational baselines and other sequential ranking models from previous work. Our findings have implications for understanding the ways people prioritize and manage tasks with digital tools, and in the design of support for users of task management applications.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"75 1","pages":"1 - 28"},"PeriodicalIF":0.0,"publicationDate":"2021-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86428384","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Graph Co-Attentive Session-based Recommendation 图表基于共同关注会话的推荐

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-12-01 DOI: 10.1145/3486711

Zhiqiang Pan, Fei Cai, Wanyu Chen, Honghui Chen

Session-based recommendation aims to generate recommendations merely based on the ongoing session, which is a challenging task. Previous methods mainly focus on modeling the sequential signals or the transition relations between items in the current session using RNNs or GNNs to identify user’s intent for recommendation. Such models generally ignore the dynamic connections between the local and global item transition patterns, although the global information is taken into consideration by exploiting the global-level pair-wise item transitions. Moreover, existing methods that mainly adopt the cross-entropy loss with softmax generally face a serious over-fitting problem, harming the recommendation accuracy. Thus, in this article, we propose a Graph Co-Attentive Recommendation Machine (GCARM) for session-based recommendation. In detail, we first design a Graph Co-Attention Network (GCAT) to consider the dynamic correlations between the local and global neighbors of each node during the information propagation. Then, the item-level dynamic connections between the output of the local and global graphs are modeled to generate the final item representations. After that, we produce the prediction scores and design a Max Cross-Entropy (MCE) loss to prevent over-fitting. Extensive experiments are conducted on three benchmark datasets, i.e., Diginetica, Gowalla, and Yoochoose. The experimental results show that GCARM can achieve the state-of-the-art performance in terms of Recall and MRR, especially on boosting the ranking of the target item.

基于会话的推荐旨在仅基于正在进行的会话生成建议，这是一项具有挑战性的任务。以前的方法主要是利用rnn或gnn对当前会话中的顺序信号或项目之间的转换关系进行建模，以识别用户的推荐意图。这种模型通常忽略了局部和全局项目转换模式之间的动态连接，尽管通过利用全局级成对项目转换考虑了全局信息。此外，现有的主要采用交叉熵损失和softmax的推荐方法普遍存在严重的过拟合问题，影响了推荐的准确性。因此，在本文中，我们提出了一个基于会话的推荐图协同关注推荐机(GCARM)。首先，我们设计了一个图协同关注网络(GCAT)来考虑信息传播过程中每个节点的局部邻居和全局邻居之间的动态相关性。然后，对局部图和全局图的输出之间的项目级动态连接进行建模，以生成最终的项目表示。然后，我们生成预测分数并设计一个最大交叉熵(MCE)损失来防止过拟合。在Diginetica、Gowalla和Yoochoose三个基准数据集上进行了大量的实验。实验结果表明，GCARM在查全率和MRR方面取得了较好的效果，特别是在提高目标条目的排名方面。

{"title":"Graph Co-Attentive Session-based Recommendation","authors":"Zhiqiang Pan, Fei Cai, Wanyu Chen, Honghui Chen","doi":"10.1145/3486711","DOIUrl":"https://doi.org/10.1145/3486711","url":null,"abstract":"Session-based recommendation aims to generate recommendations merely based on the ongoing session, which is a challenging task. Previous methods mainly focus on modeling the sequential signals or the transition relations between items in the current session using RNNs or GNNs to identify user’s intent for recommendation. Such models generally ignore the dynamic connections between the local and global item transition patterns, although the global information is taken into consideration by exploiting the global-level pair-wise item transitions. Moreover, existing methods that mainly adopt the cross-entropy loss with softmax generally face a serious over-fitting problem, harming the recommendation accuracy. Thus, in this article, we propose a Graph Co-Attentive Recommendation Machine (GCARM) for session-based recommendation. In detail, we first design a Graph Co-Attention Network (GCAT) to consider the dynamic correlations between the local and global neighbors of each node during the information propagation. Then, the item-level dynamic connections between the output of the local and global graphs are modeled to generate the final item representations. After that, we produce the prediction scores and design a Max Cross-Entropy (MCE) loss to prevent over-fitting. Extensive experiments are conducted on three benchmark datasets, i.e., Diginetica, Gowalla, and Yoochoose. The experimental results show that GCARM can achieve the state-of-the-art performance in terms of Recall and MRR, especially on boosting the ranking of the target item.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"12 1","pages":"1 - 31"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89639125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

The Footprint of Factorization Models and Their Applications in Collaborative Filtering 因子分解模型的足迹及其在协同过滤中的应用

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-11-29 DOI: 10.1145/3490475

Jinze Wang, Yongli Ren, Jie Li, Ke Deng

Factorization models have been successfully applied to the recommendation problems and have significant impact to both academia and industries in the field of Collaborative Filtering (CF). However, the intermediate data generated in factorization models’ decision making process (or training process, footprint) have been overlooked even though they may provide rich information to further improve recommendations. In this article, we introduce the concept of Convergence Pattern, which records how ratings are learned step-by-step in factorization models in the field of CF. We show that the concept of Convergence Patternexists in both the model perspective (e.g., classical Matrix Factorization (MF) and deep-learning factorization) and the training (learning) perspective (e.g., stochastic gradient descent (SGD), alternating least squares (ALS), and Markov Chain Monte Carlo (MCMC)). By utilizing the Convergence Pattern, we propose a prediction model to estimate the prediction reliability of missing ratings and then improve the quality of recommendations. Two applications have been investigated: (1) how to evaluate the reliability of predicted missing ratings and thus recommend those ratings with high reliability. (2) How to explore the estimated reliability to adjust the predicted ratings to further improve the predication accuracy. Extensive experiments have been conducted on several benchmark datasets on three recommendation tasks: decision-aware recommendation, rating predicted, and Top-N recommendation. The experiment results have verified the effectiveness of the proposed methods in various aspects.

因式分解模型已经成功地应用于推荐问题中，在协同过滤(CF)领域产生了重大影响。然而，在分解模型的决策过程(或训练过程，足迹)中生成的中间数据被忽视了，尽管它们可能提供丰富的信息来进一步改进建议。在本文中，我们介绍了收敛模式的概念，它记录了CF领域的分解模型如何逐步学习评级。我们在模型角度(例如，经典矩阵分解(MF)和深度学习分解)和训练(学习)角度(例如，随机梯度下降(SGD)，交替最小二乘法(ALS)和马尔可夫链蒙特卡罗(MCMC))中展示了收敛模式的概念。利用收敛模式，我们提出了一个预测模型来估计缺失评级的预测可靠性，从而提高推荐的质量。研究了两方面的应用:(1)如何评估预测缺失评分的信度，从而推荐高信度的评分。(2)如何探索估计信度来调整预测评级，进一步提高预测精度。在几个基准数据集上对决策感知推荐、评级预测推荐和Top-N推荐三种推荐任务进行了大量的实验。实验结果从各个方面验证了所提方法的有效性。

{"title":"The Footprint of Factorization Models and Their Applications in Collaborative Filtering","authors":"Jinze Wang, Yongli Ren, Jie Li, Ke Deng","doi":"10.1145/3490475","DOIUrl":"https://doi.org/10.1145/3490475","url":null,"abstract":"Factorization models have been successfully applied to the recommendation problems and have significant impact to both academia and industries in the field of Collaborative Filtering (CF). However, the intermediate data generated in factorization models’ decision making process (or training process, footprint) have been overlooked even though they may provide rich information to further improve recommendations. In this article, we introduce the concept of Convergence Pattern, which records how ratings are learned step-by-step in factorization models in the field of CF. We show that the concept of Convergence Patternexists in both the model perspective (e.g., classical Matrix Factorization (MF) and deep-learning factorization) and the training (learning) perspective (e.g., stochastic gradient descent (SGD), alternating least squares (ALS), and Markov Chain Monte Carlo (MCMC)). By utilizing the Convergence Pattern, we propose a prediction model to estimate the prediction reliability of missing ratings and then improve the quality of recommendations. Two applications have been investigated: (1) how to evaluate the reliability of predicted missing ratings and thus recommend those ratings with high reliability. (2) How to explore the estimated reliability to adjust the predicted ratings to further improve the predication accuracy. Extensive experiments have been conducted on several benchmark datasets on three recommendation tasks: decision-aware recommendation, rating predicted, and Top-N recommendation. The experiment results have verified the effectiveness of the proposed methods in various aspects.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"33 1","pages":"1 - 32"},"PeriodicalIF":0.0,"publicationDate":"2021-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82436598","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

An Unsupervised Aspect-Aware Recommendation Model with Explanation Text Generation 具有解释文本生成的无监督方面感知推荐模型

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-11-29 DOI: 10.1145/3483611

Peijie Sun, Le Wu, Kun Zhang, Yuxuan Su, Meng Wang

Review based recommendation utilizes both users’ rating records and the associated reviews for recommendation. Recently, with the rapid demand for explanations of recommendation results, reviews are used to train the encoder–decoder models for explanation text generation. As most of the reviews are general text without detailed evaluation, some researchers leveraged auxiliary information of users or items to enrich the generated explanation text. Nevertheless, the auxiliary data is not available in most scenarios and may suffer from data privacy problems. In this article, we argue that the reviews contain abundant semantic information to express the users’ feelings for various aspects of items, while these information are not fully explored in current explanation text generation task. To this end, we study how to generate more fine-grained explanation text in review based recommendation without any auxiliary data. Though the idea is simple, it is non-trivial since the aspect is hidden and unlabeled. Besides, it is also very challenging to inject aspect information for generating explanation text with noisy review input. To solve these challenges, we first leverage an advanced unsupervised neural aspect extraction model to learn the aspect-aware representation of each review sentence. Thus, users and items can be represented in the aspect space based on their historical associated reviews. After that, we detail how to better predict ratings and generate explanation text with the user and item representations in the aspect space. We further dynamically assign review sentences which contain larger proportion of aspect words with larger weights to control the text generation process, and jointly optimize rating prediction accuracy and explanation text generation quality with a multi-task learning framework. Finally, extensive experimental results on three real-world datasets demonstrate the superiority of our proposed model for both recommendation accuracy and explainability.

基于评论的推荐利用用户的评级记录和相关评论进行推荐。近年来，随着对推荐结果解释的快速需求，评论被用于训练编码器-解码器模型来生成解释文本。由于大多数评论都是一般文本，没有详细的评价，一些研究者利用用户或物品的辅助信息来丰富生成的解释文本。然而，辅助数据在大多数情况下是不可用的，并且可能遭受数据隐私问题。在本文中，我们认为评论包含了丰富的语义信息来表达用户对项目的各个方面的感受，而这些信息在目前的解释性文本生成任务中并没有得到充分的挖掘。为此，我们研究如何在没有任何辅助数据的情况下，在基于评论的推荐中生成更细粒度的解释文本。虽然这个想法很简单，但它不是微不足道的，因为方面是隐藏的和未标记的。此外，注入方面信息来生成带有噪声评审输入的解释文本也是非常具有挑战性的。为了解决这些挑战，我们首先利用一种先进的无监督神经方面提取模型来学习每个复习句子的方面感知表示。因此，用户和项可以基于它们的历史关联评论在方面空间中表示。之后，我们详细介绍了如何更好地预测评分，并使用方面空间中的用户和项目表示生成解释文本。我们进一步动态分配包含更大权重的方面词比例的复习句来控制文本生成过程，并通过多任务学习框架共同优化评级预测精度和解释文本生成质量。最后，在三个真实数据集上的大量实验结果证明了我们提出的模型在推荐准确性和可解释性方面的优越性。

{"title":"An Unsupervised Aspect-Aware Recommendation Model with Explanation Text Generation","authors":"Peijie Sun, Le Wu, Kun Zhang, Yuxuan Su, Meng Wang","doi":"10.1145/3483611","DOIUrl":"https://doi.org/10.1145/3483611","url":null,"abstract":"Review based recommendation utilizes both users’ rating records and the associated reviews for recommendation. Recently, with the rapid demand for explanations of recommendation results, reviews are used to train the encoder–decoder models for explanation text generation. As most of the reviews are general text without detailed evaluation, some researchers leveraged auxiliary information of users or items to enrich the generated explanation text. Nevertheless, the auxiliary data is not available in most scenarios and may suffer from data privacy problems. In this article, we argue that the reviews contain abundant semantic information to express the users’ feelings for various aspects of items, while these information are not fully explored in current explanation text generation task. To this end, we study how to generate more fine-grained explanation text in review based recommendation without any auxiliary data. Though the idea is simple, it is non-trivial since the aspect is hidden and unlabeled. Besides, it is also very challenging to inject aspect information for generating explanation text with noisy review input. To solve these challenges, we first leverage an advanced unsupervised neural aspect extraction model to learn the aspect-aware representation of each review sentence. Thus, users and items can be represented in the aspect space based on their historical associated reviews. After that, we detail how to better predict ratings and generate explanation text with the user and item representations in the aspect space. We further dynamically assign review sentences which contain larger proportion of aspect words with larger weights to control the text generation process, and jointly optimize rating prediction accuracy and explanation text generation quality with a multi-task learning framework. Finally, extensive experimental results on three real-world datasets demonstrate the superiority of our proposed model for both recommendation accuracy and explainability.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"218 1","pages":"1 - 29"},"PeriodicalIF":0.0,"publicationDate":"2021-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85540746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Combining Graph Convolutional Neural Networks and Label Propagation 图卷积神经网络与标签传播的结合

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-11-29 DOI: 10.1145/3490478

Hongwei Wang, J. Leskovec

Label Propagation Algorithm (LPA) and Graph Convolutional Neural Networks (GCN) are both message passing algorithms on graphs. Both solve the task of node classification, but LPA propagates node label information across the edges of the graph, while GCN propagates and transforms node feature information. However, while conceptually similar, theoretical relationship between LPA and GCN has not yet been systematically investigated. Moreover, it is unclear how LPA and GCN can be combined under a unified framework to improve the performance. Here we study the relationship between LPA and GCN in terms of feature/label influence, in which we characterize how much the initial feature/label of one node influences the final feature/label of another node in GCN/LPA. Based on our theoretical analysis, we propose an end-to-end model that combines GCN and LPA. In our unified model, edge weights are learnable, and the LPA serves as regularization to assist the GCN in learning proper edge weights that lead to improved performance. Our model can also be seen as learning the weights of edges based on node labels, which is more direct and efficient than existing feature-based attention models or topology-based diffusion models. In a number of experiments for semi-supervised node classification and knowledge-graph-aware recommendation, our model shows superiority over state-of-the-art baselines.

标签传播算法(LPA)和图卷积神经网络(GCN)都是基于图的消息传递算法。两者都解决了节点分类的任务，但LPA在图的边缘传播节点标签信息，而GCN传播和转换节点特征信息。然而，虽然概念上相似，但LPA和GCN之间的理论关系尚未得到系统的研究。此外，如何将LPA和GCN结合在一个统一的框架下以提高性能还不清楚。在这里，我们从特征/标签影响的角度研究了LPA和GCN之间的关系，其中我们表征了GCN/LPA中一个节点的初始特征/标签对另一个节点的最终特征/标签的影响程度。在理论分析的基础上，提出了一种结合GCN和LPA的端到端模型。在我们的统一模型中，边权是可学习的，LPA作为正则化来帮助GCN学习适当的边权，从而提高性能。我们的模型也可以看作是基于节点标签学习边的权重，这比现有的基于特征的注意力模型或基于拓扑的扩散模型更直接和有效。在半监督节点分类和知识图感知推荐的大量实验中，我们的模型显示出优于最先进基线的优势。

{"title":"Combining Graph Convolutional Neural Networks and Label Propagation","authors":"Hongwei Wang, J. Leskovec","doi":"10.1145/3490478","DOIUrl":"https://doi.org/10.1145/3490478","url":null,"abstract":"Label Propagation Algorithm (LPA) and Graph Convolutional Neural Networks (GCN) are both message passing algorithms on graphs. Both solve the task of node classification, but LPA propagates node label information across the edges of the graph, while GCN propagates and transforms node feature information. However, while conceptually similar, theoretical relationship between LPA and GCN has not yet been systematically investigated. Moreover, it is unclear how LPA and GCN can be combined under a unified framework to improve the performance. Here we study the relationship between LPA and GCN in terms of feature/label influence, in which we characterize how much the initial feature/label of one node influences the final feature/label of another node in GCN/LPA. Based on our theoretical analysis, we propose an end-to-end model that combines GCN and LPA. In our unified model, edge weights are learnable, and the LPA serves as regularization to assist the GCN in learning proper edge weights that lead to improved performance. Our model can also be seen as learning the weights of edges based on node labels, which is more direct and efficient than existing feature-based attention models or topology-based diffusion models. In a number of experiments for semi-supervised node classification and knowledge-graph-aware recommendation, our model shows superiority over state-of-the-art baselines.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"13 1","pages":"1 - 27"},"PeriodicalIF":0.0,"publicationDate":"2021-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75803302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 26

STARec: Adaptive Learning with Spatiotemporal and Activity Influence for POI Recommendation 自适应学习对POI推荐的时空和活动影响

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-11-29 DOI: 10.1145/3485631

Weiyun Ji, Xiang-wu Meng, Yujie Zhang

POI recommendation has become an essential means to help people discover attractive places. Intuitively, activities have an important impact on users’ decision-making, because users select POIs to attend corresponding activities. However, many existing studies ignore the social motivation of user behaviors and regard all check-ins as influenced only by individual user interests. As a result, they cannot model user preferences accurately, which degrades recommendation effectiveness. In this article, from the perspective of activities, this study proposes a probabilistic generative model called STARec. Specifically, based on the social effect of activities, STARec defines users’ social preferences as distinct from their individual interests and combines these with individual user activity interests to effectively depict user preferences. Moreover, the inconsistency between users’ social preferences and their decisions is modeled. An activity frequency feature is introduced to acquire accurate user social preferences because of close correlation between these and the key impact factor of corresponding check-ins. An alias sampling-based training method was used to accelerate training. Extensive experiments were conducted on two real-world datasets. Experimental results demonstrated that the proposed STARec model achieves superior performance in terms of high recommendation accuracy, robustness to data sparsity, effectiveness in handling cold-start problems, efficiency, and interpretability.

POI推荐已经成为帮助人们发现有吸引力的地方的重要手段。直观上，活动对用户的决策有重要影响，因为用户选择poi参加相应的活动。然而，现有的许多研究忽略了用户行为的社会动机，认为所有签到都只受用户个人兴趣的影响。因此，他们不能准确地模拟用户偏好，这降低了推荐的有效性。在本文中，本研究从活动的角度提出了一个概率生成模型，称为STARec。具体而言，基于活动的社会效应，STARec将用户的社会偏好与个人兴趣区分开来，并将其与个人用户的活动兴趣结合起来，有效地描述用户的偏好。此外，还对用户的社会偏好与决策之间的不一致性进行了建模。由于用户社交偏好与相应签到的关键影响因子密切相关，引入活动频率特征来获取准确的用户社交偏好。采用基于别名采样的训练方法，提高了训练速度。在两个真实世界的数据集上进行了广泛的实验。实验结果表明，提出的STARec模型在推荐精度高、对数据稀疏性的鲁棒性、处理冷启动问题的有效性、效率和可解释性等方面都取得了优异的性能。

{"title":"STARec: Adaptive Learning with Spatiotemporal and Activity Influence for POI Recommendation","authors":"Weiyun Ji, Xiang-wu Meng, Yujie Zhang","doi":"10.1145/3485631","DOIUrl":"https://doi.org/10.1145/3485631","url":null,"abstract":"POI recommendation has become an essential means to help people discover attractive places. Intuitively, activities have an important impact on users’ decision-making, because users select POIs to attend corresponding activities. However, many existing studies ignore the social motivation of user behaviors and regard all check-ins as influenced only by individual user interests. As a result, they cannot model user preferences accurately, which degrades recommendation effectiveness. In this article, from the perspective of activities, this study proposes a probabilistic generative model called STARec. Specifically, based on the social effect of activities, STARec defines users’ social preferences as distinct from their individual interests and combines these with individual user activity interests to effectively depict user preferences. Moreover, the inconsistency between users’ social preferences and their decisions is modeled. An activity frequency feature is introduced to acquire accurate user social preferences because of close correlation between these and the key impact factor of corresponding check-ins. An alias sampling-based training method was used to accelerate training. Extensive experiments were conducted on two real-world datasets. Experimental results demonstrated that the proposed STARec model achieves superior performance in terms of high recommendation accuracy, robustness to data sparsity, effectiveness in handling cold-start problems, efficiency, and interpretability.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"66 1","pages":"1 - 40"},"PeriodicalIF":0.0,"publicationDate":"2021-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89324699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13