ACM Transactions on Information Systems最新文献_第6页

Causal Inference in Recommender Systems: A Survey and Future Directions 推荐系统中的因果推理：调查与未来方向

IF 5.6 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Information Systems

Pub Date : 2024-01-02 DOI: 10.1145/3639048

Chen Gao, Yu Zheng, Wenjie Wang, Fuli Feng, Xiangnan He, Yong Li

Recommender systems have become crucial in information filtering nowadays. Existing recommender systems extract user preferences based on the correlation in data, such as behavioral correlation in collaborative filtering, feature-feature, or feature-behavior correlation in click-through rate prediction. However, unfortunately, the real world is driven by causality, not just correlation, and correlation does not imply causation. For instance, recommender systems might recommend a battery charger to a user after buying a phone, where the latter can serve as the cause of the former; such a causal relation cannot be reversed. Recently, to address this, researchers in recommender systems have begun utilizing causal inference to extract causality, thereby enhancing the recommender system. In this survey, we offer a comprehensive review of the literature on causal inference-based recommendation. Initially, we introduce the fundamental concepts of both recommender system and causal inference as the foundation for subsequent content. We then highlight the typical issues faced by non-causality recommender system. Following that, we thoroughly review the existing work on causal inference-based recommender systems, based on a taxonomy of three-aspect challenges that causal inference can address. Finally, we discuss the open problems in this critical research area and suggest important potential future works.

如今，推荐系统已成为信息过滤的关键。现有的推荐系统根据数据的相关性来提取用户偏好，如协同过滤中的行为相关性，点击率预测中的特征-特征或特征-行为相关性。然而，不幸的是，现实世界是由因果关系驱动的，而不仅仅是相关性，相关性并不意味着因果关系。例如，推荐系统可能会在用户购买手机后向其推荐电池充电器，而后者可能是前者的原因；这种因果关系无法逆转。最近，为了解决这个问题，推荐系统的研究人员开始利用因果推理来提取因果关系，从而增强推荐系统的功能。在本调查中，我们将对基于因果推理的推荐文献进行全面回顾。首先，我们介绍了推荐系统和因果推理的基本概念，作为后续内容的基础。然后，我们强调了非因果关系推荐系统所面临的典型问题。随后，我们根据因果推理可应对的三方面挑战的分类法，全面回顾了基于因果推理的推荐系统方面的现有工作。最后，我们讨论了这一关键研究领域的未决问题，并提出了未来可能开展的重要工作。

{"title":"Causal Inference in Recommender Systems: A Survey and Future Directions","authors":"Chen Gao, Yu Zheng, Wenjie Wang, Fuli Feng, Xiangnan He, Yong Li","doi":"10.1145/3639048","DOIUrl":"https://doi.org/10.1145/3639048","url":null,"abstract":"Recommender systems have become crucial in information filtering nowadays. Existing recommender systems extract user preferences based on the correlation in data, such as behavioral correlation in collaborative filtering, feature-feature, or feature-behavior correlation in click-through rate prediction. However, unfortunately, the real world is driven by causality, not just correlation, and correlation does not imply causation. For instance, recommender systems might recommend a battery charger to a user after buying a phone, where the latter can serve as the cause of the former; such a causal relation cannot be reversed. Recently, to address this, researchers in recommender systems have begun utilizing causal inference to extract causality, thereby enhancing the recommender system. In this survey, we offer a comprehensive review of the literature on causal inference-based recommendation. Initially, we introduce the fundamental concepts of both recommender system and causal inference as the foundation for subsequent content. We then highlight the typical issues faced by non-causality recommender system. Following that, we thoroughly review the existing work on causal inference-based recommender systems, based on a taxonomy of three-aspect challenges that causal inference can address. Finally, we discuss the open problems in this critical research area and suggest important potential future works.","PeriodicalId":50936,"journal":{"name":"ACM Transactions on Information Systems","volume":"21 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139082213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 27

DiffuRec: A Diffusion Model for Sequential Recommendation DiffuRec：顺序推荐的扩散模型

IF 5.6 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Information Systems

Pub Date : 2023-12-29 DOI: 10.1145/3631116

Zihao Li, Aixin Sun, Chenliang Li

Mainstream solutions to sequential recommendation represent items with fixed vectors. These vectors have limited capability in capturing items’ latent aspects and users’ diverse preferences. As a new generative paradigm, diffusion models have achieved excellent performance in areas like computer vision and natural language processing. To our understanding, its unique merit in representation generation well fits the problem setting of sequential recommendation. In this article, we make the very first attempt to adapt the diffusion model to sequential recommendation and propose DiffuRec for item representation construction and uncertainty injection. Rather than modeling item representations as fixed vectors, we represent them as distributions in DiffuRec, which reflect a user’s multiple interests and an item’s various aspects adaptively. In the diffusion phase, DiffuRec corrupts the target item embedding into a Gaussian distribution via noise adding, which is further applied for sequential item distribution representation generation and uncertainty injection. Afterward, the item representation is fed into an approximator for target item representation reconstruction. In the reverse phase, based on a user’s historical interaction behaviors, we reverse a Gaussian noise into the target item representation, then apply a rounding operation for target item prediction. Experiments over four datasets show that DiffuRec outperforms strong baselines by a large margin.¹

顺序推荐的主流解决方案是用固定向量表示项目。这些向量在捕捉项目的潜在方面和用户的不同偏好方面能力有限。作为一种新的生成范式，扩散模型在计算机视觉和自然语言处理等领域取得了优异的表现。据我们了解，它在表征生成方面的独特优点非常适合顺序推荐的问题设置。在本文中，我们首次尝试将扩散模型应用于顺序推荐，并提出了用于项目表示构建和不确定性注入的 DiffuRec。在 DiffuRec 中，我们不再将项目表示建模为固定向量，而是将其表示为分布，从而自适应地反映用户的多种兴趣和项目的各个方面。在扩散阶段，DiffuRec 通过添加噪声将目标项目嵌入破坏为高斯分布，并进一步应用于顺序项目分布表示的生成和不确定性注入。然后，将项目表示输入近似器，以重建目标项目表示。在反向阶段，根据用户的历史交互行为，我们将高斯噪声反向引入目标项目表示，然后应用舍入操作进行目标项目预测。在四个数据集上进行的实验表明，DiffuRec 的性能远远优于强基线1。

{"title":"DiffuRec: A Diffusion Model for Sequential Recommendation","authors":"Zihao Li, Aixin Sun, Chenliang Li","doi":"10.1145/3631116","DOIUrl":"https://doi.org/10.1145/3631116","url":null,"abstract":"Mainstream solutions to sequential recommendation represent items with fixed vectors. These vectors have limited capability in capturing items’ latent aspects and users’ diverse preferences. As a new generative paradigm, diffusion models have achieved excellent performance in areas like computer vision and natural language processing. To our understanding, its unique merit in representation generation well fits the problem setting of sequential recommendation. In this article, we make the very first attempt to adapt the diffusion model to sequential recommendation and propose DiffuRec for item representation construction and uncertainty injection. Rather than modeling item representations as fixed vectors, we represent them as distributions in DiffuRec, which reflect a user’s multiple interests and an item’s various aspects adaptively. In the diffusion phase, DiffuRec corrupts the target item embedding into a Gaussian distribution via noise adding, which is further applied for sequential item distribution representation generation and uncertainty injection. Afterward, the item representation is fed into an approximator for target item representation reconstruction. In the reverse phase, based on a user’s historical interaction behaviors, we reverse a Gaussian noise into the target item representation, then apply a rounding operation for target item prediction. Experiments over four datasets show that DiffuRec outperforms strong baselines by a large margin.1","PeriodicalId":50936,"journal":{"name":"ACM Transactions on Information Systems","volume":"1 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2023-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139063803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

FairGap: Fairness-aware Recommendation via Generating Counterfactual Graph FairGap：通过生成反事实图进行公平感知推荐

IF 5.6 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Information Systems

Pub Date : 2023-12-22 DOI: 10.1145/3638352

Wei Chen, Yiqing Wu, Zhao Zhang, Fuzhen Zhuang, Zhongshi He, Ruobing Xie, Feng xia

The emergence of Graph Neural Networks (GNNs) has greatly advanced the development of recommendation systems. Recently, many researchers have leveraged GNN-based models to learn fair representations for users and items. However, current GNN-based models suffer from biased user-item interaction data, which negatively impacts recommendation fairness. Although there have been several studies employed adversarial learning to mitigate this issue in recommendation systems, they mostly focus on modifying the model training approach with fairness regularization and neglect direct intervention of biased interaction. Different from these models, this paper introduces a novel perspective by directly intervening in observed interactions to generate a counterfactual graph (called FairGap) that is not influenced by sensitive node attributes, enabling us to learn fair representations for users and items easily. We design the FairGap to answer the key counterfactual question: “ Would interactions with an item remain unchanged if user’s sensitive attributes were concealed? ”. We also provide theoretical proofs to show that our learning strategy via the counterfactual graph is unbiased in expectation. Moreover, we propose a fairness-enhancing mechanism to continuously improve user fairness in the graph-based recommendation. Extensive experimental results against state-of-the-art competitors and base models on three real-world datasets validate the effectiveness of our proposed model.

图神经网络（GNN）的出现极大地推动了推荐系统的发展。最近，许多研究人员利用基于 GNN 的模型来学习用户和项目的公平表征。然而，目前基于 GNN 的模型存在用户与项目交互数据偏差的问题，这对推荐的公平性产生了负面影响。虽然已有一些研究采用对抗学习来缓解推荐系统中的这一问题，但它们大多侧重于通过公平正则化来修改模型训练方法，而忽视了对有偏差的交互的直接干预。与这些模型不同，本文引入了一个新的视角，即直接干预观察到的交互，生成一个不受敏感节点属性影响的反事实图（称为 FairGap），使我们能够轻松地学习用户和项目的公平表征。我们设计公平差距来回答关键的反事实问题："如果用户的敏感属性被隐藏，与物品的交互会保持不变吗？".我们还提供了理论证明，表明我们通过反事实图的学习策略在预期上是无偏的。此外，我们还提出了一种公平性增强机制，以持续改善基于图的推荐中的用户公平性。在三个真实数据集上与最先进的竞争对手和基础模型进行的大量实验结果验证了我们提出的模型的有效性。

{"title":"FairGap: Fairness-aware Recommendation via Generating Counterfactual Graph","authors":"Wei Chen, Yiqing Wu, Zhao Zhang, Fuzhen Zhuang, Zhongshi He, Ruobing Xie, Feng xia","doi":"10.1145/3638352","DOIUrl":"https://doi.org/10.1145/3638352","url":null,"abstract":"The emergence of Graph Neural Networks (GNNs) has greatly advanced the development of recommendation systems. Recently, many researchers have leveraged GNN-based models to learn fair representations for users and items. However, current GNN-based models suffer from biased user-item interaction data, which negatively impacts recommendation fairness. Although there have been several studies employed adversarial learning to mitigate this issue in recommendation systems, they mostly focus on modifying the model training approach with fairness regularization and neglect direct intervention of biased interaction. Different from these models, this paper introduces a novel perspective by directly intervening in observed interactions to generate a counterfactual graph (called FairGap) that is not influenced by sensitive node attributes, enabling us to learn fair representations for users and items easily. We design the FairGap to answer the key counterfactual question: “ Would interactions with an item remain unchanged if user’s sensitive attributes were concealed? ”. We also provide theoretical proofs to show that our learning strategy via the counterfactual graph is unbiased in expectation. Moreover, we propose a fairness-enhancing mechanism to continuously improve user fairness in the graph-based recommendation. Extensive experimental results against state-of-the-art competitors and base models on three real-world datasets validate the effectiveness of our proposed model.","PeriodicalId":50936,"journal":{"name":"ACM Transactions on Information Systems","volume":"17 11","pages":""},"PeriodicalIF":5.6,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138945901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Triple Sequence Learning for Cross-domain Recommendation 跨域推荐的三重序列学习

IF 5.6 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Information Systems

Pub Date : 2023-12-22 DOI: 10.1145/3638351

Haokai Ma, Ruobing Xie, Lei Meng, Xin Chen, Xu Zhang, Leyu Lin, Jie Zhou

Cross-domain recommendation (CDR) aims to leverage the correlation of users’ behaviors in both the source and target domains to improve the user preference modeling in the target domain. Conventional CDR methods typically explore the dual-relations between the source and target domains’ behaviors. However, this may ignore the informative mixed behaviors that naturally reflect the user’s global preference. To address this issue, we present a novel framework, termed triple sequence learning for cross-domain recommendation (Tri-CDR), which jointly models the source, target, and mixed behavior sequences to highlight the global and target preference and precisely model the triple correlation in CDR. Specifically, Tri-CDR independently models the hidden representations for the triple behavior sequences and proposes a triple cross-domain attention (TCA) method to emphasize the informative knowledge related to both user’s global and target-domain preference. To comprehensively explore the cross-domain correlations, we design a triple contrastive learning (TCL) strategy that simultaneously considers the coarse-grained similarities and fine-grained distinctions among the triple sequences, ensuring the alignment while preserving information diversity in multi-domain. We conduct extensive experiments and analyses on six cross-domain settings. The significant improvements of Tri-CDR with different sequential encoders verify its effectiveness and universality. The source code is avaliable in https://github.com/hulkima/Tri-CDR.

跨域推荐（CDR）旨在利用源域和目标域中用户行为的相关性来改进目标域中的用户偏好建模。传统的 CDR 方法通常会探索源域和目标域行为之间的双重关系。然而，这可能会忽略自然反映用户全局偏好的信息混合行为。为了解决这个问题，我们提出了一个新颖的框架，称为跨域推荐的三重序列学习（Tri-CDR），它可以对源域、目标域和混合行为序列进行联合建模，以突出全局和目标偏好，并对 CDR 中的三重相关性进行精确建模。具体来说，Tri-CDR 对三重行为序列的隐藏表示进行独立建模，并提出了一种三重跨域关注（TCA）方法，以强调与用户全域和目标域偏好相关的信息知识。为了全面探索跨域相关性，我们设计了一种三重对比学习（TCL）策略，该策略同时考虑了三重序列之间的粗粒度相似性和细粒度区别，在确保一致性的同时保留了多域信息的多样性。我们在六个跨域设置中进行了广泛的实验和分析。不同顺序编码器对 Tri-CDR 的明显改善验证了它的有效性和普遍性。源代码见 https://github.com/hulkima/Tri-CDR。

{"title":"Triple Sequence Learning for Cross-domain Recommendation","authors":"Haokai Ma, Ruobing Xie, Lei Meng, Xin Chen, Xu Zhang, Leyu Lin, Jie Zhou","doi":"10.1145/3638351","DOIUrl":"https://doi.org/10.1145/3638351","url":null,"abstract":"Cross-domain recommendation (CDR) aims to leverage the correlation of users’ behaviors in both the source and target domains to improve the user preference modeling in the target domain. Conventional CDR methods typically explore the dual-relations between the source and target domains’ behaviors. However, this may ignore the informative mixed behaviors that naturally reflect the user’s global preference. To address this issue, we present a novel framework, termed triple sequence learning for cross-domain recommendation (Tri-CDR), which jointly models the source, target, and mixed behavior sequences to highlight the global and target preference and precisely model the triple correlation in CDR. Specifically, Tri-CDR independently models the hidden representations for the triple behavior sequences and proposes a triple cross-domain attention (TCA) method to emphasize the informative knowledge related to both user’s global and target-domain preference. To comprehensively explore the cross-domain correlations, we design a triple contrastive learning (TCL) strategy that simultaneously considers the coarse-grained similarities and fine-grained distinctions among the triple sequences, ensuring the alignment while preserving information diversity in multi-domain. We conduct extensive experiments and analyses on six cross-domain settings. The significant improvements of Tri-CDR with different sequential encoders verify its effectiveness and universality. The source code is avaliable in https://github.com/hulkima/Tri-CDR.","PeriodicalId":50936,"journal":{"name":"ACM Transactions on Information Systems","volume":"4 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139020342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

DGEKT: A Dual Graph Ensemble Learning Method for Knowledge Tracing DGEKT：知识追踪的双图集合学习法

IF 5.6 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Information Systems

Pub Date : 2023-12-22 DOI: 10.1145/3638350

Chaoran Cui, Yumo Yao, Chunyun Zhang, Hebo Ma, Yuling Ma, Zhaochun Ren, Chen Zhang, James Ko

Knowledge tracing aims to trace students’ evolving knowledge states by predicting their future performance on concept-related exercises. Recently, some graph-based models have been developed to incorporate the relationships between exercises to improve knowledge tracing, but only a single type of relationship information is generally explored. In this paper, we present a novel Dual Graph Ensemble learning method for Knowledge Tracing (DGEKT), which establishes a dual graph structure of students’ learning interactions to capture the heterogeneous exercise-concept associations and interaction transitions by hypergraph modeling and directed graph modeling, respectively. To combine the dual graph models, we introduce the technique of online knowledge distillation. This choice arises from the observation that, while the knowledge tracing model is designed to predict students’ responses to the exercises related to different concepts, it is optimized merely with respect to the prediction accuracy on a single exercise at each step. With online knowledge distillation, the dual graph models are adaptively combined to form a stronger ensemble teacher model, which provides its predictions on all exercises as extra supervision for better modeling ability. In the experiments, we compare DGEKT against eight knowledge tracing baselines on three benchmark datasets, and the results demonstrate that DGEKT achieves state-of-the-art performance.

知识追踪的目的是通过预测学生未来在与概念相关的练习中的表现来追踪他们不断变化的知识状态。最近，一些基于图的模型被开发出来，以结合练习之间的关系来改进知识追踪，但一般只探讨单一类型的关系信息。本文提出了一种新颖的知识追踪双图集合学习方法（DGEKT），通过超图建模和有向图建模，分别建立学生学习互动的双图结构，以捕捉异质的练习-概念关联和互动转换。为了结合双图模型，我们引入了在线知识提炼技术。我们之所以选择这种方法，是因为我们发现，虽然知识追踪模型旨在预测学生对不同概念相关练习的反应，但它仅仅是针对每一步单个练习的预测准确性进行了优化。通过在线知识提炼，双图模型被自适应地组合在一起，形成一个更强的集合教师模型，它对所有练习的预测作为额外的监督，以获得更好的建模能力。在实验中，我们将 DGEKT 与三个基准数据集上的八个知识追踪基线进行了比较，结果表明 DGEKT 达到了最先进的性能。

{"title":"DGEKT: A Dual Graph Ensemble Learning Method for Knowledge Tracing","authors":"Chaoran Cui, Yumo Yao, Chunyun Zhang, Hebo Ma, Yuling Ma, Zhaochun Ren, Chen Zhang, James Ko","doi":"10.1145/3638350","DOIUrl":"https://doi.org/10.1145/3638350","url":null,"abstract":"Knowledge tracing aims to trace students’ evolving knowledge states by predicting their future performance on concept-related exercises. Recently, some graph-based models have been developed to incorporate the relationships between exercises to improve knowledge tracing, but only a single type of relationship information is generally explored. In this paper, we present a novel Dual Graph Ensemble learning method for Knowledge Tracing (DGEKT), which establishes a dual graph structure of students’ learning interactions to capture the heterogeneous exercise-concept associations and interaction transitions by hypergraph modeling and directed graph modeling, respectively. To combine the dual graph models, we introduce the technique of online knowledge distillation. This choice arises from the observation that, while the knowledge tracing model is designed to predict students’ responses to the exercises related to different concepts, it is optimized merely with respect to the prediction accuracy on a single exercise at each step. With online knowledge distillation, the dual graph models are adaptively combined to form a stronger ensemble teacher model, which provides its predictions on all exercises as extra supervision for better modeling ability. In the experiments, we compare DGEKT against eight knowledge tracing baselines on three benchmark datasets, and the results demonstrate that DGEKT achieves state-of-the-art performance.","PeriodicalId":50936,"journal":{"name":"ACM Transactions on Information Systems","volume":"54 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139020371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Using Neural and Graph Neural Recommender systems to Overcome Choice Overload: Evidence from a Music Education Platform 使用神经和图神经推荐系统克服选择过载：来自音乐教育平台的证据

IF 5.6 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Information Systems

Pub Date : 2023-12-20 DOI: 10.1145/3637873

Hédi Razgallah, Michalis Vlachos, Ahmad Ajalloeian, Ninghao Liu, Johannes Schneider, Alexis Steinmann

The application of recommendation technologies has been crucial in the promotion of physical and digital content across numerous global platforms such as Amazon, Apple, and Netflix. Our study aims to investigate the advantages of employing recommendation technologies on educational platforms, with a particular focus on an educational platform for learning and practicing music.

Our research is based on data from Tomplay, a music platform that offers sheet music with professional audio recordings, enabling users to discover and practice music content at varying levels of difficulty. Through our analysis, we emphasize the distinct interaction patterns on educational platforms like Tomplay, which we compare with other commonly used recommendation datasets. We find that interactions are comparatively sparse on educational platforms, with users often focusing on specific content as they learn, rather than interacting with a broader range of material. Therefore, our primary goal is to address the issue of data sparsity. We achieve this through entity resolution principles and propose a neural network (NN) based recommendation model. Further, we improve this model by utilizing graph neural networks (GNNs), which provide superior predictive accuracy compared to NNs. Notably, our study demonstrates that GNNs are highly effective even for users with little or no historical preferences (cold-start problem).

Our cold-start experiments also provide valuable insights into an independent issue, namely the number of historical interactions needed by a recommendation model to gain a comprehensive understanding of a user. Our findings demonstrate that a platform acquires a solid knowledge of a user’s general preferences and characteristics with 50 past interactions. Overall, our study makes significant contributions to information systems research on business analytics and prescriptive analytics. Moreover, our framework and evaluation results offer implications for various stakeholders, including online educational institutions, education policymakers, and learning platform users.

在亚马逊、苹果和 Netflix 等众多全球平台上推广实体和数字内容时，推荐技术的应用至关重要。我们的研究旨在探讨在教育平台上应用推荐技术的优势，尤其关注音乐学习和练习的教育平台。我们的研究基于 Tomplay 的数据，Tomplay 是一个音乐平台，提供带有专业录音的乐谱，使用户能够发现并练习不同难度的音乐内容。通过分析，我们强调了 Tomplay 等教育平台上独特的交互模式，并将其与其他常用的推荐数据集进行了比较。我们发现，教育平台上的交互相对稀少，用户在学习过程中往往只关注特定内容，而不是与更广泛的材料进行交互。因此，我们的首要目标是解决数据稀少的问题。我们通过实体解析原则来实现这一目标，并提出了一个基于神经网络 (NN) 的推荐模型。此外，我们还利用图神经网络（GNN）改进了这一模型，与神经网络相比，图神经网络具有更高的预测准确性。值得注意的是，我们的研究表明，即使用户很少或没有历史偏好（冷启动问题），图神经网络也非常有效。我们的冷启动实验还为一个独立问题提供了有价值的见解，即推荐模型全面了解用户所需的历史交互数量。我们的研究结果表明，一个平台通过过去 50 次互动就能获得关于用户一般偏好和特征的可靠知识。总之，我们的研究为商业分析和描述性分析方面的信息系统研究做出了重大贡献。此外，我们的框架和评估结果对在线教育机构、教育政策制定者和学习平台用户等各利益相关方都有借鉴意义。

{"title":"Using Neural and Graph Neural Recommender systems to Overcome Choice Overload: Evidence from a Music Education Platform","authors":"Hédi Razgallah, Michalis Vlachos, Ahmad Ajalloeian, Ninghao Liu, Johannes Schneider, Alexis Steinmann","doi":"10.1145/3637873","DOIUrl":"https://doi.org/10.1145/3637873","url":null,"abstract":"The application of recommendation technologies has been crucial in the promotion of physical and digital content across numerous global platforms such as Amazon, Apple, and Netflix. Our study aims to investigate the advantages of employing recommendation technologies on educational platforms, with a particular focus on an educational platform for learning and practicing music. Our research is based on data from Tomplay, a music platform that offers sheet music with professional audio recordings, enabling users to discover and practice music content at varying levels of difficulty. Through our analysis, we emphasize the distinct interaction patterns on educational platforms like Tomplay, which we compare with other commonly used recommendation datasets. We find that interactions are comparatively sparse on educational platforms, with users often focusing on specific content as they learn, rather than interacting with a broader range of material. Therefore, our primary goal is to address the issue of data sparsity. We achieve this through entity resolution principles and propose a neural network (NN) based recommendation model. Further, we improve this model by utilizing graph neural networks (GNNs), which provide superior predictive accuracy compared to NNs. Notably, our study demonstrates that GNNs are highly effective even for users with little or no historical preferences (cold-start problem). Our cold-start experiments also provide valuable insights into an independent issue, namely the number of historical interactions needed by a recommendation model to gain a comprehensive understanding of a user. Our findings demonstrate that a platform acquires a solid knowledge of a user’s general preferences and characteristics with 50 past interactions. Overall, our study makes significant contributions to information systems research on business analytics and prescriptive analytics. Moreover, our framework and evaluation results offer implications for various stakeholders, including online educational institutions, education policymakers, and learning platform users.","PeriodicalId":50936,"journal":{"name":"ACM Transactions on Information Systems","volume":"73 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2023-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138817699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

On the Impact of Showing Evidence from Peers in Crowdsourced Truthfulness Assessments 论在众包真实性评估中展示同行证据的影响

IF 5.6 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Information Systems

Pub Date : 2023-12-19 DOI: 10.1145/3637872

Jiechen Xu, Lei Han, Shazia Sadiq, Gianluca Demartini

Misinformation has been rapidly spreading online. The common approach to deal with it is deploying expert fact-checkers that follow forensic processes to identify the veracity of statements. Unfortunately, such an approach does not scale well. To deal with this, crowdsourcing has been looked at as an opportunity to complement the work done by trained journalists. In this paper, we look at the effect of presenting the crowd with evidence from others while judging the veracity of statements. We implement various variants of the judgment task design to understand if and how the presented evidence may or may not affect the way crowd workers judge truthfulness and their performance. Our results show that, in certain cases, the presented evidence and the way in which it is presented may mislead crowd workers who would otherwise be more accurate if judging independently from others. Those who make appropriate use of the provided evidence, however, can benefit from it and generate better judgments.

错误信息在网上迅速传播。常见的应对方法是部署专家事实核查人员，按照取证流程识别言论的真实性。遗憾的是，这种方法不能很好地扩展。为了解决这个问题，众包被视为补充训练有素的记者工作的一个机会。在本文中，我们研究了在判断言论的真实性时，向人群展示他人证据的效果。我们实施了各种变体的判断任务设计，以了解呈现的证据是否会影响或如何影响人群工作者判断真实性的方式及其表现。我们的结果表明，在某些情况下，提供的证据和提供证据的方式可能会误导人群工作者，否则他们在独立于他人进行判断时会更加准确。然而，那些适当利用所提供证据的人却能从中受益，做出更好的判断。

引用次数: 0

SMLP4Rec: An Efficient all-MLP Architecture for Sequential Recommendations SMLP4Rec：顺序推荐的高效全 MLP 架构

IF 5.6 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Information Systems

Pub Date : 2023-12-18 DOI: 10.1145/3637871

Jingtong Gao, Xiangyu Zhao, Muyang Li, Minghao Zhao, Runze Wu, Ruocheng Guo, Yiding Liu, Dawei Yin

Self-attention models have achieved the state-of-the-art performance in sequential recommender systems by capturing the sequential dependencies among user-item interactions. However, they rely on adding positional embeddings to the item sequence to retain the sequential information, which may break the semantics of item embeddings due to the heterogeneity between these two types of embeddings. In addition, most existing works assume that such dependencies exist solely in the item embeddings, but neglect their existence among the item features. In our previous study, we proposed a novel sequential recommendation model, i.e., MLP4Rec, based on the recent advances of MLP-Mixer architectures, which is naturally sensitive to the order of items in a sequence because matrix elements related to different positions of a sequence will be given different weights in training. We developed a tri-directional fusion scheme to coherently capture sequential, cross-channel, and cross-feature correlations with linear computational complexity as well as much fewer model parameters than existing self-attention methods. However, the cascading mixer structure, the large number of normalization layers between different mixer layers, and the noise generated by these operations limit the efficiency of information extraction and the effectiveness of MLP4Rec. In this extended version, we propose a novel framework – SMLP4Rec for sequential recommendation to address the aforementioned issues. The new framework changes the flawed cascading structure to a parallel mode, and integrates normalization layers to minimize their impact on the model’s efficiency while maximizing their effectiveness. As a result, the training speed and prediction accuracy of SMLP4Rec are vastly improved in comparison to MLP4Rec. Extensive experimental results demonstrate that the proposed method is significantly superior to the state-of-the-art approaches. The implementation code is available online to ease reproducibility.

自我关注模型通过捕捉用户与项目交互之间的顺序依赖关系，在顺序推荐系统中取得了最先进的性能。然而，它们依赖于在项目序列中添加位置嵌入来保留序列信息，这可能会破坏项目嵌入的语义，因为这两种类型的嵌入之间存在异质性。此外，现有的大多数研究都假定这种依赖关系只存在于项目嵌入中，而忽略了它们在项目特征中的存在。在之前的研究中，我们基于 MLP-Mixer 体系结构的最新进展，提出了一种新颖的序列推荐模型，即 MLP4Rec，它对序列中项目的顺序具有天然的敏感性，因为与序列中不同位置相关的矩阵元素在训练中会被赋予不同的权重。我们开发了一种三向融合方案，以线性计算复杂度和比现有自注意方法更少的模型参数，连贯地捕捉序列、跨信道和跨特征相关性。然而，级联混频器结构、不同混频器层之间的大量归一化层以及这些操作产生的噪声限制了信息提取的效率和 MLP4Rec 的有效性。在本扩展版本中，我们提出了一种用于顺序推荐的新型框架--SMLP4Rec，以解决上述问题。新框架将有缺陷的级联结构改为并行模式，并整合了归一化层，以尽量减少其对模型效率的影响，同时最大限度地提高其有效性。因此，与 MLP4Rec 相比，SMLP4Rec 的训练速度和预测准确性都有了大幅提高。广泛的实验结果表明，所提出的方法明显优于最先进的方法。实现代码可在线获取，以方便重现。

{"title":"SMLP4Rec: An Efficient all-MLP Architecture for Sequential Recommendations","authors":"Jingtong Gao, Xiangyu Zhao, Muyang Li, Minghao Zhao, Runze Wu, Ruocheng Guo, Yiding Liu, Dawei Yin","doi":"10.1145/3637871","DOIUrl":"https://doi.org/10.1145/3637871","url":null,"abstract":"Self-attention models have achieved the state-of-the-art performance in sequential recommender systems by capturing the sequential dependencies among user-item interactions. However, they rely on adding positional embeddings to the item sequence to retain the sequential information, which may break the semantics of item embeddings due to the heterogeneity between these two types of embeddings. In addition, most existing works assume that such dependencies exist solely in the item embeddings, but neglect their existence among the item features. In our previous study, we proposed a novel sequential recommendation model, i.e., MLP4Rec, based on the recent advances of MLP-Mixer architectures, which is naturally sensitive to the order of items in a sequence because matrix elements related to different positions of a sequence will be given different weights in training. We developed a tri-directional fusion scheme to coherently capture sequential, cross-channel, and cross-feature correlations with linear computational complexity as well as much fewer model parameters than existing self-attention methods. However, the cascading mixer structure, the large number of normalization layers between different mixer layers, and the noise generated by these operations limit the efficiency of information extraction and the effectiveness of MLP4Rec. In this extended version, we propose a novel framework – SMLP4Rec for sequential recommendation to address the aforementioned issues. The new framework changes the flawed cascading structure to a parallel mode, and integrates normalization layers to minimize their impact on the model’s efficiency while maximizing their effectiveness. As a result, the training speed and prediction accuracy of SMLP4Rec are vastly improved in comparison to MLP4Rec. Extensive experimental results demonstrate that the proposed method is significantly superior to the state-of-the-art approaches. The implementation code is available online to ease reproducibility.","PeriodicalId":50936,"journal":{"name":"ACM Transactions on Information Systems","volume":"16 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138717249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Dense Text Retrieval based on Pretrained Language Models: A Survey 基于预训练语言模型的密集文本检索：调查

IF 5.6 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Information Systems

Pub Date : 2023-12-18 DOI: 10.1145/3637870

Wayne Xin Zhao, Jing Liu, Ruiyang Ren, Ji-Rong Wen

Text retrieval is a long-standing research topic on information seeking, where a system is required to return relevant information resources to user’s queries in natural language. From heuristic-based retrieval methods to learning-based ranking functions, the underlying retrieval models have been continually evolved with the ever-lasting technical innovation. To design effective retrieval models, a key point lies in how to learn text representations and model the relevance matching. The recent success of pretrained language models (PLM) sheds light on developing more capable text retrieval approaches by leveraging the excellent modeling capacity of PLMs. With powerful PLMs, we can effectively learn the semantic representations of queries and texts in the latent representation space, and further construct the semantic matching function between the dense vectors for relevance modeling. Such a retrieval approach is called dense retrieval, since it employs dense vectors to represent the texts. Considering the rapid progress on dense retrieval, this survey systematically reviews the recent progress on PLM-based dense retrieval. Different from previous surveys on dense retrieval, we take a new perspective to organize the related studies by four major aspects, including architecture, training, indexing and integration, and thoroughly summarize the mainstream techniques for each aspect. We extensively collect the recent advances on this topic, and include 300+ reference papers. To support our survey, we create a website for providing useful resources, and release a code repository for dense retrieval. This survey aims to provide a comprehensive, practical reference focused on the major progress for dense text retrieval.

文本检索是信息搜索领域的一个长期研究课题，系统需要根据用户的自然语言查询返回相关的信息资源。从基于启发式的检索方法到基于学习的排序功能，随着技术的不断创新，基础检索模型也在不断发展。要设计有效的检索模型，关键在于如何学习文本表征和建立相关性匹配模型。最近，预训练语言模型（PLM）取得了成功，这为我们利用 PLM 的出色建模能力开发更强大的文本检索方法提供了启示。利用功能强大的 PLM，我们可以有效地学习潜在表征空间中查询和文本的语义表征，并进一步构建密集向量之间的语义匹配函数，从而建立相关性模型。这种检索方法采用密集向量来表示文本，因此被称为密集检索。考虑到高密度检索的快速发展，本调查系统地回顾了基于 PLM 的高密度检索的最新进展。与以往的密集检索研究不同，我们从一个全新的视角出发，从架构、训练、索引和集成四个主要方面对相关研究进行了梳理，并对每个方面的主流技术进行了全面总结。我们广泛收集了该主题的最新进展，并收录了 300 多篇参考文献。为了支持我们的调查，我们创建了一个提供有用资源的网站，并发布了一个用于密集检索的代码库。本调查旨在为密集文本检索的主要进展提供全面、实用的参考。

{"title":"Dense Text Retrieval based on Pretrained Language Models: A Survey","authors":"Wayne Xin Zhao, Jing Liu, Ruiyang Ren, Ji-Rong Wen","doi":"10.1145/3637870","DOIUrl":"https://doi.org/10.1145/3637870","url":null,"abstract":"Text retrieval is a long-standing research topic on information seeking, where a system is required to return relevant information resources to user’s queries in natural language. From heuristic-based retrieval methods to learning-based ranking functions, the underlying retrieval models have been continually evolved with the ever-lasting technical innovation. To design effective retrieval models, a key point lies in how to learn text representations and model the relevance matching. The recent success of pretrained language models (PLM) sheds light on developing more capable text retrieval approaches by leveraging the excellent modeling capacity of PLMs. With powerful PLMs, we can effectively learn the semantic representations of queries and texts in the latent representation space, and further construct the semantic matching function between the dense vectors for relevance modeling. Such a retrieval approach is called dense retrieval, since it employs dense vectors to represent the texts. Considering the rapid progress on dense retrieval, this survey systematically reviews the recent progress on PLM-based dense retrieval. Different from previous surveys on dense retrieval, we take a new perspective to organize the related studies by four major aspects, including architecture, training, indexing and integration, and thoroughly summarize the mainstream techniques for each aspect. We extensively collect the recent advances on this topic, and include 300+ reference papers. To support our survey, we create a website for providing useful resources, and release a code repository for dense retrieval. This survey aims to provide a comprehensive, practical reference focused on the major progress for dense text retrieval.","PeriodicalId":50936,"journal":{"name":"ACM Transactions on Information Systems","volume":"70 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138716968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Relevance Feedback with Brain Signals 利用大脑信号进行相关性反馈

IF 5.6 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Information Systems

Pub Date : 2023-12-18 DOI: 10.1145/3637874

Ziyi Ye, Xiaohui Xie, Qingyao Ai, Yiqun Liu, Zhihong Wang, Weihang Su, Min Zhang

The Relevance Feedback (RF) process relies on accurate and real-time relevance estimation of feedback documents to improve retrieval performance. Since collecting explicit relevance annotations imposes an extra burden on the user, extensive studies have explored using pseudo-relevance signals and implicit feedback signals as substitutes. However, such signals are indirect indicators of relevance and suffer from complex search scenarios where user interactions are absent or biased.

Recently, the advances in portable and high-precision brain-computer interface (BCI) devices have shown the possibility to monitor user’s brain activities during search process. Brain signals can directly reflect user’s psychological responses to search results and thus it can act as additional and unbiased RF signals. To explore the effectiveness of brain signals in the context of RF, we propose a novel RF framework that combines BCI-based relevance feedback with pseudo-relevance signals and implicit signals to improve the performance of document re-ranking. The experimental results on the user study dataset show that incorporating brain signals leads to significant performance improvement in our RF framework. Besides, we observe that brain signals perform particularly well in several hard search scenarios, especially when implicit signals as feedback are missing or noisy. This reveals when and how to exploit brain signals in the context of RF.

相关性反馈（RF）过程依赖于对反馈文档进行准确和实时的相关性估计，以提高检索性能。由于收集明确的相关性注释会给用户带来额外负担，因此大量研究都在探索使用伪相关性信号和隐式反馈信号作为替代。然而，这些信号都是相关性的间接指标，在用户互动缺失或有偏差的复杂搜索场景中会受到影响。最近，便携式高精度脑机接口（BCI）设备的发展为监测用户在搜索过程中的大脑活动提供了可能。脑信号可以直接反映用户对搜索结果的心理反应，因此可以作为额外的、无偏见的射频信号。为了探索大脑信号在搜索相关性方面的有效性，我们提出了一个新颖的搜索相关性框架，该框架将基于 BCI 的相关性反馈与伪相关性信号和隐式信号相结合，以提高文档重新排序的性能。在用户研究数据集上的实验结果表明，在我们的 RF 框架中，结合大脑信号可显著提高性能。此外，我们还观察到大脑信号在几种困难搜索场景中表现尤为出色，尤其是在作为反馈的隐含信号缺失或存在噪声的情况下。这揭示了何时以及如何在射频范围内利用大脑信号。

{"title":"Relevance Feedback with Brain Signals","authors":"Ziyi Ye, Xiaohui Xie, Qingyao Ai, Yiqun Liu, Zhihong Wang, Weihang Su, Min Zhang","doi":"10.1145/3637874","DOIUrl":"https://doi.org/10.1145/3637874","url":null,"abstract":"The Relevance Feedback (RF) process relies on accurate and real-time relevance estimation of feedback documents to improve retrieval performance. Since collecting explicit relevance annotations imposes an extra burden on the user, extensive studies have explored using pseudo-relevance signals and implicit feedback signals as substitutes. However, such signals are indirect indicators of relevance and suffer from complex search scenarios where user interactions are absent or biased. Recently, the advances in portable and high-precision brain-computer interface (BCI) devices have shown the possibility to monitor user’s brain activities during search process. Brain signals can directly reflect user’s psychological responses to search results and thus it can act as additional and unbiased RF signals. To explore the effectiveness of brain signals in the context of RF, we propose a novel RF framework that combines BCI-based relevance feedback with pseudo-relevance signals and implicit signals to improve the performance of document re-ranking. The experimental results on the user study dataset show that incorporating brain signals leads to significant performance improvement in our RF framework. Besides, we observe that brain signals perform particularly well in several hard search scenarios, especially when implicit signals as feedback are missing or noisy. This reveals when and how to exploit brain signals in the context of RF.","PeriodicalId":50936,"journal":{"name":"ACM Transactions on Information Systems","volume":"9 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138717032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0