A multi-agent reinforcement learning framework for cross-domain sequential recommendation

IF 6.3 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Neural Networks Pub Date : 2025-05-01 Epub Date: 2025-01-22 DOI:10.1016/j.neunet.2025.107192

Huiting Liu , Junyi Wei , Kaiwen Zhu , Peipei Li , Peng Zhao , Xindong Wu

{"title":"A multi-agent reinforcement learning framework for cross-domain sequential recommendation","authors":"Huiting Liu , Junyi Wei , Kaiwen Zhu , Peipei Li , Peng Zhao , Xindong Wu","doi":"10.1016/j.neunet.2025.107192","DOIUrl":null,"url":null,"abstract":"<div><div>Sequential recommendation models aim to predict the next item based on the sequence of items users interact with, ordered chronologically. However, these models face the challenge of data sparsity. Recent studies have explored cross-domain sequential recommendation, where users’ interaction data across multiple source domains are leveraged to enhance recommendations in data-sparse target domains. Despite this, users’ interests in the target and source domains may not align perfectly. Additionally, current research often neglects the collaboration between different transfer strategies across source domains, leading to suboptimal performance. To address these challenges, we propose a multi-agent reinforcement learning framework for cross-domain sequential recommendation (MARL4CDSR). Unlike traditional approaches that transfer knowledge from the entire source domain sequence, MARL4CDSR uses agents to select relevant items from source domain sequences for transfer. This approach optimizes the transfer process by coordinating agents’ strategies within each source domain through a multi-agent reinforcement learning framework. Additionally, we introduce an information fusion module with a cross-attention mechanism to align the embedding representations of selected source domain items with target domain items. A reward function based on score differences for the next item optimizes the multi-agent system. We evaluate the method on three Amazon domains: Movies_and_TV, Toys_and_Games, and Books. Our proposed model MARL4CDSR outperforms all baselines on all metrics. Specifically, for the Movies&Books<span><math><mo>→</mo></math></span>Toys task, where the target domain interaction sequence is relatively sparse, MARL4CDSR improves NDCG@10 and HR@10 by 14.76% and 10.25%, respectively.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"185 ","pages":"Article 107192"},"PeriodicalIF":6.3000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0893608025000711","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/22 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Sequential recommendation models aim to predict the next item based on the sequence of items users interact with, ordered chronologically. However, these models face the challenge of data sparsity. Recent studies have explored cross-domain sequential recommendation, where users’ interaction data across multiple source domains are leveraged to enhance recommendations in data-sparse target domains. Despite this, users’ interests in the target and source domains may not align perfectly. Additionally, current research often neglects the collaboration between different transfer strategies across source domains, leading to suboptimal performance. To address these challenges, we propose a multi-agent reinforcement learning framework for cross-domain sequential recommendation (MARL4CDSR). Unlike traditional approaches that transfer knowledge from the entire source domain sequence, MARL4CDSR uses agents to select relevant items from source domain sequences for transfer. This approach optimizes the transfer process by coordinating agents’ strategies within each source domain through a multi-agent reinforcement learning framework. Additionally, we introduce an information fusion module with a cross-attention mechanism to align the embedding representations of selected source domain items with target domain items. A reward function based on score differences for the next item optimizes the multi-agent system. We evaluate the method on three Amazon domains: Movies_and_TV, Toys_and_Games, and Books. Our proposed model MARL4CDSR outperforms all baselines on all metrics. Specifically, for the Movies&Books

\to

Toys task, where the target domain interaction sequence is relatively sparse, MARL4CDSR improves NDCG@10 and HR@10 by 14.76% and 10.25%, respectively.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

跨领域顺序推荐的多智能体强化学习框架。

顺序推荐模型旨在根据用户与之交互的项目顺序（按时间顺序排序）预测下一个项目。然而，这些模型面临着数据稀疏性的挑战。最近的研究探索了跨域顺序推荐，其中利用用户跨多个源域的交互数据来增强数据稀疏目标域中的推荐。尽管如此，用户在目标领域和源领域的兴趣可能并不完全一致。此外，目前的研究往往忽略了跨源域的不同迁移策略之间的协作，导致性能不佳。为了解决这些挑战，我们提出了一个跨域顺序推荐的多智能体强化学习框架（MARL4CDSR）。与从整个源域序列中转移知识的传统方法不同，MARL4CDSR使用代理从源域序列中选择相关项目进行转移。该方法通过多智能体强化学习框架协调每个源域内智能体的策略，从而优化迁移过程。此外，我们还引入了一个具有交叉注意机制的信息融合模块，以使所选源领域项目的嵌入表示与目标领域项目对齐。基于下一个项目得分差异的奖励函数优化了多智能体系统。我们在三个Amazon域上评估该方法：Movies_and_TV， Toys_and_Games和Books。我们提出的模型MARL4CDSR在所有指标上都优于所有基线。具体来说，对于目标域交互序列相对稀疏的Movies&Books→Toys任务，MARL4CDSR分别提高了NDCG@10和HR@10 14.76%和10.25%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Neural Networks 工程技术-计算机：人工智能

CiteScore

13.90

自引率

7.70%

发文量

425

审稿时长

67 days

期刊介绍： Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.