Huiting Liu , Junyi Wei , Kaiwen Zhu , Peipei Li , Peng Zhao , Xindong Wu
{"title":"A multi-agent reinforcement learning framework for cross-domain sequential recommendation","authors":"Huiting Liu , Junyi Wei , Kaiwen Zhu , Peipei Li , Peng Zhao , Xindong Wu","doi":"10.1016/j.neunet.2025.107192","DOIUrl":null,"url":null,"abstract":"<div><div>Sequential recommendation models aim to predict the next item based on the sequence of items users interact with, ordered chronologically. However, these models face the challenge of data sparsity. Recent studies have explored cross-domain sequential recommendation, where users’ interaction data across multiple source domains are leveraged to enhance recommendations in data-sparse target domains. Despite this, users’ interests in the target and source domains may not align perfectly. Additionally, current research often neglects the collaboration between different transfer strategies across source domains, leading to suboptimal performance. To address these challenges, we propose a multi-agent reinforcement learning framework for cross-domain sequential recommendation (MARL4CDSR). Unlike traditional approaches that transfer knowledge from the entire source domain sequence, MARL4CDSR uses agents to select relevant items from source domain sequences for transfer. This approach optimizes the transfer process by coordinating agents’ strategies within each source domain through a multi-agent reinforcement learning framework. Additionally, we introduce an information fusion module with a cross-attention mechanism to align the embedding representations of selected source domain items with target domain items. A reward function based on score differences for the next item optimizes the multi-agent system. We evaluate the method on three Amazon domains: Movies_and_TV, Toys_and_Games, and Books. Our proposed model MARL4CDSR outperforms all baselines on all metrics. Specifically, for the Movies&Books<span><math><mo>→</mo></math></span>Toys task, where the target domain interaction sequence is relatively sparse, MARL4CDSR improves NDCG@10 and HR@10 by 14.76% and 10.25%, respectively.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"185 ","pages":"Article 107192"},"PeriodicalIF":6.0000,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0893608025000711","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Sequential recommendation models aim to predict the next item based on the sequence of items users interact with, ordered chronologically. However, these models face the challenge of data sparsity. Recent studies have explored cross-domain sequential recommendation, where users’ interaction data across multiple source domains are leveraged to enhance recommendations in data-sparse target domains. Despite this, users’ interests in the target and source domains may not align perfectly. Additionally, current research often neglects the collaboration between different transfer strategies across source domains, leading to suboptimal performance. To address these challenges, we propose a multi-agent reinforcement learning framework for cross-domain sequential recommendation (MARL4CDSR). Unlike traditional approaches that transfer knowledge from the entire source domain sequence, MARL4CDSR uses agents to select relevant items from source domain sequences for transfer. This approach optimizes the transfer process by coordinating agents’ strategies within each source domain through a multi-agent reinforcement learning framework. Additionally, we introduce an information fusion module with a cross-attention mechanism to align the embedding representations of selected source domain items with target domain items. A reward function based on score differences for the next item optimizes the multi-agent system. We evaluate the method on three Amazon domains: Movies_and_TV, Toys_and_Games, and Books. Our proposed model MARL4CDSR outperforms all baselines on all metrics. Specifically, for the Movies&BooksToys task, where the target domain interaction sequence is relatively sparse, MARL4CDSR improves NDCG@10 and HR@10 by 14.76% and 10.25%, respectively.
期刊介绍:
Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.