Jun Yin, Zhengxin Zeng, Mingzheng Li, Hao Yan, Chaozhuo Li, Weihao Han, Jianjin Zhang, Ruochen Liu, Allen Sun, Denvy Deng, Feng Sun, Qi Zhang, Shirui Pan, Senzhang Wang
{"title":"Unleash LLMs Potential for Recommendation by Coordinating Twin-Tower Dynamic Semantic Token Generator","authors":"Jun Yin, Zhengxin Zeng, Mingzheng Li, Hao Yan, Chaozhuo Li, Weihao Han, Jianjin Zhang, Ruochen Liu, Allen Sun, Denvy Deng, Feng Sun, Qi Zhang, Shirui Pan, Senzhang Wang","doi":"arxiv-2409.09253","DOIUrl":null,"url":null,"abstract":"Owing to the unprecedented capability in semantic understanding and logical\nreasoning, the pre-trained large language models (LLMs) have shown fantastic\npotential in developing the next-generation recommender systems (RSs). However,\nthe static index paradigm adopted by current methods greatly restricts the\nutilization of LLMs capacity for recommendation, leading to not only the\ninsufficient alignment between semantic and collaborative knowledge, but also\nthe neglect of high-order user-item interaction patterns. In this paper, we\npropose Twin-Tower Dynamic Semantic Recommender (TTDS), the first generative RS\nwhich adopts dynamic semantic index paradigm, targeting at resolving the above\nproblems simultaneously. To be more specific, we for the first time contrive a\ndynamic knowledge fusion framework which integrates a twin-tower semantic token\ngenerator into the LLM-based recommender, hierarchically allocating meaningful\nsemantic index for items and users, and accordingly predicting the semantic\nindex of target item. Furthermore, a dual-modality variational auto-encoder is\nproposed to facilitate multi-grained alignment between semantic and\ncollaborative knowledge. Eventually, a series of novel tuning tasks specially\ncustomized for capturing high-order user-item interaction patterns are proposed\nto take advantages of user historical behavior. Extensive experiments across\nthree public datasets demonstrate the superiority of the proposed methodology\nin developing LLM-based generative RSs. The proposed TTDS recommender achieves\nan average improvement of 19.41% in Hit-Rate and 20.84% in NDCG metric,\ncompared with the leading baseline methods.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"41 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Information Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.09253","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Owing to the unprecedented capability in semantic understanding and logical
reasoning, the pre-trained large language models (LLMs) have shown fantastic
potential in developing the next-generation recommender systems (RSs). However,
the static index paradigm adopted by current methods greatly restricts the
utilization of LLMs capacity for recommendation, leading to not only the
insufficient alignment between semantic and collaborative knowledge, but also
the neglect of high-order user-item interaction patterns. In this paper, we
propose Twin-Tower Dynamic Semantic Recommender (TTDS), the first generative RS
which adopts dynamic semantic index paradigm, targeting at resolving the above
problems simultaneously. To be more specific, we for the first time contrive a
dynamic knowledge fusion framework which integrates a twin-tower semantic token
generator into the LLM-based recommender, hierarchically allocating meaningful
semantic index for items and users, and accordingly predicting the semantic
index of target item. Furthermore, a dual-modality variational auto-encoder is
proposed to facilitate multi-grained alignment between semantic and
collaborative knowledge. Eventually, a series of novel tuning tasks specially
customized for capturing high-order user-item interaction patterns are proposed
to take advantages of user historical behavior. Extensive experiments across
three public datasets demonstrate the superiority of the proposed methodology
in developing LLM-based generative RSs. The proposed TTDS recommender achieves
an average improvement of 19.41% in Hit-Rate and 20.84% in NDCG metric,
compared with the leading baseline methods.