首页 > 最新文献

ACM Transactions on Intelligent Systems and Technology最新文献

英文 中文
DNSRF: Deep Network-based Semi-NMF Representation Framework DNSRF:基于深度网络的半 NMF 表示框架
IF 5 4区 计算机科学 Q1 Mathematics Pub Date : 2024-06-03 DOI: 10.1145/3670408
Dexian Wang, Tianrui Li, Ping Deng, Zhipeng Luo, Pengfei Zhang, Keyu Liu, Wei Huang

Representation learning is an important topic in machine learning, pattern recognition, and data mining research. Among many representation learning approaches, semi-nonnegative matrix factorization (SNMF) is a frequently-used one. However, a typical problem of SNMF is that usually there is no learning rate guidance during the optimization process, which often leads to a poor representation ability. To overcome this limitation, we propose a very general representation learning framework (DNSRF) that is based on deep neural net. Essentially, the parameters of the deep net used to construct the DNSRF algorithms are obtained by matrix element update. In combination with different activation functions, DNSRF can be implemented in various ways. In our experiments, we tested nine instances of our DNSRF framework on six benchmark datasets. In comparison with other state-of-the-art methods, the results demonstrate superior performance of our framework, which is thus shown to have a great representation ability.

表示学习是机器学习、模式识别和数据挖掘研究中的一个重要课题。在众多表示学习方法中,半负矩阵因式分解(SNMF)是一种常用的方法。然而,SNMF 的一个典型问题是在优化过程中通常没有学习率的指导,这往往会导致表示能力较差。为了克服这一局限,我们提出了一种基于深度神经网络的通用表示学习框架(DNSRF)。从本质上讲,用于构建 DNSRF 算法的深度网参数是通过矩阵元素更新获得的。结合不同的激活函数,DNSRF 可以以多种方式实现。在实验中,我们在六个基准数据集上测试了 DNSRF 框架的九个实例。与其他最先进的方法相比,结果表明我们的框架性能优越,因此具有很强的表示能力。
{"title":"DNSRF: Deep Network-based Semi-NMF Representation Framework","authors":"Dexian Wang, Tianrui Li, Ping Deng, Zhipeng Luo, Pengfei Zhang, Keyu Liu, Wei Huang","doi":"10.1145/3670408","DOIUrl":"https://doi.org/10.1145/3670408","url":null,"abstract":"<p>Representation learning is an important topic in machine learning, pattern recognition, and data mining research. Among many representation learning approaches, semi-nonnegative matrix factorization (SNMF) is a frequently-used one. However, a typical problem of SNMF is that usually there is no learning rate guidance during the optimization process, which often leads to a poor representation ability. To overcome this limitation, we propose a very general representation learning framework (DNSRF) that is based on deep neural net. Essentially, the parameters of the deep net used to construct the DNSRF algorithms are obtained by matrix element update. In combination with different activation functions, DNSRF can be implemented in various ways. In our experiments, we tested nine instances of our DNSRF framework on six benchmark datasets. In comparison with other state-of-the-art methods, the results demonstrate superior performance of our framework, which is thus shown to have a great representation ability.</p>","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141256071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving Faithfulness and Factuality with Contrastive Learning in Explainable Recommendation 在可解释的推荐中通过对比学习提高忠实性和事实性
IF 5 4区 计算机科学 Q1 Mathematics Pub Date : 2024-05-25 DOI: 10.1145/3653984
Haojie Zhuang, Wei Zhang, Weitong Chen, Jian Yang, Quan Z. Sheng

Recommender systems have become increasingly important in navigating the vast amount of information and options available in various domains. By tailoring and personalizing recommendations to user preferences and interests, these systems improve the user experience, efficiency and satisfaction. With a growing demand for transparency and understanding of recommendation outputs, explainable recommender systems have gained growing attention in recent years. Additionally, as user reviews could be considered the rationales behind why the user likes (or dislikes) the products, generating informative and reliable reviews alongside recommendations has thus emerged as a research focus in explainable recommendation. However, the model-generated reviews might contain factual inconsistent contents (i.e., the hallucination issue), which would thus compromise the recommendation rationales. To address this issue, we propose a contrastive learning framework to improve the faithfulness and factuality in explainable recommendation in this paper. We further develop different strategies of generating positive and negative examples for contrastive learning, such as back-translation or synonym substitution for positive examples, and editing positive examples or utilizing model-generated texts for negative examples. Our proposed method optimizes the model to distinguish faithful explanations (i.e., positive examples) and unfaithful ones with factual errors (i.e., negative examples), which thus drives the model to generate faithful reviews as explanations while avoiding inconsistent contents. Extensive experiments and analysis on three benchmark datasets show that our proposed model outperforms other review generation baselines in faithfulness and factuality. In addition, the proposed contrastive learning component could be easily incorporated into other explainable recommender systems in a plug-and-play manner.

在浏览各领域的大量信息和选项时,推荐系统变得越来越重要。通过根据用户偏好和兴趣定制个性化推荐,这些系统可以改善用户体验,提高效率和满意度。随着人们对推荐结果的透明度和理解力的要求越来越高,可解释的推荐系统近年来受到越来越多的关注。此外,由于用户评论可被视为用户喜欢(或不喜欢)产品背后的理由,因此在推荐的同时生成信息丰富且可靠的评论已成为可解释推荐的研究重点。然而,模型生成的评论可能会包含与事实不符的内容(即幻觉问题),从而影响推荐的合理性。为了解决这个问题,我们在本文中提出了一个对比学习框架,以提高可解释推荐的忠实性和事实性。我们进一步开发了用于对比学习的生成正面和负面示例的不同策略,例如正面示例的回译或同义词替换,负面示例的编辑正面示例或利用模型生成的文本。我们提出的方法优化了模型,以区分忠实的解释(即正面例子)和有事实错误的不忠实解释(即负面例子),从而促使模型生成忠实的评论作为解释,同时避免不一致的内容。在三个基准数据集上进行的大量实验和分析表明,我们提出的模型在忠实性和事实性方面优于其他评论生成基线。此外,我们提出的对比学习组件可以即插即用的方式轻松地集成到其他可解释的推荐系统中。
{"title":"Improving Faithfulness and Factuality with Contrastive Learning in Explainable Recommendation","authors":"Haojie Zhuang, Wei Zhang, Weitong Chen, Jian Yang, Quan Z. Sheng","doi":"10.1145/3653984","DOIUrl":"https://doi.org/10.1145/3653984","url":null,"abstract":"<p>Recommender systems have become increasingly important in navigating the vast amount of information and options available in various domains. By tailoring and personalizing recommendations to user preferences and interests, these systems improve the user experience, efficiency and satisfaction. With a growing demand for transparency and understanding of recommendation outputs, explainable recommender systems have gained growing attention in recent years. Additionally, as user reviews could be considered the rationales behind why the user likes (or dislikes) the products, generating informative and reliable reviews alongside recommendations has thus emerged as a research focus in explainable recommendation. However, the model-generated reviews might contain factual inconsistent contents (i.e., the hallucination issue), which would thus compromise the recommendation rationales. To address this issue, we propose a contrastive learning framework to improve the faithfulness and factuality in explainable recommendation in this paper. We further develop different strategies of generating positive and negative examples for contrastive learning, such as back-translation or synonym substitution for positive examples, and editing positive examples or utilizing model-generated texts for negative examples. Our proposed method optimizes the model to distinguish faithful explanations (i.e., positive examples) and unfaithful ones with factual errors (i.e., negative examples), which thus drives the model to generate faithful reviews as explanations while avoiding inconsistent contents. Extensive experiments and analysis on three benchmark datasets show that our proposed model outperforms other review generation baselines in faithfulness and factuality. In addition, the proposed contrastive learning component could be easily incorporated into other explainable recommender systems in a plug-and-play manner.</p>","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141146457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Federated Social Recommendation Approach with Enhanced Hypergraph Neural Network 使用增强超图神经网络的联合社交推荐方法
IF 5 4区 计算机科学 Q1 Mathematics Pub Date : 2024-05-24 DOI: 10.1145/3665931
Hongliang Sun, Zhiying Tu, Dianbo Sui, Bolin Zhang, Xiaofei Xu

In recent years, the development of online social network platforms has led to increased research efforts in social recommendation systems. Unlike traditional recommendation systems, social recommendation systems utilize both user-item interactions and user-user social relations to recommend relevant items, taking into account social homophily and social influence. Graph neural network (GNN) based social recommendation methods have been proposed to model these item interactions and social relations effectively. However, existing GNN-based methods rely on centralized training, which raises privacy concerns and faces challenges in data collection due to regulations and privacy restrictions. Federated learning has emerged as a privacy-preserving alternative. Combining federated learning with GNN-based methods for social recommendation can leverage their respective advantages, but it also introduces new challenges: 1) existing federated recommendation systems often lack the capability to process heterogeneous data, such as user-item interactions and social relations; 2) due to the sparsity of data distributed across different clients, capturing the higher-order relationship information among users becomes challenging and is often overlooked by most federated recommendation systems. To overcome these challenges, we propose a federated social recommendation approach with enhanced hypergraph neural network. We introduce hypergraph graph neural networks (HGNN) to learn user and item embeddings in federated recommendation systems, leveraging the hypergraph structure to address the heterogeneity of data. Based on carefully crafted triangular motifs, we merge user and item nodes to construct hypergraphs on local clients, capturing specific triangular relations. Multiple HGNN channels are used to encode different categories of high-order relations, and an attention mechanism is applied to aggregate the embedded information from these channels. Our experiments on real-world social recommendation datasets demonstrate the effectiveness of the proposed approach. Extensive experiment results on three publicly available datasets validate the effectiveness of the proposed method.

近年来,随着在线社交网络平台的发展,社交推荐系统的研究工作日益增多。与传统的推荐系统不同,社交推荐系统既利用用户与项目之间的互动,也利用用户与用户之间的社交关系来推荐相关项目,同时考虑到社交同质性和社交影响力。基于图神经网络(GNN)的社交推荐方法已被提出,以有效地模拟这些项目交互和社交关系。然而,现有的基于图神经网络的方法依赖于集中式训练,这会引发隐私问题,并且由于法规和隐私限制,在数据收集方面面临挑战。联盟学习作为一种保护隐私的替代方法应运而生。将联合学习与基于 GNN 的社交推荐方法相结合,可以发挥各自的优势,但也带来了新的挑战:1)现有的联合推荐系统往往缺乏处理异构数据的能力,如用户-项目交互和社会关系;2)由于分布在不同客户端的数据稀少,捕捉用户之间的高阶关系信息变得具有挑战性,而且往往被大多数联合推荐系统所忽视。为了克服这些挑战,我们提出了一种使用增强超图神经网络的联合社交推荐方法。我们引入超图神经网络(HGNN)来学习联合推荐系统中的用户和项目嵌入,利用超图结构来解决数据的异质性问题。基于精心制作的三角形图案,我们合并用户和项目节点,在本地客户端上构建超图,捕捉特定的三角形关系。我们使用多个 HGNN 通道来编码不同类别的高阶关系,并采用注意力机制来聚合这些通道中的嵌入信息。我们在真实世界社交推荐数据集上的实验证明了所提方法的有效性。在三个公开数据集上的广泛实验结果验证了所提方法的有效性。
{"title":"A Federated Social Recommendation Approach with Enhanced Hypergraph Neural Network","authors":"Hongliang Sun, Zhiying Tu, Dianbo Sui, Bolin Zhang, Xiaofei Xu","doi":"10.1145/3665931","DOIUrl":"https://doi.org/10.1145/3665931","url":null,"abstract":"<p>In recent years, the development of online social network platforms has led to increased research efforts in social recommendation systems. Unlike traditional recommendation systems, social recommendation systems utilize both user-item interactions and user-user social relations to recommend relevant items, taking into account social homophily and social influence. Graph neural network (GNN) based social recommendation methods have been proposed to model these item interactions and social relations effectively. However, existing GNN-based methods rely on centralized training, which raises privacy concerns and faces challenges in data collection due to regulations and privacy restrictions. Federated learning has emerged as a privacy-preserving alternative. Combining federated learning with GNN-based methods for social recommendation can leverage their respective advantages, but it also introduces new challenges: 1) existing federated recommendation systems often lack the capability to process heterogeneous data, such as user-item interactions and social relations; 2) due to the sparsity of data distributed across different clients, capturing the higher-order relationship information among users becomes challenging and is often overlooked by most federated recommendation systems. To overcome these challenges, we propose a federated social recommendation approach with enhanced hypergraph neural network. We introduce hypergraph graph neural networks (HGNN) to learn user and item embeddings in federated recommendation systems, leveraging the hypergraph structure to address the heterogeneity of data. Based on carefully crafted triangular motifs, we merge user and item nodes to construct hypergraphs on local clients, capturing specific triangular relations. Multiple HGNN channels are used to encode different categories of high-order relations, and an attention mechanism is applied to aggregate the embedded information from these channels. Our experiments on real-world social recommendation datasets demonstrate the effectiveness of the proposed approach. Extensive experiment results on three publicly available datasets validate the effectiveness of the proposed method.</p>","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141146469","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Incremental Data Drifting: Evaluation Metrics, Data Generation, and Approach Comparison 增量数据漂移:评估指标、数据生成和方法比较
IF 5 4区 计算机科学 Q1 Mathematics Pub Date : 2024-05-24 DOI: 10.1145/3655630
Yu-Tung Pai, Nien-En Sun, Cheng-Te Li, Shou-de Lin
Incremental data drifting is a common problem when employing a machine-learning model in industrial applications. The underlying data distribution evolves gradually, e.g., users change their buying preferences on an E-commerce website over time. The problem needs to be addressed to obtain high performance. Right now, studies regarding incremental data drifting suffer from several issues. For one thing, there is a lack of clear-defined incremental drift datasets for examination. Existing efforts use either collected real datasets or synthetic datasets that show two obvious limitations. One is in particular when and of which type of drifts the distribution undergoes is unknown, and the other is that a simple synthesized dataset cannot reflect the complex representation we would normally face in the real world. For another, there lacks of a well-defined protocol to evaluate a learner’s knowledge transfer capability on an incremental drift dataset. To provide a holistic discussion on these issues, we create approaches to generate datasets with specific drift types, and define a novel protocol for evaluation. Besides, we investigate recent advances in the transfer learning field, including Domain Adaptation and Lifelong Learning, and examine how they perform in the presence of incremental data drifting. The results unfold the relationships among drift types, knowledge preservation, and learning approaches.
在工业应用中使用机器学习模型时,增量数据漂移是一个常见问题。底层数据分布会逐渐变化,例如,用户在电子商务网站上的购买偏好会随着时间的推移而改变。要想获得高性能,就必须解决这个问题。目前,有关增量数据漂移的研究存在几个问题。首先,缺乏明确定义的增量漂移数据集来进行研究。现有的研究要么使用收集的真实数据集,要么使用合成数据集,这两种数据集都有两个明显的局限性。一个是分布何时发生漂移以及发生哪种漂移尚不可知,另一个是简单的合成数据集无法反映我们在现实世界中通常会遇到的复杂情况。此外,还缺乏一个定义明确的协议来评估学习者在增量漂移数据集上的知识迁移能力。为了对这些问题进行全面讨论,我们创建了生成特定漂移类型数据集的方法,并定义了新颖的评估协议。此外,我们还研究了迁移学习领域的最新进展,包括领域适应和终身学习,并考察了它们在增量数据漂移情况下的表现。研究结果揭示了漂移类型、知识保存和学习方法之间的关系。
{"title":"Incremental Data Drifting: Evaluation Metrics, Data Generation, and Approach Comparison","authors":"Yu-Tung Pai, Nien-En Sun, Cheng-Te Li, Shou-de Lin","doi":"10.1145/3655630","DOIUrl":"https://doi.org/10.1145/3655630","url":null,"abstract":"Incremental data drifting is a common problem when employing a machine-learning model in industrial applications. The underlying data distribution evolves gradually, e.g., users change their buying preferences on an E-commerce website over time. The problem needs to be addressed to obtain high performance. Right now, studies regarding incremental data drifting suffer from several issues. For one thing, there is a lack of clear-defined incremental drift datasets for examination. Existing efforts use either collected real datasets or synthetic datasets that show two obvious limitations. One is in particular when and of which type of drifts the distribution undergoes is unknown, and the other is that a simple synthesized dataset cannot reflect the complex representation we would normally face in the real world. For another, there lacks of a well-defined protocol to evaluate a learner’s knowledge transfer capability on an incremental drift dataset. To provide a holistic discussion on these issues, we create approaches to generate datasets with specific drift types, and define a novel protocol for evaluation. Besides, we investigate recent advances in the transfer learning field, including Domain Adaptation and Lifelong Learning, and examine how they perform in the presence of incremental data drifting. The results unfold the relationships among drift types, knowledge preservation, and learning approaches.","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141098931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Model-Agnostic Adaptive Testing for Intelligent Education Systems via Meta-learned Gradient Embeddings 通过元学习梯度嵌入实现智能教育系统的模型诊断自适应测试
IF 5 4区 计算机科学 Q1 Mathematics Pub Date : 2024-05-23 DOI: 10.1145/3660642
Haoyang Bi, Qi Liu, Han Wu, Weidong He, Zhenya Huang, Yu Yin, Haiping Ma, Yu Su, Shijin Wang, Enhong Chen
The field of education has undergone a significant revolution with the advent of intelligent systems and technology, which aim to personalize the learning experience, catering to the unique needs and abilities of individual learners. In this pursuit, a fundamental challenge is designing proper test for assessing the students’ cognitive status on knowledge and skills accurately and efficiently. One promising approach, referred to as Computerized Adaptive Testing (CAT), is to administrate computer-automated tests that alternately select the next item for each examinee and estimate their cognitive states given their responses to the selected items. Nevertheless, existing CAT systems suffer from inflexibility in item selection and ineffectiveness in cognitive state estimation, respectively. In this paper, we propose a Model-Agnostic adaptive testing framework via Meta-leaned Gradient Embeddings, MAMGE for short, improving both item selection and cognitive state estimation simultaneously. For item selection, we design a Gradient Embedding based Item Selector (GEIS) which incorporates the concept of gradient embeddings to represent items and selects the best ones that are both informative and representative. For cognitive state estimation, we propose a Meta-learned Cognitive State Estimator (MCSE) to automatically control the estimation process by learning to learn a proper initialization and dynamically inferred updates. Both MCSE and GEIS are inherently model-agnostic, and the two modules have an ingenious connection via meta-learned gradient embeddings. Finally, extensive experiments evaluate the effectiveness and flexibility of MAMGE.
随着智能系统和技术的出现,教育领域经历了一场重大变革,其目的是提供个性化的学习体验,满足学习者的独特需求和能力。在这一过程中,一个基本挑战是设计适当的测试,以准确有效地评估学生对知识和技能的认知状况。计算机化自适应测试(Computerized Adaptive Testing,简称 CAT)是一种很有前途的方法,它通过计算机自动测试,交替为每个考生选择下一个测试项目,并根据他们对所选项目的反应来估计他们的认知状态。然而,现有的计算机辅助测试系统在项目选择方面缺乏灵活性,在认知状态估计方面效果不佳。在本文中,我们通过元倾斜梯度嵌入(简称 MAMGE)提出了一种模型诊断自适应测试框架,可同时改进项目选择和认知状态估计。在项目选择方面,我们设计了一种基于梯度嵌入的项目选择器(GEIS),它结合了梯度嵌入的概念来表示项目,并选择信息量大且具有代表性的最佳项目。在认知状态估计方面,我们提出了元学习认知状态估计器(MCSE),通过学习适当的初始化和动态推断更新来自动控制估计过程。MCSE 和 GEIS 本身与模型无关,这两个模块通过元学习梯度嵌入巧妙地联系在一起。最后,大量实验评估了 MAMGE 的有效性和灵活性。
{"title":"Model-Agnostic Adaptive Testing for Intelligent Education Systems via Meta-learned Gradient Embeddings","authors":"Haoyang Bi, Qi Liu, Han Wu, Weidong He, Zhenya Huang, Yu Yin, Haiping Ma, Yu Su, Shijin Wang, Enhong Chen","doi":"10.1145/3660642","DOIUrl":"https://doi.org/10.1145/3660642","url":null,"abstract":"\u0000 The field of education has undergone a significant revolution with the advent of intelligent systems and technology, which aim to personalize the learning experience, catering to the unique needs and abilities of individual learners. In this pursuit, a fundamental challenge is designing proper test for assessing the students’ cognitive status on knowledge and skills accurately and efficiently. One promising approach, referred to as\u0000 Computerized Adaptive Testing\u0000 (CAT), is to administrate computer-automated tests that alternately select the next item for each examinee and estimate their cognitive states given their responses to the selected items. Nevertheless, existing CAT systems suffer from inflexibility in item selection and ineffectiveness in cognitive state estimation, respectively. In this paper, we propose a Model-Agnostic adaptive testing framework via Meta-leaned Gradient Embeddings, MAMGE for short, improving both item selection and cognitive state estimation simultaneously. For item selection, we design a Gradient Embedding based Item Selector (GEIS) which incorporates the concept of gradient embeddings to represent items and selects the best ones that are both informative and representative. For cognitive state estimation, we propose a Meta-learned Cognitive State Estimator (MCSE) to automatically control the estimation process by learning to learn a proper initialization and dynamically inferred updates. Both MCSE and GEIS are inherently model-agnostic, and the two modules have an ingenious connection via meta-learned gradient embeddings. Finally, extensive experiments evaluate the effectiveness and flexibility of MAMGE.\u0000","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141105249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fairness and Diversity in Recommender Systems: A Survey 推荐系统的公平性和多样性:调查
IF 5 4区 计算机科学 Q1 Mathematics Pub Date : 2024-05-21 DOI: 10.1145/3664928
Yuying Zhao, Yu Wang, Yunchao Liu, Xueqi Cheng, Charu C. Aggarwal, Tyler Derr

Recommender systems (RS) are effective tools for mitigating information overload and have seen extensive applications across various domains. However, the single focus on utility goals proves to be inadequate in addressing real-world concerns, leading to increasing attention to fairness-aware and diversity-aware RS. While most existing studies explore fairness and diversity independently, we identify strong connections between these two domains. In this survey, we first discuss each of them individually and then dive into their connections. Additionally, motivated by the concepts of user-level and item-level fairness, we broaden the understanding of diversity to encompass not only the item level but also the user level. With this expanded perspective on user and item-level diversity, we re-interpret fairness studies from the viewpoint of diversity. This fresh perspective enhances our understanding of fairness-related work and paves the way for potential future research directions. Papers discussed in this survey along with public code links are available at: https://github.com/YuyingZhao/Awesome-Fairness-and-Diversity-Papers-in-Recommender-Systems.

推荐系统(RS)是减轻信息过载的有效工具,已在各个领域得到广泛应用。然而,事实证明,只关注效用目标不足以解决现实世界中的问题,因此公平感知和多样性感知的推荐系统越来越受到关注。虽然现有的大多数研究都是单独探讨公平性和多样性的,但我们发现这两个领域之间存在紧密联系。在本调查中,我们首先单独讨论这两个领域,然后深入探讨它们之间的联系。此外,受用户层面和项目层面公平性概念的启发,我们拓宽了对多样性的理解,使其不仅包括项目层面,还包括用户层面。有了这种对用户和项目层面多样性的扩展视角,我们就能从多样性的角度重新诠释公平性研究。这种全新的视角增强了我们对公平性相关工作的理解,并为未来潜在的研究方向铺平了道路。本调查中讨论的论文以及公共代码链接可在以下网址获取:https://github.com/YuyingZhao/Awesome-Fairness-and-Diversity-Papers-in-Recommender-Systems。
{"title":"Fairness and Diversity in Recommender Systems: A Survey","authors":"Yuying Zhao, Yu Wang, Yunchao Liu, Xueqi Cheng, Charu C. Aggarwal, Tyler Derr","doi":"10.1145/3664928","DOIUrl":"https://doi.org/10.1145/3664928","url":null,"abstract":"<p>Recommender systems (RS) are effective tools for mitigating information overload and have seen extensive applications across various domains. However, the single focus on utility goals proves to be inadequate in addressing real-world concerns, leading to increasing attention to fairness-aware and diversity-aware RS. While most existing studies explore fairness and diversity independently, we identify strong connections between these two domains. In this survey, we first discuss each of them individually and then dive into their connections. Additionally, motivated by the concepts of user-level and item-level fairness, we broaden the understanding of diversity to encompass not only the item level but also the user level. With this expanded perspective on user and item-level diversity, we re-interpret fairness studies from the viewpoint of diversity. This fresh perspective enhances our understanding of fairness-related work and paves the way for potential future research directions. Papers discussed in this survey along with public code links are available at: https://github.com/YuyingZhao/Awesome-Fairness-and-Diversity-Papers-in-Recommender-Systems.</p>","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141146512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PerFedRec++: Enhancing Personalized Federated Recommendation with Self-Supervised Pre-Training PerFedRec++:利用自监督预培训增强个性化联合推荐
IF 5 4区 计算机科学 Q1 Mathematics Pub Date : 2024-05-14 DOI: 10.1145/3664927
Sichun Luo, Yuanzhang Xiao, Xinyi Zhang, Yang Liu, Wenbo Ding, Linqi Song

Federated recommendation systems employ federated learning techniques to safeguard user privacy by transmitting model parameters instead of raw user data between user devices and the central server. Nevertheless, the current federated recommender system faces three significant challenges: (1) data heterogeneity: the heterogeneity of users’ attributes and local data necessitates the acquisition of personalized models to improve the performance of federated recommendation; (2) model performance degradation: the privacy-preserving protocol design in the federated recommendation, such as pseudo item labeling and differential privacy, would deteriorate the model performance; (3) communication bottleneck: the standard federated recommendation algorithm can have a high communication overhead. Previous studies have attempted to address these issues, but none have been able to solve them simultaneously.

In this paper, we propose a novel framework, named PerFedRec++, to enhance the personalized federated recommendation with self-supervised pre-training. Specifically, we utilize the privacy-preserving mechanism of federated recommender systems to generate two augmented graph views, which are used as contrastive tasks in self-supervised graph learning to pre-train the model. Pre-training enhances the performance of federated models by improving the uniformity of representation learning. Also, by providing a better initial state for federated training, pre-training makes the overall training converge faster, thus alleviating the heavy communication burden. We then construct a collaborative graph to learn the client representation through a federated graph neural network. Based on these learned representations, we cluster users into different user groups and learn personalized models for each cluster. Each user learns a personalized model by combining the global federated model, the cluster-level federated model, and its own fine-tuned local model. Experiments on three real-world datasets show that our proposed method achieves superior performance over existing methods.

联盟推荐系统采用联盟学习技术,通过在用户设备和中央服务器之间传输模型参数而非原始用户数据来保护用户隐私。然而,当前的联合推荐系统面临三个重大挑战:(1)数据异构:用户属性和本地数据的异构要求获取个性化模型,以提高联合推荐的性能;(2)模型性能下降:联合推荐中的隐私保护协议设计,如伪项目标签和差分隐私,会使模型性能下降;(3)通信瓶颈:标准的联合推荐算法会有很高的通信开销。在本文中,我们提出了一个名为 PerFedRec++ 的新框架,通过自监督预训练来增强个性化联合推荐。具体来说,我们利用联合推荐系统的隐私保护机制来生成两个增强图视图,并将其作为自监督图学习中的对比任务来预训练模型。预训练可以提高表征学习的一致性,从而增强联合模型的性能。同时,通过为联合训练提供更好的初始状态,预训练使整体训练收敛得更快,从而减轻了沉重的通信负担。然后,我们构建一个协作图,通过联合图神经网络学习客户端表示。基于这些学习到的表征,我们将用户聚类为不同的用户组,并为每个聚类学习个性化模型。每个用户通过结合全局联合模型、集群级联合模型和自己的微调本地模型来学习个性化模型。在三个真实数据集上的实验表明,我们提出的方法比现有方法性能更优越。
{"title":"PerFedRec++: Enhancing Personalized Federated Recommendation with Self-Supervised Pre-Training","authors":"Sichun Luo, Yuanzhang Xiao, Xinyi Zhang, Yang Liu, Wenbo Ding, Linqi Song","doi":"10.1145/3664927","DOIUrl":"https://doi.org/10.1145/3664927","url":null,"abstract":"<p>Federated recommendation systems employ federated learning techniques to safeguard user privacy by transmitting model parameters instead of raw user data between user devices and the central server. Nevertheless, the current federated recommender system faces three significant challenges: (1) <i>data heterogeneity:</i> the heterogeneity of users’ attributes and local data necessitates the acquisition of personalized models to improve the performance of federated recommendation; (2) <i>model performance degradation:</i> the privacy-preserving protocol design in the federated recommendation, such as pseudo item labeling and differential privacy, would deteriorate the model performance; (3) <i>communication bottleneck:</i> the standard federated recommendation algorithm can have a high communication overhead. Previous studies have attempted to address these issues, but none have been able to solve them simultaneously.</p><p>In this paper, we propose a novel framework, named <monospace>PerFedRec++</monospace>, to enhance the personalized federated recommendation with self-supervised pre-training. Specifically, we utilize the privacy-preserving mechanism of federated recommender systems to generate two augmented graph views, which are used as contrastive tasks in self-supervised graph learning to pre-train the model. Pre-training enhances the performance of federated models by improving the uniformity of representation learning. Also, by providing a better initial state for federated training, pre-training makes the overall training converge faster, thus alleviating the heavy communication burden. We then construct a collaborative graph to learn the client representation through a federated graph neural network. Based on these learned representations, we cluster users into different user groups and learn personalized models for each cluster. Each user learns a personalized model by combining the global federated model, the cluster-level federated model, and its own fine-tuned local model. Experiments on three real-world datasets show that our proposed method achieves superior performance over existing methods.</p>","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140925508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mitigating Recommendation Biases via Group-Alignment and Global-Uniformity in Representation Learning 通过表征学习中的组对齐和全局一致性减轻推荐偏差
IF 5 4区 计算机科学 Q1 Mathematics Pub Date : 2024-05-14 DOI: 10.1145/3664931
Miaomiao Cai, Min Hou, Lei Chen, Le Wu, Haoyue Bai, Yong Li, Meng Wang

Collaborative Filtering (CF) plays a crucial role in modern recommender systems, leveraging historical user-item interactions to provide personalized suggestions. However, CF-based methods often encounter biases due to imbalances in training data. This phenomenon makes CF-based methods tend to prioritize recommending popular items and performing unsatisfactorily on inactive users. Existing works address this issue by rebalancing training samples, reranking recommendation results, or making the modeling process robust to the bias. Despite their effectiveness, these approaches can compromise accuracy or be sensitive to weighting strategies, making them challenging to train. Therefore, exploring how to mitigate these biases remains in urgent demand.

In this paper, we deeply analyze the causes and effects of the biases and propose a framework to alleviate biases in recommendation from the perspective of representation distribution, namely Group-Alignment and Global-Uniformity Enhanced Representation Learning for Debiasing Recommendation (AURL). Specifically, we identify two significant problems in the representation distribution of users and items, namely group-discrepancy and global-collapse. These two problems directly lead to biases in the recommendation results. To this end, we propose two simple but effective regularizers in the representation space, respectively named group-alignment and global-uniformity. The goal of group-alignment is to bring the representation distribution of long-tail entities closer to that of popular entities, while global-uniformity aims to preserve the information of entities as much as possible by evenly distributing representations. Our method directly optimizes both the group-alignment and global-uniformity regularization terms to mitigate recommendation biases. Please note that AURL applies to arbitrary CF-based recommendation backbones. Extensive experiments on three real datasets and various recommendation backbones verify the superiority of our proposed framework. The results show that AURL not only outperforms existing debiasing models in mitigating biases but also improves recommendation performance to some extent.

协同过滤(CF)在现代推荐系统中发挥着至关重要的作用,它利用用户与项目之间的历史互动来提供个性化建议。然而,由于训练数据的不平衡,基于协同过滤的方法经常会遇到偏差。这种现象使得基于 CF 的方法倾向于优先推荐热门项目,而对不活跃用户的推荐效果则不尽人意。现有的工作通过重新平衡训练样本、重新排序推荐结果或使建模过程对偏差具有鲁棒性来解决这一问题。尽管这些方法很有效,但它们可能会影响准确性或对加权策略很敏感,从而给训练带来挑战。在本文中,我们深入分析了偏差的原因和影响,并从表征分布的角度提出了一个减轻推荐偏差的框架,即用于去偏差推荐的组对齐和全局均匀性增强表征学习(AURL)。具体来说,我们发现用户和项目的表征分布存在两个重要问题,即群体差异和全局塌陷。这两个问题会直接导致推荐结果出现偏差。为此,我们在表征空间中提出了两个简单而有效的正则,分别称为组对齐(group-alignment)和全局均匀性(global-uniformity)。组对齐的目的是使长尾实体的表示分布更接近流行实体的表示分布,而全局均匀性的目的是通过均匀分布表示来尽可能保留实体的信息。我们的方法直接优化了组对齐和全局均匀性正则化项,以减少推荐偏差。请注意,AURL 适用于任意基于 CF 的推荐骨干网。在三个真实数据集和各种推荐骨干网上进行的广泛实验验证了我们提出的框架的优越性。结果表明,AURL 不仅在减轻偏差方面优于现有的去除法模型,还在一定程度上提高了推荐性能。
{"title":"Mitigating Recommendation Biases via Group-Alignment and Global-Uniformity in Representation Learning","authors":"Miaomiao Cai, Min Hou, Lei Chen, Le Wu, Haoyue Bai, Yong Li, Meng Wang","doi":"10.1145/3664931","DOIUrl":"https://doi.org/10.1145/3664931","url":null,"abstract":"<p>Collaborative Filtering (CF) plays a crucial role in modern recommender systems, leveraging historical user-item interactions to provide personalized suggestions. However, CF-based methods often encounter biases due to imbalances in training data. This phenomenon makes CF-based methods tend to prioritize recommending popular items and performing unsatisfactorily on inactive users. Existing works address this issue by rebalancing training samples, reranking recommendation results, or making the modeling process robust to the bias. Despite their effectiveness, these approaches can compromise accuracy or be sensitive to weighting strategies, making them challenging to train. Therefore, exploring how to mitigate these biases remains in urgent demand.</p><p>In this paper, we deeply analyze the causes and effects of the biases and propose a framework to alleviate biases in recommendation from the perspective of representation distribution, namely <b><i>Group-<underline>A</underline>lignment and Global-<underline>U</underline>niformity Enhanced <underline>R</underline>epresentation <underline>L</underline>earning for Debiasing Recommendation (AURL)</i></b>. Specifically, we identify two significant problems in the representation distribution of users and items, namely group-discrepancy and global-collapse. These two problems directly lead to biases in the recommendation results. To this end, we propose two simple but effective regularizers in the representation space, respectively named group-alignment and global-uniformity. The goal of group-alignment is to bring the representation distribution of long-tail entities closer to that of popular entities, while global-uniformity aims to preserve the information of entities as much as possible by evenly distributing representations. Our method directly optimizes both the group-alignment and global-uniformity regularization terms to mitigate recommendation biases. Please note that <i>AURL</i> applies to arbitrary CF-based recommendation backbones. Extensive experiments on three real datasets and various recommendation backbones verify the superiority of our proposed framework. The results show that <i>AURL</i> not only outperforms existing debiasing models in mitigating biases but also improves recommendation performance to some extent.</p>","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140925510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fair Projections as a Means Towards Balanced Recommendations 将公平预测作为平衡建议的手段
IF 5 4区 计算机科学 Q1 Mathematics Pub Date : 2024-05-14 DOI: 10.1145/3664929
Aris Anagnostopoulos, Luca Becchetti, Matteo Böhm, Adriano Fazzone, Stefano Leonardi, Cristina Menghini, Chris Schwiegelshohn

The goal of recommender systems is to provide to users suggestions that match their interests, with the eventual goal of increasing their satisfaction, as measured by the number of transactions (clicks, purchases, etc.). Often, this leads to providing recommendations that are of a particular type. For some contexts (e.g., browsing videos for information) this may be undesirable, as it may enforce the creation of filter bubbles. This is because of the existence of underlying bias in the input data of prior user actions.

Reducing hidden bias in the data and ensuring fairness in algorithmic data analysis has recently received significant attention. In this paper, we consider both the densest subgraph and the (k)-clustering problem, two primitives that are being used by some recommender systems. We are given a coloring on the nodes, respectively the points, and aim to compute a fair solution (S), consisting of a subgraph or a clustering, such that none of the colors is disparately impacted by the solution.

Unfortunately, introducing fair solutions typically makes these problems substantially more difficult. Unlike the unconstrained densest subgraph problem, which is solvable in polynomial time, the fair densest subgraph problem is NP-hard even to approximate. For (k)-clustering, the fairness constraints make the problem very similar to capacitated clustering, which is a notoriously hard problem to even approximate.

Despite such negative premises, we are able to provide positive results in important use cases. In particular, we are able to prove that a suitable spectral embedding allows recovery of an almost optimal, fair, dense subgraph hidden in the input data, whenever one is present, a result that is further supported by experimental evidence.

We also show a polynomial-time, (2)-approximation algorithm to the problem of fair densest subgraph, assuming that there exist only two colors and both colors occur equally often in the graph. This result turns out to be optimal assuming the small set expansion hypothesis. For fair (k)-clustering, we show that we can recover high quality fair clusterings effectively and efficiently. For the special case of (k)-median and (k)-center, we offer additional, fast and simple approximation algorithms as well as new hardness results.

The above theoretical findings drive the design of heuristics, which we experimentally evaluate on a scenario based on real data, in which our aim is to strike a good balance between diversity and highly correlated items from Amazon co-purchasing graphs and facebook contacts. We additionally evaluated our algorithmic solutions for the fair (k)-median problem through experiments on various real-world datasets.

推荐系统的目标是向用户提供符合他们兴趣的建议,最终目的是提高他们的满意度,这可以用交易次数(点击、购买等)来衡量。这通常会导致提供特定类型的推荐。在某些情况下(如浏览视频获取信息),这种做法可能并不可取,因为它可能会强制产生过滤气泡。这是因为先前用户操作的输入数据中存在潜在的偏差。减少数据中隐藏的偏差并确保算法数据分析的公平性最近受到了广泛关注。在本文中,我们同时考虑了最密子图和(k)聚类问题,这是一些推荐系统正在使用的两个基本原理。我们分别给定了节点和点的着色,目的是计算出一个公平的解决方案,包括一个子图或一个聚类,使得没有任何一种颜色受到该解决方案的影响。与可在多项式时间内求解的无约束最密子图问题不同,公平最密子图问题甚至连近似都很难。对于(k)聚类来说,公平性约束使得这个问题非常类似于容纳聚类,而容纳聚类是一个臭名昭著的甚至难以近似的问题。我们还展示了一种多项式时间、(2)-近似算法来解决公平最密子图问题,假设只有两种颜色,并且这两种颜色在图中出现的频率相同。结果证明,假设小集扩展假设,这一结果是最优的。对于公平聚类,我们证明可以有效地恢复高质量的公平聚类。上述理论发现推动了启发式算法的设计,我们在一个基于真实数据的场景中对其进行了实验评估,在该场景中,我们的目标是在亚马逊共同购买图和Facebook联系人中的多样性和高度相关的项目之间取得良好的平衡。此外,我们还通过在各种真实世界数据集上的实验,评估了我们针对公平(k)-中值问题的算法解决方案。
{"title":"Fair Projections as a Means Towards Balanced Recommendations","authors":"Aris Anagnostopoulos, Luca Becchetti, Matteo Böhm, Adriano Fazzone, Stefano Leonardi, Cristina Menghini, Chris Schwiegelshohn","doi":"10.1145/3664929","DOIUrl":"https://doi.org/10.1145/3664929","url":null,"abstract":"<p>The goal of recommender systems is to provide to users suggestions that match their interests, with the eventual goal of increasing their satisfaction, as measured by the number of transactions (clicks, purchases, etc.). Often, this leads to providing recommendations that are of a particular type. For some contexts (e.g., browsing videos for information) this may be undesirable, as it may enforce the creation of filter bubbles. This is because of the existence of underlying bias in the input data of prior user actions.</p><p>Reducing hidden bias in the data and ensuring fairness in algorithmic data analysis has recently received significant attention. In this paper, we consider both the densest subgraph and the (k)-clustering problem, two primitives that are being used by some recommender systems. We are given a coloring on the nodes, respectively the points, and aim to compute a <i>fair</i> solution (S), consisting of a subgraph or a clustering, such that none of the colors is disparately impacted by the solution.</p><p>Unfortunately, introducing fair solutions typically makes these problems substantially more difficult. Unlike the unconstrained densest subgraph problem, which is solvable in polynomial time, the fair densest subgraph problem is NP-hard even to approximate. For (k)-clustering, the fairness constraints make the problem very similar to capacitated clustering, which is a notoriously hard problem to even approximate.</p><p>Despite such negative premises, we are able to provide positive results in important use cases. In particular, we are able to prove that a suitable spectral embedding allows recovery of an almost optimal, fair, dense subgraph hidden in the input data, whenever one is present, a result that is further supported by experimental evidence.</p><p>We also show a polynomial-time, (2)-approximation algorithm to the problem of fair densest subgraph, assuming that there exist only two colors and both colors occur equally often in the graph. This result turns out to be optimal assuming the small set expansion hypothesis. For fair (k)-clustering, we show that we can recover high quality fair clusterings effectively and efficiently. For the special case of (k)-median and (k)-center, we offer additional, fast and simple approximation algorithms as well as new hardness results.</p><p>The above theoretical findings drive the design of heuristics, which we experimentally evaluate on a scenario based on real data, in which our aim is to strike a good balance between diversity and highly correlated items from Amazon co-purchasing graphs and facebook contacts. We additionally evaluated our algorithmic solutions for the fair (k)-median problem through experiments on various real-world datasets.</p>","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140925507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Bibliometric Review of Large Language Models Research from 2017 to 2023 2017 至 2023 年大语言模型研究文献计量学回顾
IF 5 4区 计算机科学 Q1 Mathematics Pub Date : 2024-05-13 DOI: 10.1145/3664930
Lizhou Fan, Lingyao Li, Zihui Ma, Sanggyu Lee, Huizi Yu, Libby Hemphill

Large language models (LLMs), such as OpenAI’s Generative Pre-trained Transformer (GPT), are a class of language models that have demonstrated outstanding performance across a range of natural language processing (NLP) tasks. LLMs have become a highly sought-after research area because of their ability to generate human-like language and their potential to revolutionize science and technology. In this study, we conduct bibliometric and discourse analyses of scholarly literature on LLMs. Synthesizing over 5,000 publications, this paper serves as a roadmap for researchers, practitioners, and policymakers to navigate the current landscape of LLMs research. We present the research trends from 2017 to early 2023, identifying patterns in research paradigms and collaborations. We start with analyzing the core algorithm developments and NLP tasks that are fundamental in LLMs research. We then investigate the applications of LLMs in various fields and domains, including medicine, engineering, social science, and humanities. Our review also reveals the dynamic, fast-paced evolution of LLMs research. Overall, this paper offers valuable insights into the current state, impact, and potential of LLMs research and its applications.

大型语言模型(LLM),如 OpenAI 的生成预训练转换器(GPT),是一类在一系列自然语言处理(NLP)任务中表现出卓越性能的语言模型。由于 LLM 具备生成类人语言的能力,并具有彻底改变科学技术的潜力,因此已成为备受追捧的研究领域。在本研究中,我们对有关 LLM 的学术文献进行了文献计量学和话语分析。本文综合了 5000 多篇文献,为研究人员、从业人员和政策制定者提供了一个路线图,帮助他们了解 LLMs 研究的现状。我们介绍了从2017年到2023年初的研究趋势,确定了研究范式和合作模式。我们首先分析了 LLMs 研究的核心算法开发和 NLP 任务。然后,我们研究了 LLMs 在各个领域的应用,包括医学、工程学、社会科学和人文学科。我们的回顾还揭示了 LLMs 研究的动态和快速发展。总之,本文为了解 LLMs 研究及其应用的现状、影响和潜力提供了宝贵的见解。
{"title":"A Bibliometric Review of Large Language Models Research from 2017 to 2023","authors":"Lizhou Fan, Lingyao Li, Zihui Ma, Sanggyu Lee, Huizi Yu, Libby Hemphill","doi":"10.1145/3664930","DOIUrl":"https://doi.org/10.1145/3664930","url":null,"abstract":"<p>Large language models (LLMs), such as OpenAI’s Generative Pre-trained Transformer (GPT), are a class of language models that have demonstrated outstanding performance across a range of natural language processing (NLP) tasks. LLMs have become a highly sought-after research area because of their ability to generate human-like language and their potential to revolutionize science and technology. In this study, we conduct bibliometric and discourse analyses of scholarly literature on LLMs. Synthesizing over 5,000 publications, this paper serves as a roadmap for researchers, practitioners, and policymakers to navigate the current landscape of LLMs research. We present the research trends from 2017 to early 2023, identifying patterns in research paradigms and collaborations. We start with analyzing the core algorithm developments and NLP tasks that are fundamental in LLMs research. We then investigate the applications of LLMs in various fields and domains, including medicine, engineering, social science, and humanities. Our review also reveals the dynamic, fast-paced evolution of LLMs research. Overall, this paper offers valuable insights into the current state, impact, and potential of LLMs research and its applications.</p>","PeriodicalId":48967,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology","volume":null,"pages":null},"PeriodicalIF":5.0,"publicationDate":"2024-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140925439","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ACM Transactions on Intelligent Systems and Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1