Psycholinguistic knowledge-guided graph network for personality detection of silent users

IF 6.9 1区管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Information Processing & Management Pub Date : 2025-05-01 Epub Date: 2025-01-13 DOI:10.1016/j.ipm.2025.104064

Houjie Qiu , Xingkong Ma , Bo Liu , Yiqing Cai , Xinyi Chen , Zhaoyun Ding

{"title":"Psycholinguistic knowledge-guided graph network for personality detection of silent users","authors":"Houjie Qiu , Xingkong Ma , Bo Liu , Yiqing Cai , Xinyi Chen , Zhaoyun Ding","doi":"10.1016/j.ipm.2025.104064","DOIUrl":null,"url":null,"abstract":"<div><div>Personality detection is an emerging task benefiting numerous fields. The existing studies on text-based personality detection rarely concern silent users who never publish social texts due to the lack of their posts. Simultaneously, the mainstream methods lack an effective pathway for silent user representation to detect the personality. To solve the silent user problems, we propose a psycholinguistic knowledge-guided graph network, <em>PKGN</em>. Our method is composed of neighbor post metric, graph initialization & learning, and classification. Under the guidance of psychological knowledge, our model first selects high-quality posts from neighbors as the posts of silent users through the neighbor post metric. In the graph initialization & learning, psychologically relevant categories are introduced to build the bipartite graph for each silent user and obtain the user representation via GATv2. Then, we utilize linear classifiers for personality classification. We conducted extensive experiments on a new real-world dataset, including 1581 samples. To conduct a baseline benchmark for the silent user personality detection task, we apply the neighbor post metric to combine with the existing work. From the experimental results, our model achieves 64.11% average accuracy and 63.21% average macro-F1, outperforming mainstream methods in most individual personality traits and comprehensive comparisons. Furthermore, the introduction of psycholinguistic knowledge benefits the model performance. In neighbor post metric comparison, the psycholinguistic knowledge from LIWC reduces the standard variances of psychological category count and improves the detection results (3.01% <span><math><mi>↑</mi></math></span> for average accuracy and 2.34% <span><math><mi>↑</mi></math></span> for average macro-F1). In the ablation study, all the psychologically relevant categories contribute to the model performance (ranging from 0.02% <span><math><mi>↑</mi></math></span> to 3.02% <span><math><mi>↑</mi></math></span> for average macro-F1).</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 3","pages":"Article 104064"},"PeriodicalIF":6.9000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457325000068","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/13 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Personality detection is an emerging task benefiting numerous fields. The existing studies on text-based personality detection rarely concern silent users who never publish social texts due to the lack of their posts. Simultaneously, the mainstream methods lack an effective pathway for silent user representation to detect the personality. To solve the silent user problems, we propose a psycholinguistic knowledge-guided graph network, PKGN. Our method is composed of neighbor post metric, graph initialization & learning, and classification. Under the guidance of psychological knowledge, our model first selects high-quality posts from neighbors as the posts of silent users through the neighbor post metric. In the graph initialization & learning, psychologically relevant categories are introduced to build the bipartite graph for each silent user and obtain the user representation via GATv2. Then, we utilize linear classifiers for personality classification. We conducted extensive experiments on a new real-world dataset, including 1581 samples. To conduct a baseline benchmark for the silent user personality detection task, we apply the neighbor post metric to combine with the existing work. From the experimental results, our model achieves 64.11% average accuracy and 63.21% average macro-F1, outperforming mainstream methods in most individual personality traits and comprehensive comparisons. Furthermore, the introduction of psycholinguistic knowledge benefits the model performance. In neighbor post metric comparison, the psycholinguistic knowledge from LIWC reduces the standard variances of psychological category count and improves the detection results (3.01%

↑

for average accuracy and 2.34%

↑

for average macro-F1). In the ablation study, all the psychologically relevant categories contribute to the model performance (ranging from 0.02%

↑

to 3.02%

↑

for average macro-F1).

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

沉默用户性格检测的心理语言知识引导图网络

人格检测是一项新兴的任务，对许多领域都有好处。现有的基于文本的人格检测研究很少涉及沉默用户，他们因为缺少帖子而从不发布社交文本。与此同时，主流方法缺乏一种有效的途径来检测沉默用户表征的个性。为了解决沉默用户问题，我们提出了一种心理语言知识引导图网络PKGN。该方法由相邻柱度量、图初始化和图初始化组成；学习，分类。在心理学知识的指导下，我们的模型首先通过邻居帖子度量从邻居中选择高质量的帖子作为沉默用户的帖子。在图初始化&；在学习过程中，引入心理相关类别，对每个沉默用户建立二部图，并通过GATv2获得用户表示。然后，我们利用线性分类器进行人格分类。我们在一个新的真实世界数据集上进行了广泛的实验，包括1581个样本。为了对沉默用户个性检测任务进行基准基准测试，我们将邻居帖子度量与现有工作相结合。从实验结果来看，我们的模型平均准确率达到64.11%，平均宏观f1达到63.21%，在大多数个体人格特征和综合比较上都优于主流方法。此外，心理语言学知识的引入有利于模型的性能。在相邻后度量比较中，来自LIWC的心理语言学知识降低了心理类别计数的标准方差，提高了检测结果（平均准确度为3.01%↑，平均宏观f1为2.34%↑）。在消融研究中，所有心理相关类别都对模型性能有贡献（平均宏观f1从0.02%↑到3.02%↑）。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Information Processing & Management 工程技术-计算机：信息系统

CiteScore

17.00

自引率

11.60%

发文量

276

审稿时长

39 days

期刊介绍： Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing. We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.