Psycholinguistic knowledge-guided graph network for personality detection of silent users

IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Information Processing & Management Pub Date : 2025-05-01 Epub Date: 2025-01-13 DOI:10.1016/j.ipm.2025.104064
Houjie Qiu , Xingkong Ma , Bo Liu , Yiqing Cai , Xinyi Chen , Zhaoyun Ding
{"title":"Psycholinguistic knowledge-guided graph network for personality detection of silent users","authors":"Houjie Qiu ,&nbsp;Xingkong Ma ,&nbsp;Bo Liu ,&nbsp;Yiqing Cai ,&nbsp;Xinyi Chen ,&nbsp;Zhaoyun Ding","doi":"10.1016/j.ipm.2025.104064","DOIUrl":null,"url":null,"abstract":"<div><div>Personality detection is an emerging task benefiting numerous fields. The existing studies on text-based personality detection rarely concern silent users who never publish social texts due to the lack of their posts. Simultaneously, the mainstream methods lack an effective pathway for silent user representation to detect the personality. To solve the silent user problems, we propose a psycholinguistic knowledge-guided graph network, <em>PKGN</em>. Our method is composed of neighbor post metric, graph initialization &amp; learning, and classification. Under the guidance of psychological knowledge, our model first selects high-quality posts from neighbors as the posts of silent users through the neighbor post metric. In the graph initialization &amp; learning, psychologically relevant categories are introduced to build the bipartite graph for each silent user and obtain the user representation via GATv2. Then, we utilize linear classifiers for personality classification. We conducted extensive experiments on a new real-world dataset, including 1581 samples. To conduct a baseline benchmark for the silent user personality detection task, we apply the neighbor post metric to combine with the existing work. From the experimental results, our model achieves 64.11% average accuracy and 63.21% average macro-F1, outperforming mainstream methods in most individual personality traits and comprehensive comparisons. Furthermore, the introduction of psycholinguistic knowledge benefits the model performance. In neighbor post metric comparison, the psycholinguistic knowledge from LIWC reduces the standard variances of psychological category count and improves the detection results (3.01% <span><math><mi>↑</mi></math></span> for average accuracy and 2.34% <span><math><mi>↑</mi></math></span> for average macro-F1). In the ablation study, all the psychologically relevant categories contribute to the model performance (ranging from 0.02% <span><math><mi>↑</mi></math></span> to 3.02% <span><math><mi>↑</mi></math></span> for average macro-F1).</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 3","pages":"Article 104064"},"PeriodicalIF":6.9000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457325000068","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/13 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Personality detection is an emerging task benefiting numerous fields. The existing studies on text-based personality detection rarely concern silent users who never publish social texts due to the lack of their posts. Simultaneously, the mainstream methods lack an effective pathway for silent user representation to detect the personality. To solve the silent user problems, we propose a psycholinguistic knowledge-guided graph network, PKGN. Our method is composed of neighbor post metric, graph initialization & learning, and classification. Under the guidance of psychological knowledge, our model first selects high-quality posts from neighbors as the posts of silent users through the neighbor post metric. In the graph initialization & learning, psychologically relevant categories are introduced to build the bipartite graph for each silent user and obtain the user representation via GATv2. Then, we utilize linear classifiers for personality classification. We conducted extensive experiments on a new real-world dataset, including 1581 samples. To conduct a baseline benchmark for the silent user personality detection task, we apply the neighbor post metric to combine with the existing work. From the experimental results, our model achieves 64.11% average accuracy and 63.21% average macro-F1, outperforming mainstream methods in most individual personality traits and comprehensive comparisons. Furthermore, the introduction of psycholinguistic knowledge benefits the model performance. In neighbor post metric comparison, the psycholinguistic knowledge from LIWC reduces the standard variances of psychological category count and improves the detection results (3.01% for average accuracy and 2.34% for average macro-F1). In the ablation study, all the psychologically relevant categories contribute to the model performance (ranging from 0.02% to 3.02% for average macro-F1).
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
沉默用户性格检测的心理语言知识引导图网络
人格检测是一项新兴的任务,对许多领域都有好处。现有的基于文本的人格检测研究很少涉及沉默用户,他们因为缺少帖子而从不发布社交文本。与此同时,主流方法缺乏一种有效的途径来检测沉默用户表征的个性。为了解决沉默用户问题,我们提出了一种心理语言知识引导图网络PKGN。该方法由相邻柱度量、图初始化和图初始化组成;学习,分类。在心理学知识的指导下,我们的模型首先通过邻居帖子度量从邻居中选择高质量的帖子作为沉默用户的帖子。在图初始化&;在学习过程中,引入心理相关类别,对每个沉默用户建立二部图,并通过GATv2获得用户表示。然后,我们利用线性分类器进行人格分类。我们在一个新的真实世界数据集上进行了广泛的实验,包括1581个样本。为了对沉默用户个性检测任务进行基准基准测试,我们将邻居帖子度量与现有工作相结合。从实验结果来看,我们的模型平均准确率达到64.11%,平均宏观f1达到63.21%,在大多数个体人格特征和综合比较上都优于主流方法。此外,心理语言学知识的引入有利于模型的性能。在相邻后度量比较中,来自LIWC的心理语言学知识降低了心理类别计数的标准方差,提高了检测结果(平均准确度为3.01%↑,平均宏观f1为2.34%↑)。在消融研究中,所有心理相关类别都对模型性能有贡献(平均宏观f1从0.02%↑到3.02%↑)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Information Processing & Management
Information Processing & Management 工程技术-计算机:信息系统
CiteScore
17.00
自引率
11.60%
发文量
276
审稿时长
39 days
期刊介绍: Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing. We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.
期刊最新文献
PhiMark: watermarking relational data robustly with zero distortion A self-guided few-shot semantic segmentation model based on query foreground-background similarity Emotion and noise-robust speaker identification via filter-free self-supervised learning TemFRC: Enterprise financial risk prediction with temporal folding and risk contrast A dual-source knowledge distillation framework for hate speech detection based on cognitive distortion awareness
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1