改进的深度学习网络,解决社交媒体谣言源检测中的图节点失衡问题

IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE New Generation Computing Pub Date : 2024-07-26 DOI:10.1007/s00354-024-00270-5
Greeshma N. Gopal, Binsu C. Kovoor, S. Shailesh
{"title":"改进的深度学习网络,解决社交媒体谣言源检测中的图节点失衡问题","authors":"Greeshma N. Gopal, Binsu C. Kovoor, S. Shailesh","doi":"10.1007/s00354-024-00270-5","DOIUrl":null,"url":null,"abstract":"<p>Finding the source of rumors in the social network was addressed by researchers with probabilistic models like Maximum Likelihood Estimation in complex network analysis for the past few decades. However, the most promising results could reach up to 2-hop distant neighborhoods on average. With the advent of graph neural networks, the issue was addressed as a classical node classification problem in large networks. Node classification problems achieve the best results when there are appropriate node attributes as features and when node classes are balanced. However, unlike other node classification scenarios, the data collected for source identification usually lacks node attributes because of time limitations. Moreover, the detection of the sources among thousands of users is typically a class imbalance problem. If we could deal with these issues skillfully, then the dominance of the deep learning method compared to other conventional probabilistic methods in multiple rumor source detection will be prominent. We have proposed here a deep learning-based multiple source node classification framework that can predict the sources with promising accuracy. The primary hurdles in non-attributed network classification are navigated by generating feature vectors that capture the structural characteristics of the network and spreading pattern. These features are further solidified with the Graph embedding technique, incorporating the neighborhood features. We have triumphed over the challenge of imbalanced node classes by synthetic node generation with a suitable mathematical model. The concern is further resolved by selective sampling and weighted loss estimation in the deep learning network for classification used in the framework. A study of the Immuno-Diffuse Likelihood parameter in Label propagation based feature construction and its influence on accurate prediction is examined. Our approach demonstrates superior performance compared to existing methods in the datasets available in public repositories, making it a reliable and robust tool for rumor source detection in the complex landscape of social media.</p>","PeriodicalId":54726,"journal":{"name":"New Generation Computing","volume":"13 1","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Improved Deep Learning Network, Addressing Graph Node Imbalance in Social Media Rumor Source Detection\",\"authors\":\"Greeshma N. Gopal, Binsu C. Kovoor, S. Shailesh\",\"doi\":\"10.1007/s00354-024-00270-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Finding the source of rumors in the social network was addressed by researchers with probabilistic models like Maximum Likelihood Estimation in complex network analysis for the past few decades. However, the most promising results could reach up to 2-hop distant neighborhoods on average. With the advent of graph neural networks, the issue was addressed as a classical node classification problem in large networks. Node classification problems achieve the best results when there are appropriate node attributes as features and when node classes are balanced. However, unlike other node classification scenarios, the data collected for source identification usually lacks node attributes because of time limitations. Moreover, the detection of the sources among thousands of users is typically a class imbalance problem. If we could deal with these issues skillfully, then the dominance of the deep learning method compared to other conventional probabilistic methods in multiple rumor source detection will be prominent. We have proposed here a deep learning-based multiple source node classification framework that can predict the sources with promising accuracy. The primary hurdles in non-attributed network classification are navigated by generating feature vectors that capture the structural characteristics of the network and spreading pattern. These features are further solidified with the Graph embedding technique, incorporating the neighborhood features. We have triumphed over the challenge of imbalanced node classes by synthetic node generation with a suitable mathematical model. The concern is further resolved by selective sampling and weighted loss estimation in the deep learning network for classification used in the framework. A study of the Immuno-Diffuse Likelihood parameter in Label propagation based feature construction and its influence on accurate prediction is examined. Our approach demonstrates superior performance compared to existing methods in the datasets available in public repositories, making it a reliable and robust tool for rumor source detection in the complex landscape of social media.</p>\",\"PeriodicalId\":54726,\"journal\":{\"name\":\"New Generation Computing\",\"volume\":\"13 1\",\"pages\":\"\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2024-07-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"New Generation Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s00354-024-00270-5\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"New Generation Computing","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s00354-024-00270-5","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0

摘要

在过去几十年里,研究人员在复杂网络分析中使用最大似然估计等概率模型来寻找社交网络中的谣言来源。然而,最有希望的结果平均只能达到 2 跳的邻域。随着图神经网络的出现,这一问题被作为大型网络中的经典节点分类问题加以解决。如果有适当的节点属性作为特征,并且节点类别平衡,节点分类问题就能取得最佳结果。然而,与其他节点分类情况不同,由于时间限制,用于源识别的数据通常缺乏节点属性。此外,在成千上万的用户中检测信源通常是一个类不平衡问题。如果我们能巧妙地处理这些问题,那么在多谣言源检测中,深度学习方法相对于其他传统概率方法的优势就会凸显出来。在此,我们提出了一种基于深度学习的多谣言源节点分类框架,该框架可以准确预测谣言源。通过生成捕捉网络结构特征和传播模式的特征向量,可以克服非归属网络分类中的主要障碍。利用图形嵌入技术,结合邻域特征,进一步巩固了这些特征。通过使用合适的数学模型生成合成节点,我们克服了节点类别不平衡的难题。在该框架中使用的深度学习分类网络中,通过选择性采样和加权损失估计,进一步解决了这一问题。我们研究了基于标签传播的特征构建中的免疫扩散似然参数及其对准确预测的影响。与现有方法相比,我们的方法在公共资料库中的数据集上表现出了卓越的性能,使其成为在复杂的社交媒体环境中进行谣言源检测的可靠、稳健的工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
An Improved Deep Learning Network, Addressing Graph Node Imbalance in Social Media Rumor Source Detection

Finding the source of rumors in the social network was addressed by researchers with probabilistic models like Maximum Likelihood Estimation in complex network analysis for the past few decades. However, the most promising results could reach up to 2-hop distant neighborhoods on average. With the advent of graph neural networks, the issue was addressed as a classical node classification problem in large networks. Node classification problems achieve the best results when there are appropriate node attributes as features and when node classes are balanced. However, unlike other node classification scenarios, the data collected for source identification usually lacks node attributes because of time limitations. Moreover, the detection of the sources among thousands of users is typically a class imbalance problem. If we could deal with these issues skillfully, then the dominance of the deep learning method compared to other conventional probabilistic methods in multiple rumor source detection will be prominent. We have proposed here a deep learning-based multiple source node classification framework that can predict the sources with promising accuracy. The primary hurdles in non-attributed network classification are navigated by generating feature vectors that capture the structural characteristics of the network and spreading pattern. These features are further solidified with the Graph embedding technique, incorporating the neighborhood features. We have triumphed over the challenge of imbalanced node classes by synthetic node generation with a suitable mathematical model. The concern is further resolved by selective sampling and weighted loss estimation in the deep learning network for classification used in the framework. A study of the Immuno-Diffuse Likelihood parameter in Label propagation based feature construction and its influence on accurate prediction is examined. Our approach demonstrates superior performance compared to existing methods in the datasets available in public repositories, making it a reliable and robust tool for rumor source detection in the complex landscape of social media.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
New Generation Computing
New Generation Computing 工程技术-计算机:理论方法
CiteScore
5.90
自引率
15.40%
发文量
47
审稿时长
>12 weeks
期刊介绍: The journal is specially intended to support the development of new computational and cognitive paradigms stemming from the cross-fertilization of various research fields. These fields include, but are not limited to, programming (logic, constraint, functional, object-oriented), distributed/parallel computing, knowledge-based systems, agent-oriented systems, and cognitive aspects of human embodied knowledge. It also encourages theoretical and/or practical papers concerning all types of learning, knowledge discovery, evolutionary mechanisms, human cognition and learning, and emergent systems that can lead to key technologies enabling us to build more complex and intelligent systems. The editorial board hopes that New Generation Computing will work as a catalyst among active researchers with broad interests by ensuring a smooth publication process.
期刊最新文献
Infant Walking and Everyday Experience: Unraveling the Development of Behavior from Motor Development Improvement and Analysis of Peak Shift Demand Response Scenarios of Industrial Consumers Using an Electricity Market Model Chaotic Satin Bowerbird Optimizer Based Advanced AI Techniques for Detection of COVID-19 Diseases from CT Scans Images Dance Information Processing: Computational Approaches for Assisting Dance Composition Intelligent Bayesian Inference for Multiclass Lung Infection Diagnosis: Network Analysis of Ranked Gray Level Co-occurrence (GLCM) Features
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1