{"title":"改进的深度学习网络,解决社交媒体谣言源检测中的图节点失衡问题","authors":"Greeshma N. Gopal, Binsu C. Kovoor, S. Shailesh","doi":"10.1007/s00354-024-00270-5","DOIUrl":null,"url":null,"abstract":"<p>Finding the source of rumors in the social network was addressed by researchers with probabilistic models like Maximum Likelihood Estimation in complex network analysis for the past few decades. However, the most promising results could reach up to 2-hop distant neighborhoods on average. With the advent of graph neural networks, the issue was addressed as a classical node classification problem in large networks. Node classification problems achieve the best results when there are appropriate node attributes as features and when node classes are balanced. However, unlike other node classification scenarios, the data collected for source identification usually lacks node attributes because of time limitations. Moreover, the detection of the sources among thousands of users is typically a class imbalance problem. If we could deal with these issues skillfully, then the dominance of the deep learning method compared to other conventional probabilistic methods in multiple rumor source detection will be prominent. We have proposed here a deep learning-based multiple source node classification framework that can predict the sources with promising accuracy. The primary hurdles in non-attributed network classification are navigated by generating feature vectors that capture the structural characteristics of the network and spreading pattern. These features are further solidified with the Graph embedding technique, incorporating the neighborhood features. We have triumphed over the challenge of imbalanced node classes by synthetic node generation with a suitable mathematical model. The concern is further resolved by selective sampling and weighted loss estimation in the deep learning network for classification used in the framework. A study of the Immuno-Diffuse Likelihood parameter in Label propagation based feature construction and its influence on accurate prediction is examined. Our approach demonstrates superior performance compared to existing methods in the datasets available in public repositories, making it a reliable and robust tool for rumor source detection in the complex landscape of social media.</p>","PeriodicalId":54726,"journal":{"name":"New Generation Computing","volume":"13 1","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Improved Deep Learning Network, Addressing Graph Node Imbalance in Social Media Rumor Source Detection\",\"authors\":\"Greeshma N. Gopal, Binsu C. Kovoor, S. Shailesh\",\"doi\":\"10.1007/s00354-024-00270-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Finding the source of rumors in the social network was addressed by researchers with probabilistic models like Maximum Likelihood Estimation in complex network analysis for the past few decades. However, the most promising results could reach up to 2-hop distant neighborhoods on average. With the advent of graph neural networks, the issue was addressed as a classical node classification problem in large networks. Node classification problems achieve the best results when there are appropriate node attributes as features and when node classes are balanced. However, unlike other node classification scenarios, the data collected for source identification usually lacks node attributes because of time limitations. Moreover, the detection of the sources among thousands of users is typically a class imbalance problem. If we could deal with these issues skillfully, then the dominance of the deep learning method compared to other conventional probabilistic methods in multiple rumor source detection will be prominent. We have proposed here a deep learning-based multiple source node classification framework that can predict the sources with promising accuracy. The primary hurdles in non-attributed network classification are navigated by generating feature vectors that capture the structural characteristics of the network and spreading pattern. These features are further solidified with the Graph embedding technique, incorporating the neighborhood features. We have triumphed over the challenge of imbalanced node classes by synthetic node generation with a suitable mathematical model. The concern is further resolved by selective sampling and weighted loss estimation in the deep learning network for classification used in the framework. A study of the Immuno-Diffuse Likelihood parameter in Label propagation based feature construction and its influence on accurate prediction is examined. Our approach demonstrates superior performance compared to existing methods in the datasets available in public repositories, making it a reliable and robust tool for rumor source detection in the complex landscape of social media.</p>\",\"PeriodicalId\":54726,\"journal\":{\"name\":\"New Generation Computing\",\"volume\":\"13 1\",\"pages\":\"\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2024-07-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"New Generation Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s00354-024-00270-5\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"New Generation Computing","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s00354-024-00270-5","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
An Improved Deep Learning Network, Addressing Graph Node Imbalance in Social Media Rumor Source Detection
Finding the source of rumors in the social network was addressed by researchers with probabilistic models like Maximum Likelihood Estimation in complex network analysis for the past few decades. However, the most promising results could reach up to 2-hop distant neighborhoods on average. With the advent of graph neural networks, the issue was addressed as a classical node classification problem in large networks. Node classification problems achieve the best results when there are appropriate node attributes as features and when node classes are balanced. However, unlike other node classification scenarios, the data collected for source identification usually lacks node attributes because of time limitations. Moreover, the detection of the sources among thousands of users is typically a class imbalance problem. If we could deal with these issues skillfully, then the dominance of the deep learning method compared to other conventional probabilistic methods in multiple rumor source detection will be prominent. We have proposed here a deep learning-based multiple source node classification framework that can predict the sources with promising accuracy. The primary hurdles in non-attributed network classification are navigated by generating feature vectors that capture the structural characteristics of the network and spreading pattern. These features are further solidified with the Graph embedding technique, incorporating the neighborhood features. We have triumphed over the challenge of imbalanced node classes by synthetic node generation with a suitable mathematical model. The concern is further resolved by selective sampling and weighted loss estimation in the deep learning network for classification used in the framework. A study of the Immuno-Diffuse Likelihood parameter in Label propagation based feature construction and its influence on accurate prediction is examined. Our approach demonstrates superior performance compared to existing methods in the datasets available in public repositories, making it a reliable and robust tool for rumor source detection in the complex landscape of social media.
期刊介绍:
The journal is specially intended to support the development of new computational and cognitive paradigms stemming from the cross-fertilization of various research fields. These fields include, but are not limited to, programming (logic, constraint, functional, object-oriented), distributed/parallel computing, knowledge-based systems, agent-oriented systems, and cognitive aspects of human embodied knowledge. It also encourages theoretical and/or practical papers concerning all types of learning, knowledge discovery, evolutionary mechanisms, human cognition and learning, and emergent systems that can lead to key technologies enabling us to build more complex and intelligent systems. The editorial board hopes that New Generation Computing will work as a catalyst among active researchers with broad interests by ensuring a smooth publication process.