首页 > 最新文献

arXiv - CS - Social and Information Networks最新文献

英文 中文
MAPX: An explainable model-agnostic framework for the detection of false information on social media networks MAPX:用于检测社交媒体网络虚假信息的可解释模型无关框架
Pub Date : 2024-09-13 DOI: arxiv-2409.08522
Sarah Condran, Michael Bewong, Selasi Kwashie, Md Zahidul Islam, Irfan Altas, Joshua Condran
The automated detection of false information has become a fundamental task incombating the spread of "fake news" on online social media networks (OSMN) asit reduces the need for manual discernment by individuals. In the literature,leveraging various content or context features of OSMN documents have beenfound useful. However, most of the existing detection models often utilisethese features in isolation without regard to the temporal and dynamic changesoft-seen in reality, thus, limiting the robustness of the models. Furthermore,there has been little to no consideration of the impact of the quality ofdocuments' features on the trustworthiness of the final prediction. In thispaper, we introduce a novel model-agnostic framework, called MAPX, which allowsevidence based aggregation of predictions from existing models in anexplainable manner. Indeed, the developed aggregation method is adaptive,dynamic and considers the quality of OSMN document features. Further, weperform extensive experiments on benchmarked fake news datasets to demonstratethe effectiveness of MAPX using various real-world data quality scenarios. Ourempirical results show that the proposed framework consistently outperforms allstate-of-the-art models evaluated. For reproducibility, a demo of MAPX isavailable at href{https://github.com/SCondran/MAPX_framework}{this link}
自动检测虚假信息已成为应对在线社交媒体网络(OSMN)上 "假新闻 "传播的一项基本任务,因为它减少了个人手动辨别的需要。在文献中,人们发现利用 OSMN 文档的各种内容或上下文特征非常有用。然而,现有的大多数检测模型往往孤立地利用这些特征,而不考虑现实中的时间和动态变化,从而限制了模型的鲁棒性。此外,几乎没有人考虑过文档特征的质量对最终预测可信度的影响。在本文中,我们介绍了一种名为 MAPX 的新型模型无关框架,它允许以可解释的方式基于证据聚合现有模型的预测结果。事实上,所开发的聚合方法是自适应的、动态的,并考虑了 OSMN 文档特征的质量。此外,我们还在基准假新闻数据集上进行了大量实验,利用各种真实世界的数据质量场景来证明 MAPX 的有效性。实证结果表明,所提出的框架始终优于所有接受评估的最先进模型。为便于重现,MAPX 的演示可在(href{https://github.com/SCondran/MAPX_framework}{this link} )上下载。
{"title":"MAPX: An explainable model-agnostic framework for the detection of false information on social media networks","authors":"Sarah Condran, Michael Bewong, Selasi Kwashie, Md Zahidul Islam, Irfan Altas, Joshua Condran","doi":"arxiv-2409.08522","DOIUrl":"https://doi.org/arxiv-2409.08522","url":null,"abstract":"The automated detection of false information has become a fundamental task in\u0000combating the spread of \"fake news\" on online social media networks (OSMN) as\u0000it reduces the need for manual discernment by individuals. In the literature,\u0000leveraging various content or context features of OSMN documents have been\u0000found useful. However, most of the existing detection models often utilise\u0000these features in isolation without regard to the temporal and dynamic changes\u0000oft-seen in reality, thus, limiting the robustness of the models. Furthermore,\u0000there has been little to no consideration of the impact of the quality of\u0000documents' features on the trustworthiness of the final prediction. In this\u0000paper, we introduce a novel model-agnostic framework, called MAPX, which allows\u0000evidence based aggregation of predictions from existing models in an\u0000explainable manner. Indeed, the developed aggregation method is adaptive,\u0000dynamic and considers the quality of OSMN document features. Further, we\u0000perform extensive experiments on benchmarked fake news datasets to demonstrate\u0000the effectiveness of MAPX using various real-world data quality scenarios. Our\u0000empirical results show that the proposed framework consistently outperforms all\u0000state-of-the-art models evaluated. For reproducibility, a demo of MAPX is\u0000available at href{https://github.com/SCondran/MAPX_framework}{this link}","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unveiling User Engagement Patterns on Stack Exchange Through Network Analysis 通过网络分析揭示用户在 Stack Exchange 上的参与模式
Pub Date : 2024-09-13 DOI: arxiv-2409.08944
Agnik Saha, Mohammad Shahidul Kader, Mohammad Masum
Stack Exchange, a question-and-answer(Q&A) platform, has exhibited signs of adeclining user engagement. This paper investigates user engagement dynamicsacross various Stack Exchange communities including Data science, AI, softwareengineering, project management, and GenAI. We propose a network graphrepresenting users as nodes and their interactions as edges. We exploreengagement patterns through key network metrics including Degree Centerality,Betweenness Centrality, and PageRank. The study findings reveal distinctcommunity dynamics across these platforms, with smaller communitiesdemonstrating more concentrated user influence, while larger platforms showcasemore distributed engagement. Besides, the results showed insights into userroles, influence, and potential strategies for enhancing engagement. Thisresearch contributes to understanding of online community behavior and providesa framework for future studies to improve the Stack Exchange user experience.
问答平台 Stack Exchange 显示出用户参与度下降的迹象。本文研究了 Stack Exchange 各个社区的用户参与动态,包括数据科学、人工智能、软件工程、项目管理和 GenAI。我们提出了一个网络图,将用户表示为节点,将他们之间的互动表示为边。我们通过关键的网络指标(包括度中心性、间中心性和 PageRank)来探索管理模式。研究结果表明,这些平台上的社区动态各不相同,规模较小的社区显示出更集中的用户影响力,而规模较大的平台则显示出更分散的参与度。此外,研究结果还揭示了用户角色、影响力以及提高参与度的潜在策略。这项研究有助于人们了解在线社区行为,并为今后改善 Stack Exchange 用户体验的研究提供了一个框架。
{"title":"Unveiling User Engagement Patterns on Stack Exchange Through Network Analysis","authors":"Agnik Saha, Mohammad Shahidul Kader, Mohammad Masum","doi":"arxiv-2409.08944","DOIUrl":"https://doi.org/arxiv-2409.08944","url":null,"abstract":"Stack Exchange, a question-and-answer(Q&A) platform, has exhibited signs of a\u0000declining user engagement. This paper investigates user engagement dynamics\u0000across various Stack Exchange communities including Data science, AI, software\u0000engineering, project management, and GenAI. We propose a network graph\u0000representing users as nodes and their interactions as edges. We explore\u0000engagement patterns through key network metrics including Degree Centerality,\u0000Betweenness Centrality, and PageRank. The study findings reveal distinct\u0000community dynamics across these platforms, with smaller communities\u0000demonstrating more concentrated user influence, while larger platforms showcase\u0000more distributed engagement. Besides, the results showed insights into user\u0000roles, influence, and potential strategies for enhancing engagement. This\u0000research contributes to understanding of online community behavior and provides\u0000a framework for future studies to improve the Stack Exchange user experience.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sybil Detection using Graph Neural Networks 利用图神经网络检测假冒者
Pub Date : 2024-09-13 DOI: arxiv-2409.08631
Stuart Heeb, Andreas Plesner, Roger Wattenhofer
This paper presents SYBILGAT, a novel approach to Sybil detection in socialnetworks using Graph Attention Networks (GATs). Traditional methods for Sybildetection primarily leverage structural properties of networks; however, theytend to struggle with a large number of attack edges and are often unable tosimultaneously utilize both known Sybil and honest nodes. Our proposed methodaddresses these limitations by dynamically assigning attention weights todifferent nodes during aggregations, enhancing detection performance. Weconducted extensive experiments in various scenarios, including pretraining insampled subgraphs, synthetic networks, and networks under targeted attacks. Theresults show that SYBILGAT significantly outperforms the state-of-the-artalgorithms, particularly in scenarios with high attack complexity and when thenumber of attack edges increases. Our approach shows robust performance acrossdifferent network models and sizes, even as the detection task becomes morechallenging. We successfully applied the model to a real-world Twitter graphwith more than 269k nodes and 6.8M edges. The flexibility and generalizabilityof SYBILGAT make it a promising tool to defend against Sybil attacks in onlinesocial networks with only structural information.
本文介绍了 SYBILGAT,这是一种利用图注意网络(GAT)在社交网络中进行假冒者检测的新方法。传统的假冒者检测方法主要利用网络的结构特性;然而,这些方法往往难以应对大量的攻击边,而且往往无法同时利用已知的假冒者节点和诚实节点。我们提出的方法通过在聚合过程中动态分配不同节点的关注权重来解决这些局限性,从而提高检测性能。我们在各种场景下进行了广泛的实验,包括在采样子图、合成网络和目标攻击下的网络中进行预训练。结果表明,SYBILGAT 的性能明显优于最先进的算法,尤其是在攻击复杂度较高和攻击边数量增加的情况下。即使检测任务变得更具挑战性,我们的方法也能在不同的网络模型和规模下表现出稳健的性能。我们成功地将该模型应用于一个拥有超过 269k 个节点和 680 万条边的真实 Twitter 图。SYBILGAT 的灵活性和通用性使其成为在仅有结构信息的在线社交网络中防御仿冒攻击的理想工具。
{"title":"Sybil Detection using Graph Neural Networks","authors":"Stuart Heeb, Andreas Plesner, Roger Wattenhofer","doi":"arxiv-2409.08631","DOIUrl":"https://doi.org/arxiv-2409.08631","url":null,"abstract":"This paper presents SYBILGAT, a novel approach to Sybil detection in social\u0000networks using Graph Attention Networks (GATs). Traditional methods for Sybil\u0000detection primarily leverage structural properties of networks; however, they\u0000tend to struggle with a large number of attack edges and are often unable to\u0000simultaneously utilize both known Sybil and honest nodes. Our proposed method\u0000addresses these limitations by dynamically assigning attention weights to\u0000different nodes during aggregations, enhancing detection performance. We\u0000conducted extensive experiments in various scenarios, including pretraining in\u0000sampled subgraphs, synthetic networks, and networks under targeted attacks. The\u0000results show that SYBILGAT significantly outperforms the state-of-the-art\u0000algorithms, particularly in scenarios with high attack complexity and when the\u0000number of attack edges increases. Our approach shows robust performance across\u0000different network models and sizes, even as the detection task becomes more\u0000challenging. We successfully applied the model to a real-world Twitter graph\u0000with more than 269k nodes and 6.8M edges. The flexibility and generalizability\u0000of SYBILGAT make it a promising tool to defend against Sybil attacks in online\u0000social networks with only structural information.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fusing Dynamics Equation: A Social Opinions Prediction Algorithm with LLM-based Agents 融合动力学方程:基于 LLM 代理的社会意见预测算法
Pub Date : 2024-09-13 DOI: arxiv-2409.08717
Junchi Yao, Hongjie Zhang, Jie Ou, Dingyi Zuo, Zheng Yang, Zhicheng Dong
In the context where social media is increasingly becoming a significantplatform for social movements and the formation of public opinion, accuratelysimulating and predicting the dynamics of user opinions is of great importancefor understanding social phenomena, policy making, and guiding public opinion.However, existing simulation methods face challenges in capturing thecomplexity and dynamics of user behavior. Addressing this issue, this paperproposes an innovative simulation method for the dynamics of social media useropinions, the FDE-LLM algorithm, which incorporates opinion dynamics andepidemic model. This effectively constrains the actions and opinion evolutionprocess of large language models (LLM), making them more aligned with the realcyber world. In particular, the FDE-LLM categorizes users into opinion leadersand followers. Opinion leaders are based on LLM role-playing and areconstrained by the CA model, while opinion followers are integrated into adynamic system that combines the CA model with the SIR model. This innovativedesign significantly improves the accuracy and efficiency of the simulation.Experiments were conducted on four real Weibo datasets and validated using theopen-source model ChatGLM. The results show that, compared to traditionalagent-based modeling (ABM) opinion dynamics algorithms and LLM-based opiniondiffusion algorithms, our FDE-LLM algorithm demonstrates higher accuracy andinterpretability.
在社交媒体日益成为社会运动和舆论形成的重要平台的背景下,准确模拟和预测用户意见的动态变化对于理解社会现象、制定政策和引导舆论具有重要意义。然而,现有的模拟方法在捕捉用户行为的复杂性和动态性方面面临挑战。针对这一问题,本文提出了一种创新的社交媒体用户舆论动态模拟方法--FDE-LLM 算法,该算法结合了舆论动态和流行病模型。这有效地约束了大型语言模型(LLM)的行动和观点演变过程,使其更加贴近真实的网络世界。其中,FDE-LLM 将用户分为意见领袖和追随者。意见领袖以 LLM 角色扮演为基础,受到 CA 模型的约束,而意见追随者则被整合到结合了 CA 模型和 SIR 模型的动态系统中。我们在四个真实微博数据集上进行了实验,并使用开源模型 ChatGLM 进行了验证。实验结果表明,与传统的基于代理建模(ABM)的舆情动态算法和基于 LLM 的舆情扩散算法相比,我们的 FDE-LLM 算法具有更高的准确性和可解释性。
{"title":"Fusing Dynamics Equation: A Social Opinions Prediction Algorithm with LLM-based Agents","authors":"Junchi Yao, Hongjie Zhang, Jie Ou, Dingyi Zuo, Zheng Yang, Zhicheng Dong","doi":"arxiv-2409.08717","DOIUrl":"https://doi.org/arxiv-2409.08717","url":null,"abstract":"In the context where social media is increasingly becoming a significant\u0000platform for social movements and the formation of public opinion, accurately\u0000simulating and predicting the dynamics of user opinions is of great importance\u0000for understanding social phenomena, policy making, and guiding public opinion.\u0000However, existing simulation methods face challenges in capturing the\u0000complexity and dynamics of user behavior. Addressing this issue, this paper\u0000proposes an innovative simulation method for the dynamics of social media user\u0000opinions, the FDE-LLM algorithm, which incorporates opinion dynamics and\u0000epidemic model. This effectively constrains the actions and opinion evolution\u0000process of large language models (LLM), making them more aligned with the real\u0000cyber world. In particular, the FDE-LLM categorizes users into opinion leaders\u0000and followers. Opinion leaders are based on LLM role-playing and are\u0000constrained by the CA model, while opinion followers are integrated into a\u0000dynamic system that combines the CA model with the SIR model. This innovative\u0000design significantly improves the accuracy and efficiency of the simulation.\u0000Experiments were conducted on four real Weibo datasets and validated using the\u0000open-source model ChatGLM. The results show that, compared to traditional\u0000agent-based modeling (ABM) opinion dynamics algorithms and LLM-based opinion\u0000diffusion algorithms, our FDE-LLM algorithm demonstrates higher accuracy and\u0000interpretability.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263479","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Community Fact-Checks Trigger Moral Outrage in Replies to Misleading Posts on Social Media 社区事实核查引发道德愤怒,回应社交媒体上的误导性帖子
Pub Date : 2024-09-13 DOI: arxiv-2409.08829
Yuwei Chuai, Anastasia Sergeeva, Gabriele Lenzini, Nicolas Pröllochs
Displaying community fact-checks is a promising approach to reduce engagementwith misinformation on social media. However, how users respond to misleadingcontent emotionally after community fact-checks are displayed on posts isunclear. Here, we employ quasi-experimental methods to causally analyze changesin sentiments and (moral) emotions in replies to misleading posts following thedisplay of community fact-checks. Our evaluation is based on a large-scalepanel dataset comprising N=2,225,260 replies across 1841 source posts from X'sCommunity Notes platform. We find that informing users about falsehoods throughcommunity fact-checks significantly increases negativity (by 7.3%), anger (by13.2%), disgust (by 4.7%), and moral outrage (by 16.0%) in the correspondingreplies. These results indicate that users perceive spreading misinformation asa violation of social norms and that those who spread misinformation shouldexpect negative reactions once their content is debunked. We derive importantimplications for the design of community-based fact-checking systems.
在社交媒体上显示社区事实核查是减少参与错误信息的一种有前途的方法。然而,在帖子上显示社区事实核查后,用户如何对误导性内容做出情绪反应还不清楚。在此,我们采用准实验方法对社区事实核查显示后误导性帖子回复中的情绪和(道德)情感变化进行因果分析。我们的评估基于一个大型面板数据集,该数据集包含来自 X's 社区笔记平台的 1841 个源帖子的 N=2,225,260 条回复。我们发现,通过社区事实核查告知用户虚假信息会显著增加相应回复中的负面情绪(7.3%)、愤怒(13.2%)、厌恶(4.7%)和道德愤怒(16.0%)。这些结果表明,用户认为传播错误信息违反了社会规范,而那些传播错误信息的人应该预料到一旦他们的内容被揭穿后会出现负面反应。我们得出了设计基于社区的事实核查系统的重要启示。
{"title":"Community Fact-Checks Trigger Moral Outrage in Replies to Misleading Posts on Social Media","authors":"Yuwei Chuai, Anastasia Sergeeva, Gabriele Lenzini, Nicolas Pröllochs","doi":"arxiv-2409.08829","DOIUrl":"https://doi.org/arxiv-2409.08829","url":null,"abstract":"Displaying community fact-checks is a promising approach to reduce engagement\u0000with misinformation on social media. However, how users respond to misleading\u0000content emotionally after community fact-checks are displayed on posts is\u0000unclear. Here, we employ quasi-experimental methods to causally analyze changes\u0000in sentiments and (moral) emotions in replies to misleading posts following the\u0000display of community fact-checks. Our evaluation is based on a large-scale\u0000panel dataset comprising N=2,225,260 replies across 1841 source posts from X's\u0000Community Notes platform. We find that informing users about falsehoods through\u0000community fact-checks significantly increases negativity (by 7.3%), anger (by\u000013.2%), disgust (by 4.7%), and moral outrage (by 16.0%) in the corresponding\u0000replies. These results indicate that users perceive spreading misinformation as\u0000a violation of social norms and that those who spread misinformation should\u0000expect negative reactions once their content is debunked. We derive important\u0000implications for the design of community-based fact-checking systems.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Community-based fact-checking reduces the spread of misleading posts on social media 基于社区的事实核查减少了误导性帖子在社交媒体上的传播
Pub Date : 2024-09-13 DOI: arxiv-2409.08781
Yuwei Chuai, Moritz Pilarski, Thomas Renault, David Restrepo-Amariles, Aurore Troussel-Clément, Gabriele Lenzini, Nicolas Pröllochs
Community-based fact-checking is a promising approach to verify social mediacontent and correct misleading posts at scale. Yet, causal evidence regardingits effectiveness in reducing the spread of misinformation on social media ismissing. Here, we performed a large-scale empirical study to analyze whethercommunity notes reduce the spread of misleading posts on X. Using aDifference-in-Differences design and repost time series data for N=237,677(community fact-checked) cascades that had been reposted more than 431 milliontimes, we found that exposing users to community notes reduced the spread ofmisleading posts by, on average, 62.0%. Furthermore, community notes increasedthe odds that users delete their misleading posts by 103.4%. However, ourfindings also suggest that community notes might be too slow to intervene inthe early (and most viral) stage of the diffusion. Our work offers importantimplications to enhance the effectiveness of community-based fact-checkingapproaches on social media.
基于社区的事实核查是验证社交媒体内容和大规模纠正误导性帖子的一种很有前途的方法。然而,有关其在减少社交媒体上错误信息传播方面的有效性的因果证据还很缺乏。在此,我们进行了一项大规模的实证研究,分析社区注释是否减少了 X 上误导性帖子的传播。我们使用了差分法设计和 N=237,677 个(社区事实核查)级联的转贴时间序列数据,这些级联的转贴次数超过了 4.31 亿次,我们发现,让用户接触社区注释平均减少了 62.0% 的误导性帖子的传播。此外,社区注释还使用户删除其误导性帖子的几率提高了 103.4%。然而,我们的研究结果也表明,社区注释可能过于缓慢,无法在传播的早期(也是病毒最猖獗的阶段)进行干预。我们的工作为提高社交媒体上基于社区的事实核查方法的有效性提供了重要启示。
{"title":"Community-based fact-checking reduces the spread of misleading posts on social media","authors":"Yuwei Chuai, Moritz Pilarski, Thomas Renault, David Restrepo-Amariles, Aurore Troussel-Clément, Gabriele Lenzini, Nicolas Pröllochs","doi":"arxiv-2409.08781","DOIUrl":"https://doi.org/arxiv-2409.08781","url":null,"abstract":"Community-based fact-checking is a promising approach to verify social media\u0000content and correct misleading posts at scale. Yet, causal evidence regarding\u0000its effectiveness in reducing the spread of misinformation on social media is\u0000missing. Here, we performed a large-scale empirical study to analyze whether\u0000community notes reduce the spread of misleading posts on X. Using a\u0000Difference-in-Differences design and repost time series data for N=237,677\u0000(community fact-checked) cascades that had been reposted more than 431 million\u0000times, we found that exposing users to community notes reduced the spread of\u0000misleading posts by, on average, 62.0%. Furthermore, community notes increased\u0000the odds that users delete their misleading posts by 103.4%. However, our\u0000findings also suggest that community notes might be too slow to intervene in\u0000the early (and most viral) stage of the diffusion. Our work offers important\u0000implications to enhance the effectiveness of community-based fact-checking\u0000approaches on social media.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimation of Graph Features Based on Random Walks Using Neighbors' Properties 基于随机漫步的图形特征估算(利用邻域属性
Pub Date : 2024-09-13 DOI: arxiv-2409.08599
Tsuyoshi Hasegawa, Shiori Hironaka, Kazuyuki Shudo
Using random walks for sampling has proven advantageous in assessing thecharacteristics of large and unknown social networks. Several algorithms basedon random walks have been introduced in recent years. In the practicalapplication of social network sampling, there is a recurrent reliance on anapplication programming interface (API) for obtaining adjacent nodes. However,owing to constraints related to query frequency and associated API expenses, itis preferable to minimize API calls during the feature estimation process. Inthis study, considering the acquisition of neighboring nodes as a cost factor,we introduce a feature estimation algorithm that outperforms existingalgorithms in terms of accuracy. Through experiments that simulate sampling onknown graphs, we demonstrate the superior accuracy of our proposed algorithmwhen compared to existing alternatives.
事实证明,使用随机游走进行抽样在评估大型未知社交网络的特征方面具有优势。近年来,已经推出了几种基于随机游走的算法。在社交网络采样的实际应用中,人们经常依赖应用编程接口(API)来获取相邻节点。然而,由于查询频率和相关 API 费用的限制,在特征估计过程中最好尽量减少 API 调用。在本研究中,考虑到获取相邻节点的成本因素,我们引入了一种在准确性方面优于现有算法的特征估计算法。通过在已知图上模拟采样的实验,我们证明了与现有算法相比,我们提出的算法具有更高的准确性。
{"title":"Estimation of Graph Features Based on Random Walks Using Neighbors' Properties","authors":"Tsuyoshi Hasegawa, Shiori Hironaka, Kazuyuki Shudo","doi":"arxiv-2409.08599","DOIUrl":"https://doi.org/arxiv-2409.08599","url":null,"abstract":"Using random walks for sampling has proven advantageous in assessing the\u0000characteristics of large and unknown social networks. Several algorithms based\u0000on random walks have been introduced in recent years. In the practical\u0000application of social network sampling, there is a recurrent reliance on an\u0000application programming interface (API) for obtaining adjacent nodes. However,\u0000owing to constraints related to query frequency and associated API expenses, it\u0000is preferable to minimize API calls during the feature estimation process. In\u0000this study, considering the acquisition of neighboring nodes as a cost factor,\u0000we introduce a feature estimation algorithm that outperforms existing\u0000algorithms in terms of accuracy. Through experiments that simulate sampling on\u0000known graphs, we demonstrate the superior accuracy of our proposed algorithm\u0000when compared to existing alternatives.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DELTA: Dual Consistency Delving with Topological Uncertainty for Active Graph Domain Adaptation DELTA:带拓扑不确定性的双一致性迭代,用于主动图域自适应
Pub Date : 2024-09-13 DOI: arxiv-2409.08946
Pengyun Wang, Yadi Cao, Chris Russell, Siyu Heng, Junyu Luo, Yanxin Shen, Xiao Luo
Graph domain adaptation has recently enabled knowledge transfer acrossdifferent graphs. However, without the semantic information on target graphs,the performance on target graphs is still far from satisfactory. To address theissue, we study the problem of active graph domain adaptation, which selects asmall quantitative of informative nodes on the target graph for extraannotation. This problem is highly challenging due to the complicatedtopological relationships and the distribution discrepancy across graphs. Inthis paper, we propose a novel approach named Dual Consistency Delving withTopological Uncertainty (DELTA) for active graph domain adaptation. Our DELTAconsists of an edge-oriented graph subnetwork and a path-oriented graphsubnetwork, which can explore topological semantics from complementaryperspectives. In particular, our edge-oriented graph subnetwork utilizes themessage passing mechanism to learn neighborhood information, while ourpath-oriented graph subnetwork explores high-order relationships fromsubstructures. To jointly learn from two subnetworks, we roughly selectinformative candidate nodes with the consideration of consistency across twosubnetworks. Then, we aggregate local semantics from its K-hop subgraph basedon node degrees for topological uncertainty estimation. To overcome potentialdistribution shifts, we compare target nodes and their corresponding sourcenodes for discrepancy scores as an additional component for fine selection.Extensive experiments on benchmark datasets demonstrate that DELTA outperformsvarious state-of-the-art approaches.
最近,图域适应技术实现了跨不同图的知识转移。然而,如果没有目标图的语义信息,在目标图上的性能仍然远远不能令人满意。为了解决这个问题,我们研究了主动图域适配问题,即选择目标图上的少量信息节点进行额外标注。由于图之间存在复杂的拓扑关系和分布差异,这个问题极具挑战性。在本文中,我们提出了一种名为 "带拓扑不确定性的双一致性掘取(Dual Consistency Delving withTopological Uncertainty,DELTA)"的主动图域适应新方法。我们的 DELTA 由面向边缘的图子网络和面向路径的图子网络组成,可以从互补的角度探索拓扑语义。其中,面向边缘的图子网络利用消息传递机制学习邻域信息,而面向路径的图子网络则从子结构中探索高阶关系。为了从两个子网络中联合学习,我们在考虑两个子网络一致性的基础上,粗略地选择有信息的候选节点。然后,我们根据节点度从其 K 跳子图中汇总局部语义,以进行拓扑不确定性估计。为了克服潜在的分布偏移,我们比较了目标节点和其相应来源节点的差异分数,作为精细选择的额外组成部分。
{"title":"DELTA: Dual Consistency Delving with Topological Uncertainty for Active Graph Domain Adaptation","authors":"Pengyun Wang, Yadi Cao, Chris Russell, Siyu Heng, Junyu Luo, Yanxin Shen, Xiao Luo","doi":"arxiv-2409.08946","DOIUrl":"https://doi.org/arxiv-2409.08946","url":null,"abstract":"Graph domain adaptation has recently enabled knowledge transfer across\u0000different graphs. However, without the semantic information on target graphs,\u0000the performance on target graphs is still far from satisfactory. To address the\u0000issue, we study the problem of active graph domain adaptation, which selects a\u0000small quantitative of informative nodes on the target graph for extra\u0000annotation. This problem is highly challenging due to the complicated\u0000topological relationships and the distribution discrepancy across graphs. In\u0000this paper, we propose a novel approach named Dual Consistency Delving with\u0000Topological Uncertainty (DELTA) for active graph domain adaptation. Our DELTA\u0000consists of an edge-oriented graph subnetwork and a path-oriented graph\u0000subnetwork, which can explore topological semantics from complementary\u0000perspectives. In particular, our edge-oriented graph subnetwork utilizes the\u0000message passing mechanism to learn neighborhood information, while our\u0000path-oriented graph subnetwork explores high-order relationships from\u0000substructures. To jointly learn from two subnetworks, we roughly select\u0000informative candidate nodes with the consideration of consistency across two\u0000subnetworks. Then, we aggregate local semantics from its K-hop subgraph based\u0000on node degrees for topological uncertainty estimation. To overcome potential\u0000distribution shifts, we compare target nodes and their corresponding source\u0000nodes for discrepancy scores as an additional component for fine selection.\u0000Extensive experiments on benchmark datasets demonstrate that DELTA outperforms\u0000various state-of-the-art approaches.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Accurate and Fast Estimation of Temporal Motifs using Path Sampling 利用路径采样准确快速地估计时空动机
Pub Date : 2024-09-13 DOI: arxiv-2409.08975
Yunjie Pan, Omkar Bhalerao, C. Seshadhri, Nishil Talati
Counting the number of small subgraphs, called motifs, is a fundamentalproblem in social network analysis and graph mining. Many real-world networksare directed and temporal, where edges have timestamps. Motif counting indirected, temporal graphs is especially challenging because there are aplethora of different kinds of patterns. Temporal motif counts reveal muchricher information and there is a need for scalable algorithms for motifcounting. A major challenge in counting is that there can be trillions of temporalmotif matches even with a graph with only millions of vertices. Both the motifsand the input graphs can have multiple edges between two vertices, leading to acombinatorial explosion problem. Counting temporal motifs involving just fourvertices is not feasible with current state-of-the-art algorithms. We design an algorithm, TEACUPS, that addresses this problem using a noveltechnique of temporal path sampling. We combine a path sampling method withcarefully designed temporal data structures, to propose an efficientapproximate algorithm for temporal motif counting. TEACUPS is an unbiasedestimator with provable concentration behavior, which can be used to bound theestimation error. For a Bitcoin graph with hundreds of millions of edges,TEACUPS runs in less than 1 minute, while the exact counting algorithm takesmore than a day. We empirically demonstrate the accuracy of TEACUPS on largedatasets, showing an average of 30$times$ speedup (up to 2000$times$ speedup)compared to existing GPU-based exact counting methods while preserving highcount estimation accuracy.
计算小型子图(称为主题图)的数量是社交网络分析和图挖掘中的一个基本问题。现实世界中的许多网络都是有向和时态的,其边缘都有时间戳。由于存在大量不同类型的模式,因此对间接的时间图进行图案计数尤其具有挑战性。时态图案计数能揭示更丰富的信息,因此需要可扩展的图案计数算法。计数的一大挑战在于,即使只有数百万顶点的图,也可能有数万亿个时态图案匹配。主题图和输入图的两个顶点之间都可能有多条边,从而导致组合爆炸问题。目前最先进的算法无法计算只涉及四个顶点的时空主题。我们设计了一种名为 TEACUPS 的算法,利用新颖的时空路径采样技术来解决这个问题。我们将路径采样方法与精心设计的时态数据结构相结合,提出了一种高效的近似时态图案计数算法。TEACUPS 是一种无偏估计器,具有可证明的集中行为,可用于限制估计误差。对于具有数亿条边的比特币图,TEACUPS 的运行时间不到 1 分钟,而精确计数算法则需要一天以上。我们通过实证证明了TEACUPS在大型数据集上的准确性,与现有的基于GPU的精确计数方法相比,TEACUPS平均提速30倍(最高提速2000倍),同时保持了较高的计数估计准确性。
{"title":"Accurate and Fast Estimation of Temporal Motifs using Path Sampling","authors":"Yunjie Pan, Omkar Bhalerao, C. Seshadhri, Nishil Talati","doi":"arxiv-2409.08975","DOIUrl":"https://doi.org/arxiv-2409.08975","url":null,"abstract":"Counting the number of small subgraphs, called motifs, is a fundamental\u0000problem in social network analysis and graph mining. Many real-world networks\u0000are directed and temporal, where edges have timestamps. Motif counting in\u0000directed, temporal graphs is especially challenging because there are a\u0000plethora of different kinds of patterns. Temporal motif counts reveal much\u0000richer information and there is a need for scalable algorithms for motif\u0000counting. A major challenge in counting is that there can be trillions of temporal\u0000motif matches even with a graph with only millions of vertices. Both the motifs\u0000and the input graphs can have multiple edges between two vertices, leading to a\u0000combinatorial explosion problem. Counting temporal motifs involving just four\u0000vertices is not feasible with current state-of-the-art algorithms. We design an algorithm, TEACUPS, that addresses this problem using a novel\u0000technique of temporal path sampling. We combine a path sampling method with\u0000carefully designed temporal data structures, to propose an efficient\u0000approximate algorithm for temporal motif counting. TEACUPS is an unbiased\u0000estimator with provable concentration behavior, which can be used to bound the\u0000estimation error. For a Bitcoin graph with hundreds of millions of edges,\u0000TEACUPS runs in less than 1 minute, while the exact counting algorithm takes\u0000more than a day. We empirically demonstrate the accuracy of TEACUPS on large\u0000datasets, showing an average of 30$times$ speedup (up to 2000$times$ speedup)\u0000compared to existing GPU-based exact counting methods while preserving high\u0000count estimation accuracy.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modeling Information Narrative Detection and Evolution on Telegram during the Russia-Ukraine War 俄乌战争期间 Telegram 上的信息叙事检测和演变建模
Pub Date : 2024-09-12 DOI: arxiv-2409.07684
Patrick Gerard, Svitlana Volkova, Louis Penafiel, Kristina Lerman, Tim Weninger
Following the Russian Federation's full-scale invasion of Ukraine in February2022, a multitude of information narratives emerged within both pro-Russian andpro-Ukrainian communities online. As the conflict progresses, so too do theinformation narratives, constantly adapting and influencing local and globalcommunity perceptions and attitudes. This dynamic nature of the evolvinginformation environment (IE) underscores a critical need to fully discern hownarratives evolve and affect online communities. Existing research, however,often fails to capture information narrative evolution, overlooking both thefluid nature of narratives and the internal mechanisms that drive theirevolution. Recognizing this, we introduce a novel approach designed to bothmodel narrative evolution and uncover the underlying mechanisms driving them.In this work we perform a comparative discourse analysis across communities onTelegram covering the initial three months following the invasion. First, weuncover substantial disparities in narratives and perceptions betweenpro-Russian and pro-Ukrainian communities. Then, we probe deeper into prevalentnarratives of each group, identifying key themes and examining the underlyingmechanisms fueling their evolution. Finally, we explore influences and factorsthat may shape the development and spread of narratives.
俄罗斯联邦于 2022 年 2 月全面入侵乌克兰后,亲俄和亲乌网络社区内出现了大量的信息叙事。随着冲突的发展,信息叙事也在不断调整,并影响着当地和全球社区的看法和态度。不断演变的信息环境(IE)的这一动态性质突出表明,我们亟需全面了解叙事如何演变并影响网络社区。然而,现有的研究往往无法捕捉到信息叙事的演变,忽略了叙事的流动性以及推动叙事演变的内部机制。认识到这一点后,我们引入了一种新颖的方法,旨在对叙事演变进行建模,并揭示驱动叙事演变的内在机制。在这项工作中,我们对 Telegram 上的社区进行了比较话语分析,分析范围涵盖了入侵后的最初三个月。首先,我们发现了亲俄和亲乌克兰社区在叙事和认知上的巨大差异。然后,我们深入探讨了每个群体的普遍叙事,确定了关键主题,并研究了推动其演变的潜在机制。最后,我们探讨了可能影响叙事发展和传播的影响因素。
{"title":"Modeling Information Narrative Detection and Evolution on Telegram during the Russia-Ukraine War","authors":"Patrick Gerard, Svitlana Volkova, Louis Penafiel, Kristina Lerman, Tim Weninger","doi":"arxiv-2409.07684","DOIUrl":"https://doi.org/arxiv-2409.07684","url":null,"abstract":"Following the Russian Federation's full-scale invasion of Ukraine in February\u00002022, a multitude of information narratives emerged within both pro-Russian and\u0000pro-Ukrainian communities online. As the conflict progresses, so too do the\u0000information narratives, constantly adapting and influencing local and global\u0000community perceptions and attitudes. This dynamic nature of the evolving\u0000information environment (IE) underscores a critical need to fully discern how\u0000narratives evolve and affect online communities. Existing research, however,\u0000often fails to capture information narrative evolution, overlooking both the\u0000fluid nature of narratives and the internal mechanisms that drive their\u0000evolution. Recognizing this, we introduce a novel approach designed to both\u0000model narrative evolution and uncover the underlying mechanisms driving them.\u0000In this work we perform a comparative discourse analysis across communities on\u0000Telegram covering the initial three months following the invasion. First, we\u0000uncover substantial disparities in narratives and perceptions between\u0000pro-Russian and pro-Ukrainian communities. Then, we probe deeper into prevalent\u0000narratives of each group, identifying key themes and examining the underlying\u0000mechanisms fueling their evolution. Finally, we explore influences and factors\u0000that may shape the development and spread of narratives.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142226899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - CS - Social and Information Networks
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1