arXiv - CS - Social and Information Networks最新文献_第3页

MAPX: An explainable model-agnostic framework for the detection of false information on social media networks MAPX：用于检测社交媒体网络虚假信息的可解释模型无关框架

arXiv - CS - Social and Information Networks

Pub Date : 2024-09-13 DOI: arxiv-2409.08522

Sarah Condran, Michael Bewong, Selasi Kwashie, Md Zahidul Islam, Irfan Altas, Joshua Condran

The automated detection of false information has become a fundamental task incombating the spread of "fake news" on online social media networks (OSMN) asit reduces the need for manual discernment by individuals. In the literature,leveraging various content or context features of OSMN documents have beenfound useful. However, most of the existing detection models often utilisethese features in isolation without regard to the temporal and dynamic changesoft-seen in reality, thus, limiting the robustness of the models. Furthermore,there has been little to no consideration of the impact of the quality ofdocuments' features on the trustworthiness of the final prediction. In thispaper, we introduce a novel model-agnostic framework, called MAPX, which allowsevidence based aggregation of predictions from existing models in anexplainable manner. Indeed, the developed aggregation method is adaptive,dynamic and considers the quality of OSMN document features. Further, weperform extensive experiments on benchmarked fake news datasets to demonstratethe effectiveness of MAPX using various real-world data quality scenarios. Ourempirical results show that the proposed framework consistently outperforms allstate-of-the-art models evaluated. For reproducibility, a demo of MAPX isavailable at href{https://github.com/SCondran/MAPX_framework}{this link}

自动检测虚假信息已成为应对在线社交媒体网络（OSMN）上 "假新闻 "传播的一项基本任务，因为它减少了个人手动辨别的需要。在文献中，人们发现利用 OSMN 文档的各种内容或上下文特征非常有用。然而，现有的大多数检测模型往往孤立地利用这些特征，而不考虑现实中的时间和动态变化，从而限制了模型的鲁棒性。此外，几乎没有人考虑过文档特征的质量对最终预测可信度的影响。在本文中，我们介绍了一种名为 MAPX 的新型模型无关框架，它允许以可解释的方式基于证据聚合现有模型的预测结果。事实上，所开发的聚合方法是自适应的、动态的，并考虑了 OSMN 文档特征的质量。此外，我们还在基准假新闻数据集上进行了大量实验，利用各种真实世界的数据质量场景来证明 MAPX 的有效性。实证结果表明，所提出的框架始终优于所有接受评估的最先进模型。为便于重现，MAPX 的演示可在（href{https://github.com/SCondran/MAPX_framework}{this link} ）上下载。

{"title":"MAPX: An explainable model-agnostic framework for the detection of false information on social media networks","authors":"Sarah Condran, Michael Bewong, Selasi Kwashie, Md Zahidul Islam, Irfan Altas, Joshua Condran","doi":"arxiv-2409.08522","DOIUrl":"https://doi.org/arxiv-2409.08522","url":null,"abstract":"The automated detection of false information has become a fundamental task in\u0000combating the spread of \"fake news\" on online social media networks (OSMN) as\u0000it reduces the need for manual discernment by individuals. In the literature,\u0000leveraging various content or context features of OSMN documents have been\u0000found useful. However, most of the existing detection models often utilise\u0000these features in isolation without regard to the temporal and dynamic changes\u0000oft-seen in reality, thus, limiting the robustness of the models. Furthermore,\u0000there has been little to no consideration of the impact of the quality of\u0000documents' features on the trustworthiness of the final prediction. In this\u0000paper, we introduce a novel model-agnostic framework, called MAPX, which allows\u0000evidence based aggregation of predictions from existing models in an\u0000explainable manner. Indeed, the developed aggregation method is adaptive,\u0000dynamic and considers the quality of OSMN document features. Further, we\u0000perform extensive experiments on benchmarked fake news datasets to demonstrate\u0000the effectiveness of MAPX using various real-world data quality scenarios. Our\u0000empirical results show that the proposed framework consistently outperforms all\u0000state-of-the-art models evaluated. For reproducibility, a demo of MAPX is\u0000available at href{https://github.com/SCondran/MAPX_framework}{this link}","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":"104 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Unveiling User Engagement Patterns on Stack Exchange Through Network Analysis 通过网络分析揭示用户在 Stack Exchange 上的参与模式

arXiv - CS - Social and Information Networks

Pub Date : 2024-09-13 DOI: arxiv-2409.08944

Agnik Saha, Mohammad Shahidul Kader, Mohammad Masum

Stack Exchange, a question-and-answer(Q&A) platform, has exhibited signs of adeclining user engagement. This paper investigates user engagement dynamicsacross various Stack Exchange communities including Data science, AI, softwareengineering, project management, and GenAI. We propose a network graphrepresenting users as nodes and their interactions as edges. We exploreengagement patterns through key network metrics including Degree Centerality,Betweenness Centrality, and PageRank. The study findings reveal distinctcommunity dynamics across these platforms, with smaller communitiesdemonstrating more concentrated user influence, while larger platforms showcasemore distributed engagement. Besides, the results showed insights into userroles, influence, and potential strategies for enhancing engagement. Thisresearch contributes to understanding of online community behavior and providesa framework for future studies to improve the Stack Exchange user experience.

问答平台 Stack Exchange 显示出用户参与度下降的迹象。本文研究了 Stack Exchange 各个社区的用户参与动态，包括数据科学、人工智能、软件工程、项目管理和 GenAI。我们提出了一个网络图，将用户表示为节点，将他们之间的互动表示为边。我们通过关键的网络指标（包括度中心性、间中心性和 PageRank）来探索管理模式。研究结果表明，这些平台上的社区动态各不相同，规模较小的社区显示出更集中的用户影响力，而规模较大的平台则显示出更分散的参与度。此外，研究结果还揭示了用户角色、影响力以及提高参与度的潜在策略。这项研究有助于人们了解在线社区行为，并为今后改善 Stack Exchange 用户体验的研究提供了一个框架。

引用次数: 0

Fusing Dynamics Equation: A Social Opinions Prediction Algorithm with LLM-based Agents 融合动力学方程：基于 LLM 代理的社会意见预测算法

arXiv - CS - Social and Information Networks

Pub Date : 2024-09-13 DOI: arxiv-2409.08717

Junchi Yao, Hongjie Zhang, Jie Ou, Dingyi Zuo, Zheng Yang, Zhicheng Dong

In the context where social media is increasingly becoming a significantplatform for social movements and the formation of public opinion, accuratelysimulating and predicting the dynamics of user opinions is of great importancefor understanding social phenomena, policy making, and guiding public opinion.However, existing simulation methods face challenges in capturing thecomplexity and dynamics of user behavior. Addressing this issue, this paperproposes an innovative simulation method for the dynamics of social media useropinions, the FDE-LLM algorithm, which incorporates opinion dynamics andepidemic model. This effectively constrains the actions and opinion evolutionprocess of large language models (LLM), making them more aligned with the realcyber world. In particular, the FDE-LLM categorizes users into opinion leadersand followers. Opinion leaders are based on LLM role-playing and areconstrained by the CA model, while opinion followers are integrated into adynamic system that combines the CA model with the SIR model. This innovativedesign significantly improves the accuracy and efficiency of the simulation.Experiments were conducted on four real Weibo datasets and validated using theopen-source model ChatGLM. The results show that, compared to traditionalagent-based modeling (ABM) opinion dynamics algorithms and LLM-based opiniondiffusion algorithms, our FDE-LLM algorithm demonstrates higher accuracy andinterpretability.

在社交媒体日益成为社会运动和舆论形成的重要平台的背景下，准确模拟和预测用户意见的动态变化对于理解社会现象、制定政策和引导舆论具有重要意义。然而，现有的模拟方法在捕捉用户行为的复杂性和动态性方面面临挑战。针对这一问题，本文提出了一种创新的社交媒体用户舆论动态模拟方法--FDE-LLM 算法，该算法结合了舆论动态和流行病模型。这有效地约束了大型语言模型（LLM）的行动和观点演变过程，使其更加贴近真实的网络世界。其中，FDE-LLM 将用户分为意见领袖和追随者。意见领袖以 LLM 角色扮演为基础，受到 CA 模型的约束，而意见追随者则被整合到结合了 CA 模型和 SIR 模型的动态系统中。我们在四个真实微博数据集上进行了实验，并使用开源模型 ChatGLM 进行了验证。实验结果表明，与传统的基于代理建模（ABM）的舆情动态算法和基于 LLM 的舆情扩散算法相比，我们的 FDE-LLM 算法具有更高的准确性和可解释性。

{"title":"Fusing Dynamics Equation: A Social Opinions Prediction Algorithm with LLM-based Agents","authors":"Junchi Yao, Hongjie Zhang, Jie Ou, Dingyi Zuo, Zheng Yang, Zhicheng Dong","doi":"arxiv-2409.08717","DOIUrl":"https://doi.org/arxiv-2409.08717","url":null,"abstract":"In the context where social media is increasingly becoming a significant\u0000platform for social movements and the formation of public opinion, accurately\u0000simulating and predicting the dynamics of user opinions is of great importance\u0000for understanding social phenomena, policy making, and guiding public opinion.\u0000However, existing simulation methods face challenges in capturing the\u0000complexity and dynamics of user behavior. Addressing this issue, this paper\u0000proposes an innovative simulation method for the dynamics of social media user\u0000opinions, the FDE-LLM algorithm, which incorporates opinion dynamics and\u0000epidemic model. This effectively constrains the actions and opinion evolution\u0000process of large language models (LLM), making them more aligned with the real\u0000cyber world. In particular, the FDE-LLM categorizes users into opinion leaders\u0000and followers. Opinion leaders are based on LLM role-playing and are\u0000constrained by the CA model, while opinion followers are integrated into a\u0000dynamic system that combines the CA model with the SIR model. This innovative\u0000design significantly improves the accuracy and efficiency of the simulation.\u0000Experiments were conducted on four real Weibo datasets and validated using the\u0000open-source model ChatGLM. The results show that, compared to traditional\u0000agent-based modeling (ABM) opinion dynamics algorithms and LLM-based opinion\u0000diffusion algorithms, our FDE-LLM algorithm demonstrates higher accuracy and\u0000interpretability.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":"18 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263479","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Sybil Detection using Graph Neural Networks 利用图神经网络检测假冒者

arXiv - CS - Social and Information Networks

Pub Date : 2024-09-13 DOI: arxiv-2409.08631

Stuart Heeb, Andreas Plesner, Roger Wattenhofer

This paper presents SYBILGAT, a novel approach to Sybil detection in socialnetworks using Graph Attention Networks (GATs). Traditional methods for Sybildetection primarily leverage structural properties of networks; however, theytend to struggle with a large number of attack edges and are often unable tosimultaneously utilize both known Sybil and honest nodes. Our proposed methodaddresses these limitations by dynamically assigning attention weights todifferent nodes during aggregations, enhancing detection performance. Weconducted extensive experiments in various scenarios, including pretraining insampled subgraphs, synthetic networks, and networks under targeted attacks. Theresults show that SYBILGAT significantly outperforms the state-of-the-artalgorithms, particularly in scenarios with high attack complexity and when thenumber of attack edges increases. Our approach shows robust performance acrossdifferent network models and sizes, even as the detection task becomes morechallenging. We successfully applied the model to a real-world Twitter graphwith more than 269k nodes and 6.8M edges. The flexibility and generalizabilityof SYBILGAT make it a promising tool to defend against Sybil attacks in onlinesocial networks with only structural information.

本文介绍了 SYBILGAT，这是一种利用图注意网络（GAT）在社交网络中进行假冒者检测的新方法。传统的假冒者检测方法主要利用网络的结构特性；然而，这些方法往往难以应对大量的攻击边，而且往往无法同时利用已知的假冒者节点和诚实节点。我们提出的方法通过在聚合过程中动态分配不同节点的关注权重来解决这些局限性，从而提高检测性能。我们在各种场景下进行了广泛的实验，包括在采样子图、合成网络和目标攻击下的网络中进行预训练。结果表明，SYBILGAT 的性能明显优于最先进的算法，尤其是在攻击复杂度较高和攻击边数量增加的情况下。即使检测任务变得更具挑战性，我们的方法也能在不同的网络模型和规模下表现出稳健的性能。我们成功地将该模型应用于一个拥有超过 269k 个节点和 680 万条边的真实 Twitter 图。SYBILGAT 的灵活性和通用性使其成为在仅有结构信息的在线社交网络中防御仿冒攻击的理想工具。

{"title":"Sybil Detection using Graph Neural Networks","authors":"Stuart Heeb, Andreas Plesner, Roger Wattenhofer","doi":"arxiv-2409.08631","DOIUrl":"https://doi.org/arxiv-2409.08631","url":null,"abstract":"This paper presents SYBILGAT, a novel approach to Sybil detection in social\u0000networks using Graph Attention Networks (GATs). Traditional methods for Sybil\u0000detection primarily leverage structural properties of networks; however, they\u0000tend to struggle with a large number of attack edges and are often unable to\u0000simultaneously utilize both known Sybil and honest nodes. Our proposed method\u0000addresses these limitations by dynamically assigning attention weights to\u0000different nodes during aggregations, enhancing detection performance. We\u0000conducted extensive experiments in various scenarios, including pretraining in\u0000sampled subgraphs, synthetic networks, and networks under targeted attacks. The\u0000results show that SYBILGAT significantly outperforms the state-of-the-art\u0000algorithms, particularly in scenarios with high attack complexity and when the\u0000number of attack edges increases. Our approach shows robust performance across\u0000different network models and sizes, even as the detection task becomes more\u0000challenging. We successfully applied the model to a real-world Twitter graph\u0000with more than 269k nodes and 6.8M edges. The flexibility and generalizability\u0000of SYBILGAT make it a promising tool to defend against Sybil attacks in online\u0000social networks with only structural information.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":"11 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Community Fact-Checks Trigger Moral Outrage in Replies to Misleading Posts on Social Media 社区事实核查引发道德愤怒，回应社交媒体上的误导性帖子

arXiv - CS - Social and Information Networks

Pub Date : 2024-09-13 DOI: arxiv-2409.08829

Yuwei Chuai, Anastasia Sergeeva, Gabriele Lenzini, Nicolas Pröllochs

Displaying community fact-checks is a promising approach to reduce engagementwith misinformation on social media. However, how users respond to misleadingcontent emotionally after community fact-checks are displayed on posts isunclear. Here, we employ quasi-experimental methods to causally analyze changesin sentiments and (moral) emotions in replies to misleading posts following thedisplay of community fact-checks. Our evaluation is based on a large-scalepanel dataset comprising N=2,225,260 replies across 1841 source posts from X'sCommunity Notes platform. We find that informing users about falsehoods throughcommunity fact-checks significantly increases negativity (by 7.3%), anger (by13.2%), disgust (by 4.7%), and moral outrage (by 16.0%) in the correspondingreplies. These results indicate that users perceive spreading misinformation asa violation of social norms and that those who spread misinformation shouldexpect negative reactions once their content is debunked. We derive importantimplications for the design of community-based fact-checking systems.

在社交媒体上显示社区事实核查是减少参与错误信息的一种有前途的方法。然而，在帖子上显示社区事实核查后，用户如何对误导性内容做出情绪反应还不清楚。在此，我们采用准实验方法对社区事实核查显示后误导性帖子回复中的情绪和（道德）情感变化进行因果分析。我们的评估基于一个大型面板数据集，该数据集包含来自 X's 社区笔记平台的 1841 个源帖子的 N=2,225,260 条回复。我们发现，通过社区事实核查告知用户虚假信息会显著增加相应回复中的负面情绪（7.3%）、愤怒（13.2%）、厌恶（4.7%）和道德愤怒（16.0%）。这些结果表明，用户认为传播错误信息违反了社会规范，而那些传播错误信息的人应该预料到一旦他们的内容被揭穿后会出现负面反应。我们得出了设计基于社区的事实核查系统的重要启示。

{"title":"Community Fact-Checks Trigger Moral Outrage in Replies to Misleading Posts on Social Media","authors":"Yuwei Chuai, Anastasia Sergeeva, Gabriele Lenzini, Nicolas Pröllochs","doi":"arxiv-2409.08829","DOIUrl":"https://doi.org/arxiv-2409.08829","url":null,"abstract":"Displaying community fact-checks is a promising approach to reduce engagement\u0000with misinformation on social media. However, how users respond to misleading\u0000content emotionally after community fact-checks are displayed on posts is\u0000unclear. Here, we employ quasi-experimental methods to causally analyze changes\u0000in sentiments and (moral) emotions in replies to misleading posts following the\u0000display of community fact-checks. Our evaluation is based on a large-scale\u0000panel dataset comprising N=2,225,260 replies across 1841 source posts from X's\u0000Community Notes platform. We find that informing users about falsehoods through\u0000community fact-checks significantly increases negativity (by 7.3%), anger (by\u000013.2%), disgust (by 4.7%), and moral outrage (by 16.0%) in the corresponding\u0000replies. These results indicate that users perceive spreading misinformation as\u0000a violation of social norms and that those who spread misinformation should\u0000expect negative reactions once their content is debunked. We derive important\u0000implications for the design of community-based fact-checking systems.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":"45 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Community-based fact-checking reduces the spread of misleading posts on social media 基于社区的事实核查减少了误导性帖子在社交媒体上的传播

arXiv - CS - Social and Information Networks

Pub Date : 2024-09-13 DOI: arxiv-2409.08781

Yuwei Chuai, Moritz Pilarski, Thomas Renault, David Restrepo-Amariles, Aurore Troussel-Clément, Gabriele Lenzini, Nicolas Pröllochs

Community-based fact-checking is a promising approach to verify social mediacontent and correct misleading posts at scale. Yet, causal evidence regardingits effectiveness in reducing the spread of misinformation on social media ismissing. Here, we performed a large-scale empirical study to analyze whethercommunity notes reduce the spread of misleading posts on X. Using aDifference-in-Differences design and repost time series data for N=237,677(community fact-checked) cascades that had been reposted more than 431 milliontimes, we found that exposing users to community notes reduced the spread ofmisleading posts by, on average, 62.0%. Furthermore, community notes increasedthe odds that users delete their misleading posts by 103.4%. However, ourfindings also suggest that community notes might be too slow to intervene inthe early (and most viral) stage of the diffusion. Our work offers importantimplications to enhance the effectiveness of community-based fact-checkingapproaches on social media.

基于社区的事实核查是验证社交媒体内容和大规模纠正误导性帖子的一种很有前途的方法。然而，有关其在减少社交媒体上错误信息传播方面的有效性的因果证据还很缺乏。在此，我们进行了一项大规模的实证研究，分析社区注释是否减少了 X 上误导性帖子的传播。我们使用了差分法设计和 N=237,677 个（社区事实核查）级联的转贴时间序列数据，这些级联的转贴次数超过了 4.31 亿次，我们发现，让用户接触社区注释平均减少了 62.0% 的误导性帖子的传播。此外，社区注释还使用户删除其误导性帖子的几率提高了 103.4%。然而，我们的研究结果也表明，社区注释可能过于缓慢，无法在传播的早期（也是病毒最猖獗的阶段）进行干预。我们的工作为提高社交媒体上基于社区的事实核查方法的有效性提供了重要启示。

{"title":"Community-based fact-checking reduces the spread of misleading posts on social media","authors":"Yuwei Chuai, Moritz Pilarski, Thomas Renault, David Restrepo-Amariles, Aurore Troussel-Clément, Gabriele Lenzini, Nicolas Pröllochs","doi":"arxiv-2409.08781","DOIUrl":"https://doi.org/arxiv-2409.08781","url":null,"abstract":"Community-based fact-checking is a promising approach to verify social media\u0000content and correct misleading posts at scale. Yet, causal evidence regarding\u0000its effectiveness in reducing the spread of misinformation on social media is\u0000missing. Here, we performed a large-scale empirical study to analyze whether\u0000community notes reduce the spread of misleading posts on X. Using a\u0000Difference-in-Differences design and repost time series data for N=237,677\u0000(community fact-checked) cascades that had been reposted more than 431 million\u0000times, we found that exposing users to community notes reduced the spread of\u0000misleading posts by, on average, 62.0%. Furthermore, community notes increased\u0000the odds that users delete their misleading posts by 103.4%. However, our\u0000findings also suggest that community notes might be too slow to intervene in\u0000the early (and most viral) stage of the diffusion. Our work offers important\u0000implications to enhance the effectiveness of community-based fact-checking\u0000approaches on social media.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":"12 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Estimation of Graph Features Based on Random Walks Using Neighbors' Properties 基于随机漫步的图形特征估算（利用邻域属性

arXiv - CS - Social and Information Networks

Pub Date : 2024-09-13 DOI: arxiv-2409.08599

Tsuyoshi Hasegawa, Shiori Hironaka, Kazuyuki Shudo

Using random walks for sampling has proven advantageous in assessing thecharacteristics of large and unknown social networks. Several algorithms basedon random walks have been introduced in recent years. In the practicalapplication of social network sampling, there is a recurrent reliance on anapplication programming interface (API) for obtaining adjacent nodes. However,owing to constraints related to query frequency and associated API expenses, itis preferable to minimize API calls during the feature estimation process. Inthis study, considering the acquisition of neighboring nodes as a cost factor,we introduce a feature estimation algorithm that outperforms existingalgorithms in terms of accuracy. Through experiments that simulate sampling onknown graphs, we demonstrate the superior accuracy of our proposed algorithmwhen compared to existing alternatives.

事实证明，使用随机游走进行抽样在评估大型未知社交网络的特征方面具有优势。近年来，已经推出了几种基于随机游走的算法。在社交网络采样的实际应用中，人们经常依赖应用编程接口（API）来获取相邻节点。然而，由于查询频率和相关 API 费用的限制，在特征估计过程中最好尽量减少 API 调用。在本研究中，考虑到获取相邻节点的成本因素，我们引入了一种在准确性方面优于现有算法的特征估计算法。通过在已知图上模拟采样的实验，我们证明了与现有算法相比，我们提出的算法具有更高的准确性。

引用次数: 0

DELTA: Dual Consistency Delving with Topological Uncertainty for Active Graph Domain Adaptation DELTA：带拓扑不确定性的双一致性迭代，用于主动图域自适应

arXiv - CS - Social and Information Networks

Pub Date : 2024-09-13 DOI: arxiv-2409.08946

Pengyun Wang, Yadi Cao, Chris Russell, Siyu Heng, Junyu Luo, Yanxin Shen, Xiao Luo

Graph domain adaptation has recently enabled knowledge transfer acrossdifferent graphs. However, without the semantic information on target graphs,the performance on target graphs is still far from satisfactory. To address theissue, we study the problem of active graph domain adaptation, which selects asmall quantitative of informative nodes on the target graph for extraannotation. This problem is highly challenging due to the complicatedtopological relationships and the distribution discrepancy across graphs. Inthis paper, we propose a novel approach named Dual Consistency Delving withTopological Uncertainty (DELTA) for active graph domain adaptation. Our DELTAconsists of an edge-oriented graph subnetwork and a path-oriented graphsubnetwork, which can explore topological semantics from complementaryperspectives. In particular, our edge-oriented graph subnetwork utilizes themessage passing mechanism to learn neighborhood information, while ourpath-oriented graph subnetwork explores high-order relationships fromsubstructures. To jointly learn from two subnetworks, we roughly selectinformative candidate nodes with the consideration of consistency across twosubnetworks. Then, we aggregate local semantics from its K-hop subgraph basedon node degrees for topological uncertainty estimation. To overcome potentialdistribution shifts, we compare target nodes and their corresponding sourcenodes for discrepancy scores as an additional component for fine selection.Extensive experiments on benchmark datasets demonstrate that DELTA outperformsvarious state-of-the-art approaches.

最近，图域适应技术实现了跨不同图的知识转移。然而，如果没有目标图的语义信息，在目标图上的性能仍然远远不能令人满意。为了解决这个问题，我们研究了主动图域适配问题，即选择目标图上的少量信息节点进行额外标注。由于图之间存在复杂的拓扑关系和分布差异，这个问题极具挑战性。在本文中，我们提出了一种名为 "带拓扑不确定性的双一致性掘取（Dual Consistency Delving withTopological Uncertainty，DELTA）"的主动图域适应新方法。我们的 DELTA 由面向边缘的图子网络和面向路径的图子网络组成，可以从互补的角度探索拓扑语义。其中，面向边缘的图子网络利用消息传递机制学习邻域信息，而面向路径的图子网络则从子结构中探索高阶关系。为了从两个子网络中联合学习，我们在考虑两个子网络一致性的基础上，粗略地选择有信息的候选节点。然后，我们根据节点度从其 K 跳子图中汇总局部语义，以进行拓扑不确定性估计。为了克服潜在的分布偏移，我们比较了目标节点和其相应来源节点的差异分数，作为精细选择的额外组成部分。

{"title":"DELTA: Dual Consistency Delving with Topological Uncertainty for Active Graph Domain Adaptation","authors":"Pengyun Wang, Yadi Cao, Chris Russell, Siyu Heng, Junyu Luo, Yanxin Shen, Xiao Luo","doi":"arxiv-2409.08946","DOIUrl":"https://doi.org/arxiv-2409.08946","url":null,"abstract":"Graph domain adaptation has recently enabled knowledge transfer across\u0000different graphs. However, without the semantic information on target graphs,\u0000the performance on target graphs is still far from satisfactory. To address the\u0000issue, we study the problem of active graph domain adaptation, which selects a\u0000small quantitative of informative nodes on the target graph for extra\u0000annotation. This problem is highly challenging due to the complicated\u0000topological relationships and the distribution discrepancy across graphs. In\u0000this paper, we propose a novel approach named Dual Consistency Delving with\u0000Topological Uncertainty (DELTA) for active graph domain adaptation. Our DELTA\u0000consists of an edge-oriented graph subnetwork and a path-oriented graph\u0000subnetwork, which can explore topological semantics from complementary\u0000perspectives. In particular, our edge-oriented graph subnetwork utilizes the\u0000message passing mechanism to learn neighborhood information, while our\u0000path-oriented graph subnetwork explores high-order relationships from\u0000substructures. To jointly learn from two subnetworks, we roughly select\u0000informative candidate nodes with the consideration of consistency across two\u0000subnetworks. Then, we aggregate local semantics from its K-hop subgraph based\u0000on node degrees for topological uncertainty estimation. To overcome potential\u0000distribution shifts, we compare target nodes and their corresponding source\u0000nodes for discrepancy scores as an additional component for fine selection.\u0000Extensive experiments on benchmark datasets demonstrate that DELTA outperforms\u0000various state-of-the-art approaches.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":"31 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Accurate and Fast Estimation of Temporal Motifs using Path Sampling 利用路径采样准确快速地估计时空动机

arXiv - CS - Social and Information Networks

Pub Date : 2024-09-13 DOI: arxiv-2409.08975

Yunjie Pan, Omkar Bhalerao, C. Seshadhri, Nishil Talati

Counting the number of small subgraphs, called motifs, is a fundamentalproblem in social network analysis and graph mining. Many real-world networksare directed and temporal, where edges have timestamps. Motif counting indirected, temporal graphs is especially challenging because there are aplethora of different kinds of patterns. Temporal motif counts reveal muchricher information and there is a need for scalable algorithms for motifcounting. A major challenge in counting is that there can be trillions of temporalmotif matches even with a graph with only millions of vertices. Both the motifsand the input graphs can have multiple edges between two vertices, leading to acombinatorial explosion problem. Counting temporal motifs involving just fourvertices is not feasible with current state-of-the-art algorithms. We design an algorithm, TEACUPS, that addresses this problem using a noveltechnique of temporal path sampling. We combine a path sampling method withcarefully designed temporal data structures, to propose an efficientapproximate algorithm for temporal motif counting. TEACUPS is an unbiasedestimator with provable concentration behavior, which can be used to bound theestimation error. For a Bitcoin graph with hundreds of millions of edges,TEACUPS runs in less than 1 minute, while the exact counting algorithm takesmore than a day. We empirically demonstrate the accuracy of TEACUPS on largedatasets, showing an average of 30$times$ speedup (up to 2000$times$ speedup)compared to existing GPU-based exact counting methods while preserving highcount estimation accuracy.

计算小型子图（称为主题图）的数量是社交网络分析和图挖掘中的一个基本问题。现实世界中的许多网络都是有向和时态的，其边缘都有时间戳。由于存在大量不同类型的模式，因此对间接的时间图进行图案计数尤其具有挑战性。时态图案计数能揭示更丰富的信息，因此需要可扩展的图案计数算法。计数的一大挑战在于，即使只有数百万顶点的图，也可能有数万亿个时态图案匹配。主题图和输入图的两个顶点之间都可能有多条边，从而导致组合爆炸问题。目前最先进的算法无法计算只涉及四个顶点的时空主题。我们设计了一种名为 TEACUPS 的算法，利用新颖的时空路径采样技术来解决这个问题。我们将路径采样方法与精心设计的时态数据结构相结合，提出了一种高效的近似时态图案计数算法。TEACUPS 是一种无偏估计器，具有可证明的集中行为，可用于限制估计误差。对于具有数亿条边的比特币图，TEACUPS 的运行时间不到 1 分钟，而精确计数算法则需要一天以上。我们通过实证证明了TEACUPS在大型数据集上的准确性，与现有的基于GPU的精确计数方法相比，TEACUPS平均提速30倍（最高提速2000倍），同时保持了较高的计数估计准确性。

{"title":"Accurate and Fast Estimation of Temporal Motifs using Path Sampling","authors":"Yunjie Pan, Omkar Bhalerao, C. Seshadhri, Nishil Talati","doi":"arxiv-2409.08975","DOIUrl":"https://doi.org/arxiv-2409.08975","url":null,"abstract":"Counting the number of small subgraphs, called motifs, is a fundamental\u0000problem in social network analysis and graph mining. Many real-world networks\u0000are directed and temporal, where edges have timestamps. Motif counting in\u0000directed, temporal graphs is especially challenging because there are a\u0000plethora of different kinds of patterns. Temporal motif counts reveal much\u0000richer information and there is a need for scalable algorithms for motif\u0000counting. A major challenge in counting is that there can be trillions of temporal\u0000motif matches even with a graph with only millions of vertices. Both the motifs\u0000and the input graphs can have multiple edges between two vertices, leading to a\u0000combinatorial explosion problem. Counting temporal motifs involving just four\u0000vertices is not feasible with current state-of-the-art algorithms. We design an algorithm, TEACUPS, that addresses this problem using a novel\u0000technique of temporal path sampling. We combine a path sampling method with\u0000carefully designed temporal data structures, to propose an efficient\u0000approximate algorithm for temporal motif counting. TEACUPS is an unbiased\u0000estimator with provable concentration behavior, which can be used to bound the\u0000estimation error. For a Bitcoin graph with hundreds of millions of edges,\u0000TEACUPS runs in less than 1 minute, while the exact counting algorithm takes\u0000more than a day. We empirically demonstrate the accuracy of TEACUPS on large\u0000datasets, showing an average of 30$times$ speedup (up to 2000$times$ speedup)\u0000compared to existing GPU-based exact counting methods while preserving high\u0000count estimation accuracy.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":"52 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Informal Labor of Content Creators: Situating Xiaohongshu's Key Opinion Consumers in Relationships to Marketers, Consumer Brands, and the Platform 内容创作者的非正式劳动：小红书关键意见消费者与营销者、消费品牌和平台的关系定位

arXiv - CS - Social and Information Networks

Pub Date : 2024-09-12 DOI: arxiv-2409.08360

Huiran Yi, Lu Xian

This paper critically examines flexible content creation conducted by KeyOpinion Consumers (KOCs) on a prominent social media and e-commerce platform inChina, Xiaohongshu (RED). Drawing on nine-month ethnographic work conductedonline, we find that the production of the KOC role on RED is predicated on theinteractions and negotiations among multiple stakeholders -- content creators,marketers, consumer brands (corporations), and the platform. KOCs areinstrumental in RED influencer marketing tactics and amplify the mundane anddaily life content popular on the platform. They navigate the dynamics in thetriangulated relations with other stakeholders in order to secure economicopportunities for producing advertorial content, and yet, the labor involved inproducing such content is deliberately obscured to make it appear asspontaneous, ordinary user posts for the sake of marketing campaigns.Meanwhile, the commercial value of their work is often underestimated andovershadowed in corporate paperwork, platform technological mechanisms, andbusiness models, resulting in and reinforcing inadequate recognition andcompensation of KOCs. We propose the concept of ``informal labor'' to offer anew lens to understand content creation labor that is indispensable yetunrecognized by the social media industry. We advocate for a contextualized andnuanced examination of how labor is valued and compensated and urge for betterprotections and working conditions for informal laborers like KOCs.

本文对关键意见消费者（KOC）在中国著名社交媒体和电子商务平台小红书（RED）上进行的灵活内容创作进行了批判性研究。通过为期九个月的在线人种学研究，我们发现小红书平台上的关键意见消费者角色的产生是以内容创作者、营销者、消费品牌（企业）和平台等多方利益相关者之间的互动和协商为前提的。KOC 是 RED 影响力营销战术中的重要角色，他们放大了平台上流行的世俗和日常生活内容。他们在与其他利益相关者的三角关系中游刃有余，以获得制作广告内容的经济机会，然而，制作这些内容所涉及的劳动却被刻意掩盖，使其看起来像是为了营销活动而发布的自发、普通的用户帖子。同时，他们工作的商业价值往往被低估，并在企业文书、平台技术机制和商业模式中被遮蔽，导致并强化了对KOC的不充分认可和补偿。我们提出了 "非正式劳动 "的概念，以提供一个新的视角来理解社交媒体行业中不可或缺但却不被认可的内容创作劳动。我们主张对劳动的价值和补偿方式进行因地制宜的均衡考察，并敦促为 KOC 等非正规劳动者提供更好的保护和工作条件。

{"title":"The Informal Labor of Content Creators: Situating Xiaohongshu's Key Opinion Consumers in Relationships to Marketers, Consumer Brands, and the Platform","authors":"Huiran Yi, Lu Xian","doi":"arxiv-2409.08360","DOIUrl":"https://doi.org/arxiv-2409.08360","url":null,"abstract":"This paper critically examines flexible content creation conducted by Key\u0000Opinion Consumers (KOCs) on a prominent social media and e-commerce platform in\u0000China, Xiaohongshu (RED). Drawing on nine-month ethnographic work conducted\u0000online, we find that the production of the KOC role on RED is predicated on the\u0000interactions and negotiations among multiple stakeholders -- content creators,\u0000marketers, consumer brands (corporations), and the platform. KOCs are\u0000instrumental in RED influencer marketing tactics and amplify the mundane and\u0000daily life content popular on the platform. They navigate the dynamics in the\u0000triangulated relations with other stakeholders in order to secure economic\u0000opportunities for producing advertorial content, and yet, the labor involved in\u0000producing such content is deliberately obscured to make it appear as\u0000spontaneous, ordinary user posts for the sake of marketing campaigns.\u0000Meanwhile, the commercial value of their work is often underestimated and\u0000overshadowed in corporate paperwork, platform technological mechanisms, and\u0000business models, resulting in and reinforcing inadequate recognition and\u0000compensation of KOCs. We propose the concept of ``informal labor'' to offer a\u0000new lens to understand content creation labor that is indispensable yet\u0000unrecognized by the social media industry. We advocate for a contextualized and\u0000nuanced examination of how labor is valued and compensated and urge for better\u0000protections and working conditions for informal laborers like KOCs.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":"10 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0