首页 > 最新文献

arXiv - CS - Social and Information Networks最新文献

英文 中文
Let's Influence Algorithms Together: How Millions of Fans Build Collective Understanding of Algorithms and Organize Coordinated Algorithmic Actions 让我们一起影响算法:数百万粉丝如何建立对算法的集体理解并组织协调算法行动
Pub Date : 2024-09-16 DOI: arxiv-2409.10670
Qing Xiao, Yuhang Zheng, Xianzhe Fan, Bingbing Zhang, Zhicong Lu
Previous research pays attention to how users strategically understand andconsciously interact with algorithms but mainly focuses on an individual level,making it difficult to explore how users within communities could develop acollective understanding of algorithms and organize collective algorithmicactions. Through a two-year ethnography of online fan activities, this studyinvestigates 43 core fans who always organize large-scale fans' collectiveactions and their corresponding general fan groups. This study aims to revealhow these core fans mobilize millions of general fans through collectivealgorithmic actions. These core fans reported the rhetorical strategies used topersuade general fans, the steps taken to build a collective understanding ofalgorithms, and the collaborative processes that adapt collective actionsacross platforms and cultures. Our findings highlight the key factors thatenable computer-supported collective algorithmic actions and extend collectiveaction research into large-scale domain targeting algorithms.
以往的研究关注用户如何战略性地理解算法并有意识地与算法互动,但主要集中在个人层面,难以探索社区内的用户如何形成对算法的集体理解并组织集体算法互动。本研究通过对网络粉丝活动进行为期两年的人种学研究,调查了 43 名经常组织大规模粉丝集体活动的核心粉丝及其相应的普通粉丝群体。本研究旨在揭示这些核心粉丝如何通过集体算法行动动员数百万普通粉丝。这些核心粉丝报告了用于说服普通粉丝的修辞策略、建立对算法的集体理解所采取的步骤,以及调整跨平台和跨文化集体行动的协作过程。我们的研究结果强调了计算机支持集体算法行动的关键因素,并将集体行动研究扩展到了大规模领域目标算法。
{"title":"Let's Influence Algorithms Together: How Millions of Fans Build Collective Understanding of Algorithms and Organize Coordinated Algorithmic Actions","authors":"Qing Xiao, Yuhang Zheng, Xianzhe Fan, Bingbing Zhang, Zhicong Lu","doi":"arxiv-2409.10670","DOIUrl":"https://doi.org/arxiv-2409.10670","url":null,"abstract":"Previous research pays attention to how users strategically understand and\u0000consciously interact with algorithms but mainly focuses on an individual level,\u0000making it difficult to explore how users within communities could develop a\u0000collective understanding of algorithms and organize collective algorithmic\u0000actions. Through a two-year ethnography of online fan activities, this study\u0000investigates 43 core fans who always organize large-scale fans' collective\u0000actions and their corresponding general fan groups. This study aims to reveal\u0000how these core fans mobilize millions of general fans through collective\u0000algorithmic actions. These core fans reported the rhetorical strategies used to\u0000persuade general fans, the steps taken to build a collective understanding of\u0000algorithms, and the collaborative processes that adapt collective actions\u0000across platforms and cultures. Our findings highlight the key factors that\u0000enable computer-supported collective algorithmic actions and extend collective\u0000action research into large-scale domain targeting algorithms.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hierarchical Graph Pooling Based on Minimum Description Length 基于最小描述长度的分层图池化
Pub Date : 2024-09-16 DOI: arxiv-2409.10263
Jan von Pichowski, Christopher Blöcker, Ingo Scholtes
Graph pooling is an essential part of deep graph representation learning. Weintroduce MapEqPool, a principled pooling operator that takes the inherenthierarchical structure of real-world graphs into account. MapEqPool builds onthe map equation, an information-theoretic objective function for communitydetection based on the minimum description length principle which naturallyimplements Occam's razor and balances between model complexity and fit. Wedemonstrate MapEqPool's competitive performance with an empirical comparisonagainst various baselines across standard graph classification datasets.
图池化是深度图表示学习的重要组成部分。我们引入了 MapEqPool,这是一种有原则的池化算子,它将现实世界图的内在层次结构考虑在内。MapEqPool 建立在地图方程的基础上,地图方程是基于最小描述长度原则的社区检测信息理论目标函数,它自然地实现了奥卡姆剃刀原则,并在模型复杂度和拟合度之间取得了平衡。通过在标准图分类数据集上与各种基线进行实证比较,我们展示了 MapEqPool 的竞争性能。
{"title":"Hierarchical Graph Pooling Based on Minimum Description Length","authors":"Jan von Pichowski, Christopher Blöcker, Ingo Scholtes","doi":"arxiv-2409.10263","DOIUrl":"https://doi.org/arxiv-2409.10263","url":null,"abstract":"Graph pooling is an essential part of deep graph representation learning. We\u0000introduce MapEqPool, a principled pooling operator that takes the inherent\u0000hierarchical structure of real-world graphs into account. MapEqPool builds on\u0000the map equation, an information-theoretic objective function for community\u0000detection based on the minimum description length principle which naturally\u0000implements Occam's razor and balances between model complexity and fit. We\u0000demonstrate MapEqPool's competitive performance with an empirical comparison\u0000against various baselines across standard graph classification datasets.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263472","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Signed Graph Autoencoder for Explainable and Polarization-Aware Network Embeddings 用于可解释和极化感知网络嵌入的有符号图自动编码器
Pub Date : 2024-09-16 DOI: arxiv-2409.10452
Nikolaos Nakis, Chrysoula Kosma, Giannis Nikolentzos, Michalis Chatzianastasis, Iakovos Evdaimon, Michalis Vazirgiannis
Autoencoders based on Graph Neural Networks (GNNs) have garnered significantattention in recent years for their ability to extract informative latentrepresentations, characterizing the structure of complex topologies, such asgraphs. Despite the prevalence of Graph Autoencoders, there has been limitedfocus on developing and evaluating explainable neural-based graph generativemodels specifically designed for signed networks. To address this gap, wepropose the Signed Graph Archetypal Autoencoder (SGAAE) framework. SGAAEextracts node-level representations that express node memberships over distinctextreme profiles, referred to as archetypes, within the network. This isachieved by projecting the graph onto a learned polytope, which governs itspolarization. The framework employs a recently proposed likelihood foranalyzing signed networks based on the Skellam distribution, combined withrelational archetypal analysis and GNNs. Our experimental evaluationdemonstrates the SGAAEs' capability to successfully infer node memberships overthe different underlying latent structures while extracting competingcommunities formed through the participation of the opposing views in thenetwork. Additionally, we introduce the 2-level network polarization problemand show how SGAAE is able to characterize such a setting. The proposed modelachieves high performance in different tasks of signed link prediction acrossfour real-world datasets, outperforming several baseline models.
近年来,基于图神经网络(GNN)的自动编码器因其能够提取信息性潜在表征,描述复杂拓扑结构(如图)的特征而备受关注。尽管图自动编码器非常普遍,但专门为签名网络设计的、基于神经的可解释图生成模型的开发和评估却一直受到关注。为了弥补这一不足,我们提出了签名图原型自动编码器(SGAAE)框架。SGAAE 提取节点级表示,表达网络中不同极端剖面(称为原型)上的节点成员身份。这是通过将图投影到学习多面体上实现的,学习多面体控制着图的极化。该框架采用了最近提出的基于 Skellam 分布的签名网络分析似然法,并结合了相关原型分析和 GNN。我们的实验评估证明,SGAAEs 能够成功推断出不同潜在结构的节点成员身份,同时提取出网络中对立观点参与形成的竞争群体。此外,我们还引入了两级网络极化问题,并展示了 SGAAE 是如何描述这种情况的。所提出的模型在四个真实世界数据集的不同签名链接预测任务中都取得了很高的性能,超过了几个基线模型。
{"title":"Signed Graph Autoencoder for Explainable and Polarization-Aware Network Embeddings","authors":"Nikolaos Nakis, Chrysoula Kosma, Giannis Nikolentzos, Michalis Chatzianastasis, Iakovos Evdaimon, Michalis Vazirgiannis","doi":"arxiv-2409.10452","DOIUrl":"https://doi.org/arxiv-2409.10452","url":null,"abstract":"Autoencoders based on Graph Neural Networks (GNNs) have garnered significant\u0000attention in recent years for their ability to extract informative latent\u0000representations, characterizing the structure of complex topologies, such as\u0000graphs. Despite the prevalence of Graph Autoencoders, there has been limited\u0000focus on developing and evaluating explainable neural-based graph generative\u0000models specifically designed for signed networks. To address this gap, we\u0000propose the Signed Graph Archetypal Autoencoder (SGAAE) framework. SGAAE\u0000extracts node-level representations that express node memberships over distinct\u0000extreme profiles, referred to as archetypes, within the network. This is\u0000achieved by projecting the graph onto a learned polytope, which governs its\u0000polarization. The framework employs a recently proposed likelihood for\u0000analyzing signed networks based on the Skellam distribution, combined with\u0000relational archetypal analysis and GNNs. Our experimental evaluation\u0000demonstrates the SGAAEs' capability to successfully infer node memberships over\u0000the different underlying latent structures while extracting competing\u0000communities formed through the participation of the opposing views in the\u0000network. Additionally, we introduce the 2-level network polarization problem\u0000and show how SGAAE is able to characterize such a setting. The proposed model\u0000achieves high performance in different tasks of signed link prediction across\u0000four real-world datasets, outperforming several baseline models.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semantics Preserving Emoji Recommendation with Large Language Models 利用大型语言模型进行语义保存型表情符号推荐
Pub Date : 2024-09-16 DOI: arxiv-2409.10760
Zhongyi Qiu, Kangyi Qiu, Hanjia Lyu, Wei Xiong, Jiebo Luo
Emojis have become an integral part of digital communication, enriching textby conveying emotions, tone, and intent. Existing emoji recommendation methodsare primarily evaluated based on their ability to match the exact emoji a userchooses in the original text. However, they ignore the essence of users'behavior on social media in that each text can correspond to multiplereasonable emojis. To better assess a model's ability to align with suchreal-world emoji usage, we propose a new semantics preserving evaluationframework for emoji recommendation, which measures a model's ability torecommend emojis that maintain the semantic consistency with the user's text.To evaluate how well a model preserves semantics, we assess whether thepredicted affective state, demographic profile, and attitudinal stance of theuser remain unchanged. If these attributes are preserved, we consider therecommended emojis to have maintained the original semantics. The advancedabilities of Large Language Models (LLMs) in understanding and generatingnuanced, contextually relevant output make them well-suited for handling thecomplexities of semantics preserving emoji recommendation. To this end, weconstruct a comprehensive benchmark to systematically assess the performance ofsix proprietary and open-source LLMs using different prompting techniques onour task. Our experiments demonstrate that GPT-4o outperforms other LLMs,achieving a semantics preservation score of 79.23%. Additionally, we conductcase studies to analyze model biases in downstream classification tasks andevaluate the diversity of the recommended emojis.
表情符号已成为数字通信中不可或缺的一部分,通过传达情感、语气和意图来丰富文本内容。现有的表情符号推荐方法主要是根据是否能准确匹配用户在原始文本中选择的表情符号来进行评估的。然而,这些方法忽略了用户在社交媒体上行为的本质,即每篇文本可以对应多个合理的表情符号。为了更好地评估模型与真实世界中的表情符号使用情况保持一致的能力,我们提出了一个新的表情符号推荐语义保护评估框架,用于衡量模型推荐与用户文本语义保持一致的表情符号的能力。为了评估模型的语义保护程度,我们评估用户的预测情感状态、人口统计学特征和态度立场是否保持不变。如果这些属性保持不变,我们就认为推荐的表情符号保持了原有语义。大语言模型(LLM)在理解和生成增强的、与上下文相关的输出方面具有先进的能力,这使它们非常适合处理语义保留表情符号推荐的复杂问题。为此,我们构建了一个综合基准,系统地评估了在我们的任务中使用不同提示技术的六种专有和开源 LLM 的性能。实验证明,GPT-4o 的性能优于其他 LLM,语义保存率达到 79.23%。此外,我们还进行了案例研究,分析了下游分类任务中的模型偏差,并评估了推荐表情符号的多样性。
{"title":"Semantics Preserving Emoji Recommendation with Large Language Models","authors":"Zhongyi Qiu, Kangyi Qiu, Hanjia Lyu, Wei Xiong, Jiebo Luo","doi":"arxiv-2409.10760","DOIUrl":"https://doi.org/arxiv-2409.10760","url":null,"abstract":"Emojis have become an integral part of digital communication, enriching text\u0000by conveying emotions, tone, and intent. Existing emoji recommendation methods\u0000are primarily evaluated based on their ability to match the exact emoji a user\u0000chooses in the original text. However, they ignore the essence of users'\u0000behavior on social media in that each text can correspond to multiple\u0000reasonable emojis. To better assess a model's ability to align with such\u0000real-world emoji usage, we propose a new semantics preserving evaluation\u0000framework for emoji recommendation, which measures a model's ability to\u0000recommend emojis that maintain the semantic consistency with the user's text.\u0000To evaluate how well a model preserves semantics, we assess whether the\u0000predicted affective state, demographic profile, and attitudinal stance of the\u0000user remain unchanged. If these attributes are preserved, we consider the\u0000recommended emojis to have maintained the original semantics. The advanced\u0000abilities of Large Language Models (LLMs) in understanding and generating\u0000nuanced, contextually relevant output make them well-suited for handling the\u0000complexities of semantics preserving emoji recommendation. To this end, we\u0000construct a comprehensive benchmark to systematically assess the performance of\u0000six proprietary and open-source LLMs using different prompting techniques on\u0000our task. Our experiments demonstrate that GPT-4o outperforms other LLMs,\u0000achieving a semantics preservation score of 79.23%. Additionally, we conduct\u0000case studies to analyze model biases in downstream classification tasks and\u0000evaluate the diversity of the recommended emojis.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Impact Of Emotions on Information Seeking And Sharing Behaviors During Pandemic 大流行病期间情绪对信息搜索和共享行为的影响
Pub Date : 2024-09-16 DOI: arxiv-2409.10754
Smitha Muthya Sudheendra, Hao Xu, Jisu Huh, Jaideep Srivastava
We propose a novel approach to assess the public's coping behavior during theCOVID-19 outbreak by examining the emotions. Specifically, we explore (1)changes in the public's emotions with the COVID-19 crisis progression and (2)the impacts of the public's emotions on their information-seeking,information-sharing behaviors, and compliance with stay-at-home policies. Webase the study on the appraisal tendency framework, detect the public'semotions by fine-tuning a pre-trained RoBERTa model, and cross-analyzethird-party behavioral data. We demonstrate the feasibility and reliability ofour proposed approach in providing a large-scale examination of the publi'semotions and coping behaviors in a real-world crisis: COVID-19. The approachcomplements prior crisis communication research, mainly based on self-reported,small-scale experiments and survey data. Our results show that anger and fearare more prominent than other emotions experienced by the public at thepandemic's outbreak stage. Results also show that the extent of low certaintyand passive emotions (e.g., sadness, fear) was related to increasedinformation-seeking and information-sharing behaviors. Additionally,high-certainty (e.g., anger) and low-certainty (e.g., sadness, fear) emotionsduring the outbreak correlated to the public's compliance with stay-at-homeorders.
我们提出了一种新方法,通过研究情绪来评估公众在 COVID-19 爆发期间的应对行为。具体来说,我们探讨了(1)公众情绪随着 COVID-19 危机进展的变化;(2)公众情绪对其信息搜寻、信息共享行为和遵守留守政策的影响。我们的研究基于评价倾向框架,通过微调预先训练好的 RoBERTa 模型来检测公众情绪,并交叉分析第三方行为数据。我们证明了所提出方法的可行性和可靠性,可以对真实世界危机中的公众情绪和应对行为进行大规模检测:COVID-19。该方法补充了之前的危机传播研究,这些研究主要基于自我报告、小规模实验和调查数据。我们的研究结果表明,在疫情爆发阶段,公众的愤怒和恐惧情绪比其他情绪更为突出。结果还显示,低确定性和被动情绪(如悲伤、恐惧)的程度与信息搜寻和信息分享行为的增加有关。此外,疫情爆发期间的高确定性情绪(如愤怒)和低确定性情绪(如悲伤、恐惧)与公众遵守留在家中的命令有关。
{"title":"Impact Of Emotions on Information Seeking And Sharing Behaviors During Pandemic","authors":"Smitha Muthya Sudheendra, Hao Xu, Jisu Huh, Jaideep Srivastava","doi":"arxiv-2409.10754","DOIUrl":"https://doi.org/arxiv-2409.10754","url":null,"abstract":"We propose a novel approach to assess the public's coping behavior during the\u0000COVID-19 outbreak by examining the emotions. Specifically, we explore (1)\u0000changes in the public's emotions with the COVID-19 crisis progression and (2)\u0000the impacts of the public's emotions on their information-seeking,\u0000information-sharing behaviors, and compliance with stay-at-home policies. We\u0000base the study on the appraisal tendency framework, detect the public's\u0000emotions by fine-tuning a pre-trained RoBERTa model, and cross-analyze\u0000third-party behavioral data. We demonstrate the feasibility and reliability of\u0000our proposed approach in providing a large-scale examination of the publi's\u0000emotions and coping behaviors in a real-world crisis: COVID-19. The approach\u0000complements prior crisis communication research, mainly based on self-reported,\u0000small-scale experiments and survey data. Our results show that anger and fear\u0000are more prominent than other emotions experienced by the public at the\u0000pandemic's outbreak stage. Results also show that the extent of low certainty\u0000and passive emotions (e.g., sadness, fear) was related to increased\u0000information-seeking and information-sharing behaviors. Additionally,\u0000high-certainty (e.g., anger) and low-certainty (e.g., sadness, fear) emotions\u0000during the outbreak correlated to the public's compliance with stay-at-home\u0000orders.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142269353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Flexible Diffusion Scopes with Parameterized Laplacian for Heterophilic Graph Learning 针对嗜异图形学习的参数化拉普拉奇灵活扩散范围
Pub Date : 2024-09-15 DOI: arxiv-2409.09888
Qincheng Lu, Jiaqi Zhu, Sitao Luan, Xiao-Wen Chang
The ability of Graph Neural Networks (GNNs) to capture long-range and globaltopology information is limited by the scope of conventional graph Laplacian,leading to unsatisfactory performance on some datasets, particularly onheterophilic graphs. To address this limitation, we propose a new class ofparameterized Laplacian matrices, which provably offers more flexibility incontrolling the diffusion distance between nodes than the conventional graphLaplacian, allowing long-range information to be adaptively captured throughdiffusion on graph. Specifically, we first prove that the diffusion distanceand spectral distance on graph have an order-preserving relationship. With thisresult, we demonstrate that the parameterized Laplacian can accelerate thediffusion of long-range information, and the parameters in the Laplacian enableflexibility of the diffusion scopes. Based on the theoretical results, wepropose topology-guided rewiring mechanism to capture helpful long-rangeneighborhood information for heterophilic graphs. With this mechanism and thenew Laplacian, we propose two GNNs with flexible diffusion scopes: namely theParameterized Diffusion based Graph Convolutional Networks (PD-GCN) and GraphAttention Networks (PD-GAT). Synthetic experiments reveal the high correlationsbetween the parameters of the new Laplacian and the performance ofparameterized GNNs under various graph homophily levels, which verifies thatour new proposed GNNs indeed have the ability to adjust the parameters toadaptively capture the global information for different levels of heterophilicgraphs. They also outperform the state-of-the-art (SOTA) models on 6 out of 7real-world benchmark datasets, which further confirms their superiority.
图神经网络(GNN)捕捉远距离和全局拓扑信息的能力受到传统图拉普拉斯矩阵范围的限制,导致其在某些数据集上的性能不尽如人意,尤其是在嗜异图上。针对这一局限性,我们提出了一类新的参数化拉普拉斯矩阵,与传统图拉普拉斯矩阵相比,该矩阵在控制节点间扩散距离方面具有更大的灵活性,允许通过图上的扩散自适应地捕捉长程信息。具体来说,我们首先证明了图上的扩散距离和光谱距离具有保序关系。根据这一结果,我们证明了参数化的拉普拉斯可以加速长程信息的扩散,并且拉普拉斯中的参数可以实现扩散范围的灵活性。在理论结果的基础上,我们提出了拓扑引导的重布线机制,以捕捉异嗜图中有用的长邻域信息。利用这种机制和新的拉普拉斯函数,我们提出了两种具有灵活扩散范围的 GNN:即基于参数化扩散的图卷积网络(Parameterized Diffusion based Graph Convolutional Networks,PD-GCN)和图注意力网络(GraphAttention Networks,PD-GAT)。合成实验显示,在不同的图同亲程度下,新拉普拉斯参数与参数化 GNN 的性能之间存在高度相关性,这验证了我们提出的新 GNN 确实有能力调整参数,以适应性地捕捉不同异亲程度图的全局信息。在 7 个真实世界基准数据集中的 6 个数据集上,它们的表现也优于最先进的(SOTA)模型,这进一步证实了它们的优越性。
{"title":"Flexible Diffusion Scopes with Parameterized Laplacian for Heterophilic Graph Learning","authors":"Qincheng Lu, Jiaqi Zhu, Sitao Luan, Xiao-Wen Chang","doi":"arxiv-2409.09888","DOIUrl":"https://doi.org/arxiv-2409.09888","url":null,"abstract":"The ability of Graph Neural Networks (GNNs) to capture long-range and global\u0000topology information is limited by the scope of conventional graph Laplacian,\u0000leading to unsatisfactory performance on some datasets, particularly on\u0000heterophilic graphs. To address this limitation, we propose a new class of\u0000parameterized Laplacian matrices, which provably offers more flexibility in\u0000controlling the diffusion distance between nodes than the conventional graph\u0000Laplacian, allowing long-range information to be adaptively captured through\u0000diffusion on graph. Specifically, we first prove that the diffusion distance\u0000and spectral distance on graph have an order-preserving relationship. With this\u0000result, we demonstrate that the parameterized Laplacian can accelerate the\u0000diffusion of long-range information, and the parameters in the Laplacian enable\u0000flexibility of the diffusion scopes. Based on the theoretical results, we\u0000propose topology-guided rewiring mechanism to capture helpful long-range\u0000neighborhood information for heterophilic graphs. With this mechanism and the\u0000new Laplacian, we propose two GNNs with flexible diffusion scopes: namely the\u0000Parameterized Diffusion based Graph Convolutional Networks (PD-GCN) and Graph\u0000Attention Networks (PD-GAT). Synthetic experiments reveal the high correlations\u0000between the parameters of the new Laplacian and the performance of\u0000parameterized GNNs under various graph homophily levels, which verifies that\u0000our new proposed GNNs indeed have the ability to adjust the parameters to\u0000adaptively capture the global information for different levels of heterophilic\u0000graphs. They also outperform the state-of-the-art (SOTA) models on 6 out of 7\u0000real-world benchmark datasets, which further confirms their superiority.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic Fraud Detection: Integrating Reinforcement Learning into Graph Neural Networks 动态欺诈检测:将强化学习融入图神经网络
Pub Date : 2024-09-15 DOI: arxiv-2409.09892
Yuxin Dong, Jianhua Yao, Jiajing Wang, Yingbin Liang, Shuhan Liao, Minheng Xiao
Financial fraud refers to the act of obtaining financial benefits throughdishonest means. Such behavior not only disrupts the order of the financialmarket but also harms economic and social development and breeds other illegaland criminal activities. With the popularization of the internet and onlinepayment methods, many fraudulent activities and money laundering behaviors inlife have shifted from offline to online, posing a great challenge toregulatory authorities. How to efficiently detect these financial fraudactivities has become an urgent issue that needs to be resolved. Graph neuralnetworks are a type of deep learning model that can utilize the interactiverelationships within graph structures, and they have been widely applied in thefield of fraud detection. However, there are still some issues. First,fraudulent activities only account for a very small part of transactiontransfers, leading to an inevitable problem of label imbalance in frauddetection. At the same time, fraudsters often disguise their behavior, whichcan have a negative impact on the final prediction results. In addition,existing research has overlooked the importance of balancing neighborinformation and central node information. For example, when the central nodehas too many neighbors, the features of the central node itself are oftenneglected. Finally, fraud activities and patterns are constantly changing overtime, so considering the dynamic evolution of graph edge relationships is alsovery important.
金融欺诈是指通过不正当手段获取金融利益的行为。这种行为不仅扰乱了金融市场秩序,而且危害了经济和社会发展,并滋生了其他违法犯罪活动。随着互联网和在线支付方式的普及,生活中的许多欺诈活动和洗钱行为已经从线下转移到了线上,这给监管部门带来了巨大的挑战。如何有效地检测这些金融欺诈活动已成为亟待解决的问题。图神经网络是一种深度学习模型,可以利用图结构中的交互关系,在欺诈检测领域得到了广泛应用。然而,目前仍存在一些问题。首先,欺诈活动只占交易转账的很小一部分,导致欺诈检测中不可避免地存在标签不平衡的问题。同时,欺诈者往往会伪装自己的行为,这会对最终预测结果产生负面影响。此外,现有研究还忽略了平衡邻居信息和中心节点信息的重要性。例如,当中心节点有太多邻居时,中心节点本身的特征往往会被忽略。最后,欺诈活动和模式会随着时间的推移而不断变化,因此考虑图边关系的动态演化也非常重要。
{"title":"Dynamic Fraud Detection: Integrating Reinforcement Learning into Graph Neural Networks","authors":"Yuxin Dong, Jianhua Yao, Jiajing Wang, Yingbin Liang, Shuhan Liao, Minheng Xiao","doi":"arxiv-2409.09892","DOIUrl":"https://doi.org/arxiv-2409.09892","url":null,"abstract":"Financial fraud refers to the act of obtaining financial benefits through\u0000dishonest means. Such behavior not only disrupts the order of the financial\u0000market but also harms economic and social development and breeds other illegal\u0000and criminal activities. With the popularization of the internet and online\u0000payment methods, many fraudulent activities and money laundering behaviors in\u0000life have shifted from offline to online, posing a great challenge to\u0000regulatory authorities. How to efficiently detect these financial fraud\u0000activities has become an urgent issue that needs to be resolved. Graph neural\u0000networks are a type of deep learning model that can utilize the interactive\u0000relationships within graph structures, and they have been widely applied in the\u0000field of fraud detection. However, there are still some issues. First,\u0000fraudulent activities only account for a very small part of transaction\u0000transfers, leading to an inevitable problem of label imbalance in fraud\u0000detection. At the same time, fraudsters often disguise their behavior, which\u0000can have a negative impact on the final prediction results. In addition,\u0000existing research has overlooked the importance of balancing neighbor\u0000information and central node information. For example, when the central node\u0000has too many neighbors, the features of the central node itself are often\u0000neglected. Finally, fraud activities and patterns are constantly changing over\u0000time, so considering the dynamic evolution of graph edge relationships is also\u0000very important.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
What you say or how you say it? Predicting Conflict Outcomes in Real and LLM-Generated Conversations 说什么还是怎么说?预测真实对话和 LLM 生成的对话中的冲突结果
Pub Date : 2024-09-14 DOI: arxiv-2409.09338
Priya Ronald D'Costa, Evan Rowbotham, Xinlan Emily Hu
When conflicts escalate, is it due to what is said or how it is said? In theconflict literature, two theoretical approaches take opposing views: onefocuses on the content of the disagreement, while the other focuses on how itis expressed. This paper aims to integrate these two perspectives through acomputational analysis of 191 communication features -- 128 related toexpression and 63 to content. We analyze 1,200 GPT-4 simulated conversationsand 12,630 real-world discussions from Reddit. We find that expression featuresmore reliably predict destructive conflict outcomes across both settings,although the most important features differ. In the Reddit data, conversationaldynamics such as turn-taking and conversational equality are highly predictive,but they are not predictive in simulated conversations. These results maysuggest a possible limitation in simulating social interactions with languagemodels, and we discuss the implications for our findings on building socialcomputing systems.
当冲突升级时,是因为说了什么还是怎么说的?在冲突文献中,有两种理论观点截然相反:一种侧重于分歧的内容,而另一种则侧重于分歧的表达方式。本文旨在通过对 191 个交流特征(其中 128 个与表达有关,63 个与内容有关)进行计算分析,将这两种观点进行整合。我们分析了 1,200 个 GPT-4 模拟对话和 12,630 个来自 Reddit 的真实讨论。我们发现,在两种情况下,表达特征都能更可靠地预测破坏性冲突的结果,尽管最重要的特征有所不同。在 Reddit 数据中,诸如轮流发言和会话平等等会话动态具有很高的预测性,但在模拟会话中却不具有预测性。这些结果表明,用语言模型模拟社交互动可能存在局限性,我们将讨论我们的发现对构建社交计算系统的影响。
{"title":"What you say or how you say it? Predicting Conflict Outcomes in Real and LLM-Generated Conversations","authors":"Priya Ronald D'Costa, Evan Rowbotham, Xinlan Emily Hu","doi":"arxiv-2409.09338","DOIUrl":"https://doi.org/arxiv-2409.09338","url":null,"abstract":"When conflicts escalate, is it due to what is said or how it is said? In the\u0000conflict literature, two theoretical approaches take opposing views: one\u0000focuses on the content of the disagreement, while the other focuses on how it\u0000is expressed. This paper aims to integrate these two perspectives through a\u0000computational analysis of 191 communication features -- 128 related to\u0000expression and 63 to content. We analyze 1,200 GPT-4 simulated conversations\u0000and 12,630 real-world discussions from Reddit. We find that expression features\u0000more reliably predict destructive conflict outcomes across both settings,\u0000although the most important features differ. In the Reddit data, conversational\u0000dynamics such as turn-taking and conversational equality are highly predictive,\u0000but they are not predictive in simulated conversations. These results may\u0000suggest a possible limitation in simulating social interactions with language\u0000models, and we discuss the implications for our findings on building social\u0000computing systems.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142269902","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Toward satisfactory public accessibility: A crowdsourcing approach through online reviews to inclusive urban design 实现令人满意的公共无障碍环境:通过在线评论为包容性城市设计提供众包方法
Pub Date : 2024-09-13 DOI: arxiv-2409.08459
Lingyao Li, Songhua Hu, Yinpei Dai, Min Deng, Parisa Momeni, Gabriel Laverghetta, Lizhou Fan, Zihui Ma, Xi Wang, Siyuan Ma, Jay Ligatti, Libby Hemphill
As urban populations grow, the need for accessible urban design has becomeurgent. Traditional survey methods for assessing public perceptions ofaccessibility are often limited in scope. Crowdsourcing via online reviewsoffers a valuable alternative to understanding public perceptions, andadvancements in large language models can facilitate their use. This study usesGoogle Maps reviews across the United States and fine-tunes Llama 3 model withthe Low-Rank Adaptation technique to analyze public sentiment on accessibility.At the POI level, most categories -- restaurants, retail, hotels, andhealthcare -- show negative sentiments. Socio-spatial analysis reveals thatareas with higher proportions of white residents and greater socioeconomicstatus report more positive sentiment, while areas with more elderly,highly-educated residents exhibit more negative sentiment. Interestingly, noclear link is found between the presence of disabilities and public sentiments.Overall, this study highlights the potential of crowdsourcing for identifyingaccessibility challenges and providing insights for urban planners.
随着城市人口的增长,对无障碍城市设计的需求日益迫切。评估公众对无障碍环境看法的传统调查方法往往范围有限。通过在线评论进行众包,为了解公众看法提供了一种有价值的替代方法,而大型语言模型的进步可以促进其使用。本研究使用谷歌地图在美国各地的评论,并利用低等级适应技术对 Llama 3 模型进行了微调,以分析公众对无障碍环境的看法。社会空间分析表明,白人居民比例较高、社会经济地位较高的地区报告了更多的积极情绪,而老年人和高学历居民较多的地区则表现出更多的消极情绪。总之,这项研究强调了众包在识别无障碍挑战和为城市规划者提供见解方面的潜力。
{"title":"Toward satisfactory public accessibility: A crowdsourcing approach through online reviews to inclusive urban design","authors":"Lingyao Li, Songhua Hu, Yinpei Dai, Min Deng, Parisa Momeni, Gabriel Laverghetta, Lizhou Fan, Zihui Ma, Xi Wang, Siyuan Ma, Jay Ligatti, Libby Hemphill","doi":"arxiv-2409.08459","DOIUrl":"https://doi.org/arxiv-2409.08459","url":null,"abstract":"As urban populations grow, the need for accessible urban design has become\u0000urgent. Traditional survey methods for assessing public perceptions of\u0000accessibility are often limited in scope. Crowdsourcing via online reviews\u0000offers a valuable alternative to understanding public perceptions, and\u0000advancements in large language models can facilitate their use. This study uses\u0000Google Maps reviews across the United States and fine-tunes Llama 3 model with\u0000the Low-Rank Adaptation technique to analyze public sentiment on accessibility.\u0000At the POI level, most categories -- restaurants, retail, hotels, and\u0000healthcare -- show negative sentiments. Socio-spatial analysis reveals that\u0000areas with higher proportions of white residents and greater socioeconomic\u0000status report more positive sentiment, while areas with more elderly,\u0000highly-educated residents exhibit more negative sentiment. Interestingly, no\u0000clear link is found between the presence of disabilities and public sentiments.\u0000Overall, this study highlights the potential of crowdsourcing for identifying\u0000accessibility challenges and providing insights for urban planners.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
User Identity Linkage on Social Networks: A Review of Modern Techniques and Applications 社交网络上的用户身份链接:现代技术与应用综述
Pub Date : 2024-09-13 DOI: arxiv-2409.08966
Caterina Senette, Marco Siino, Maurizio Tesconi
In an Online Social Network (OSN), users can create a unique public personaby crafting a user identity that may encompass profile details, content, andnetwork-related information. As a result, a relevant task of interest isrelated to the ability to link identities across different OSNs. Linking usersacross social networks can have multiple implications in several contexts bothat the individual level and at the group level. At the individual level, themain interest in linking the same identity across social networks is to enablea better knowledge of each user. At the group level, linking user identitiesthrough different OSNs helps in predicting user behaviors, network dynamics,information diffusion, and migration phenomena across social media. The processof tying together user accounts on different OSNs is challenging and hasattracted more and more research attention in the last fifteen years. Thepurpose of this work is to provide a comprehensive review of recent studies(from 2016 to the present) on User Identity Linkage (UIL) methods across onlinesocial networks. This review aims to offer guidance for other researchers inthe field by outlining the main problem formulations, the different featureextraction strategies, algorithms, machine learning models, datasets, andevaluation metrics proposed by researchers working in this area. The proposedoverview takes a pragmatic perspective to highlight the concrete possibilitiesfor accomplishing this task depending on the type of available data.
在在线社交网络(OSN)中,用户可以创建一个独一无二的公共个人信息,精心制作用户身份,其中可能包括个人资料、内容和网络相关信息。因此,人们感兴趣的一项相关任务与在不同的 OSN 之间链接身份的能力有关。跨社交网络链接用户在个人和群体两个层面都会产生多重影响。在个人层面,跨社交网络链接同一身份的主要目的是为了更好地了解每个用户。在群体层面,通过不同的 OSNs 链接用户身份有助于预测用户行为、网络动态、信息扩散和社交媒体间的迁移现象。将不同 OSN 上的用户账户绑定在一起的过程极具挑战性,在过去 15 年里吸引了越来越多的研究关注。这项工作的目的是全面回顾近期(2016 年至今)关于跨网络社交网络用户身份关联(UIL)方法的研究。本综述旨在通过概述该领域研究人员提出的主要问题表述、不同的特征提取策略、算法、机器学习模型、数据集和评估指标,为该领域的其他研究人员提供指导。本综述从务实的角度出发,强调了根据可用数据类型完成这项任务的具体可能性。
{"title":"User Identity Linkage on Social Networks: A Review of Modern Techniques and Applications","authors":"Caterina Senette, Marco Siino, Maurizio Tesconi","doi":"arxiv-2409.08966","DOIUrl":"https://doi.org/arxiv-2409.08966","url":null,"abstract":"In an Online Social Network (OSN), users can create a unique public persona\u0000by crafting a user identity that may encompass profile details, content, and\u0000network-related information. As a result, a relevant task of interest is\u0000related to the ability to link identities across different OSNs. Linking users\u0000across social networks can have multiple implications in several contexts both\u0000at the individual level and at the group level. At the individual level, the\u0000main interest in linking the same identity across social networks is to enable\u0000a better knowledge of each user. At the group level, linking user identities\u0000through different OSNs helps in predicting user behaviors, network dynamics,\u0000information diffusion, and migration phenomena across social media. The process\u0000of tying together user accounts on different OSNs is challenging and has\u0000attracted more and more research attention in the last fifteen years. The\u0000purpose of this work is to provide a comprehensive review of recent studies\u0000(from 2016 to the present) on User Identity Linkage (UIL) methods across online\u0000social networks. This review aims to offer guidance for other researchers in\u0000the field by outlining the main problem formulations, the different feature\u0000extraction strategies, algorithms, machine learning models, datasets, and\u0000evaluation metrics proposed by researchers working in this area. The proposed\u0000overview takes a pragmatic perspective to highlight the concrete possibilities\u0000for accomplishing this task depending on the type of available data.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - CS - Social and Information Networks
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1