首页 > 最新文献

arXiv - CS - Social and Information Networks最新文献

英文 中文
Sequential Classification of Misinformation 错误信息的顺序分类
Pub Date : 2024-09-07 DOI: arxiv-2409.04860
Daniel Toma, Wasim Huleihel
In recent years there have been a growing interest in online auditing ofinformation flow over social networks with the goal of monitoring undesirableeffects, such as, misinformation and fake news. Most previous work on thesubject, focus on the binary classification problem of classifying informationas fake or genuine. Nonetheless, in many practical scenarios, themulti-class/label setting is of particular importance. For example, it could bethe case that a social media platform may want to distinguish between ``true",``partly-true", and ``false" information. Accordingly, in this paper, weconsider the problem of online multiclass classification of information flow.To that end, driven by empirical studies on information flow over real-worldsocial media networks, we propose a probabilistic information flow model overgraphs. Then, the learning task is to detect the label of the information flow,with the goal of minimizing a combination of the classification error and thedetection time. For this problem, we propose two detection algorithms; thefirst is based on the well-known multiple sequential probability ratio test,while the second is a novel graph neural network based sequential decisionalgorithm. For both algorithms, we prove several strong statistical guarantees.We also construct a data driven algorithm for learning the proposedprobabilistic model. Finally, we test our algorithms over two real-worlddatasets, and show that they outperform other state-of-the-art misinformationdetection algorithms, in terms of detection time and classification error.
近年来,人们对社交网络信息流在线审计的兴趣与日俱增,其目标是监控不良影响,如错误信息和假新闻。以前关于这一主题的大部分工作都集中在信息真假的二元分类问题上。然而,在许多实际场景中,多类别/标签设置尤为重要。例如,社交媒体平台可能希望区分 "真实"、"部分真实 "和 "虚假 "信息。因此,在本文中,我们考虑了信息流的在线多类分类问题。为此,在对现实世界社交媒体网络上的信息流进行实证研究的基础上,我们提出了一个图上的概率信息流模型。然后,学习任务是检测信息流的标签,目标是最小化分类误差和检测时间的组合。针对这个问题,我们提出了两种检测算法:第一种是基于著名的多序列概率比检验,第二种是基于新型图神经网络的序列判定算法。我们还构建了一种数据驱动算法,用于学习所提出的概率模型。最后,我们在两个真实世界数据集上测试了我们的算法,结果表明它们在检测时间和分类误差方面优于其他最先进的错误信息检测算法。
{"title":"Sequential Classification of Misinformation","authors":"Daniel Toma, Wasim Huleihel","doi":"arxiv-2409.04860","DOIUrl":"https://doi.org/arxiv-2409.04860","url":null,"abstract":"In recent years there have been a growing interest in online auditing of\u0000information flow over social networks with the goal of monitoring undesirable\u0000effects, such as, misinformation and fake news. Most previous work on the\u0000subject, focus on the binary classification problem of classifying information\u0000as fake or genuine. Nonetheless, in many practical scenarios, the\u0000multi-class/label setting is of particular importance. For example, it could be\u0000the case that a social media platform may want to distinguish between ``true\",\u0000``partly-true\", and ``false\" information. Accordingly, in this paper, we\u0000consider the problem of online multiclass classification of information flow.\u0000To that end, driven by empirical studies on information flow over real-world\u0000social media networks, we propose a probabilistic information flow model over\u0000graphs. Then, the learning task is to detect the label of the information flow,\u0000with the goal of minimizing a combination of the classification error and the\u0000detection time. For this problem, we propose two detection algorithms; the\u0000first is based on the well-known multiple sequential probability ratio test,\u0000while the second is a novel graph neural network based sequential decision\u0000algorithm. For both algorithms, we prove several strong statistical guarantees.\u0000We also construct a data driven algorithm for learning the proposed\u0000probabilistic model. Finally, we test our algorithms over two real-world\u0000datasets, and show that they outperform other state-of-the-art misinformation\u0000detection algorithms, in terms of detection time and classification error.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142214715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Veracity Problem: Detecting False Information and its Propagation on Online Social Media Networks 真实性问题:检测虚假信息及其在网络社交媒体上的传播
Pub Date : 2024-09-06 DOI: arxiv-2409.03948
Sarah Condran
Detecting false information on social media is critical in mitigating itsnegative societal impacts. To reduce the propagation of false information,automated detection provide scalable, unbiased, and cost-effective methods.However, there are three potential research areas identified which once solvedimprove detection. First, current AI-based solutions often provide auni-dimensional analysis on a complex, multi-dimensional issue, with solutionsdiffering based on the features used. Furthermore, these methods do not accountfor the temporal and dynamic changes observed within the document's life cycle.Second, there has been little research on the detection of coordinatedinformation campaigns and in understanding the intent of the actors and thecampaign. Thirdly, there is a lack of consideration of cross-platform analysis,with existing datasets focusing on a single platform, such as X, and detectionmodels designed for specific platform. This work aims to develop methods for effective detection of falseinformation and its propagation. To this end, firstly we aim to propose thecreation of an ensemble multi-faceted framework that leverages multiple aspectsof false information. Secondly, we propose a method to identify actors andtheir intent when working in coordination to manipulate a narrative. Thirdly,we aim to analyse the impact of cross-platform interactions on the propagationof false information via the creation of a new dataset.
检测社交媒体上的虚假信息对于减轻其负面社会影响至关重要。为了减少虚假信息的传播,自动检测提供了可扩展、无偏见和具有成本效益的方法。然而,目前发现了三个潜在的研究领域,一旦解决了这些问题,就能提高检测能力。首先,当前基于人工智能的解决方案通常会对复杂的多维问题进行单维分析,根据所使用的特征,解决方案会有所不同。此外,这些方法没有考虑到在文档生命周期内观察到的时间和动态变化。第二,在检测协调信息活动以及理解参与者和活动意图方面的研究很少。第三,缺乏对跨平台分析的考虑,现有的数据集主要集中在 X 等单一平台上,检测模型也是针对特定平台设计的。这项工作旨在开发有效检测虚假信息及其传播的方法。为此,我们首先提出创建一个多元集合框架,以利用虚假信息的多个方面。其次,我们提出了一种方法来识别行动者以及他们在协同操纵叙事时的意图。第三,我们旨在通过创建一个新的数据集,分析跨平台互动对虚假信息传播的影响。
{"title":"The Veracity Problem: Detecting False Information and its Propagation on Online Social Media Networks","authors":"Sarah Condran","doi":"arxiv-2409.03948","DOIUrl":"https://doi.org/arxiv-2409.03948","url":null,"abstract":"Detecting false information on social media is critical in mitigating its\u0000negative societal impacts. To reduce the propagation of false information,\u0000automated detection provide scalable, unbiased, and cost-effective methods.\u0000However, there are three potential research areas identified which once solved\u0000improve detection. First, current AI-based solutions often provide a\u0000uni-dimensional analysis on a complex, multi-dimensional issue, with solutions\u0000differing based on the features used. Furthermore, these methods do not account\u0000for the temporal and dynamic changes observed within the document's life cycle.\u0000Second, there has been little research on the detection of coordinated\u0000information campaigns and in understanding the intent of the actors and the\u0000campaign. Thirdly, there is a lack of consideration of cross-platform analysis,\u0000with existing datasets focusing on a single platform, such as X, and detection\u0000models designed for specific platform. This work aims to develop methods for effective detection of false\u0000information and its propagation. To this end, firstly we aim to propose the\u0000creation of an ensemble multi-faceted framework that leverages multiple aspects\u0000of false information. Secondly, we propose a method to identify actors and\u0000their intent when working in coordination to manipulate a narrative. Thirdly,\u0000we aim to analyse the impact of cross-platform interactions on the propagation\u0000of false information via the creation of a new dataset.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142226901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sustainability of Scale-Free Properties in Synchronizations of Dynamic Scale-Free Networks 动态无标度网络同步中无标度特性的可持续性
Pub Date : 2024-09-06 DOI: arxiv-2409.08298
Rakib Hassan Pran
Scale-free networks are ubiquitous in social, biological and technologicalnetworked systems. Dynamic Scale-free networks and their synchronizations areimportant to understand and predict the behavior of social, biological andtechnological networked systems. In this research, computational experimentshave been conducted to understand the sustainability of scale-free propertiesduring the time of synchronizations in dynamic scale-free networks. Twosynchronization phenomena which are synchronization based on states of nodeswith coupling configuration matrix and synchronization based on states of nodeswith network centralities have been implemented for the synchronization indynamic scale-free networks. In experiments, dynamic scale-free networks havebeen generated with a network generation algorithm and analyzed to understandthe fluctuation from the scale-free properties in their phases during the timeof synchronizations.
无标度网络在社会、生物和技术网络系统中无处不在。动态无标度网络及其同步对于理解和预测社会、生物和技术网络系统的行为非常重要。本研究通过计算实验来了解动态无标度网络同步期间无标度特性的可持续性。针对动态无标度网络中的同步问题,研究了两种同步现象,即基于节点状态与耦合配置矩阵的同步和基于节点状态与网络中心性的同步。在实验中,利用网络生成算法生成了动态无标度网络,并对其进行了分析,以了解在同步过程中无标度特性在其相位上的波动。
{"title":"Sustainability of Scale-Free Properties in Synchronizations of Dynamic Scale-Free Networks","authors":"Rakib Hassan Pran","doi":"arxiv-2409.08298","DOIUrl":"https://doi.org/arxiv-2409.08298","url":null,"abstract":"Scale-free networks are ubiquitous in social, biological and technological\u0000networked systems. Dynamic Scale-free networks and their synchronizations are\u0000important to understand and predict the behavior of social, biological and\u0000technological networked systems. In this research, computational experiments\u0000have been conducted to understand the sustainability of scale-free properties\u0000during the time of synchronizations in dynamic scale-free networks. Two\u0000synchronization phenomena which are synchronization based on states of nodes\u0000with coupling configuration matrix and synchronization based on states of nodes\u0000with network centralities have been implemented for the synchronization in\u0000dynamic scale-free networks. In experiments, dynamic scale-free networks have\u0000been generated with a network generation algorithm and analyzed to understand\u0000the fluctuation from the scale-free properties in their phases during the time\u0000of synchronizations.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Preserving Individuality while Following the Crowd: Understanding the Role of User Taste and Crowd Wisdom in Online Product Rating Prediction 从众的同时保留个性:了解用户品味和群众智慧在在线产品评分预测中的作用
Pub Date : 2024-09-06 DOI: arxiv-2409.04649
Liang Wang, Shubham Jain, Yingtong Dou, Junpeng Wang, Chin-Chia Michael Yeh, Yujie Fan, Prince Aboagye, Yan Zheng, Xin Dai, Zhongfang Zhuang, Uday Singh Saini, Wei Zhang
Numerous algorithms have been developed for online product rating prediction,but the specific influence of user and product information in determining thefinal prediction score remains largely unexplored. Existing research oftenrelies on narrowly defined data settings, which overlooks real-world challengessuch as the cold-start problem, cross-category information utilization, andscalability and deployment issues. To delve deeper into these aspects, andparticularly to uncover the roles of individual user taste and collectivewisdom, we propose a unique and practical approach that emphasizes historicalratings at both the user and product levels, encapsulated using a continuouslyupdated dynamic tree representation. This representation effectively capturesthe temporal dynamics of users and products, leverages user information acrossproduct categories, and provides a natural solution to the cold-start problem.Furthermore, we have developed an efficient data processing strategy that makesthis approach highly scalable and easily deployable. Comprehensive experimentsin real industry settings demonstrate the effectiveness of our approach.Notably, our findings reveal that individual taste dominates over collectivewisdom in online product rating prediction, a perspective that contrasts withthe commonly observed wisdom of the crowd phenomenon in other domains. Thisdominance of individual user taste is consistent across various model types,including the boosting tree model, recurrent neural network (RNN), andtransformer-based architectures. This observation holds true across the overallpopulation, within individual product categories, and in cold-start scenarios.Our findings underscore the significance of individual user tastes in thecontext of online product rating prediction and the robustness of our approachacross different model architectures.
针对在线产品评级预测开发的算法不胜枚举,但用户和产品信息对最终预测得分的具体影响在很大程度上仍未得到探讨。现有研究往往依赖于狭义的数据设置,从而忽略了现实世界中的挑战,如冷启动问题、跨类别信息利用以及可扩展性和部署问题。为了深入研究这些方面,特别是揭示用户个人品味和集体智慧的作用,我们提出了一种独特而实用的方法,强调用户和产品层面的历史评价,并使用持续更新的动态树形表示法进行封装。此外,我们还开发了一种高效的数据处理策略,使这种方法具有高度的可扩展性和易部署性。值得注意的是,我们的研究结果表明,在在线产品评级预测中,个人品味比集体智慧更有优势,这与在其他领域普遍观察到的群体智慧现象形成了鲜明对比。用户个人品味的优势在各种类型的模型中都是一致的,包括提升树模型、递归神经网络(RNN)和基于变换器的架构。我们的发现强调了用户个人品味在在线产品评分预测中的重要性,以及我们的方法在不同模型架构中的稳健性。
{"title":"Preserving Individuality while Following the Crowd: Understanding the Role of User Taste and Crowd Wisdom in Online Product Rating Prediction","authors":"Liang Wang, Shubham Jain, Yingtong Dou, Junpeng Wang, Chin-Chia Michael Yeh, Yujie Fan, Prince Aboagye, Yan Zheng, Xin Dai, Zhongfang Zhuang, Uday Singh Saini, Wei Zhang","doi":"arxiv-2409.04649","DOIUrl":"https://doi.org/arxiv-2409.04649","url":null,"abstract":"Numerous algorithms have been developed for online product rating prediction,\u0000but the specific influence of user and product information in determining the\u0000final prediction score remains largely unexplored. Existing research often\u0000relies on narrowly defined data settings, which overlooks real-world challenges\u0000such as the cold-start problem, cross-category information utilization, and\u0000scalability and deployment issues. To delve deeper into these aspects, and\u0000particularly to uncover the roles of individual user taste and collective\u0000wisdom, we propose a unique and practical approach that emphasizes historical\u0000ratings at both the user and product levels, encapsulated using a continuously\u0000updated dynamic tree representation. This representation effectively captures\u0000the temporal dynamics of users and products, leverages user information across\u0000product categories, and provides a natural solution to the cold-start problem.\u0000Furthermore, we have developed an efficient data processing strategy that makes\u0000this approach highly scalable and easily deployable. Comprehensive experiments\u0000in real industry settings demonstrate the effectiveness of our approach.\u0000Notably, our findings reveal that individual taste dominates over collective\u0000wisdom in online product rating prediction, a perspective that contrasts with\u0000the commonly observed wisdom of the crowd phenomenon in other domains. This\u0000dominance of individual user taste is consistent across various model types,\u0000including the boosting tree model, recurrent neural network (RNN), and\u0000transformer-based architectures. This observation holds true across the overall\u0000population, within individual product categories, and in cold-start scenarios.\u0000Our findings underscore the significance of individual user tastes in the\u0000context of online product rating prediction and the robustness of our approach\u0000across different model architectures.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142214685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Understanding Online Discussion Across Difference: Insights from Gun Discourse on Reddit 了解跨越差异的在线讨论:从 Reddit 上的枪支讨论中获得的启示
Pub Date : 2024-09-06 DOI: arxiv-2409.03989
Rijul Magu, Nivedhitha Mathan Kumar, Yihe Liu, Xander Koo, Diyi Yang, Amy Bruckman
When discussing difficult topics online, is it common to meaningfully engagewith people from diverse perspectives? Why or why not? Could features of theonline environment be redesigned to encourage civil conversation acrossdifference? In this paper, we study discussions of gun policy on Reddit, withthe overarching goal of developing insights into the potential of the internetto support understanding across difference. We use two methods: a clusteringanalysis of Reddit posts to contribute insights about what people discuss, andan interview study of twenty Reddit users to help us understand why certainkinds of conversation take place and others don't. We find that the discussionof gun politics falls into three groups: conservative pro-gun, liberal pro-gun,and liberal anti-gun. Each type of group has its own characteristic topics.While our subjects state that they would be willing to engage with othersacross the ideological divide, in practice they rarely do. Subjects are siloedinto like-minded subreddits through a two-pronged effect, where they aresimultaneously pushed away from opposing-view communities while activelyseeking belonging in like-minded ones. Another contributing factor is Reddit's"karma" mechanism: fear of being downvoted and losing karma points and socialapproval of peers causes our subjects to hesitate to say anything in conflictwith group norms. The pseudonymous nature of discussion on Reddit plays acomplex role, with some subjects finding it freeing and others fearing reprisalfrom others not bound by face-to-face norms of politeness. Our subjects believethat content moderation can help ameliorate these issues; however, our findingssuggest that moderators need different tools to do so effectively. We concludeby suggesting platform design changes that might increase discussion acrossdifference.
在网上讨论难题时,与不同观点的人进行有意义的交流是否常见?为什么?是否可以重新设计网络环境的特征,以鼓励跨越差异的文明对话?在本文中,我们研究了 Reddit 上关于枪支政策的讨论,其总体目标是深入了解互联网支持跨差异理解的潜力。我们使用了两种方法:一种是对 Reddit 帖子进行聚类分析,以深入了解人们讨论的内容;另一种是对 20 位 Reddit 用户进行访谈研究,以帮助我们理解为什么会出现某些类型的对话,而另一些则没有。我们发现,枪支政治的讨论分为三类:保守派拥枪、自由派拥枪和自由派反枪。每一类群体都有自己的特色话题。虽然我们的受试者表示愿意与其他跨越意识形态鸿沟的人进行交流,但实际上他们很少这样做。通过双管齐下的效应,受试者被孤立在观点相同的子论坛中,他们在积极寻求志同道合者归属感的同时,也被挤出了观点对立的社区。另一个因素是 Reddit 的 "因果报应 "机制:由于害怕被降权、失去因果报应点数和同伴的社会认可,我们的研究对象在发表与群体规范相冲突的言论时会犹豫不决。Reddit 上的匿名讨论性质起着复杂的作用,有些受试者认为这种讨论方式很自由,而另一些受试者则担心会遭到不受面对面礼貌规范约束的其他人的报复。我们的研究对象认为,内容审核可以帮助改善这些问题;然而,我们的研究结果表明,审核人员需要不同的工具才能有效地进行审核。最后,我们建议改变平台设计,以增加跨差异讨论。
{"title":"Understanding Online Discussion Across Difference: Insights from Gun Discourse on Reddit","authors":"Rijul Magu, Nivedhitha Mathan Kumar, Yihe Liu, Xander Koo, Diyi Yang, Amy Bruckman","doi":"arxiv-2409.03989","DOIUrl":"https://doi.org/arxiv-2409.03989","url":null,"abstract":"When discussing difficult topics online, is it common to meaningfully engage\u0000with people from diverse perspectives? Why or why not? Could features of the\u0000online environment be redesigned to encourage civil conversation across\u0000difference? In this paper, we study discussions of gun policy on Reddit, with\u0000the overarching goal of developing insights into the potential of the internet\u0000to support understanding across difference. We use two methods: a clustering\u0000analysis of Reddit posts to contribute insights about what people discuss, and\u0000an interview study of twenty Reddit users to help us understand why certain\u0000kinds of conversation take place and others don't. We find that the discussion\u0000of gun politics falls into three groups: conservative pro-gun, liberal pro-gun,\u0000and liberal anti-gun. Each type of group has its own characteristic topics.\u0000While our subjects state that they would be willing to engage with others\u0000across the ideological divide, in practice they rarely do. Subjects are siloed\u0000into like-minded subreddits through a two-pronged effect, where they are\u0000simultaneously pushed away from opposing-view communities while actively\u0000seeking belonging in like-minded ones. Another contributing factor is Reddit's\u0000\"karma\" mechanism: fear of being downvoted and losing karma points and social\u0000approval of peers causes our subjects to hesitate to say anything in conflict\u0000with group norms. The pseudonymous nature of discussion on Reddit plays a\u0000complex role, with some subjects finding it freeing and others fearing reprisal\u0000from others not bound by face-to-face norms of politeness. Our subjects believe\u0000that content moderation can help ameliorate these issues; however, our findings\u0000suggest that moderators need different tools to do so effectively. We conclude\u0000by suggesting platform design changes that might increase discussion across\u0000difference.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142214716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Structure and dynamics of growing networks of Reddit threads 不断增长的 Reddit 线程网络的结构与动态
Pub Date : 2024-09-06 DOI: arxiv-2409.04085
Diletta Goglia, Davide Vega
Millions of people use online social networks to reinforce their sense ofbelonging, for example by giving and asking for feedback as a form of socialvalidation and self-recognition. It is common to observe disagreement amongpeople beliefs and points of view when expressing this feedback. Modeling andanalyzing such interactions is crucial to understand social phenomena thathappen when people face different opinions while expressing and discussingtheir values. In this work, we study a Reddit community in which peopleparticipate to judge or be judged with respect to some behavior, as itrepresents a valuable source to study how users express judgments online. Wemodel threads of this community as complex networks of user interactionsgrowing in time, and we analyze the evolution of their structural properties.We show that the evolution of Reddit networks differ from other real socialnetworks, despite falling in the same category. This happens because theirglobal clustering coefficient is extremely small and the average shortest pathlength increases over time. Such properties reveal how users discuss inthreads, i.e. with mostly one other user and often by a single message. Westrengthen such result by analyzing the role that disagreement and reciprocityplay in such conversations. We also show that Reddit thread's evolution overtime is governed by two subgraphs growing at different speeds. We discoverthat, in the studied community, the difference of such speed is higher than inother communities because of the user guidelines enforcing specific userinteractions. Finally, we interpret the obtained results on user behaviordrawing back to Social Judgment Theory.
数百万人使用在线社交网络来加强其归属感,例如,通过提供和征求反馈意见作为一种社会认可和自我认可的形式。在表达这种反馈意见时,人们经常会看到信仰和观点上的分歧。对这种互动进行建模和分析,对于理解人们在表达和讨论自己的价值观时面对不同意见所产生的社会现象至关重要。在这项工作中,我们研究了一个 Reddit 社区,在这个社区中,人们参与对某些行为进行评判或被评判,因为它是研究用户如何在网上表达评判的一个有价值的来源。我们将该社区的线程建模为随时间增长的复杂用户互动网络,并分析了其结构特性的演变。这是因为它们的全局聚类系数极小,平均最短路径长度随时间增长。这些特性揭示了用户是如何在线程中进行讨论的,即主要是与另一个用户进行讨论,而且往往是通过单条消息进行讨论。通过分析分歧和互惠在此类会话中的作用,我们进一步证实了这一结果。我们还表明,Reddit 线程的演化是由两个以不同速度增长的子图所控制的。我们发现,在所研究的社区中,这种速度的差异要高于其他社区,这是因为用户指南强制规定了特定的用户交互。最后,我们从社会判断理论出发,对所获得的用户行为结果进行了解释。
{"title":"Structure and dynamics of growing networks of Reddit threads","authors":"Diletta Goglia, Davide Vega","doi":"arxiv-2409.04085","DOIUrl":"https://doi.org/arxiv-2409.04085","url":null,"abstract":"Millions of people use online social networks to reinforce their sense of\u0000belonging, for example by giving and asking for feedback as a form of social\u0000validation and self-recognition. It is common to observe disagreement among\u0000people beliefs and points of view when expressing this feedback. Modeling and\u0000analyzing such interactions is crucial to understand social phenomena that\u0000happen when people face different opinions while expressing and discussing\u0000their values. In this work, we study a Reddit community in which people\u0000participate to judge or be judged with respect to some behavior, as it\u0000represents a valuable source to study how users express judgments online. We\u0000model threads of this community as complex networks of user interactions\u0000growing in time, and we analyze the evolution of their structural properties.\u0000We show that the evolution of Reddit networks differ from other real social\u0000networks, despite falling in the same category. This happens because their\u0000global clustering coefficient is extremely small and the average shortest path\u0000length increases over time. Such properties reveal how users discuss in\u0000threads, i.e. with mostly one other user and often by a single message. We\u0000strengthen such result by analyzing the role that disagreement and reciprocity\u0000play in such conversations. We also show that Reddit thread's evolution over\u0000time is governed by two subgraphs growing at different speeds. We discover\u0000that, in the studied community, the difference of such speed is higher than in\u0000other communities because of the user guidelines enforcing specific user\u0000interactions. Finally, we interpret the obtained results on user behavior\u0000drawing back to Social Judgment Theory.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142214718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Survey on Signed Graph Embedding: Methods and Applications 有符号图嵌入概览:方法与应用
Pub Date : 2024-09-05 DOI: arxiv-2409.03916
Shrabani Ghosh
A signed graph (SG) is a graph where edges carry sign information attached toit. The sign of a network can be positive, negative, or neutral. A signednetwork is ubiquitous in a real-world network like social networks, citationnetworks, and various technical networks. There are many network embeddingmodels have been proposed and developed for signed networks for bothhomogeneous and heterogeneous types. SG embedding learns low-dimensional vectorrepresentations for nodes of a network, which helps to do many network analysistasks such as link prediction, node classification, and community detection. Inthis survey, we perform a comprehensive study of SG embedding methods andapplications. We introduce here the basic theories and methods of SGs andsurvey the current state of the art of signed graph embedding methods. Inaddition, we explore the applications of different types of SG embeddingmethods in real-world scenarios. As an application, we have explored thecitation network to analyze authorship networks. We also provide source codeand datasets to give future direction. Lastly, we explore the challenges of SGembedding and forecast various future research directions in this field.
符号图(SG)是一种边上附带符号信息的图。网络的符号可以是正符号、负符号或中性符号。符号网络在现实世界的网络中无处不在,如社交网络、引用网络和各种技术网络。针对同构和异构类型的签名网络,人们提出并开发了许多网络嵌入模型。SG embedding 可以学习网络节点的低维向量表示,有助于完成许多网络分析任务,如链接预测、节点分类和社区检测。在本研究中,我们对 SG 嵌入方法和应用进行了全面研究。我们介绍了 SG 的基本理论和方法,并调查了签名图嵌入方法的技术现状。此外,我们还探讨了不同类型的 SG 嵌入方法在实际场景中的应用。作为一种应用,我们探索了引用网络来分析作者网络。我们还提供了源代码和数据集,以指明未来的发展方向。最后,我们探讨了 SG 嵌入所面临的挑战,并预测了该领域未来的各种研究方向。
{"title":"A Survey on Signed Graph Embedding: Methods and Applications","authors":"Shrabani Ghosh","doi":"arxiv-2409.03916","DOIUrl":"https://doi.org/arxiv-2409.03916","url":null,"abstract":"A signed graph (SG) is a graph where edges carry sign information attached to\u0000it. The sign of a network can be positive, negative, or neutral. A signed\u0000network is ubiquitous in a real-world network like social networks, citation\u0000networks, and various technical networks. There are many network embedding\u0000models have been proposed and developed for signed networks for both\u0000homogeneous and heterogeneous types. SG embedding learns low-dimensional vector\u0000representations for nodes of a network, which helps to do many network analysis\u0000tasks such as link prediction, node classification, and community detection. In\u0000this survey, we perform a comprehensive study of SG embedding methods and\u0000applications. We introduce here the basic theories and methods of SGs and\u0000survey the current state of the art of signed graph embedding methods. In\u0000addition, we explore the applications of different types of SG embedding\u0000methods in real-world scenarios. As an application, we have explored the\u0000citation network to analyze authorship networks. We also provide source code\u0000and datasets to give future direction. Lastly, we explore the challenges of SG\u0000embedding and forecast various future research directions in this field.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142214720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Topic-wise Exploration of the Telegram Group-verse 按主题划分的 Telegram 群组探索
Pub Date : 2024-09-04 DOI: arxiv-2409.02525
Alessandro Perlo, Giordano Paoletti, Nikhil Jha, Luca Vassio, Jussara Almeida, Marco Mellia
Although currently one of the most popular instant messaging apps worldwide,Telegram has been largely understudied in the past years. In this paper, we aimto address this gap by presenting an analysis of publicly accessible groupscovering discussions encompassing different topics, as diverse as Education,Erotic, Politics, and Cryptocurrencies. We engineer and offer an open-sourcetool to automate the collection of messages from Telegram groups, anon-straightforward problem. We use it to collect more than 50 million messagesfrom 669 groups. Here, we present a first-of-its-kind, per-topic analysis,contrasting the characteristics of the messages sent on the platform fromdifferent angles -- the language, the presence of bots, the type and volume ofshared media content. Our results confirm some anecdotal evidence, e.g., cluesthat Telegram is used to share possibly illicit content, and unveil someunexpected findings, e.g., the different sharing patterns of video and stickersin groups of different topics. While preliminary, we hope that our work pavesthe road for several avenues of future research on the understudied Telegramplatform.
尽管 Telegram 目前是全球最流行的即时通讯应用程序之一,但在过去几年中,人们对它的研究却很少。在本文中,我们旨在通过分析可公开访问的群组来填补这一空白,这些群组涵盖了教育、情色、政治和加密货币等不同主题的讨论。我们设计并提供了一个开源工具,用于自动收集 Telegram 群组中的消息,这不是一个简单的问题。我们用它从 669 个群组中收集了 5000 多万条信息。在此,我们首次按主题进行分析,从语言、机器人的存在、共享媒体内容的类型和数量等不同角度对比了平台上发送的消息的特征。我们的研究结果证实了一些轶事证据,例如 Telegram 被用于分享可能是非法内容的线索,同时也揭示了一些意想不到的发现,例如在不同主题的群组中视频和贴纸的不同分享模式。虽然我们的研究是初步的,但我们希望我们的工作能为今后对研究不足的 Telegram 平台的研究铺平道路。
{"title":"A Topic-wise Exploration of the Telegram Group-verse","authors":"Alessandro Perlo, Giordano Paoletti, Nikhil Jha, Luca Vassio, Jussara Almeida, Marco Mellia","doi":"arxiv-2409.02525","DOIUrl":"https://doi.org/arxiv-2409.02525","url":null,"abstract":"Although currently one of the most popular instant messaging apps worldwide,\u0000Telegram has been largely understudied in the past years. In this paper, we aim\u0000to address this gap by presenting an analysis of publicly accessible groups\u0000covering discussions encompassing different topics, as diverse as Education,\u0000Erotic, Politics, and Cryptocurrencies. We engineer and offer an open-source\u0000tool to automate the collection of messages from Telegram groups, a\u0000non-straightforward problem. We use it to collect more than 50 million messages\u0000from 669 groups. Here, we present a first-of-its-kind, per-topic analysis,\u0000contrasting the characteristics of the messages sent on the platform from\u0000different angles -- the language, the presence of bots, the type and volume of\u0000shared media content. Our results confirm some anecdotal evidence, e.g., clues\u0000that Telegram is used to share possibly illicit content, and unveil some\u0000unexpected findings, e.g., the different sharing patterns of video and stickers\u0000in groups of different topics. While preliminary, we hope that our work paves\u0000the road for several avenues of future research on the understudied Telegram\u0000platform.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142214721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comparative Study of Offline Models and Online LLMs in Fake News Detection 离线模型和在线 LLM 在假新闻检测中的比较研究
Pub Date : 2024-09-04 DOI: arxiv-2409.03067
Ruoyu Xu, Gaoxiang Li
Fake news detection remains a critical challenge in today's rapidly evolvingdigital landscape, where misinformation can spread faster than ever before.Traditional fake news detection models often rely on static datasets andauxiliary information, such as metadata or social media interactions, whichlimits their adaptability to real-time scenarios. Recent advancements in LargeLanguage Models (LLMs) have demonstrated significant potential in addressingthese challenges due to their extensive pre-trained knowledge and ability toanalyze textual content without relying on auxiliary data. However, many ofthese LLM-based approaches are still rooted in static datasets, with limitedexploration into their real-time processing capabilities. This paper presents asystematic evaluation of both traditional offline models and state-of-the-artLLMs for real-time fake news detection. We demonstrate the limitations ofexisting offline models, including their inability to adapt to dynamicmisinformation patterns. Furthermore, we show that newer LLM models with onlinecapabilities, such as GPT-4, Claude, and Gemini, are better suited fordetecting emerging fake news in real-time contexts. Our findings emphasize theimportance of transitioning from offline to online LLM models for real-timefake news detection. Additionally, the public accessibility of LLMs enhancestheir scalability and democratizes the tools needed to combat misinformation.By leveraging real-time data, our work marks a significant step toward moreadaptive, effective, and scalable fake news detection systems.
传统的假新闻检测模型通常依赖于静态数据集和辅助信息,如元数据或社交媒体互动,这限制了它们对实时场景的适应性。大型语言模型(LLMs)凭借其广泛的预训练知识和不依赖辅助数据分析文本内容的能力,在应对这些挑战方面展现出了巨大的潜力。然而,许多基于 LLM 的方法仍植根于静态数据集,对其实时处理能力的探索十分有限。本文对用于实时假新闻检测的传统离线模型和最先进的 LLM 进行了系统评估。我们证明了现有离线模型的局限性,包括它们无法适应动态的虚假信息模式。此外,我们还展示了具有在线能力的新型 LLM 模型,如 GPT-4、Claude 和 Gemini,更适合在实时环境中检测新出现的假新闻。我们的发现强调了将离线 LLM 模型过渡到在线 LLM 模型对于实时假新闻检测的重要性。通过利用实时数据,我们的工作标志着向更具适应性、有效性和可扩展性的假新闻检测系统迈出了重要一步。
{"title":"A Comparative Study of Offline Models and Online LLMs in Fake News Detection","authors":"Ruoyu Xu, Gaoxiang Li","doi":"arxiv-2409.03067","DOIUrl":"https://doi.org/arxiv-2409.03067","url":null,"abstract":"Fake news detection remains a critical challenge in today's rapidly evolving\u0000digital landscape, where misinformation can spread faster than ever before.\u0000Traditional fake news detection models often rely on static datasets and\u0000auxiliary information, such as metadata or social media interactions, which\u0000limits their adaptability to real-time scenarios. Recent advancements in Large\u0000Language Models (LLMs) have demonstrated significant potential in addressing\u0000these challenges due to their extensive pre-trained knowledge and ability to\u0000analyze textual content without relying on auxiliary data. However, many of\u0000these LLM-based approaches are still rooted in static datasets, with limited\u0000exploration into their real-time processing capabilities. This paper presents a\u0000systematic evaluation of both traditional offline models and state-of-the-art\u0000LLMs for real-time fake news detection. We demonstrate the limitations of\u0000existing offline models, including their inability to adapt to dynamic\u0000misinformation patterns. Furthermore, we show that newer LLM models with online\u0000capabilities, such as GPT-4, Claude, and Gemini, are better suited for\u0000detecting emerging fake news in real-time contexts. Our findings emphasize the\u0000importance of transitioning from offline to online LLM models for real-time\u0000fake news detection. Additionally, the public accessibility of LLMs enhances\u0000their scalability and democratizes the tools needed to combat misinformation.\u0000By leveraging real-time data, our work marks a significant step toward more\u0000adaptive, effective, and scalable fake news detection systems.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142214717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Language is Scary when Over-Analyzed: Unpacking Implied Misogynistic Reasoning with Argumentation Theory-Driven Prompts 过度分析的语言是可怕的:用论证理论驱动的提示解读隐含的厌恶女性推理
Pub Date : 2024-09-04 DOI: arxiv-2409.02519
Arianna Muti, Federico Ruggeri, Khalid Al-Khatib, Alberto Barrón-Cedeño, Tommaso Caselli
We propose misogyny detection as an Argumentative Reasoning task and weinvestigate the capacity of large language models (LLMs) to understand theimplicit reasoning used to convey misogyny in both Italian and English. Thecentral aim is to generate the missing reasoning link between a message and theimplied meanings encoding the misogyny. Our study uses argumentation theory asa foundation to form a collection of prompts in both zero-shot and few-shotsettings. These prompts integrate different techniques, includingchain-of-thought reasoning and augmented knowledge. Our findings show that LLMsfall short on reasoning capabilities about misogynistic comments and that theymostly rely on their implicit knowledge derived from internalized commonstereotypes about women to generate implied assumptions, rather than oninductive reasoning.
我们提出将厌女症检测作为一项论证推理任务,并研究了大语言模型(LLM)理解意大利语和英语中用于表达厌女症的隐含推理的能力。研究的核心目的是在信息和编码厌女症的隐含意义之间生成缺失的推理联系。我们的研究以论证理论为基础,形成了一系列零镜头和少镜头的提示语。这些提示整合了不同的技术,包括思维链推理和增强知识。我们的研究结果表明,法学硕士对厌恶女性言论的推理能力不足,他们主要依赖于从内化的对女性的共同成见中获得的内隐知识来产生隐含假设,而不是归纳推理。
{"title":"Language is Scary when Over-Analyzed: Unpacking Implied Misogynistic Reasoning with Argumentation Theory-Driven Prompts","authors":"Arianna Muti, Federico Ruggeri, Khalid Al-Khatib, Alberto Barrón-Cedeño, Tommaso Caselli","doi":"arxiv-2409.02519","DOIUrl":"https://doi.org/arxiv-2409.02519","url":null,"abstract":"We propose misogyny detection as an Argumentative Reasoning task and we\u0000investigate the capacity of large language models (LLMs) to understand the\u0000implicit reasoning used to convey misogyny in both Italian and English. The\u0000central aim is to generate the missing reasoning link between a message and the\u0000implied meanings encoding the misogyny. Our study uses argumentation theory as\u0000a foundation to form a collection of prompts in both zero-shot and few-shot\u0000settings. These prompts integrate different techniques, including\u0000chain-of-thought reasoning and augmented knowledge. Our findings show that LLMs\u0000fall short on reasoning capabilities about misogynistic comments and that they\u0000mostly rely on their implicit knowledge derived from internalized common\u0000stereotypes about women to generate implied assumptions, rather than on\u0000inductive reasoning.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142214723","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - CS - Social and Information Networks
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1