首页 > 最新文献

Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining最新文献

英文 中文
Multi-stage graph peeling algorithm for probabilistic core decomposition 概率核分解的多阶段图剥离算法
Yang Guo, Xuekui Zhang, F. Esfahani, Venkatesh Srinivasan, Alex Thomo, Li Xing
Mining dense subgraphs where vertices connect closely with each other is a common task when analyzing graphs. A very popular notion in subgraph analysis is core decomposition. Recently, Esfahani et al. presented a probabilistic core decomposition algorithm based on graph peeling and Central Limit Theorem (CLT) that is capable of handling very large graphs. Their proposed peeling algorithm (PA) starts from the lowest degree vertices and recursively deletes these vertices, assigning core numbers, and updating the degree of neighbour vertices until it reached the maximum core. However, in many applications, particularly in biology, more valuable information can be obtained from dense sub-communities and we are not interested in small cores where vertices do not interact much with others. To make the previous PA focus more on dense subgraphs, we propose a multi-stage graph peeling algorithm (M-PA) that has a two-stage data screening procedure added before the previous PA. After removing vertices from the graph based on the user-defined thresholds, we can reduce the graph complexity largely and without affecting the vertices in subgraphs that we are interested in. We show that M-PA is more efficient than the previous PA and with the properly set filtering threshold, can produce very similar if not identical dense subgraphs to the previous PA (in terms of graph density and clustering coefficient).
在分析图时,挖掘顶点紧密相连的密集子图是一项常见的任务。子图分析中一个非常流行的概念是核心分解。最近,Esfahani等人提出了一种基于图剥离和中心极限定理(CLT)的概率核心分解算法,能够处理非常大的图。他们提出的剥离算法(PA)从最低度的顶点开始,递归地删除这些顶点,分配核数,并更新相邻顶点的度,直到达到最大核。然而,在许多应用中,特别是在生物学中,更有价值的信息可以从密集的子群落中获得,我们对小的核心不感兴趣,那里的顶点与其他顶点没有太多的交互。为了使先前的PA更关注密集子图,我们提出了一种多阶段图剥离算法(M-PA),该算法在先前的PA之前添加了两阶段的数据筛选过程。在根据用户定义的阈值从图中删除顶点后,我们可以在不影响我们感兴趣的子图中的顶点的情况下大大降低图的复杂性。我们证明M-PA比之前的PA更有效,并且通过适当设置的过滤阈值,可以产生与之前的PA非常相似(如果不是完全相同的话)的密集子图(就图密度和聚类系数而言)。
{"title":"Multi-stage graph peeling algorithm for probabilistic core decomposition","authors":"Yang Guo, Xuekui Zhang, F. Esfahani, Venkatesh Srinivasan, Alex Thomo, Li Xing","doi":"10.1145/3487351.3489470","DOIUrl":"https://doi.org/10.1145/3487351.3489470","url":null,"abstract":"Mining dense subgraphs where vertices connect closely with each other is a common task when analyzing graphs. A very popular notion in subgraph analysis is core decomposition. Recently, Esfahani et al. presented a probabilistic core decomposition algorithm based on graph peeling and Central Limit Theorem (CLT) that is capable of handling very large graphs. Their proposed peeling algorithm (PA) starts from the lowest degree vertices and recursively deletes these vertices, assigning core numbers, and updating the degree of neighbour vertices until it reached the maximum core. However, in many applications, particularly in biology, more valuable information can be obtained from dense sub-communities and we are not interested in small cores where vertices do not interact much with others. To make the previous PA focus more on dense subgraphs, we propose a multi-stage graph peeling algorithm (M-PA) that has a two-stage data screening procedure added before the previous PA. After removing vertices from the graph based on the user-defined thresholds, we can reduce the graph complexity largely and without affecting the vertices in subgraphs that we are interested in. We show that M-PA is more efficient than the previous PA and with the properly set filtering threshold, can produce very similar if not identical dense subgraphs to the previous PA (in terms of graph density and clustering coefficient).","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"94 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115681757","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Group-node attention for community evolution prediction 群体节点关注的群落进化预测
Matt Revelle, C. Domeniconi, Ben U. Gelman
Communities in social networks evolve over time as people enter and leave the network and their activity behaviors shift. The task of predicting structural changes in communities over time is known as community evolution prediction. Existing work in this area has focused on the development of frameworks for defining events while using traditional classification methods to perform the actual prediction. We present a novel graph neural network for predicting community evolution events from structural and temporal information. The model (GNAN) includes a group-node attention component which enables support for variable-sized inputs and learned representation of groups based on member and neighbor node features. A comparative evaluation with standard baseline methods is performed and we demonstrate that our model outperforms the baselines. Additionally, we show the effects of network trends on model performance.
社交网络中的社区随着人们进入和离开网络以及他们的活动行为的变化而不断发展。预测群落结构随时间变化的任务被称为群落进化预测。该领域的现有工作主要集中在开发用于定义事件的框架,同时使用传统的分类方法来执行实际预测。本文提出了一种基于结构和时间信息预测群落进化事件的新型图神经网络。该模型(GNAN)包括一个组节点关注组件,支持可变大小的输入和基于成员和邻居节点特征的学习组表示。与标准基线方法进行比较评估,我们证明我们的模型优于基线。此外,我们还展示了网络趋势对模型性能的影响。
{"title":"Group-node attention for community evolution prediction","authors":"Matt Revelle, C. Domeniconi, Ben U. Gelman","doi":"10.1145/3487351.3488348","DOIUrl":"https://doi.org/10.1145/3487351.3488348","url":null,"abstract":"Communities in social networks evolve over time as people enter and leave the network and their activity behaviors shift. The task of predicting structural changes in communities over time is known as community evolution prediction. Existing work in this area has focused on the development of frameworks for defining events while using traditional classification methods to perform the actual prediction. We present a novel graph neural network for predicting community evolution events from structural and temporal information. The model (GNAN) includes a group-node attention component which enables support for variable-sized inputs and learned representation of groups based on member and neighbor node features. A comparative evaluation with standard baseline methods is performed and we demonstrate that our model outperforms the baselines. Additionally, we show the effects of network trends on model performance.","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127180643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
BotRGCN: Twitter bot detection with relational graph convolutional networks BotRGCN: Twitter机器人检测与关系图卷积网络
Shangbin Feng, Herun Wan, Ningnan Wang, Minnan Luo
Twitter bot detection is an important and challenging task. Existing bot detection measures fail to address the challenge of community and disguise, falling short of detecting bots that disguise as genuine users and attack collectively. To address these two challenges of Twitter bot detection, we propose BotRGCN, which is short for Bot detection with Relational Graph Convolutional Networks. BotRGCN addresses the challenge of community by constructing a heterogeneous graph from follow relationships and applies relational graph convolutional networks. Apart from that, BotRGCN makes use of multi-modal user semantic and property information to avoid feature engineering and augment its ability to capture bots with diversified disguise. Extensive experiments demonstrate that BotRGCN outperforms competitive baselines on a comprehensive benchmark TwiBot-20 which provides follow relationships.
Twitter机器人检测是一项重要且具有挑战性的任务。现有的机器人检测措施无法解决社区和伪装的挑战,无法检测伪装成真实用户并集体攻击的机器人。为了解决Twitter机器人检测的这两个挑战,我们提出了BotRGCN,它是使用关系图卷积网络进行机器人检测的缩写。BotRGCN通过从关注关系中构造异构图,并应用关系图卷积网络来解决社区的挑战。除此之外,BotRGCN利用多模态用户语义和属性信息来避免特征工程,增强其捕获具有多种伪装的机器人的能力。广泛的实验表明,BotRGCN在提供跟随关系的综合基准twitbot -20上优于竞争基准。
{"title":"BotRGCN: Twitter bot detection with relational graph convolutional networks","authors":"Shangbin Feng, Herun Wan, Ningnan Wang, Minnan Luo","doi":"10.1145/3487351.3488336","DOIUrl":"https://doi.org/10.1145/3487351.3488336","url":null,"abstract":"Twitter bot detection is an important and challenging task. Existing bot detection measures fail to address the challenge of community and disguise, falling short of detecting bots that disguise as genuine users and attack collectively. To address these two challenges of Twitter bot detection, we propose BotRGCN, which is short for Bot detection with Relational Graph Convolutional Networks. BotRGCN addresses the challenge of community by constructing a heterogeneous graph from follow relationships and applies relational graph convolutional networks. Apart from that, BotRGCN makes use of multi-modal user semantic and property information to avoid feature engineering and augment its ability to capture bots with diversified disguise. Extensive experiments demonstrate that BotRGCN outperforms competitive baselines on a comprehensive benchmark TwiBot-20 which provides follow relationships.","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115629450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 42
Full Bitcoin blockchain data made easy 完整的比特币区块链数据变得很容易
Jules Azad Emery, Matthieu Latapy
Despite the fact that it is publicly available, collecting and processing the full bitcoin blockchain data is not trivial. Its mere size, history, and other features indeed raise quite specific challenges, that we address in this paper. The strengths of our approach are the following: it relies on very basic and standard tools, which makes the procedure reliable and easily reproducible; it is a purely lossless procedure ensuring that we catch and preserve all existing data; it provides additional indexing that makes it easy to further process the whole data and select appropriate subsets of it. We present our procedure in details and provide an implementation online, as well as the obtained dataset.
尽管它是公开可用的,但收集和处理完整的比特币区块链数据并非微不足道。它的规模、历史和其他特征确实提出了相当具体的挑战,我们将在本文中加以解决。我们的方法的优势如下:它依赖于非常基本和标准的工具,这使得过程可靠且易于重复;这是一个纯粹无损的程序,确保我们捕捉和保存所有现有的数据;它提供了额外的索引,使进一步处理整个数据和选择适当的子集变得容易。我们详细介绍了我们的过程,并提供了一个在线实现,以及获得的数据集。
{"title":"Full Bitcoin blockchain data made easy","authors":"Jules Azad Emery, Matthieu Latapy","doi":"10.1145/3487351.3488326","DOIUrl":"https://doi.org/10.1145/3487351.3488326","url":null,"abstract":"Despite the fact that it is publicly available, collecting and processing the full bitcoin blockchain data is not trivial. Its mere size, history, and other features indeed raise quite specific challenges, that we address in this paper. The strengths of our approach are the following: it relies on very basic and standard tools, which makes the procedure reliable and easily reproducible; it is a purely lossless procedure ensuring that we catch and preserve all existing data; it provides additional indexing that makes it easy to further process the whole data and select appropriate subsets of it. We present our procedure in details and provide an implementation online, as well as the obtained dataset.","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"35643 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123658074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
What's kooking?: characterizing India's emerging social network, Koo 疯子是什么?印度新兴社交网络的特点
A. Singh, Chirag Jain, Jivitesh Jain, R. Jain, Shradha Sehgal, Tanisha Pandey, P. Kumaraguru
Social media has grown exponentially in a short period, coming to the forefront of communications and online interactions. Despite their rapid growth, social media platforms have been unable to scale to different languages globally and remain inaccessible to many. In this paper, we characterize Koo, a multilingual micro-blogging site that rose in popularity in 2021, as an Indian alternative to Twitter. We collected a dataset of 4.07 million users, 163.12 million follower-following relationships, and their content and activity across 12 languages. We study the user demographic along the lines of language, location, gender, and profession. The prominent presence of Indian languages in the discourse on Koo indicates the platform's success in promoting regional languages. We observe Koo's follower-following network to be much denser than Twitter's, comprising of closely-knit linguistic communities. An N-gram analysis of posts on Koo shows a #KooVsTwitter rhetoric, revealing the debate comparing the two platforms. Our characterization highlights the dynamics of the multilingual social network and its diverse Indian user base.
社交媒体在短时间内呈指数级增长,成为交流和在线互动的前沿。尽管发展迅速,但社交媒体平台一直无法扩展到全球不同的语言,许多人仍然无法访问。在本文中,我们将多语言微博网站Koo描述为Twitter的印度替代品,该网站于2021年流行起来。我们收集了一个数据集,包含407万用户,1.6312亿关注者-关注者关系,以及他们的内容和活动,涵盖12种语言。我们根据语言、地点、性别和职业来研究用户人口统计。印度语言在Koo的话语中显著存在,表明该平台在推广地区语言方面取得了成功。我们观察到辜朝明的追随者-追随者网络比Twitter的要密集得多,由紧密结合的语言社区组成。对具某的帖子进行N-gram分析,可以看到# koovtwitter的修辞,揭示了两个平台的比较争论。我们的特征突出了多语言社交网络的动态及其多样化的印度用户基础。
{"title":"What's kooking?: characterizing India's emerging social network, Koo","authors":"A. Singh, Chirag Jain, Jivitesh Jain, R. Jain, Shradha Sehgal, Tanisha Pandey, P. Kumaraguru","doi":"10.1145/3487351.3488354","DOIUrl":"https://doi.org/10.1145/3487351.3488354","url":null,"abstract":"Social media has grown exponentially in a short period, coming to the forefront of communications and online interactions. Despite their rapid growth, social media platforms have been unable to scale to different languages globally and remain inaccessible to many. In this paper, we characterize Koo, a multilingual micro-blogging site that rose in popularity in 2021, as an Indian alternative to Twitter. We collected a dataset of 4.07 million users, 163.12 million follower-following relationships, and their content and activity across 12 languages. We study the user demographic along the lines of language, location, gender, and profession. The prominent presence of Indian languages in the discourse on Koo indicates the platform's success in promoting regional languages. We observe Koo's follower-following network to be much denser than Twitter's, comprising of closely-knit linguistic communities. An N-gram analysis of posts on Koo shows a #KooVsTwitter rhetoric, revealing the debate comparing the two platforms. Our characterization highlights the dynamics of the multilingual social network and its diverse Indian user base.","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115772270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Examining factors associated with Twitter account suspension following the 2020 U.S. presidential election 正在研究2020年美国总统大选后推特账户被关闭的相关因素
Farhan Asif Chowdhury, Dheeman Saha, Md Rashidul Hasan, Koustuv Saha, A. Mueen
Online social media enables mass-level, transparent, and democratized discussion on numerous socio-political issues. Due to such openness, these platforms often endure manipulation and misinformation - leading to negative impacts. To prevent such harmful activities, platform moderators employ countermeasures to safeguard against actors violating their rules. However, the correlation between publicly outlined policies and employed action is less clear to general people. In this work, we examine violations and subsequent moderations related to the 2020 U.S. President Election discussion on Twitter. We focus on quantifying plausible reasons for the suspension, drawing on Twitter's rules and policies by identifying suspended users (Case) and comparing their activities and properties with (yet) non-suspended (Control) users. Using a dataset of 240M election-related tweets made by 21M unique users, we observe that Suspended users violate Twitter's rules at a higher rate (statistically significant) than Control users across all the considered aspects - hate speech, offensiveness, spamming, and civic integrity. Moreover, through the lens of Twitter's suspension mechanism, we qualitatively examine the targeted topics for manipulation.
在线社交媒体使大众能够就许多社会政治问题进行透明和民主化的讨论。由于这种开放性,这些平台经常受到操纵和错误信息的影响,从而产生负面影响。为了防止此类有害活动,平台版主采取了对策,以防止参与者违反其规则。然而,一般人不太清楚公开概述的政策和雇佣行为之间的关系。在这项工作中,我们研究了与Twitter上2020年美国总统选举讨论相关的违规行为和随后的缓和。我们专注于量化暂停的合理原因,利用Twitter的规则和政策,识别暂停的用户(Case),并将他们的活动和属性与(尚未)暂停的用户(Control)进行比较。使用2100万独立用户发布的240M条与选举相关的推文数据集,我们观察到,在所有考虑的方面——仇恨言论、冒犯性、垃圾邮件和公民诚信方面,暂停用户违反Twitter规则的比率(统计上显著)高于控制用户。此外,通过Twitter的暂停机制,我们定性地检查了操纵的目标主题。
{"title":"Examining factors associated with Twitter account suspension following the 2020 U.S. presidential election","authors":"Farhan Asif Chowdhury, Dheeman Saha, Md Rashidul Hasan, Koustuv Saha, A. Mueen","doi":"10.1145/3487351.3492715","DOIUrl":"https://doi.org/10.1145/3487351.3492715","url":null,"abstract":"Online social media enables mass-level, transparent, and democratized discussion on numerous socio-political issues. Due to such openness, these platforms often endure manipulation and misinformation - leading to negative impacts. To prevent such harmful activities, platform moderators employ countermeasures to safeguard against actors violating their rules. However, the correlation between publicly outlined policies and employed action is less clear to general people. In this work, we examine violations and subsequent moderations related to the 2020 U.S. President Election discussion on Twitter. We focus on quantifying plausible reasons for the suspension, drawing on Twitter's rules and policies by identifying suspended users (Case) and comparing their activities and properties with (yet) non-suspended (Control) users. Using a dataset of 240M election-related tweets made by 21M unique users, we observe that Suspended users violate Twitter's rules at a higher rate (statistically significant) than Control users across all the considered aspects - hate speech, offensiveness, spamming, and civic integrity. Moreover, through the lens of Twitter's suspension mechanism, we qualitatively examine the targeted topics for manipulation.","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131312570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
From #jobsearch to #mask: improving COVID-19 cascade prediction with spillover effects 从#jobsearch到#mask:改进具有溢出效应的COVID-19级联预测
Ninghan Chen, Zhiqiang Zhong, Jun Pang
An information outbreak occurs on social media along with the COVID-19 pandemic and leads to infodemic. Predicting the popularity of online content, known as cascade prediction, allows for not only catching in advance hot information that deserves attention, but also identifying false information that will widely spread and require quick response to mitigate its impact. Among the various information diffusion patterns leveraged in previous works, the spillover effect of the information exposed to users on their decision to participate in diffusing certain information is still not studied. In this paper, we focus on the diffusion of information related to COVID-19 preventive measures. Through our collected Twitter dataset, we validated the existence of this spillover effect. Building on the finding, we proposed extensions to three cascade prediction methods based on Graph Neural Networks (GNNs). Experiments conducted on our dataset demonstrated that the use of the identified spillover effect significantly improves the state-of-the-art GNNs methods in predicting the popularity of not only preventive measure messages, but also other COVID-19 related messages.
随着新冠肺炎大流行,社交媒体上的信息爆发,导致信息大流行。预测网络内容的受欢迎程度,被称为级联预测,不仅可以提前捕捉到值得关注的热点信息,还可以识别出广泛传播并需要快速反应以减轻其影响的虚假信息。在前人研究的各种信息扩散模式中,暴露给用户的信息对用户参与传播某一信息的决策的溢出效应尚未得到研究。在本文中,我们重点关注与COVID-19预防措施相关的信息传播。通过我们收集的Twitter数据集,我们验证了这种溢出效应的存在。基于这一发现,我们提出了三种基于图神经网络(gnn)的级联预测方法的扩展。在我们的数据集上进行的实验表明,使用已识别的溢出效应显著提高了最先进的gnn方法,不仅可以预测预防措施信息的受欢迎程度,还可以预测其他与COVID-19相关的信息。
{"title":"From #jobsearch to #mask: improving COVID-19 cascade prediction with spillover effects","authors":"Ninghan Chen, Zhiqiang Zhong, Jun Pang","doi":"10.1145/3487351.3488555","DOIUrl":"https://doi.org/10.1145/3487351.3488555","url":null,"abstract":"An information outbreak occurs on social media along with the COVID-19 pandemic and leads to infodemic. Predicting the popularity of online content, known as cascade prediction, allows for not only catching in advance hot information that deserves attention, but also identifying false information that will widely spread and require quick response to mitigate its impact. Among the various information diffusion patterns leveraged in previous works, the spillover effect of the information exposed to users on their decision to participate in diffusing certain information is still not studied. In this paper, we focus on the diffusion of information related to COVID-19 preventive measures. Through our collected Twitter dataset, we validated the existence of this spillover effect. Building on the finding, we proposed extensions to three cascade prediction methods based on Graph Neural Networks (GNNs). Experiments conducted on our dataset demonstrated that the use of the identified spillover effect significantly improves the state-of-the-art GNNs methods in predicting the popularity of not only preventive measure messages, but also other COVID-19 related messages.","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129661949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Racism is a virus: anti-asian hate and counterspeech in social media during the COVID-19 crisis 种族主义是一种病毒:在2019冠状病毒病危机期间,社交媒体上出现了反亚洲的仇恨和反言论
Caleb Ziems, Bing He, Sandeep Soni, Srijan Kumar
The spread of COVID-19 has sparked racism and hate on social media targeted towards Asian communities. However, little is known about how racial hate spreads during a pandemic and the role of counterspeech in mitigating this spread. In this work, we study the evolution and spread of anti-Asian hate speech through the lens of Twitter. We create COVID-HATE, the largest dataset of anti-Asian hate and counterspeech spanning 14 months, containing over 206 million tweets, and a social network with over 127 million nodes. By creating a novel hand-labeled dataset of 3,355 tweets, we train a text classifier to identify hateful and counterspeech tweets that achieves an average macro-F1 score of 0.832. Using this dataset, we conduct longitudinal analysis of tweets and users. Analysis of the social network reveals that hateful and counterspeech users interact and engage extensively with one another, instead of living in isolated polarized communities. We find that nodes were highly likely to become hateful after being exposed to hateful content in the year 2020. Notably, counterspeech messages discourage users from turning hateful, potentially suggesting a solution to curb hate on web and social media platforms. Data and code is available at http://claws.cc.gatech.edu/covid.
新冠肺炎疫情的蔓延在社交媒体上引发了针对亚洲社区的种族主义和仇恨。然而,人们对种族仇恨在大流行期间如何传播以及反言论在减轻这种传播方面的作用知之甚少。在这项工作中,我们通过Twitter的镜头研究了反亚洲仇恨言论的演变和传播。我们创建了COVID-HATE,这是历时14个月的最大的反亚洲仇恨和反言论数据集,包含超过2.06亿条推文,以及拥有超过1.27亿个节点的社交网络。通过创建一个包含3355条推文的全新手工标记数据集,我们训练了一个文本分类器来识别仇恨和反言论推文,这些推文的平均宏观f1得分为0.832。利用该数据集,我们对推文和用户进行纵向分析。对社交网络的分析表明,仇恨言论和反言论的用户彼此之间进行了广泛的互动和参与,而不是生活在孤立的两极分化社区中。我们发现,在2020年,节点在接触到仇恨内容后极有可能变得仇恨。值得注意的是,反言论信息会阻止用户变得充满仇恨,这可能为遏制网络和社交媒体平台上的仇恨提供了一个解决方案。数据和代码可在http://claws.cc.gatech.edu/covid上获得。
{"title":"Racism is a virus: anti-asian hate and counterspeech in social media during the COVID-19 crisis","authors":"Caleb Ziems, Bing He, Sandeep Soni, Srijan Kumar","doi":"10.1145/3487351.3488324","DOIUrl":"https://doi.org/10.1145/3487351.3488324","url":null,"abstract":"The spread of COVID-19 has sparked racism and hate on social media targeted towards Asian communities. However, little is known about how racial hate spreads during a pandemic and the role of counterspeech in mitigating this spread. In this work, we study the evolution and spread of anti-Asian hate speech through the lens of Twitter. We create COVID-HATE, the largest dataset of anti-Asian hate and counterspeech spanning 14 months, containing over 206 million tweets, and a social network with over 127 million nodes. By creating a novel hand-labeled dataset of 3,355 tweets, we train a text classifier to identify hateful and counterspeech tweets that achieves an average macro-F1 score of 0.832. Using this dataset, we conduct longitudinal analysis of tweets and users. Analysis of the social network reveals that hateful and counterspeech users interact and engage extensively with one another, instead of living in isolated polarized communities. We find that nodes were highly likely to become hateful after being exposed to hateful content in the year 2020. Notably, counterspeech messages discourage users from turning hateful, potentially suggesting a solution to curb hate on web and social media platforms. Data and code is available at http://claws.cc.gatech.edu/covid.","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123510855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 135
Fairness and diversity in the recommendation and ranking of participatory media content 参与性媒体内容推荐和排名的公平性和多样性
Muskaan, Mehak Preet Dhaliwal, Aaditeshwar Seth
Online participatory media platforms that enable one-to-many communication among users, see a significant amount of user generated content and consequently face a problem of being able to recommend a subset of this content to its users. We address the problem of recommending and ranking this content such that different viewpoints about a topic get exposure in a fair and diverse manner. We build our model in the context of a voice-based participatory media platform running in rural central India, for low-income and less-literate communities, that plays audio messages in a ranked list to users over a phone call and allows them to contribute their own messages. In this paper, we describe our model and evaluate it using call-logs from the platform, to compare the fairness and diversity performance of our model with the manual editorial processes currently being followed. Our models are generic and can be adapted and applied to other participatory media platforms as well.
在线参与式媒体平台允许用户之间进行一对多的交流,看到大量用户生成的内容,因此面临着能够向用户推荐这些内容的子集的问题。我们解决了推荐和排名这些内容的问题,这样关于一个话题的不同观点就能以公平和多样化的方式曝光。我们的模型是在一个基于语音的参与式媒体平台的背景下建立的,该平台运行在印度中部农村地区,面向低收入和文化水平较低的社区,该平台通过电话向用户播放音频信息,并允许他们贡献自己的信息。在本文中,我们描述了我们的模型,并使用来自平台的调用日志对其进行评估,以比较我们模型的公平性和多样性性能与目前遵循的手动编辑过程。我们的模型是通用的,也可以适用于其他参与式媒体平台。
{"title":"Fairness and diversity in the recommendation and ranking of participatory media content","authors":"Muskaan, Mehak Preet Dhaliwal, Aaditeshwar Seth","doi":"10.1145/3487351.3488363","DOIUrl":"https://doi.org/10.1145/3487351.3488363","url":null,"abstract":"Online participatory media platforms that enable one-to-many communication among users, see a significant amount of user generated content and consequently face a problem of being able to recommend a subset of this content to its users. We address the problem of recommending and ranking this content such that different viewpoints about a topic get exposure in a fair and diverse manner. We build our model in the context of a voice-based participatory media platform running in rural central India, for low-income and less-literate communities, that plays audio messages in a ranked list to users over a phone call and allows them to contribute their own messages. In this paper, we describe our model and evaluate it using call-logs from the platform, to compare the fairness and diversity performance of our model with the manual editorial processes currently being followed. Our models are generic and can be adapted and applied to other participatory media platforms as well.","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115321680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
期刊
Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1