Online Social Networks and Media最新文献_第2页

GASCOM: Graph-based Attentive Semantic Context Modeling for Online Conversation Understanding GASCOM：基于图的细心语义上下文建模用于在线对话理解

Q1 Social Sciences

Online Social Networks and Media

Pub Date : 2024-10-16 DOI: 10.1016/j.osnem.2024.100290

Vibhor Agarwal , Yu Chen , Nishanth Sastry

Online conversation understanding is an important yet challenging NLP problem which has many useful applications (e.g., hate speech detection). However, online conversations typically unfold over a series of posts and replies to those posts, forming a tree structure within which individual posts may refer to semantic context from elsewhere in the tree. Such semantic cross-referencing makes it difficult to understand a single post by itself; yet considering the entire conversation tree is not only difficult to scale but can also be misleading as a single conversation may have several distinct threads or points, not all of which are relevant to the post being considered. In this paper, we propose a Graph-based Attentive Semantic COntext Modeling (GASCOM) framework for online conversation understanding. Specifically, we design two novel algorithms that utilize both the graph structure of the online conversation as well as the semantic information from individual posts for retrieving relevant context nodes from the whole conversation. We further design a token-level multi-head graph attention mechanism to pay different attentions to different tokens from different selected context utterances for fine-grained conversation context modelling. Using this semantic conversational context, we re-examine two well-studied problems: polarity prediction and hate speech detection. Our proposed framework significantly outperforms state-of-the-art methods on both tasks, improving macro-F1 scores by 4.5% for polarity prediction and by 5% for hate speech detection. The GASCOM context weights also enhance interpretability.

在线对话理解是一个重要而又具有挑战性的 NLP 问题，它有许多有用的应用（如仇恨言论检测）。然而，在线会话通常由一系列帖子和对这些帖子的回复展开，形成一个树状结构，其中单个帖子可能会引用树状结构中其他地方的语义上下文。这种语义交叉引用使得理解单个帖子本身变得困难；然而，考虑整个对话树不仅难以扩展，而且还可能产生误导，因为单个对话可能有多个不同的线程或要点，但并非所有线程或要点都与所考虑的帖子相关。在本文中，我们为在线对话理解提出了一个基于图形的语义建模（GASCOM）框架。具体来说，我们设计了两种新颖的算法，既利用在线会话的图结构，又利用单个帖子的语义信息，从整个会话中检索相关的上下文节点。我们进一步设计了一种标记级多头图关注机制，对不同选定语境语篇中的不同标记给予不同的关注，从而建立细粒度的会话语境模型。利用这种语义对话上下文，我们重新研究了两个经过充分研究的问题：极性预测和仇恨言论检测。在这两项任务中，我们提出的框架明显优于最先进的方法，在极性预测和仇恨言论检测中，宏 F1 分数分别提高了 4.5% 和 5%。GASCOM 上下文权重还增强了可解释性。

{"title":"GASCOM: Graph-based Attentive Semantic Context Modeling for Online Conversation Understanding","authors":"Vibhor Agarwal , Yu Chen , Nishanth Sastry","doi":"10.1016/j.osnem.2024.100290","DOIUrl":"10.1016/j.osnem.2024.100290","url":null,"abstract":"<div><div>Online conversation understanding is an important yet challenging NLP problem which has many useful applications (e.g., hate speech detection). However, online conversations typically unfold over a series of posts and replies to those posts, forming a tree structure within which individual posts may refer to semantic context from elsewhere in the tree. Such semantic cross-referencing makes it difficult to understand a single post by itself; yet considering the entire conversation tree is not only difficult to scale but can also be misleading as a single conversation may have several distinct threads or points, not all of which are relevant to the post being considered. In this paper, we propose a <strong>G</strong>raph-based <strong>A</strong>ttentive <strong>S</strong>emantic <strong>CO</strong>ntext <strong>M</strong>odeling (GASCOM) framework for online conversation understanding. Specifically, we design two novel algorithms that utilize both the graph structure of the online conversation as well as the semantic information from individual posts for retrieving relevant context nodes from the whole conversation. We further design a <em>token-level</em> multi-head graph attention mechanism to pay different attentions to different tokens from different selected context utterances for fine-grained conversation context modelling. Using this semantic conversational context, we re-examine two well-studied problems: polarity prediction and hate speech detection. Our proposed framework significantly outperforms state-of-the-art methods on both tasks, improving macro-F1 scores by 4.5% for polarity prediction and by 5% for hate speech detection. The GASCOM context weights also enhance interpretability.</div></div>","PeriodicalId":52228,"journal":{"name":"Online Social Networks and Media","volume":"43 ","pages":"Article 100290"},"PeriodicalIF":0.0,"publicationDate":"2024-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142441667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The influence of coordinated behavior on toxicity 协调行为对毒性的影响

Q1 Social Sciences

Online Social Networks and Media

Pub Date : 2024-10-03 DOI: 10.1016/j.osnem.2024.100289

Edoardo Loru , Matteo Cinelli , Maurizio Tesconi , Walter Quattrociocchi

In the intricate landscape of social media, genuine content dissemination may be altered by a number of threats. Coordinated Behavior (CB), defined as orchestrated efforts by entities to deceive or mislead users about their identity and intentions, emerges as a tactic to exploit or manipulate online discourse. This study delves into the relationship between CB and toxic conversation on X (formerly known as Twitter). Using a dataset of 11 million tweets from 1 million users preceding the 2019 UK general election, we show that users displaying CB typically disseminate less harmful content, irrespective of political affiliation. However, distinct toxicity patterns emerge among different coordinated cohorts. Compared to their non-CB counterparts, CB participants show marginally higher toxicity levels only when considering their original posts. We further show the effects of CB-driven toxic content on non-CB users, gauging its impact based on political leanings. Our findings suggest that CB only has a limited impact on the toxicity of digital discourse.

在错综复杂的社交媒体环境中，真实内容的传播可能会受到多种威胁的影响。协调行为（Coordinated Behavior，CB）是指由实体精心策划，在身份和意图上欺骗或误导用户的行为，是一种利用或操纵网络言论的策略。本研究深入探讨了 X（前身为 Twitter）上的协同行为与有毒对话之间的关系。通过使用 2019 年英国大选前来自 100 万用户的 1100 万条推文数据集，我们发现，无论政治派别如何，显示 CB 的用户传播的有害内容通常较少。然而，在不同的协调群组中出现了不同的毒性模式。与非协调群组的用户相比，协调群组的参与者只有在考虑其原始帖子时才会显示出较高的毒性水平。我们进一步展示了由网络社区驱动的有毒内容对非网络社区用户的影响，并根据政治倾向来衡量其影响。我们的研究结果表明，CB 对数字言论的毒性影响有限。

引用次数: 0

Friend2User : A new CNN based method for user network and content embedding Friend2User：基于 CNN 的用户网络和内容嵌入新方法

Q1 Social Sciences

Online Social Networks and Media

Pub Date : 2024-09-27 DOI: 10.1016/j.osnem.2024.100288

Amal Rekik, Salma Jamoussi

Nowadays, social networks have become an integral part of modern society, significantly influencing individuals worldwide due to their extensive reach. Consequently, analyzing the data disseminated within these networks in order to identify online communities presents a major challenge for researchers in the data mining field. To address this challenge, we propose, in this paper, a novel deep user embedding framework for community extraction on social networks. Our method leverages the capability of Convolutional Neural Networks (CNNs) to produce abstract representations of users that preserve the semantic information in the data. Specifically, our approach considers both the profile content and the network structure, harnessing the power of unsupervised CNNs. The key concept underlying our proposal is that each user is represented not only by their own content but also by the content of their close friends. We employ a recursive CNN to integrate neighboring users’ content, thereby generating concise and informative user embeddings. The empirical findings obtained by our method demonstrate the effectiveness of our proposed user embeddings in efficiently detecting communities within social networks, particularly in the context of cybersecurity.

如今，社交网络已成为现代社会不可或缺的一部分，其广泛的覆盖面极大地影响着世界各地的个人。因此，分析这些网络中传播的数据以识别在线社区，成为数据挖掘领域研究人员面临的一大挑战。为了应对这一挑战，我们在本文中提出了一种用于社交网络社区提取的新型深度用户嵌入框架。我们的方法利用卷积神经网络（CNN）的能力来生成用户的抽象表示，从而保留数据中的语义信息。具体来说，我们的方法同时考虑了档案内容和网络结构，利用了无监督 CNN 的强大功能。我们的建议所依据的关键概念是，每个用户不仅由他们自己的内容来表示，还由他们好友的内容来表示。我们采用递归 CNN 来整合相邻用户的内容，从而生成简洁、翔实的用户嵌入。我们的方法获得的实证结果表明，我们提出的用户嵌入有效地检测了社交网络中的社群，特别是在网络安全方面。

{"title":"Friend2User : A new CNN based method for user network and content embedding","authors":"Amal Rekik, Salma Jamoussi","doi":"10.1016/j.osnem.2024.100288","DOIUrl":"10.1016/j.osnem.2024.100288","url":null,"abstract":"<div><div>Nowadays, social networks have become an integral part of modern society, significantly influencing individuals worldwide due to their extensive reach. Consequently, analyzing the data disseminated within these networks in order to identify online communities presents a major challenge for researchers in the data mining field. To address this challenge, we propose, in this paper, a novel deep user embedding framework for community extraction on social networks. Our method leverages the capability of Convolutional Neural Networks (CNNs) to produce abstract representations of users that preserve the semantic information in the data. Specifically, our approach considers both the profile content and the network structure, harnessing the power of unsupervised CNNs. The key concept underlying our proposal is that each user is represented not only by their own content but also by the content of their close friends. We employ a recursive CNN to integrate neighboring users’ content, thereby generating concise and informative user embeddings. The empirical findings obtained by our method demonstrate the effectiveness of our proposed user embeddings in efficiently detecting communities within social networks, particularly in the context of cybersecurity.</div></div>","PeriodicalId":52228,"journal":{"name":"Online Social Networks and Media","volume":"43 ","pages":"Article 100288"},"PeriodicalIF":0.0,"publicationDate":"2024-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142327284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Cross-community affinity: A polarization measure for multi-community networks 跨社区亲和力：多社区网络的极化衡量标准

Q1 Social Sciences

Online Social Networks and Media

Pub Date : 2024-08-21 DOI: 10.1016/j.osnem.2024.100280

Sreeja Nair , Adriana Iamnitchi

This article introduces a heterophily-based metric for assessing polarization in social networks when different opposing ideological communities coexist. The proposed metric measures polarization at the node level and is based on a node’s affinity for other communities. Node-level values can then be aggregated at the community, network, or any intermediate level, resulting in a more comprehensive map of polarization. We looked at our metric on the Polblogs network, the White Helmets Twitter interaction network with two communities, and the VoterFraud2020 domain network with five communities. Additionally, we evaluated our metric on different sets of synthetic graphs to confirm that it yields low polarization scores, as expected. We employed three ways to build synthetic networks: synthetic labeling, dK-series, and network models, in order to assess how the proposed measure behaves to various topologies and network features. Then, we compared our metric to two commonly used polarization metrics, Guerra’s boundary polarization and the random walk controversy score. We also examined how our suggested metric correlates with two network metrics: assortativity and modularity.

本文介绍了一种基于异质性的度量方法，用于评估不同对立意识形态社群共存时社交网络中的极化现象。该指标基于节点对其他社群的亲和力，在节点层面衡量极化程度。然后，节点级的值可以在社区、网络或任何中间级进行汇总，从而形成更全面的极化地图。我们在 Polblogs 网络、有两个社区的白头盔推特互动网络和有五个社区的 VoterFraud2020 域网络上研究了我们的指标。此外，我们还在不同的合成图集上评估了我们的度量标准，以确认它能产生较低的极化得分，正如我们所预期的那样。我们采用了三种方法来构建合成网络：合成标签、dK 序列和网络模型，以评估所提出的度量方法在不同拓扑结构和网络特征下的表现。然后，我们将我们的指标与两种常用的极化指标（格拉的边界极化和随机漫步争议得分）进行了比较。我们还研究了我们提出的度量方法与两个网络度量方法的相关性：同类性和模块性。

{"title":"Cross-community affinity: A polarization measure for multi-community networks","authors":"Sreeja Nair , Adriana Iamnitchi","doi":"10.1016/j.osnem.2024.100280","DOIUrl":"10.1016/j.osnem.2024.100280","url":null,"abstract":"<div><p>This article introduces a heterophily-based metric for assessing polarization in social networks when different opposing ideological communities coexist. The proposed metric measures polarization at the node level and is based on a node’s affinity for other communities. Node-level values can then be aggregated at the community, network, or any intermediate level, resulting in a more comprehensive map of polarization. We looked at our metric on the Polblogs network, the White Helmets Twitter interaction network with two communities, and the VoterFraud2020 domain network with five communities. Additionally, we evaluated our metric on different sets of synthetic graphs to confirm that it yields low polarization scores, as expected. We employed three ways to build synthetic networks: synthetic labeling, dK-series, and network models, in order to assess how the proposed measure behaves to various topologies and network features. Then, we compared our metric to two commonly used polarization metrics, Guerra’s boundary polarization and the random walk controversy score. We also examined how our suggested metric correlates with two network metrics: assortativity and modularity.</p></div>","PeriodicalId":52228,"journal":{"name":"Online Social Networks and Media","volume":"43 ","pages":"Article 100280"},"PeriodicalIF":0.0,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142021566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

RICo: Reddit ideological communities RICo：Reddit 意识形态社区

Q1 Social Sciences

Online Social Networks and Media

Pub Date : 2024-06-21 DOI: 10.1016/j.osnem.2024.100279

Kamalakkannan Ravi, Adan Ernesto Vela

The main objective of our research is to gain a comprehensive understanding of the relationship between language usage within different communities and delineating the ideological narratives. We focus specifically on utilizing Natural Language Processing techniques to identify underlying narratives in the coded or suggestive language employed by non-normative communities associated with targeted violence. Earlier studies addressed the detection of ideological affiliation through surveys, user studies, and a limited number based on the content of text articles, which still require label curation. Previous work addressed label curation by using ideological subreddits (r/Liberal and r/Conservative for Liberal and Conservative classes) to label the articles shared on those subreddits according to their prescribed ideologies, albeit with a limited dataset.

Building upon previous work, we use subreddit ideologies to categorize shared articles. In addition to the conservative and liberal classes, we introduce a new category called “Restricted” which encompasses text articles shared in subreddits that are restricted, privatized, or banned, such as r/TheDonald. The “Restricted” class encompasses posts tied to violence, regardless of conservative or liberal affiliations. Additionally, we augment our dataset with text articles from self-identified subreddits like r/progressive and r/askaconservative for the liberal and conservative classes, respectively. This results in an expanded dataset of 377,144 text articles, consisting of 72,488 liberal, 79,573 conservative, and 225,083 restricted class articles. Our goal is to analyze language variances in different ideological communities, investigate keyword relevance in labeling article orientations, especially in unseen cases (922,522 text articles), and delve into radicalized communities, conducting thorough analysis and interpretation of the results.

我们研究的主要目的是全面了解不同社区的语言使用与意识形态叙事之间的关系。我们特别关注利用自然语言处理技术来识别与定点暴力相关的非规范社群所使用的编码或暗示性语言中的潜在叙事。早期的研究通过调查、用户研究和少量基于文本文章内容的研究来检测意识形态从属关系，这些研究仍然需要对标签进行整理。之前的研究通过使用意识形态子红人区（r/Liberal 和 r/Conservative，分别代表自由派和保守派），根据规定的意识形态对这些子红人区上分享的文章进行标记，从而解决了标签整理问题，尽管数据集有限。除了保守派和自由派之外，我们还引入了一个名为 "受限 "的新类别，它包括在受限制、私有化或被禁止的子版块（如 r/TheDonald）中分享的文本文章。限制 "类包括与暴力相关的帖子，与保守派或自由派无关。此外，我们还为自由派和保守派分别添加了来自 r/progressive 和 r/askaconservative 等自我认同子论坛的文本文章，从而扩充了数据集。这样就得到了一个包含 377,144 篇文本文章的扩展数据集，其中包括 72,488 篇自由派文章、79,573 篇保守派文章和 225,083 篇限制级文章。我们的目标是分析不同意识形态社群的语言差异，研究关键词在标注文章取向时的相关性，尤其是在未见过的情况下（922,522 篇文本文章），并深入研究激进化社群，对结果进行全面分析和解释。

{"title":"RICo: Reddit ideological communities","authors":"Kamalakkannan Ravi, Adan Ernesto Vela","doi":"10.1016/j.osnem.2024.100279","DOIUrl":"https://doi.org/10.1016/j.osnem.2024.100279","url":null,"abstract":"<div><p>The main objective of our research is to gain a comprehensive understanding of the relationship between language usage within different communities and delineating the ideological narratives. We focus specifically on utilizing Natural Language Processing techniques to identify underlying narratives in the coded or suggestive language employed by non-normative communities associated with targeted violence. Earlier studies addressed the detection of ideological affiliation through surveys, user studies, and a limited number based on the content of text articles, which still require label curation. Previous work addressed label curation by using ideological subreddits (<em>r/Liberal</em> and <em>r/Conservative</em> for Liberal and Conservative classes) to label the articles shared on those subreddits according to their prescribed ideologies, albeit with a limited dataset.</p><p>Building upon previous work, we use subreddit ideologies to categorize shared articles. In addition to the conservative and liberal classes, we introduce a new category called “Restricted” which encompasses text articles shared in subreddits that are restricted, privatized, or banned, such as <em>r/TheDonald</em>. The “Restricted” class encompasses posts tied to violence, regardless of conservative or liberal affiliations. Additionally, we augment our dataset with text articles from self-identified subreddits like <em>r/progressive</em> and <em>r/askaconservative</em> for the liberal and conservative classes, respectively. This results in an expanded dataset of 377,144 text articles, consisting of 72,488 liberal, 79,573 conservative, and 225,083 restricted class articles. Our goal is to analyze language variances in different ideological communities, investigate keyword relevance in labeling article orientations, especially in unseen cases (922,522 text articles), and delve into radicalized communities, conducting thorough analysis and interpretation of the results.</p></div>","PeriodicalId":52228,"journal":{"name":"Online Social Networks and Media","volume":"42 ","pages":"Article 100279"},"PeriodicalIF":0.0,"publicationDate":"2024-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141438831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Evaluating password strength based on information spread on social networks: A combined approach relying on data reconstruction and generative models 根据社交网络上传播的信息评估密码强度：依靠数据重建和生成模型的组合方法

Q1 Social Sciences

Online Social Networks and Media

Pub Date : 2024-06-14 DOI: 10.1016/j.osnem.2024.100278

Maurizio Atzori , Eleonora Calò , Loredana Caruccio , Stefano Cirillo , Giuseppe Polese , Giandomenico Solimando

Ensuring the security of personal accounts has become a key concern due to the widespread password attack techniques. Although passwords are the primary defense against unauthorized access, the practice of reusing easy-to-remember passwords increases security risks for people. Traditional methods for evaluating password strength are often insufficient since they overlook the public personal information that users frequently share on social networks. In addition, while users tend to limit access to their data on single profiles, personal data is often unintentionally shared across multiple profiles, exposing users to password threats. In this paper, we present an extension of a data reconstruction tool, namely soda advance, which incorporates a new module to evaluate password strength based on publicly available data across multiple social networks. It relies on a new metric to provide a comprehensive evaluation of password strength. Moreover, we investigate the capabilities and risks associated with emerging Large Language Models (LLMs) in evaluating and generating passwords, respectively. Specifically, by exploiting the proliferation of LLMs, it has been possible to interact with many LLMs through Automated Template Learning methodologies. Experimental evaluations, performed with 100 real users, demonstrate the effectiveness of LLMs in generating strong passwords with respect to data associated with users’ profiles. Furthermore, LLMs have proved to be effective also in evaluation tasks, but the combined usage of LLMs and soda advance guaranteed better classifications up to more than 10% in terms of F1-score.

由于密码攻击技术的广泛应用，确保个人账户的安全已成为人们关注的焦点。虽然密码是防止未经授权访问的主要防御手段，但重复使用易于记忆的密码的做法增加了人们的安全风险。传统的密码强度评估方法往往不够充分，因为它们忽略了用户经常在社交网络上分享的公开个人信息。此外，虽然用户倾向于限制对单个个人资料的访问，但个人资料往往会无意中在多个个人资料中共享，从而使用户面临密码威胁。在本文中，我们介绍了一种数据重建工具（即 soda advance）的扩展功能，其中包含一个新模块，用于根据多个社交网络上的公开数据评估密码强度。它依赖于一种新的度量方法来对密码强度进行综合评估。此外，我们还研究了新兴的大型语言模型（LLM）在评估和生成密码方面的能力和风险。具体来说，利用 LLM 的扩散，我们可以通过自动模板学习方法与许多 LLM 进行交互。通过对 100 名真实用户进行实验评估，证明了 LLMs 在根据用户配置文件相关数据生成强密码方面的有效性。此外，LLMs 在评估任务中也被证明是有效的，但是 LLMs 和苏打进阶的结合使用保证了更好的分类，在 F1 分数方面提高了 10%以上。

{"title":"Evaluating password strength based on information spread on social networks: A combined approach relying on data reconstruction and generative models","authors":"Maurizio Atzori , Eleonora Calò , Loredana Caruccio , Stefano Cirillo , Giuseppe Polese , Giandomenico Solimando","doi":"10.1016/j.osnem.2024.100278","DOIUrl":"https://doi.org/10.1016/j.osnem.2024.100278","url":null,"abstract":"<div><p>Ensuring the security of personal accounts has become a key concern due to the widespread password attack techniques. Although passwords are the primary defense against unauthorized access, the practice of reusing easy-to-remember passwords increases security risks for people. Traditional methods for evaluating password strength are often insufficient since they overlook the public personal information that users frequently share on social networks. In addition, while users tend to limit access to their data on single profiles, personal data is often unintentionally shared across multiple profiles, exposing users to password threats. In this paper, we present an extension of a data reconstruction tool, namely <span>soda</span> <span>advance</span>, which incorporates a new module to evaluate password strength based on publicly available data across multiple social networks. It relies on a new metric to provide a comprehensive evaluation of password strength. Moreover, we investigate the capabilities and risks associated with emerging Large Language Models (LLMs) in evaluating and generating passwords, respectively. Specifically, by exploiting the proliferation of LLMs, it has been possible to interact with many LLMs through Automated Template Learning methodologies. Experimental evaluations, performed with 100 real users, demonstrate the effectiveness of LLMs in generating strong passwords with respect to data associated with users’ profiles. Furthermore, LLMs have proved to be effective also in evaluation tasks, but the combined usage of LLMs and <span>soda</span> <span>advance</span> guaranteed better classifications up to more than 10% in terms of F1-score.</p></div>","PeriodicalId":52228,"journal":{"name":"Online Social Networks and Media","volume":"42 ","pages":"Article 100278"},"PeriodicalIF":0.0,"publicationDate":"2024-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S246869642400003X/pdfft?md5=d155f83a585842083bfff6fb44108b0f&pid=1-s2.0-S246869642400003X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141324524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Exploring the journey of influencers in shaping social media engagement success 探索影响者塑造社交媒体参与成功的历程

Q1 Social Sciences

Online Social Networks and Media

Pub Date : 2024-05-01 DOI: 10.1016/j.osnem.2024.100277

Pouyan Eslami, Mahdi Najafabadi, Amir Gharehgozli

This study unfolds nuanced insights into the diverse dimensions dictating the success of social media influencers. Analyzing more than 210,000 social media posts and utilizing the Heuristic-Systematic Model of Information Processing (HSM), this study explores diverse factors, including individual appearance characteristics, depth of persuasive power, and various influencer types. The findings of this study shed light on the distinct impacts of varying influencer archetypes, such as celebrities and micro-celebrities, on user engagement and reveal the nuanced moderating effects of these archetypes on the relationships intertwined with personal attributes, persuasive potency, and influencer success. The proposed model advocates that influencers who leverage more profound, systematic processing strategies, marked by detailed information analysis and conveyance, are poised to experience elevated user engagement compared to counterparts employing heuristic modalities, distinguished by practical mental shortcuts and superficial examinations. This elucidation accentuates the imperative of harmonizing heuristic and systematic methodologies for emerging influencers and brands aspiring to optimize user engagement and efficaciously mold consumer behavior. This paper encapsulates a comprehensive exploration of the dynamic landscapes of influencer marketing via the HSM prism, delivering profound insights and practical ramifications for scholars, marketers, and influencers aiming to navigate and exploit the intricate networks of influential determinants in the ever-evolving digital marketing domain.

本研究对决定社交媒体影响力成功与否的各种因素进行了细致入微的分析。本研究分析了 210,000 多条社交媒体帖子，并利用信息处理启发式系统模型 (HSM)，探讨了各种因素，包括个人外观特征、说服力深度和各种影响者类型。本研究的结果揭示了名人和微名人等不同影响者原型对用户参与的不同影响，并揭示了这些原型对个人特质、说服力和影响者成功之间相互交织的关系的微妙调节作用。所提出的模型认为，与采用启发式模式（以实用的思维捷径和肤浅的检查为特征）的影响者相比，利用更深入、更系统的处理策略（以详细的信息分析和传达为特征）的影响者有望获得更高的用户参与度。这一阐释突出表明，对于希望优化用户参与度并有效塑造消费者行为的新兴影响者和品牌而言，协调启发式和系统化方法论势在必行。本文通过 HSM 棱镜对有影响力者营销的动态景观进行了全面探索，为学者、营销人员和有影响力者提供了深刻见解和实际影响，使他们能够在不断发展的数字营销领域中驾驭和利用错综复杂的有影响力决定因素网络。

{"title":"Exploring the journey of influencers in shaping social media engagement success","authors":"Pouyan Eslami, Mahdi Najafabadi, Amir Gharehgozli","doi":"10.1016/j.osnem.2024.100277","DOIUrl":"https://doi.org/10.1016/j.osnem.2024.100277","url":null,"abstract":"<div><p>This study unfolds nuanced insights into the diverse dimensions dictating the success of social media influencers. Analyzing more than 210,000 social media posts and utilizing the Heuristic-Systematic Model of Information Processing (HSM), this study explores diverse factors, including individual appearance characteristics, depth of persuasive power, and various influencer types. The findings of this study shed light on the distinct impacts of varying influencer archetypes, such as celebrities and micro-celebrities, on user engagement and reveal the nuanced moderating effects of these archetypes on the relationships intertwined with personal attributes, persuasive potency, and influencer success. The proposed model advocates that influencers who leverage more profound, systematic processing strategies, marked by detailed information analysis and conveyance, are poised to experience elevated user engagement compared to counterparts employing heuristic modalities, distinguished by practical mental shortcuts and superficial examinations. This elucidation accentuates the imperative of harmonizing heuristic and systematic methodologies for emerging influencers and brands aspiring to optimize user engagement and efficaciously mold consumer behavior. This paper encapsulates a comprehensive exploration of the dynamic landscapes of influencer marketing via the HSM prism, delivering profound insights and practical ramifications for scholars, marketers, and influencers aiming to navigate and exploit the intricate networks of influential determinants in the ever-evolving digital marketing domain.</p></div>","PeriodicalId":52228,"journal":{"name":"Online Social Networks and Media","volume":"41 ","pages":"Article 100277"},"PeriodicalIF":0.0,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2468696424000028/pdfft?md5=1f97071692e10b0a65a5cd8d1be228ce&pid=1-s2.0-S2468696424000028-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140917740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Towards echo chamber assessment by employing aspect-based sentiment analysis and GDM consensus metrics 采用基于方面的情感分析和 GDM 共识度量法评估回音室

Q1 Social Sciences

Online Social Networks and Media

Pub Date : 2024-01-01 DOI: 10.1016/j.osnem.2024.100276

Miriam Amendola , Danilo Cavaliere , Carmen De Maio , Giuseppe Fenza , Vincenzo Loia

Echo chambers naturally occur on social networks, where individuals join groups to share and discuss their own interests driven by algorithms that steer their beliefs and behaviours based on their emotions, biases, and cognitive vulnerabilities. According to recent research on information manipulation and interference, echo chambers have become crucial weapons in the arsenal of Cognitive Warfare for amplifying the effect of psychological techniques aimed at altering information and narratives to influence public perception and shape opinions. The research is focusing on the definition of assessment methods for detecting emerging echo chambers and monitoring their evolution over time. In this sense, this work stresses the complementary role of the existing topology-based metrics and the semantics of the viewpoints underlying groups as well as their belonging users. Indeed, this paper proposes a metric based on consensus Group Decision-Making (GDM) that acquires community members’ opinions through Aspect-Based Sentiment Analysis (ABSA) and applies consensus metrics to determine the agreement within a single community and between distinct communities. The potential of the proposed metrics have been evaluated on two public datasets of tweets through comparisons with sentiment-aware opinions analysis and state-of-the-art metrics for polarization and echo chamber detection. The results reveal that topology-based metrics strictly depending on random walks over the individuals are not sufficient to fully depict the communities closeness on topics and their prevailing beliefs coming out from content analysis.

在社交网络上自然会出现回音室，个人加入群组分享和讨论自己的兴趣爱好，而算法会根据个人的情绪、偏见和认知弱点引导他们的信念和行为。根据最近对信息操纵和干扰的研究，回声室已成为认知战武器库中的重要武器，可放大心理技术的效果，从而改变信息和叙述，影响公众的看法和意见。研究的重点是确定评估方法，以检测新出现的回声室并监测其随时间的演变。从这个意义上说，这项工作强调了现有的基于拓扑结构的衡量标准和群体及其所属用户的观点语义的互补作用。事实上，本文提出了一种基于共识的群体决策（GDM）度量方法，该方法通过基于方面的情感分析（ABSA）获取群体成员的观点，并应用共识度量方法来确定单个群体内部以及不同群体之间的一致性。通过与情感感知意见分析以及最先进的极化和回音室检测指标进行比较，我们在两个公共推文数据集上评估了所提指标的潜力。结果表明，基于拓扑结构的指标严格依赖于对个体的随机游走，不足以充分描述社区在主题上的接近程度以及内容分析得出的普遍观点。

{"title":"Towards echo chamber assessment by employing aspect-based sentiment analysis and GDM consensus metrics","authors":"Miriam Amendola , Danilo Cavaliere , Carmen De Maio , Giuseppe Fenza , Vincenzo Loia","doi":"10.1016/j.osnem.2024.100276","DOIUrl":"https://doi.org/10.1016/j.osnem.2024.100276","url":null,"abstract":"<div><p>Echo chambers naturally occur on social networks, where individuals join groups to share and discuss their own interests driven by algorithms that steer their beliefs and behaviours based on their emotions, biases, and cognitive vulnerabilities. According to recent research on information manipulation and interference, echo chambers have become crucial weapons in the arsenal of Cognitive Warfare for amplifying the effect of psychological techniques aimed at altering information and narratives to influence public perception and shape opinions. The research is focusing on the definition of assessment methods for detecting emerging echo chambers and monitoring their evolution over time. In this sense, this work stresses the complementary role of the existing topology-based metrics and the semantics of the viewpoints underlying groups as well as their belonging users. Indeed, this paper proposes a metric based on consensus Group Decision-Making (GDM) that acquires community members’ opinions through Aspect-Based Sentiment Analysis (ABSA) and applies consensus metrics to determine the agreement within a single community and between distinct communities. The potential of the proposed metrics have been evaluated on two public datasets of tweets through comparisons with sentiment-aware opinions analysis and state-of-the-art metrics for polarization and echo chamber detection. The results reveal that topology-based metrics strictly depending on random walks over the individuals are not sufficient to fully depict the communities closeness on topics and their prevailing beliefs coming out from content analysis.</p></div>","PeriodicalId":52228,"journal":{"name":"Online Social Networks and Media","volume":"39 ","pages":"Article 100276"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2468696424000016/pdfft?md5=201f0c26cc0e647ab968aea16e27c59d&pid=1-s2.0-S2468696424000016-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139732561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Exploring unsupervised textual representations generated by neural language models in the context of automatic tweet stream summarization 在自动tweet流摘要的背景下，探索由神经语言模型生成的无监督文本表示

Q1 Social Sciences

Online Social Networks and Media

Pub Date : 2023-09-01 DOI: 10.1016/j.osnem.2023.100272

Alexis Dusart, Karen Pinel-Sauvagnat, Gilles Hubert

Users are often overwhelmed by the amount of information generated on online social networks and media (OSNEM), in particular Twitter, during particular events. Summarizing the information streams would help them be informed in a reasonable time. In parallel, recent state of the art in summarization has a special focus on deep neural models and pre-trained language models.

In this context, we aim at (i) evaluating different pre-trained language model (PLM) to represent microblogs (i.e., tweets), and (ii) to identify the most suitable ones in a summarization context, as well as (iii) to see how neural models can be used knowing the issue of input size limitation of such models. For this purpose, we divided the problem into 3 questions and made experiments on 3 different datasets. Using a simple greedy algorithm, we first compared several pre-trained models for single tweet representation. We then evaluated the quality of the average representation of the stream and sought to use it as a starting point for a neural approach. First results show the interest of using USE and Sentence-BERT representations for tweet stream summarization, as well as the great potential of using the average representation of the stream.

用户经常被在线社交网络和媒体(OSNEM)，特别是Twitter，在特定事件期间产生的大量信息所淹没。汇总信息流将有助于他们在合理的时间内获得信息。与此同时，总结的最新技术特别关注深度神经模型和预训练语言模型。在这种情况下，我们的目标是(i)评估不同的预训练语言模型(PLM)来表示微博(即推文)，(ii)在摘要上下文中识别最合适的语言模型，以及(iii)了解如何使用神经模型来了解此类模型的输入大小限制问题。为此，我们将问题分为3个问题，在3个不同的数据集上进行实验。使用简单的贪婪算法，我们首先比较了单个tweet表示的几个预训练模型。然后，我们评估了流的平均表示的质量，并试图将其用作神经方法的起点。第一个结果显示了使用USE和Sentence-BERT表示进行tweet流摘要的兴趣，以及使用流的平均表示的巨大潜力。

{"title":"Exploring unsupervised textual representations generated by neural language models in the context of automatic tweet stream summarization","authors":"Alexis Dusart, Karen Pinel-Sauvagnat, Gilles Hubert","doi":"10.1016/j.osnem.2023.100272","DOIUrl":"https://doi.org/10.1016/j.osnem.2023.100272","url":null,"abstract":"<div><p><span>Users are often overwhelmed by the amount of information generated on online social networks<span> and media (OSNEM), in particular Twitter, during particular events. Summarizing the information streams would help them be informed in a reasonable time. In parallel, recent state of the art in summarization has a special focus on deep neural models and pre-trained </span></span>language models.</p><p>In this context, we aim at (i) evaluating different pre-trained language model (PLM) to represent microblogs<span> (i.e., tweets), and (ii) to identify the most suitable ones in a summarization context, as well as (iii) to see how neural models can be used knowing the issue of input size limitation of such models. For this purpose, we divided the problem into 3 questions and made experiments on 3 different datasets. Using a simple greedy algorithm<span>, we first compared several pre-trained models for single tweet representation. We then evaluated the quality of the average representation of the stream and sought to use it as a starting point for a neural approach. First results show the interest of using USE and Sentence-BERT representations for tweet stream summarization, as well as the great potential of using the average representation of the stream.</span></span></p></div>","PeriodicalId":52228,"journal":{"name":"Online Social Networks and Media","volume":"37 ","pages":"Article 100272"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91987037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

What do Twitter comments tell about news article bias? Assessing the impact of news article bias on its perception on Twitter 关于新闻报道的偏见，推特上的评论说明了什么?评估新闻文章偏见对其在Twitter上的看法的影响

Q1 Social Sciences

Online Social Networks and Media

Pub Date : 2023-09-01 DOI: 10.1016/j.osnem.2023.100264

Timo Spinde , Elisabeth Richter , Martin Wessel , Juhi Kulshrestha , Karsten Donnay

News stories circulating online, especially on social media platforms, are nowadays a primary source of information. Given the nature of social media, news no longer are just news, but they are embedded in the conversations of users interacting with them. This is particularly relevant for inaccurate information or even outright misinformation because user interaction has a crucial impact on whether information is uncritically disseminated or not. Biased coverage has been shown to affect personal decision-making. Still, it remains an open question whether users are aware of the biased reporting they encounter and how they react to it. The latter is particularly relevant given that user reactions help contextualize reporting for other users and can thus help mitigate but may also exacerbate the impact of biased media coverage.

This paper approaches the question from a measurement point of view, examining whether reactions to news articles on Twitter can serve as bias indicators, i.e., whether how users comment on a given article relates to its actual level of bias. We first give an overview of research on media bias before discussing key concepts related to how individuals engage with online content, focusing on the sentiment (or valance) of comments and on outright hate speech. We then present the first dataset connecting reliable human-made media bias classifications of news articles with the reactions these articles received on Twitter. We call our dataset BAT - Bias And Twitter. BAT covers 2,800 (bias-rated) news articles from 255 English-speaking news outlets. Additionally, BAT includes 175,807 comments and retweets referring to the articles.

Based on BAT, we conduct a multi-feature analysis to identify comment characteristics and analyze whether Twitter reactions correlate with an article’s bias. First, we fine-tune and apply two XLNet-based classifiers for hate speech detection and sentiment analysis. Second, we relate the results of the classifiers to the article bias annotations within a multi-level regression. The results show that Twitter reactions to an article indicate its bias, and vice-versa. With a regression coefficient of 0.703 ( $p < 0.01$ ), we specifically present evidence that Twitter reactions to biased articles are significantly more hateful. Our analysis shows that the news outlet’s individual stance reinforces the hate-bias relationship. In future work, we will extend the dataset and analysis, including additional concepts related to media bias.

如今，在网上，尤其是在社交媒体平台上流传的新闻故事是信息的主要来源。鉴于社交媒体的性质，新闻不再仅仅是新闻，而是嵌入到与之互动的用户的对话中。这与不准确的信息甚至完全错误的信息特别相关，因为用户交互对信息是否被不加批判地传播具有至关重要的影响。有偏见的报道已被证明会影响个人决策。然而，用户是否意识到他们遇到的有偏见的报道，以及他们如何应对，这仍然是一个悬而未决的问题。后者尤其重要，因为用户的反应有助于为其他用户提供报道的背景，从而有助于减轻但也可能加剧有偏见的媒体报道的影响。本文从测量的角度来探讨这个问题，研究Twitter上对新闻文章的反应是否可以作为偏见指标，即用户对给定文章的评论是否与其实际的偏见水平有关。我们首先概述了媒体偏见的研究，然后讨论了与个人如何参与在线内容相关的关键概念，重点关注评论的情绪(或价值)和直接的仇恨言论。然后，我们提出了第一个数据集，将新闻文章的可靠的人为媒体偏见分类与这些文章在Twitter上收到的反应联系起来。我们称我们的数据集为BAT——偏见和推特。BAT涵盖了来自255个英语新闻媒体的2,800篇(偏见评级)新闻文章。此外，BAT还收录了175,807条评论和转发。基于BAT，我们进行了多特征分析，以识别评论特征，并分析Twitter反应是否与文章的偏见相关。首先，我们对两个基于xlnet的分类器进行微调并应用于仇恨言论检测和情感分析。其次，我们将分类器的结果与多层次回归中的文章偏见注释联系起来。结果表明，Twitter对一篇文章的反应表明了它的偏见，反之亦然。回归系数为0.703 (p<0.01)，我们特别提出证据表明Twitter对有偏见文章的反应明显更可恶。我们的分析表明，新闻媒体的个人立场强化了仇恨-偏见关系。在未来的工作中，我们将扩展数据集和分析，包括与媒体偏见相关的其他概念。

{"title":"What do Twitter comments tell about news article bias? Assessing the impact of news article bias on its perception on Twitter","authors":"Timo Spinde , Elisabeth Richter , Martin Wessel , Juhi Kulshrestha , Karsten Donnay","doi":"10.1016/j.osnem.2023.100264","DOIUrl":"10.1016/j.osnem.2023.100264","url":null,"abstract":"<div><p>News stories circulating online, especially on social media platforms, are nowadays a primary source of information. Given the nature of social media, news no longer are just news, but they are embedded in the conversations of users interacting with them. This is particularly relevant for inaccurate information or even outright misinformation because user interaction has a crucial impact on whether information is uncritically disseminated or not. Biased coverage has been shown to affect personal decision-making. Still, it remains an open question whether users are aware of the biased reporting they encounter and how they react to it. The latter is particularly relevant given that user reactions help contextualize reporting for other users and can thus help mitigate but may also exacerbate the impact of biased media coverage.</p><p>This paper approaches the question from a measurement point of view, examining whether reactions to news articles on Twitter can serve as bias indicators, i.e., whether how users comment on a given article relates to its actual level of bias. We first give an overview of research on media bias before discussing key concepts related to how individuals engage with online content, focusing on the sentiment (or valance) of comments and on outright hate speech. We then present the first dataset connecting reliable human-made media bias classifications of news articles with the reactions these articles received on Twitter. We call our dataset BAT - <strong>B</strong>ias <strong>A</strong>nd <strong>T</strong>witter. BAT covers 2,800 (bias-rated) news articles from 255 English-speaking news outlets. Additionally, BAT includes 175,807 comments and retweets referring to the articles.</p><p>Based on BAT, we conduct a multi-feature analysis to identify comment characteristics and analyze whether Twitter reactions correlate with an article’s bias. First, we fine-tune and apply two XLNet-based classifiers for hate speech detection and sentiment analysis. Second, we relate the results of the classifiers to the article bias annotations within a multi-level regression. The results show that Twitter reactions to an article indicate its bias, and vice-versa. With a regression coefficient of 0.703 (<span><math><mrow><mi>p</mi><mo><</mo><mn>0</mn><mo>.</mo><mn>01</mn></mrow></math></span>), we specifically present evidence that Twitter reactions to biased articles are significantly more hateful. Our analysis shows that the news outlet’s individual stance reinforces the hate-bias relationship. In future work, we will extend the dataset and analysis, including additional concepts related to media bias.</p></div>","PeriodicalId":52228,"journal":{"name":"Online Social Networks and Media","volume":"37 ","pages":"Article 100264"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42750623","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0