Social Science Computer Review最新文献_第4页

Using OpenStreetMap, Census, and Survey Data to Predict Interethnic Group Relations in Belgium: A Machine Learning Approach 利用 OpenStreetMap、人口普查和调查数据预测比利时的族群间关系：机器学习方法

IF 4.1 2区社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Social Science Computer Review

Pub Date : 2024-08-08 DOI: 10.1177/08944393241269098

Daria Dementeva, Cecil Meeusen, Bart Meuleman

Neighborhoods are important contexts in shaping interethnic group relationships and sites in which these may materialize through everyday routines in shared local spaces. In this paper, we approach neighborhoods as a small-scale set of spaces of encounter, defined as local public or semi-public spaces, where residents of different ethnic backgrounds may meet. Relying on the classical contact and group threat theories, the main assumption is that local spaces of encounter are facets of an intergroup neighborhood context and may shape intergroup relations, defined as perceived ethnic threat and intergroup friendship. Drawing on the georeferenced survey data from the Belgian National Election Study 2020 enriched with spatial features from OpenStreetMap, an innovative big geospatial data source, and census-based neighborhood characteristics, the study employs machine learning algorithms to investigate whether, which, and how neighborhood spaces of encounter can predict perceived ethnic threat and intergroup friendship, while also taking into account traditional local ethnic, socioeconomic, and individual indicators. By using OpenStreetMap to measure spaces of encounter as a novel neighborhood indicator, we develop a fine-grained typology of local spaces that is rooted in urban and intergroup relations research. The results show that for predicting intergroup friendship, the important spaces were educational, functional, public open, and user-selecting spaces, while for predicting threat functional, third, retail, and other spaces stood out prediction-wise. The results also revealed the predictive importance of individual characteristics for intergroup relations, while neighborhood characteristics were not so important, both in absolute and relative terms. We conclude by reflecting on what local spaces might matter and discuss the combination of OpenStreetMap and intergroup relations as a proof of concept and prospects for future research.

邻里关系是形成族群间关系的重要背景，也是这些关系通过在当地共享空间中的日常例行活动得以具体化的场所。在本文中，我们将邻里作为一个小规模的相遇空间，定义为当地的公共或半公共空间，不同种族背景的居民可能在此相遇。根据经典的接触理论和群体威胁理论，我们的主要假设是，当地的相遇空间是群体间邻里关系的一个方面，并可能塑造群体间关系，即感知到的种族威胁和群体间友谊。本研究利用比利时 2020 年全国选举研究的地理参照调查数据，并结合创新的大地理空间数据来源 OpenStreetMap 的空间特征和基于人口普查的邻里特征，采用机器学习算法研究邻里相遇空间是否、哪些以及如何预测感知到的种族威胁和群体间友谊，同时也考虑到传统的地方种族、社会经济和个人指标。通过使用 OpenStreetMap 来测量作为一种新型邻里指标的相遇空间，我们开发出了一种植根于城市和群体间关系研究的地方空间精细类型学。结果显示，在预测群体间友谊方面，重要的空间是教育空间、功能空间、公共开放空间和用户选择空间，而在预测威胁方面，功能空间、第三空间、零售空间和其他空间在预测方面表现突出。研究结果还揭示了个人特征对群体间关系的重要预测作用，而邻里特征无论从绝对值还是相对值来看都不太重要。最后，我们对哪些本地空间可能重要进行了反思，并讨论了将 OpenStreetMap 与群体间关系相结合的概念验证和未来研究前景。

{"title":"Using OpenStreetMap, Census, and Survey Data to Predict Interethnic Group Relations in Belgium: A Machine Learning Approach","authors":"Daria Dementeva, Cecil Meeusen, Bart Meuleman","doi":"10.1177/08944393241269098","DOIUrl":"https://doi.org/10.1177/08944393241269098","url":null,"abstract":"Neighborhoods are important contexts in shaping interethnic group relationships and sites in which these may materialize through everyday routines in shared local spaces. In this paper, we approach neighborhoods as a small-scale set of spaces of encounter, defined as local public or semi-public spaces, where residents of different ethnic backgrounds may meet. Relying on the classical contact and group threat theories, the main assumption is that local spaces of encounter are facets of an intergroup neighborhood context and may shape intergroup relations, defined as perceived ethnic threat and intergroup friendship. Drawing on the georeferenced survey data from the Belgian National Election Study 2020 enriched with spatial features from OpenStreetMap, an innovative big geospatial data source, and census-based neighborhood characteristics, the study employs machine learning algorithms to investigate whether, which, and how neighborhood spaces of encounter can predict perceived ethnic threat and intergroup friendship, while also taking into account traditional local ethnic, socioeconomic, and individual indicators. By using OpenStreetMap to measure spaces of encounter as a novel neighborhood indicator, we develop a fine-grained typology of local spaces that is rooted in urban and intergroup relations research. The results show that for predicting intergroup friendship, the important spaces were educational, functional, public open, and user-selecting spaces, while for predicting threat functional, third, retail, and other spaces stood out prediction-wise. The results also revealed the predictive importance of individual characteristics for intergroup relations, while neighborhood characteristics were not so important, both in absolute and relative terms. We conclude by reflecting on what local spaces might matter and discuss the combination of OpenStreetMap and intergroup relations as a proof of concept and prospects for future research.","PeriodicalId":49509,"journal":{"name":"Social Science Computer Review","volume":"26 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141908960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Airbnb on TikTok: Brand Perception Through User Engagement and Sentiment Trends TikTok 上的 Airbnb：通过用户参与和情绪趋势感知品牌

IF 4.1 2区社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Social Science Computer Review

Pub Date : 2024-08-08 DOI: 10.1177/08944393241260242

Julia Marti-Ochoa, Eva Martin-Fuentes, Berta Ferrer-Rosell

This study delves into Airbnb’s brand presence on TikTok by analyzing textual content in posts, and human audio in videos. This approach aims to decipher the brand narrative and gauge user engagement. In the dynamic realm of social media marketing, TikTok has emerged as a key platform in shaping brand perception. This research specifically concentrates on Airbnb’s content, distinguishing between official narratives and user-generated content (UGC). Notably, themes of “Travel” dominate official posts, contrasting with “Real Estate” and “Business” in UGC. The methodology employed involves advanced data collection techniques, including web scraping for textual data and artificial intelligence for transcribing human audio to text. The findings reveal that UGC commands greater engagement and volume compared to Airbnb’s own brand content, underscoring the increasing significance of user involvement in brand storytelling. An analysis of the study results is conducted using linguistic natural processing (LNP) for the sentiment base, and the vector space model for emotion analysis. Sentiment analysis reveals a predominance of the emotion “happiness” and a significant presence of “surprise” in the posts, both of which are critical for audience engagement. Moreover, the study indicates a high approval rate for Airbnb-related content, reflecting a positive reception of the brand. Additionally, the research observes that influencers, particularly nano influencers, have higher engagement rates, indicating that their authenticity and relatability appeal especially to Generation Z audiences. This study not only sheds light on the intricate relationship between brand narrative, user engagement, and sentiment on TikTok but also offers valuable insights into effective brand image construction and propagation in the digital era, highlighting the importance of diverse emotions in enhancing audience engagement.

本研究通过分析帖子中的文字内容和视频中的人声，深入研究 Airbnb 在 TikTok 上的品牌形象。这种方法旨在解读品牌叙事并衡量用户参与度。在充满活力的社交媒体营销领域，TikTok 已成为塑造品牌认知的重要平台。本研究特别关注 Airbnb 的内容，区分官方叙事和用户生成内容（UGC）。值得注意的是，"旅游 "主题在官方帖子中占主导地位，与 UGC 中的 "房地产 "和 "商业 "形成鲜明对比。所采用的方法涉及先进的数据收集技术，包括文本数据的网络搜刮和将人类音频转录为文本的人工智能。研究结果表明，与 Airbnb 自己的品牌内容相比，UGC 的参与度更高，数量更大，这表明用户参与品牌故事讲述的意义日益重大。研究使用语言自然处理（LNP）进行情感基础分析，并使用向量空间模型进行情感分析。情感分析表明，"快乐 "和 "惊喜 "这两种情感在帖子中占主导地位，而这两种情感对于受众的参与度至关重要。此外，研究还表明，Airbnb 相关内容的支持率很高，这反映了受众对该品牌的积极认可。此外，研究还发现，有影响力者，尤其是 "纳米级 "有影响力者的参与率更高，这表明他们的真实性和亲和力尤其吸引 Z 世代受众。这项研究不仅揭示了 TikTok 上品牌叙事、用户参与和情感之间错综复杂的关系，还为数字时代有效的品牌形象建设和传播提供了宝贵的见解，强调了多样化情感在提高受众参与度方面的重要性。

{"title":"Airbnb on TikTok: Brand Perception Through User Engagement and Sentiment Trends","authors":"Julia Marti-Ochoa, Eva Martin-Fuentes, Berta Ferrer-Rosell","doi":"10.1177/08944393241260242","DOIUrl":"https://doi.org/10.1177/08944393241260242","url":null,"abstract":"This study delves into Airbnb’s brand presence on TikTok by analyzing textual content in posts, and human audio in videos. This approach aims to decipher the brand narrative and gauge user engagement. In the dynamic realm of social media marketing, TikTok has emerged as a key platform in shaping brand perception. This research specifically concentrates on Airbnb’s content, distinguishing between official narratives and user-generated content (UGC). Notably, themes of “Travel” dominate official posts, contrasting with “Real Estate” and “Business” in UGC. The methodology employed involves advanced data collection techniques, including web scraping for textual data and artificial intelligence for transcribing human audio to text. The findings reveal that UGC commands greater engagement and volume compared to Airbnb’s own brand content, underscoring the increasing significance of user involvement in brand storytelling. An analysis of the study results is conducted using linguistic natural processing (LNP) for the sentiment base, and the vector space model for emotion analysis. Sentiment analysis reveals a predominance of the emotion “happiness” and a significant presence of “surprise” in the posts, both of which are critical for audience engagement. Moreover, the study indicates a high approval rate for Airbnb-related content, reflecting a positive reception of the brand. Additionally, the research observes that influencers, particularly nano influencers, have higher engagement rates, indicating that their authenticity and relatability appeal especially to Generation Z audiences. This study not only sheds light on the intricate relationship between brand narrative, user engagement, and sentiment on TikTok but also offers valuable insights into effective brand image construction and propagation in the digital era, highlighting the importance of diverse emotions in enhancing audience engagement.","PeriodicalId":49509,"journal":{"name":"Social Science Computer Review","volume":"62 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141908959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Gender Gap in All Academic Fields Over Time 所有学术领域的性别差距随时间变化

IF 4.1 2区社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Social Science Computer Review

Pub Date : 2024-08-08 DOI: 10.1177/08944393241270633

Dariusz Jemielniak, Maciej Wilamowski

Academic publishing gender gap has been surprisingly under covered across all disciplines and over a longer timeframe. Our study fills this gap, by analyzing how the proportions of women authors change in academic publications over 20 years in all fields from 31,219 journals from 2001 to 2021. Our results indicate that the ratio of female to male authors keeps increasing steadily across disciplines. The increases are field-neutral—in other words, they are not bigger, for example, in science, technology, engineering, and mathematics, in spite of multiple initiatives focusing specifically on STEM. The increases are also decelerating in time, which could suggest that the equilibrium of female to male authors may be plateauing. Finally, although the within-field gender gap is decreasing, it actually widened between fields. Thus, our results have major consequences for science policy in the area of the gender gap.

令人惊讶的是，学术出版性别差距在所有学科和更长的时间框架内都没有得到充分关注。我们的研究填补了这一空白，从 2001 年至 2021 年的 31219 种期刊中，分析了 20 年间女性作者在各领域学术出版物中所占比例的变化情况。我们的研究结果表明，各学科中女性作者与男性作者的比例一直在稳步上升。这种增长是中性的--换句话说，尽管有多项专门针对科学、技术、工程和数学的计划，但这些增长在科学、技术、工程和数学等领域并不明显。这种增长在时间上也在减速，这可能表明女性与男性作者之间的平衡可能正在趋于稳定。最后，虽然领域内的性别差距在缩小，但领域间的差距实际上在扩大。因此，我们的研究结果对性别差距领域的科学政策具有重大影响。

引用次数: 0

Sexism and Media Communication. An Application to the Italian Case 性别歧视与媒体传播。意大利案例的应用

IF 4.1 2区社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Social Science Computer Review

Pub Date : 2024-08-06 DOI: 10.1177/08944393241269415

Elia A. G. Arfini, Luigi Curini, Fabiana G. Giannuzzi

Acknowledging the importance of focusing on media’s communication for studying linguistic sexism, we propose a new method to analyze a corpus of texts via a machine learning approach built around an original training-set. We seek to establish a framework of the current use of talking about women in newspapers that expands beyond merely the objective forms of discrimination by also measuring the degree to which it implicitly conveys sexist messages through combination of words, expressions, and lexical aspects of language. As an illustrative example, we then apply such an approach to around 15,000 Italian newspapers’ headlines to investigate the impact of newspapers’ political orientations on the linguistic choices made by journalists in writing articles’ headlines.

鉴于关注媒体传播对研究语言性别歧视的重要性，我们提出了一种新方法，通过围绕原始训练集建立的机器学习方法来分析文本语料库。我们试图建立一个关于当前报纸中谈论女性的使用框架，该框架不仅仅局限于客观形式的歧视，还可以通过词语、表达方式和语言词汇方面的组合来衡量其隐含传达性别歧视信息的程度。作为一个示例，我们随后将这种方法应用于约 15,000 份意大利报纸的标题，以研究报纸的政治取向对记者在撰写文章标题时所作语言选择的影响。

引用次数: 0

Journalists’ Ethical Responsibility: Tackling Hate Speech Against Women Politicians in Social Media Through Natural Language Processing Techniques 记者的道德责任：通过自然语言处理技术应对社交媒体中针对女政治家的仇恨言论

IF 4.1 2区社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Social Science Computer Review

Pub Date : 2024-08-05 DOI: 10.1177/08944393241269417

Maria Iranzo-Cabrera, Maria Jose Castro-Bleda, Iris Simón-Astudillo, Lluís-F. Hurtado

Social media has led to a redefinition of the journalist’s role. Specifically on Twitter, these professionals assume an influential position and their discourse is dominated by personal opinions. Taking into consideration that this platform has proven to be a breeding ground for polarization, digital harassment and hate speech, notably against women politicians, this research aims to analyze journalists’ involvement in this complex scenario. The investigation aims to determine whether, immersed in online and gender defamation campaigns, journalists enhance the quality of public debate or, on the contrary, they reinforce the visibility of this hostile content. To this end, we examined a sample of 63,926 tweets published from 23 to 25 November 2022 related to a campaign of political violence against the Spanish Minister of Equality using Natural Language Processing tools and qualitative content analysis. Results show that during those three days, at least half of the tweets contained hate speech and improper language. In this climate of hostility, journalists participating in the debate not only have an ability to attract likes and retweets but also exhibit polarization and use hate speech. Each ideological position—for and against the Minister—is also reflected in their own uncivil strategies. Under the umbrella of free speech and regardless of argumentative discourses, those journalists who lean towards ideological progressivism tend to insult their opponents, and those on the political right use divisive constructions, stereotyping and irony as attack techniques.

社交媒体重新定义了记者的角色。特别是在 Twitter 上，这些专业人士占据了有影响力的位置，他们的言论以个人观点为主。考虑到这一平台已被证明是滋生两极分化、数字骚扰和仇恨言论的温床，尤其是针对女性政治家的言论，本研究旨在分析记者在这一复杂局面中的参与情况。调查旨在确定，在网络和性别诽谤运动中，记者是提高了公共辩论的质量，还是相反地加强了这些敌对内容的可见度。为此，我们使用自然语言处理工具和定性内容分析，对 2022 年 11 月 23 日至 25 日发布的 63926 条推文进行了抽样调查，这些推文与针对西班牙平等部部长的政治暴力运动有关。结果显示，在这三天中，至少有一半的推文包含仇恨言论和不当语言。在这种充满敌意的氛围中，参与辩论的记者不仅有能力吸引点赞和转发，还表现出两极分化并使用仇恨言论。支持和反对部长的意识形态立场也体现在各自的不文明策略中。在言论自由的保护伞下，无论论证话语如何，那些倾向于意识形态进步主义的记者倾向于侮辱他们的对手，而那些政治右派则使用分裂性建构、刻板印象和讽刺作为攻击技巧。

{"title":"Journalists’ Ethical Responsibility: Tackling Hate Speech Against Women Politicians in Social Media Through Natural Language Processing Techniques","authors":"Maria Iranzo-Cabrera, Maria Jose Castro-Bleda, Iris Simón-Astudillo, Lluís-F. Hurtado","doi":"10.1177/08944393241269417","DOIUrl":"https://doi.org/10.1177/08944393241269417","url":null,"abstract":"Social media has led to a redefinition of the journalist’s role. Specifically on Twitter, these professionals assume an influential position and their discourse is dominated by personal opinions. Taking into consideration that this platform has proven to be a breeding ground for polarization, digital harassment and hate speech, notably against women politicians, this research aims to analyze journalists’ involvement in this complex scenario. The investigation aims to determine whether, immersed in online and gender defamation campaigns, journalists enhance the quality of public debate or, on the contrary, they reinforce the visibility of this hostile content. To this end, we examined a sample of 63,926 tweets published from 23 to 25 November 2022 related to a campaign of political violence against the Spanish Minister of Equality using Natural Language Processing tools and qualitative content analysis. Results show that during those three days, at least half of the tweets contained hate speech and improper language. In this climate of hostility, journalists participating in the debate not only have an ability to attract likes and retweets but also exhibit polarization and use hate speech. Each ideological position—for and against the Minister—is also reflected in their own uncivil strategies. Under the umbrella of free speech and regardless of argumentative discourses, those journalists who lean towards ideological progressivism tend to insult their opponents, and those on the political right use divisive constructions, stereotyping and irony as attack techniques.","PeriodicalId":49509,"journal":{"name":"Social Science Computer Review","volume":"55 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141895575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Forty Thousand Fake Twitter Profiles: A Computational Framework for the Visual Analysis of Social Media Propaganda 四万个虚假 Twitter 简介：社交媒体宣传可视化分析的计算框架

IF 4.1 2区社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Social Science Computer Review

Pub Date : 2024-08-02 DOI: 10.1177/08944393241269394

Noel George, Azhar Sham, Thanvi Ajith, Marco Bastos

Successful disinformation campaigns depend on the availability of fake social media profiles used for coordinated inauthentic behavior with networks of false accounts including bots, trolls, and sockpuppets. This study presents a scalable and unsupervised framework to identify visual elements in user profiles strategically exploited in nearly 60 influence operations, including camera angle, photo composition, gender, and race, but also more context-dependent categories like sensuality and emotion. We leverage Google’s Teachable Machine and the DeepFace Library to classify fake user accounts in the Twitter Moderation Research Consortium database, a large repository of social media accounts linked to foreign influence operations. We discuss the performance of these classifiers against manually coded data and their applicability in large-scale data analysis. The proposed framework demonstrates promising results for the identification of fake online profiles used in influence operations and by the cottage industry specialized in crafting desirable online personas.

成功的造谣活动依赖于虚假社交媒体资料的可用性，这些资料被用于与虚假账户网络（包括机器人、巨魔和 sockpuppets）协调不真实行为。本研究提出了一个可扩展的无监督框架，用于识别用户资料中的视觉元素，这些元素在近 60 次影响行动中被策略性地利用，包括拍摄角度、照片构图、性别和种族，以及感性和情感等更多依赖于上下文的类别。我们利用谷歌的 "可教机器"（Teachable Machine）和 DeepFace 库，对 Twitter 节制研究联盟数据库（Twitter Moderation Research Consortium）中的虚假用户账户进行分类，该数据库是一个与外国影响力行动相关的大型社交媒体账户库。我们讨论了这些分类器在人工编码数据方面的性能及其在大规模数据分析中的适用性。所提出的框架在识别影响行动中使用的虚假在线配置文件以及专门制作理想在线角色的山寨产业方面取得了可喜的成果。

引用次数: 0

Combining Natural Language Processing and Statistical Methods to Assess Gender Gaps in the Mediated Personalization of Politics 结合自然语言处理和统计方法，评估政治个性化中介中的性别差距

IF 4.1 2区社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Social Science Computer Review

Pub Date : 2024-07-31 DOI: 10.1177/08944393241269097

Emanuele Brugnoli, Rosaria Simone, Marco Delmastro

The media attention to the personal sphere of famous and important individuals has become a key element of the gender narrative. In this setting, we aim at assessing gender gaps in the mediated personalization of a wide range of political office holders in Italy during the period 2017–2020 by means of a combination of NLP and statistical methods. The proposed analysis hinges on the definition of a new score for each word in the corpus that adjusts the incidence rate for the under representation of women in politics. On this basis, evidence is found that political personalization in Italy is more detrimental for women than it is for men, with the persistence of entrenched stereotypes including a masculine connotation of leadership, the resulting women’s unsuitability to hold political functions, and a greater deal of focus on their attractiveness and body parts. In addition, women politicians are covered with a more negative tone than their men counterpart when personal details are reported. By distinguishing between different types of media, we also show that the observed gender differences are primarily found in online news rather than print news. This suggests that the expression of certain stereotypes may be favored when click baiting and personal targeting have a major impact.

媒体对名人和重要人物个人领域的关注已成为性别叙事的一个关键要素。在这一背景下，我们旨在通过结合 NLP 和统计方法，评估 2017-2020 年间意大利各类政治职位担任者的媒介个性化中的性别差距。所提议的分析依赖于为语料库中的每个单词定义一个新的分数，该分数会根据女性在政治领域代表性不足的情况调整发生率。在此基础上，我们发现有证据表明，意大利的政治人格化对女性的不利影响比对男性更大，因为根深蒂固的定型观念持续存在，包括领导力的男性内涵，由此导致女性不适合担任政治职务，以及更多关注女性的吸引力和身体部位。此外，在报道女性政治家的个人细节时，其语气也比男性政治家更为负面。通过区分不同类型的媒体，我们还发现观察到的性别差异主要出现在网络新闻而非印刷新闻中。这表明，当点击诱饵和个人目标产生重大影响时，某些刻板印象的表达可能会受到青睐。

{"title":"Combining Natural Language Processing and Statistical Methods to Assess Gender Gaps in the Mediated Personalization of Politics","authors":"Emanuele Brugnoli, Rosaria Simone, Marco Delmastro","doi":"10.1177/08944393241269097","DOIUrl":"https://doi.org/10.1177/08944393241269097","url":null,"abstract":"The media attention to the personal sphere of famous and important individuals has become a key element of the gender narrative. In this setting, we aim at assessing gender gaps in the mediated personalization of a wide range of political office holders in Italy during the period 2017–2020 by means of a combination of NLP and statistical methods. The proposed analysis hinges on the definition of a new score for each word in the corpus that adjusts the incidence rate for the under representation of women in politics. On this basis, evidence is found that political personalization in Italy is more detrimental for women than it is for men, with the persistence of entrenched stereotypes including a masculine connotation of leadership, the resulting women’s unsuitability to hold political functions, and a greater deal of focus on their attractiveness and body parts. In addition, women politicians are covered with a more negative tone than their men counterpart when personal details are reported. By distinguishing between different types of media, we also show that the observed gender differences are primarily found in online news rather than print news. This suggests that the expression of certain stereotypes may be favored when click baiting and personal targeting have a major impact.","PeriodicalId":49509,"journal":{"name":"Social Science Computer Review","volume":"178 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141877344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

How Algorithms Promote Self-Radicalization: Audit of TikTok’s Algorithm Using a Reverse Engineering Method 算法如何促进自我激进化？使用逆向工程方法审计 TikTok 算法

IF 4.1 2区社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Social Science Computer Review

Pub Date : 2024-07-30 DOI: 10.1177/08944393231225547

Donghee Shin, Kulsawasd Jitkajornwanich

Algorithmic radicalization is the idea that algorithms used by social media platforms push people down digital “rabbit holes” by framing personal online activity. Algorithms control what people see and when they see it and learn from their past activities. As such, people gradually and subconsciously adopt the ideas presented to them by the rabbit hole down which they have been pushed. In this study, TikTok’s role in fostering radicalized ideology is examined to offer a critical analysis of the state of radicalism and extremism on platforms. This study conducted an algorithm audit of the role of radicalizing information in social media by examining how TikTok’s algorithms are being used to radicalize, polarize, and spread extremism and societal instability. The results revealed that the pathways through which users access far-right content are manifold and that a large portion of the content can be ascribed to platform recommendations through radicalization pipelines. Algorithms are not simple tools that offer personalized services but rather contributors to radicalism, societal violence, and polarization. Such personalization processes have been instrumental in how artificial intelligence (AI) has been deployed, designed, and used to the detrimental outcomes that it has generated. Thus, the generation and adoption of extreme content on TikTok are, by and large, not only a reflection of user inputs and interactions with the platform but also the platform’s ability to slot users into specific categories and reinforce their ideas.

算法激进化是指社交媒体平台使用的算法通过框定个人在线活动，将人们推向数字 "兔子洞"。算法控制着人们看到什么、何时看到，并从他们过去的活动中学习。因此，人们会逐渐地、下意识地接受被推下兔子洞的人向他们展示的想法。在本研究中，我们研究了 TikTok 在培养激进意识形态方面的作用，对平台上的激进主义和极端主义状况进行了批判性分析。本研究通过研究 TikTok 的算法如何被用于激进化、分化和传播极端主义和社会不稳定，对激进化信息在社交媒体中的作用进行了算法审计。研究结果表明，用户获取极右内容的途径是多方面的，其中很大一部分内容可以通过激进化管道归因于平台推荐。算法不是提供个性化服务的简单工具，而是激进主义、社会暴力和两极分化的助推器。在人工智能（AI）的部署、设计和使用过程中，这种个性化过程发挥了重要作用，并产生了有害的结果。因此，TikTok 上极端内容的产生和采用在很大程度上不仅反映了用户对平台的投入和互动，也反映了平台将用户归入特定类别并强化其观念的能力。

{"title":"How Algorithms Promote Self-Radicalization: Audit of TikTok’s Algorithm Using a Reverse Engineering Method","authors":"Donghee Shin, Kulsawasd Jitkajornwanich","doi":"10.1177/08944393231225547","DOIUrl":"https://doi.org/10.1177/08944393231225547","url":null,"abstract":"Algorithmic radicalization is the idea that algorithms used by social media platforms push people down digital “rabbit holes” by framing personal online activity. Algorithms control what people see and when they see it and learn from their past activities. As such, people gradually and subconsciously adopt the ideas presented to them by the rabbit hole down which they have been pushed. In this study, TikTok’s role in fostering radicalized ideology is examined to offer a critical analysis of the state of radicalism and extremism on platforms. This study conducted an algorithm audit of the role of radicalizing information in social media by examining how TikTok’s algorithms are being used to radicalize, polarize, and spread extremism and societal instability. The results revealed that the pathways through which users access far-right content are manifold and that a large portion of the content can be ascribed to platform recommendations through radicalization pipelines. Algorithms are not simple tools that offer personalized services but rather contributors to radicalism, societal violence, and polarization. Such personalization processes have been instrumental in how artificial intelligence (AI) has been deployed, designed, and used to the detrimental outcomes that it has generated. Thus, the generation and adoption of extreme content on TikTok are, by and large, not only a reflection of user inputs and interactions with the platform but also the platform’s ability to slot users into specific categories and reinforce their ideas.","PeriodicalId":49509,"journal":{"name":"Social Science Computer Review","volume":"26 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141857972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Tracking Census Online Self-Completion Using Twitter Posts 利用 Twitter 帖子跟踪人口普查在线自我填写情况

IF 4.1 2区社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Social Science Computer Review

Pub Date : 2024-07-30 DOI: 10.1177/08944393241268461

Mao Li, Frederick Conrad

From the start of data collection for the 2020 US Census, official and celebrity users tweeted about the importance of everyone being counted in the Census and urged followers to complete the questionnaire (so-called social media campaign.) At the same time, social media posts expressing skepticism about the Census became increasingly common. This study distinguishes between different prototypical Twitter user groups and investigates their possible impact on (online) self-completion rate for the 2020 Census, according to Census Bureau data. Using a network analysis method, Community Detection, and a clustering algorithm, Latent Dirichlet Allocation (LDA), three prototypical user groups were identified: “Official Government Agency,” “Census Advocate,” and “Census Skeptic.” The prototypical Census Skeptic user was motivated by events about which an influential person had tweeted (e.g., “Republicans in Congress signal Census cannot take extra time to count”). This group became the largest one over the study period. The prototypical Census Advocate was motivated more by official tweets and was more active than the prototypical Census Skeptic. The Official Government Agency user group was the smallest of the three, but their messages—primarily promoting completion of the Census—seemed to have been amplified by Census Advocate, especially celebrities and politicians. We found that the daily size of the Census Advocate user group—but not the other two—predicted the 2020 Census online self-completion rate within five days after a tweet was posted. This finding suggests that the Census social media campaign was successful in promoting completion, apparently due to the help of Census Advocate users who encouraged people to fill out the Census and amplified official tweets. This finding demonstrates that a social media campaign can positively affect public behavior regarding an essential national project like the Decennial Census.

从 2020 年美国人口普查的数据收集开始，官方用户和名人用户就在推特上大肆宣扬人口普查中每个人都被计算在内的重要性，并敦促关注者填写调查问卷（即所谓的社交媒体活动）。根据人口普查局的数据，本研究区分了不同的推特用户群体原型，并调查了他们对 2020 年人口普查（在线）自我填写率可能产生的影响。使用网络分析方法 "社区检测 "和聚类算法 "潜在德里希特分配"（LDA），确定了三个原型用户群："官方政府机构"、"人口普查倡导者 "和 "人口普查怀疑者"。人口普查怀疑论者 "原型用户的动机是某个有影响力的人在推特上发布的事件（例如，"国会中的共和党人表示人口普查不能花额外的时间来统计"）。在研究期间，这一群体成为最大的群体。原型人口普查拥护者更多受到官方推文的激励，比原型人口普查怀疑者更活跃。官方政府机构用户群是三个用户群中最小的，但他们的信息--主要是促进人口普查的完成--似乎被人口普查倡导者，尤其是名人和政客所放大。我们发现，"人口普查倡导者 "用户群（而非其他两个用户群）的每日规模可以预测推文发布后五天内的 2020 年人口普查在线自我完成率。这一发现表明，人口普查社交媒体活动在促进填写方面取得了成功，这显然要归功于人口普查倡导者用户的帮助，他们鼓励人们填写人口普查并放大了官方推文。这一发现表明，对于像十年一次的人口普查这样重要的国家项目，社交媒体活动可以对公众行为产生积极影响。

{"title":"Tracking Census Online Self-Completion Using Twitter Posts","authors":"Mao Li, Frederick Conrad","doi":"10.1177/08944393241268461","DOIUrl":"https://doi.org/10.1177/08944393241268461","url":null,"abstract":"From the start of data collection for the 2020 US Census, official and celebrity users tweeted about the importance of everyone being counted in the Census and urged followers to complete the questionnaire (so-called social media campaign.) At the same time, social media posts expressing skepticism about the Census became increasingly common. This study distinguishes between different prototypical Twitter user groups and investigates their possible impact on (online) self-completion rate for the 2020 Census, according to Census Bureau data. Using a network analysis method, Community Detection, and a clustering algorithm, Latent Dirichlet Allocation (LDA), three prototypical user groups were identified: “Official Government Agency,” “Census Advocate,” and “Census Skeptic.” The prototypical Census Skeptic user was motivated by events about which an influential person had tweeted (e.g., “Republicans in Congress signal Census cannot take extra time to count”). This group became the largest one over the study period. The prototypical Census Advocate was motivated more by official tweets and was more active than the prototypical Census Skeptic. The Official Government Agency user group was the smallest of the three, but their messages—primarily promoting completion of the Census—seemed to have been amplified by Census Advocate, especially celebrities and politicians. We found that the daily size of the Census Advocate user group—but not the other two—predicted the 2020 Census online self-completion rate within five days after a tweet was posted. This finding suggests that the Census social media campaign was successful in promoting completion, apparently due to the help of Census Advocate users who encouraged people to fill out the Census and amplified official tweets. This finding demonstrates that a social media campaign can positively affect public behavior regarding an essential national project like the Decennial Census.","PeriodicalId":49509,"journal":{"name":"Social Science Computer Review","volume":"81 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141857928","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Transformer Model for Manifesto Classification Using Cross-Context Training: An Ecuadorian Case Study 利用跨语境训练进行宣言分类的转换器模型：厄瓜多尔案例研究

IF 4.1 2区社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Social Science Computer Review

Pub Date : 2024-07-24 DOI: 10.1177/08944393241266220

Fernanda Barzallo, Maria Baldeon-Calisto, Margorie Pérez, Maria Emilia Moscoso, Danny Navarrete, Daniel Riofrío, Pablo Medina-Peréz, Susana K Lai-Yuen, Diego Benítez, Noel Peréz, Ricardo Flores Moyano, Mateo Fierro

Content analysis of political manifestos is necessary to understand the policies and proposed actions of a party. However, manually labeling political texts is time-consuming and labor-intensive. Transformer networks have become essential tools for automating this task. Nevertheless, these models require extensive datasets to achieve good performance. This can be a limitation in manifesto classification, where the availability of publicly labeled datasets can be scarce. To address this challenge, in this work, we developed a Transformer network for the classification of manifestos using a cross-domain training strategy. Using the database of the Comparative Manifesto Project, we implemented a fractional factorial experimental design to determine which Spanish-written manifestos form the best training set for Ecuadorian manifesto labeling. Furthermore, we statistically analyzed which Transformer architecture and preprocessing operations improve the model accuracy. The results indicate that creating a training set with manifestos from Spain and Uruguay, along with implementing stemming and lemmatization preprocessing operations, produces the highest classification accuracy. In addition, we found that the DistilBERT and RoBERTa transformer networks perform statistically similarly and consistently well in manifesto classification. Using the cross-context training strategy, DistilBERT and RoBERTa achieve 60.05% and 57.64% accuracy, respectively, in the classification of the Ecuadorian manifesto. Finally, we investigated the effect of the composition of the training set on performance. The experiments demonstrate that training DistilBERT solely with Ecuadorian manifestos achieves the highest accuracy and F1-score. Furthermore, in the absence of the Ecuadorian dataset, competitive performance is achieved by training the model with datasets from Spain and Uruguay.

要了解一个政党的政策和拟议行动，就必须对政治宣言进行内容分析。然而，手动标注政治文本既耗时又耗力。变压器网络已成为实现这一任务自动化的重要工具。然而，这些模型需要大量的数据集才能实现良好的性能。这在宣言分类中可能是一个限制，因为公开标注的数据集可能很少。为了应对这一挑战，在这项工作中，我们采用跨领域训练策略，开发了一种用于宣言分类的 Transformer 网络。利用比较宣言项目的数据库，我们实施了一个分数因子实验设计，以确定哪些西班牙文撰写的宣言是厄瓜多尔宣言标注的最佳训练集。此外，我们还统计分析了哪些 Transformer 架构和预处理操作可以提高模型的准确性。结果表明，创建一个包含西班牙和乌拉圭宣言的训练集，并实施词干化和词素化预处理操作，能产生最高的分类准确率。此外，我们还发现 DistilBERT 和 RoBERTa 变换器网络在宣言分类方面的表现在统计上相似且一致良好。使用跨语境训练策略，DistilBERT 和 RoBERTa 在厄瓜多尔宣言的分类中分别达到了 60.05% 和 57.64% 的准确率。最后，我们研究了训练集的组成对性能的影响。实验表明，仅使用厄瓜多尔宣言对 DistilBERT 进行训练可获得最高的准确率和 F1 分数。此外，在没有厄瓜多尔数据集的情况下，使用西班牙和乌拉圭的数据集对该模型进行训练，也能获得具有竞争力的性能。

{"title":"A Transformer Model for Manifesto Classification Using Cross-Context Training: An Ecuadorian Case Study","authors":"Fernanda Barzallo, Maria Baldeon-Calisto, Margorie Pérez, Maria Emilia Moscoso, Danny Navarrete, Daniel Riofrío, Pablo Medina-Peréz, Susana K Lai-Yuen, Diego Benítez, Noel Peréz, Ricardo Flores Moyano, Mateo Fierro","doi":"10.1177/08944393241266220","DOIUrl":"https://doi.org/10.1177/08944393241266220","url":null,"abstract":"Content analysis of political manifestos is necessary to understand the policies and proposed actions of a party. However, manually labeling political texts is time-consuming and labor-intensive. Transformer networks have become essential tools for automating this task. Nevertheless, these models require extensive datasets to achieve good performance. This can be a limitation in manifesto classification, where the availability of publicly labeled datasets can be scarce. To address this challenge, in this work, we developed a Transformer network for the classification of manifestos using a cross-domain training strategy. Using the database of the Comparative Manifesto Project, we implemented a fractional factorial experimental design to determine which Spanish-written manifestos form the best training set for Ecuadorian manifesto labeling. Furthermore, we statistically analyzed which Transformer architecture and preprocessing operations improve the model accuracy. The results indicate that creating a training set with manifestos from Spain and Uruguay, along with implementing stemming and lemmatization preprocessing operations, produces the highest classification accuracy. In addition, we found that the DistilBERT and RoBERTa transformer networks perform statistically similarly and consistently well in manifesto classification. Using the cross-context training strategy, DistilBERT and RoBERTa achieve 60.05% and 57.64% accuracy, respectively, in the classification of the Ecuadorian manifesto. Finally, we investigated the effect of the composition of the training set on performance. The experiments demonstrate that training DistilBERT solely with Ecuadorian manifestos achieves the highest accuracy and F1-score. Furthermore, in the absence of the Ecuadorian dataset, competitive performance is achieved by training the model with datasets from Spain and Uruguay.","PeriodicalId":49509,"journal":{"name":"Social Science Computer Review","volume":"53 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141755367","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0