首页 > 最新文献

Social Science Computer Review最新文献

英文 中文
Sexism and Media Communication. An Application to the Italian Case 性别歧视与媒体传播。意大利案例的应用
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-08-06 DOI: 10.1177/08944393241269415
Elia A. G. Arfini, Luigi Curini, Fabiana G. Giannuzzi
Acknowledging the importance of focusing on media’s communication for studying linguistic sexism, we propose a new method to analyze a corpus of texts via a machine learning approach built around an original training-set. We seek to establish a framework of the current use of talking about women in newspapers that expands beyond merely the objective forms of discrimination by also measuring the degree to which it implicitly conveys sexist messages through combination of words, expressions, and lexical aspects of language. As an illustrative example, we then apply such an approach to around 15,000 Italian newspapers’ headlines to investigate the impact of newspapers’ political orientations on the linguistic choices made by journalists in writing articles’ headlines.
鉴于关注媒体传播对研究语言性别歧视的重要性,我们提出了一种新方法,通过围绕原始训练集建立的机器学习方法来分析文本语料库。我们试图建立一个关于当前报纸中谈论女性的使用框架,该框架不仅仅局限于客观形式的歧视,还可以通过词语、表达方式和语言词汇方面的组合来衡量其隐含传达性别歧视信息的程度。作为一个示例,我们随后将这种方法应用于约 15,000 份意大利报纸的标题,以研究报纸的政治取向对记者在撰写文章标题时所作语言选择的影响。
{"title":"Sexism and Media Communication. An Application to the Italian Case","authors":"Elia A. G. Arfini, Luigi Curini, Fabiana G. Giannuzzi","doi":"10.1177/08944393241269415","DOIUrl":"https://doi.org/10.1177/08944393241269415","url":null,"abstract":"Acknowledging the importance of focusing on media’s communication for studying linguistic sexism, we propose a new method to analyze a corpus of texts via a machine learning approach built around an original training-set. We seek to establish a framework of the current use of talking about women in newspapers that expands beyond merely the objective forms of discrimination by also measuring the degree to which it implicitly conveys sexist messages through combination of words, expressions, and lexical aspects of language. As an illustrative example, we then apply such an approach to around 15,000 Italian newspapers’ headlines to investigate the impact of newspapers’ political orientations on the linguistic choices made by journalists in writing articles’ headlines.","PeriodicalId":49509,"journal":{"name":"Social Science Computer Review","volume":"2 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141899594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Journalists’ Ethical Responsibility: Tackling Hate Speech Against Women Politicians in Social Media Through Natural Language Processing Techniques 记者的道德责任:通过自然语言处理技术应对社交媒体中针对女政治家的仇恨言论
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-08-05 DOI: 10.1177/08944393241269417
Maria Iranzo-Cabrera, Maria Jose Castro-Bleda, Iris Simón-Astudillo, Lluís-F. Hurtado
Social media has led to a redefinition of the journalist’s role. Specifically on Twitter, these professionals assume an influential position and their discourse is dominated by personal opinions. Taking into consideration that this platform has proven to be a breeding ground for polarization, digital harassment and hate speech, notably against women politicians, this research aims to analyze journalists’ involvement in this complex scenario. The investigation aims to determine whether, immersed in online and gender defamation campaigns, journalists enhance the quality of public debate or, on the contrary, they reinforce the visibility of this hostile content. To this end, we examined a sample of 63,926 tweets published from 23 to 25 November 2022 related to a campaign of political violence against the Spanish Minister of Equality using Natural Language Processing tools and qualitative content analysis. Results show that during those three days, at least half of the tweets contained hate speech and improper language. In this climate of hostility, journalists participating in the debate not only have an ability to attract likes and retweets but also exhibit polarization and use hate speech. Each ideological position—for and against the Minister—is also reflected in their own uncivil strategies. Under the umbrella of free speech and regardless of argumentative discourses, those journalists who lean towards ideological progressivism tend to insult their opponents, and those on the political right use divisive constructions, stereotyping and irony as attack techniques.
社交媒体重新定义了记者的角色。特别是在 Twitter 上,这些专业人士占据了有影响力的位置,他们的言论以个人观点为主。考虑到这一平台已被证明是滋生两极分化、数字骚扰和仇恨言论的温床,尤其是针对女性政治家的言论,本研究旨在分析记者在这一复杂局面中的参与情况。调查旨在确定,在网络和性别诽谤运动中,记者是提高了公共辩论的质量,还是相反地加强了这些敌对内容的可见度。为此,我们使用自然语言处理工具和定性内容分析,对 2022 年 11 月 23 日至 25 日发布的 63926 条推文进行了抽样调查,这些推文与针对西班牙平等部部长的政治暴力运动有关。结果显示,在这三天中,至少有一半的推文包含仇恨言论和不当语言。在这种充满敌意的氛围中,参与辩论的记者不仅有能力吸引点赞和转发,还表现出两极分化并使用仇恨言论。支持和反对部长的意识形态立场也体现在各自的不文明策略中。在言论自由的保护伞下,无论论证话语如何,那些倾向于意识形态进步主义的记者倾向于侮辱他们的对手,而那些政治右派则使用分裂性建构、刻板印象和讽刺作为攻击技巧。
{"title":"Journalists’ Ethical Responsibility: Tackling Hate Speech Against Women Politicians in Social Media Through Natural Language Processing Techniques","authors":"Maria Iranzo-Cabrera, Maria Jose Castro-Bleda, Iris Simón-Astudillo, Lluís-F. Hurtado","doi":"10.1177/08944393241269417","DOIUrl":"https://doi.org/10.1177/08944393241269417","url":null,"abstract":"Social media has led to a redefinition of the journalist’s role. Specifically on Twitter, these professionals assume an influential position and their discourse is dominated by personal opinions. Taking into consideration that this platform has proven to be a breeding ground for polarization, digital harassment and hate speech, notably against women politicians, this research aims to analyze journalists’ involvement in this complex scenario. The investigation aims to determine whether, immersed in online and gender defamation campaigns, journalists enhance the quality of public debate or, on the contrary, they reinforce the visibility of this hostile content. To this end, we examined a sample of 63,926 tweets published from 23 to 25 November 2022 related to a campaign of political violence against the Spanish Minister of Equality using Natural Language Processing tools and qualitative content analysis. Results show that during those three days, at least half of the tweets contained hate speech and improper language. In this climate of hostility, journalists participating in the debate not only have an ability to attract likes and retweets but also exhibit polarization and use hate speech. Each ideological position—for and against the Minister—is also reflected in their own uncivil strategies. Under the umbrella of free speech and regardless of argumentative discourses, those journalists who lean towards ideological progressivism tend to insult their opponents, and those on the political right use divisive constructions, stereotyping and irony as attack techniques.","PeriodicalId":49509,"journal":{"name":"Social Science Computer Review","volume":"55 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141895575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forty Thousand Fake Twitter Profiles: A Computational Framework for the Visual Analysis of Social Media Propaganda 四万个虚假 Twitter 简介:社交媒体宣传可视化分析的计算框架
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-08-02 DOI: 10.1177/08944393241269394
Noel George, Azhar Sham, Thanvi Ajith, Marco Bastos
Successful disinformation campaigns depend on the availability of fake social media profiles used for coordinated inauthentic behavior with networks of false accounts including bots, trolls, and sockpuppets. This study presents a scalable and unsupervised framework to identify visual elements in user profiles strategically exploited in nearly 60 influence operations, including camera angle, photo composition, gender, and race, but also more context-dependent categories like sensuality and emotion. We leverage Google’s Teachable Machine and the DeepFace Library to classify fake user accounts in the Twitter Moderation Research Consortium database, a large repository of social media accounts linked to foreign influence operations. We discuss the performance of these classifiers against manually coded data and their applicability in large-scale data analysis. The proposed framework demonstrates promising results for the identification of fake online profiles used in influence operations and by the cottage industry specialized in crafting desirable online personas.
成功的造谣活动依赖于虚假社交媒体资料的可用性,这些资料被用于与虚假账户网络(包括机器人、巨魔和 sockpuppets)协调不真实行为。本研究提出了一个可扩展的无监督框架,用于识别用户资料中的视觉元素,这些元素在近 60 次影响行动中被策略性地利用,包括拍摄角度、照片构图、性别和种族,以及感性和情感等更多依赖于上下文的类别。我们利用谷歌的 "可教机器"(Teachable Machine)和 DeepFace 库,对 Twitter 节制研究联盟数据库(Twitter Moderation Research Consortium)中的虚假用户账户进行分类,该数据库是一个与外国影响力行动相关的大型社交媒体账户库。我们讨论了这些分类器在人工编码数据方面的性能及其在大规模数据分析中的适用性。所提出的框架在识别影响行动中使用的虚假在线配置文件以及专门制作理想在线角色的山寨产业方面取得了可喜的成果。
{"title":"Forty Thousand Fake Twitter Profiles: A Computational Framework for the Visual Analysis of Social Media Propaganda","authors":"Noel George, Azhar Sham, Thanvi Ajith, Marco Bastos","doi":"10.1177/08944393241269394","DOIUrl":"https://doi.org/10.1177/08944393241269394","url":null,"abstract":"Successful disinformation campaigns depend on the availability of fake social media profiles used for coordinated inauthentic behavior with networks of false accounts including bots, trolls, and sockpuppets. This study presents a scalable and unsupervised framework to identify visual elements in user profiles strategically exploited in nearly 60 influence operations, including camera angle, photo composition, gender, and race, but also more context-dependent categories like sensuality and emotion. We leverage Google’s Teachable Machine and the DeepFace Library to classify fake user accounts in the Twitter Moderation Research Consortium database, a large repository of social media accounts linked to foreign influence operations. We discuss the performance of these classifiers against manually coded data and their applicability in large-scale data analysis. The proposed framework demonstrates promising results for the identification of fake online profiles used in influence operations and by the cottage industry specialized in crafting desirable online personas.","PeriodicalId":49509,"journal":{"name":"Social Science Computer Review","volume":"75 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141880310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Combining Natural Language Processing and Statistical Methods to Assess Gender Gaps in the Mediated Personalization of Politics 结合自然语言处理和统计方法,评估政治个性化中介中的性别差距
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-07-31 DOI: 10.1177/08944393241269097
Emanuele Brugnoli, Rosaria Simone, Marco Delmastro
The media attention to the personal sphere of famous and important individuals has become a key element of the gender narrative. In this setting, we aim at assessing gender gaps in the mediated personalization of a wide range of political office holders in Italy during the period 2017–2020 by means of a combination of NLP and statistical methods. The proposed analysis hinges on the definition of a new score for each word in the corpus that adjusts the incidence rate for the under representation of women in politics. On this basis, evidence is found that political personalization in Italy is more detrimental for women than it is for men, with the persistence of entrenched stereotypes including a masculine connotation of leadership, the resulting women’s unsuitability to hold political functions, and a greater deal of focus on their attractiveness and body parts. In addition, women politicians are covered with a more negative tone than their men counterpart when personal details are reported. By distinguishing between different types of media, we also show that the observed gender differences are primarily found in online news rather than print news. This suggests that the expression of certain stereotypes may be favored when click baiting and personal targeting have a major impact.
媒体对名人和重要人物个人领域的关注已成为性别叙事的一个关键要素。在这一背景下,我们旨在通过结合 NLP 和统计方法,评估 2017-2020 年间意大利各类政治职位担任者的媒介个性化中的性别差距。所提议的分析依赖于为语料库中的每个单词定义一个新的分数,该分数会根据女性在政治领域代表性不足的情况调整发生率。在此基础上,我们发现有证据表明,意大利的政治人格化对女性的不利影响比对男性更大,因为根深蒂固的定型观念持续存在,包括领导力的男性内涵,由此导致女性不适合担任政治职务,以及更多关注女性的吸引力和身体部位。此外,在报道女性政治家的个人细节时,其语气也比男性政治家更为负面。通过区分不同类型的媒体,我们还发现观察到的性别差异主要出现在网络新闻而非印刷新闻中。这表明,当点击诱饵和个人目标产生重大影响时,某些刻板印象的表达可能会受到青睐。
{"title":"Combining Natural Language Processing and Statistical Methods to Assess Gender Gaps in the Mediated Personalization of Politics","authors":"Emanuele Brugnoli, Rosaria Simone, Marco Delmastro","doi":"10.1177/08944393241269097","DOIUrl":"https://doi.org/10.1177/08944393241269097","url":null,"abstract":"The media attention to the personal sphere of famous and important individuals has become a key element of the gender narrative. In this setting, we aim at assessing gender gaps in the mediated personalization of a wide range of political office holders in Italy during the period 2017–2020 by means of a combination of NLP and statistical methods. The proposed analysis hinges on the definition of a new score for each word in the corpus that adjusts the incidence rate for the under representation of women in politics. On this basis, evidence is found that political personalization in Italy is more detrimental for women than it is for men, with the persistence of entrenched stereotypes including a masculine connotation of leadership, the resulting women’s unsuitability to hold political functions, and a greater deal of focus on their attractiveness and body parts. In addition, women politicians are covered with a more negative tone than their men counterpart when personal details are reported. By distinguishing between different types of media, we also show that the observed gender differences are primarily found in online news rather than print news. This suggests that the expression of certain stereotypes may be favored when click baiting and personal targeting have a major impact.","PeriodicalId":49509,"journal":{"name":"Social Science Computer Review","volume":"178 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141877344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How Algorithms Promote Self-Radicalization: Audit of TikTok’s Algorithm Using a Reverse Engineering Method 算法如何促进自我激进化?使用逆向工程方法审计 TikTok 算法
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-07-30 DOI: 10.1177/08944393231225547
Donghee Shin, Kulsawasd Jitkajornwanich
Algorithmic radicalization is the idea that algorithms used by social media platforms push people down digital “rabbit holes” by framing personal online activity. Algorithms control what people see and when they see it and learn from their past activities. As such, people gradually and subconsciously adopt the ideas presented to them by the rabbit hole down which they have been pushed. In this study, TikTok’s role in fostering radicalized ideology is examined to offer a critical analysis of the state of radicalism and extremism on platforms. This study conducted an algorithm audit of the role of radicalizing information in social media by examining how TikTok’s algorithms are being used to radicalize, polarize, and spread extremism and societal instability. The results revealed that the pathways through which users access far-right content are manifold and that a large portion of the content can be ascribed to platform recommendations through radicalization pipelines. Algorithms are not simple tools that offer personalized services but rather contributors to radicalism, societal violence, and polarization. Such personalization processes have been instrumental in how artificial intelligence (AI) has been deployed, designed, and used to the detrimental outcomes that it has generated. Thus, the generation and adoption of extreme content on TikTok are, by and large, not only a reflection of user inputs and interactions with the platform but also the platform’s ability to slot users into specific categories and reinforce their ideas.
算法激进化是指社交媒体平台使用的算法通过框定个人在线活动,将人们推向数字 "兔子洞"。算法控制着人们看到什么、何时看到,并从他们过去的活动中学习。因此,人们会逐渐地、下意识地接受被推下兔子洞的人向他们展示的想法。在本研究中,我们研究了 TikTok 在培养激进意识形态方面的作用,对平台上的激进主义和极端主义状况进行了批判性分析。本研究通过研究 TikTok 的算法如何被用于激进化、分化和传播极端主义和社会不稳定,对激进化信息在社交媒体中的作用进行了算法审计。研究结果表明,用户获取极右内容的途径是多方面的,其中很大一部分内容可以通过激进化管道归因于平台推荐。算法不是提供个性化服务的简单工具,而是激进主义、社会暴力和两极分化的助推器。在人工智能(AI)的部署、设计和使用过程中,这种个性化过程发挥了重要作用,并产生了有害的结果。因此,TikTok 上极端内容的产生和采用在很大程度上不仅反映了用户对平台的投入和互动,也反映了平台将用户归入特定类别并强化其观念的能力。
{"title":"How Algorithms Promote Self-Radicalization: Audit of TikTok’s Algorithm Using a Reverse Engineering Method","authors":"Donghee Shin, Kulsawasd Jitkajornwanich","doi":"10.1177/08944393231225547","DOIUrl":"https://doi.org/10.1177/08944393231225547","url":null,"abstract":"Algorithmic radicalization is the idea that algorithms used by social media platforms push people down digital “rabbit holes” by framing personal online activity. Algorithms control what people see and when they see it and learn from their past activities. As such, people gradually and subconsciously adopt the ideas presented to them by the rabbit hole down which they have been pushed. In this study, TikTok’s role in fostering radicalized ideology is examined to offer a critical analysis of the state of radicalism and extremism on platforms. This study conducted an algorithm audit of the role of radicalizing information in social media by examining how TikTok’s algorithms are being used to radicalize, polarize, and spread extremism and societal instability. The results revealed that the pathways through which users access far-right content are manifold and that a large portion of the content can be ascribed to platform recommendations through radicalization pipelines. Algorithms are not simple tools that offer personalized services but rather contributors to radicalism, societal violence, and polarization. Such personalization processes have been instrumental in how artificial intelligence (AI) has been deployed, designed, and used to the detrimental outcomes that it has generated. Thus, the generation and adoption of extreme content on TikTok are, by and large, not only a reflection of user inputs and interactions with the platform but also the platform’s ability to slot users into specific categories and reinforce their ideas.","PeriodicalId":49509,"journal":{"name":"Social Science Computer Review","volume":"26 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141857972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tracking Census Online Self-Completion Using Twitter Posts 利用 Twitter 帖子跟踪人口普查在线自我填写情况
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-07-30 DOI: 10.1177/08944393241268461
Mao Li, Frederick Conrad
From the start of data collection for the 2020 US Census, official and celebrity users tweeted about the importance of everyone being counted in the Census and urged followers to complete the questionnaire (so-called social media campaign.) At the same time, social media posts expressing skepticism about the Census became increasingly common. This study distinguishes between different prototypical Twitter user groups and investigates their possible impact on (online) self-completion rate for the 2020 Census, according to Census Bureau data. Using a network analysis method, Community Detection, and a clustering algorithm, Latent Dirichlet Allocation (LDA), three prototypical user groups were identified: “Official Government Agency,” “Census Advocate,” and “Census Skeptic.” The prototypical Census Skeptic user was motivated by events about which an influential person had tweeted (e.g., “Republicans in Congress signal Census cannot take extra time to count”). This group became the largest one over the study period. The prototypical Census Advocate was motivated more by official tweets and was more active than the prototypical Census Skeptic. The Official Government Agency user group was the smallest of the three, but their messages—primarily promoting completion of the Census—seemed to have been amplified by Census Advocate, especially celebrities and politicians. We found that the daily size of the Census Advocate user group—but not the other two—predicted the 2020 Census online self-completion rate within five days after a tweet was posted. This finding suggests that the Census social media campaign was successful in promoting completion, apparently due to the help of Census Advocate users who encouraged people to fill out the Census and amplified official tweets. This finding demonstrates that a social media campaign can positively affect public behavior regarding an essential national project like the Decennial Census.
从 2020 年美国人口普查的数据收集开始,官方用户和名人用户就在推特上大肆宣扬人口普查中每个人都被计算在内的重要性,并敦促关注者填写调查问卷(即所谓的社交媒体活动)。根据人口普查局的数据,本研究区分了不同的推特用户群体原型,并调查了他们对 2020 年人口普查(在线)自我填写率可能产生的影响。使用网络分析方法 "社区检测 "和聚类算法 "潜在德里希特分配"(LDA),确定了三个原型用户群:"官方政府机构"、"人口普查倡导者 "和 "人口普查怀疑者"。人口普查怀疑论者 "原型用户的动机是某个有影响力的人在推特上发布的事件(例如,"国会中的共和党人表示人口普查不能花额外的时间来统计")。在研究期间,这一群体成为最大的群体。原型人口普查拥护者更多受到官方推文的激励,比原型人口普查怀疑者更活跃。官方政府机构用户群是三个用户群中最小的,但他们的信息--主要是促进人口普查的完成--似乎被人口普查倡导者,尤其是名人和政客所放大。我们发现,"人口普查倡导者 "用户群(而非其他两个用户群)的每日规模可以预测推文发布后五天内的 2020 年人口普查在线自我完成率。这一发现表明,人口普查社交媒体活动在促进填写方面取得了成功,这显然要归功于人口普查倡导者用户的帮助,他们鼓励人们填写人口普查并放大了官方推文。这一发现表明,对于像十年一次的人口普查这样重要的国家项目,社交媒体活动可以对公众行为产生积极影响。
{"title":"Tracking Census Online Self-Completion Using Twitter Posts","authors":"Mao Li, Frederick Conrad","doi":"10.1177/08944393241268461","DOIUrl":"https://doi.org/10.1177/08944393241268461","url":null,"abstract":"From the start of data collection for the 2020 US Census, official and celebrity users tweeted about the importance of everyone being counted in the Census and urged followers to complete the questionnaire (so-called social media campaign.) At the same time, social media posts expressing skepticism about the Census became increasingly common. This study distinguishes between different prototypical Twitter user groups and investigates their possible impact on (online) self-completion rate for the 2020 Census, according to Census Bureau data. Using a network analysis method, Community Detection, and a clustering algorithm, Latent Dirichlet Allocation (LDA), three prototypical user groups were identified: “Official Government Agency,” “Census Advocate,” and “Census Skeptic.” The prototypical Census Skeptic user was motivated by events about which an influential person had tweeted (e.g., “Republicans in Congress signal Census cannot take extra time to count”). This group became the largest one over the study period. The prototypical Census Advocate was motivated more by official tweets and was more active than the prototypical Census Skeptic. The Official Government Agency user group was the smallest of the three, but their messages—primarily promoting completion of the Census—seemed to have been amplified by Census Advocate, especially celebrities and politicians. We found that the daily size of the Census Advocate user group—but not the other two—predicted the 2020 Census online self-completion rate within five days after a tweet was posted. This finding suggests that the Census social media campaign was successful in promoting completion, apparently due to the help of Census Advocate users who encouraged people to fill out the Census and amplified official tweets. This finding demonstrates that a social media campaign can positively affect public behavior regarding an essential national project like the Decennial Census.","PeriodicalId":49509,"journal":{"name":"Social Science Computer Review","volume":"81 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141857928","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Transformer Model for Manifesto Classification Using Cross-Context Training: An Ecuadorian Case Study 利用跨语境训练进行宣言分类的转换器模型:厄瓜多尔案例研究
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-07-24 DOI: 10.1177/08944393241266220
Fernanda Barzallo, Maria Baldeon-Calisto, Margorie Pérez, Maria Emilia Moscoso, Danny Navarrete, Daniel Riofrío, Pablo Medina-Peréz, Susana K Lai-Yuen, Diego Benítez, Noel Peréz, Ricardo Flores Moyano, Mateo Fierro
Content analysis of political manifestos is necessary to understand the policies and proposed actions of a party. However, manually labeling political texts is time-consuming and labor-intensive. Transformer networks have become essential tools for automating this task. Nevertheless, these models require extensive datasets to achieve good performance. This can be a limitation in manifesto classification, where the availability of publicly labeled datasets can be scarce. To address this challenge, in this work, we developed a Transformer network for the classification of manifestos using a cross-domain training strategy. Using the database of the Comparative Manifesto Project, we implemented a fractional factorial experimental design to determine which Spanish-written manifestos form the best training set for Ecuadorian manifesto labeling. Furthermore, we statistically analyzed which Transformer architecture and preprocessing operations improve the model accuracy. The results indicate that creating a training set with manifestos from Spain and Uruguay, along with implementing stemming and lemmatization preprocessing operations, produces the highest classification accuracy. In addition, we found that the DistilBERT and RoBERTa transformer networks perform statistically similarly and consistently well in manifesto classification. Using the cross-context training strategy, DistilBERT and RoBERTa achieve 60.05% and 57.64% accuracy, respectively, in the classification of the Ecuadorian manifesto. Finally, we investigated the effect of the composition of the training set on performance. The experiments demonstrate that training DistilBERT solely with Ecuadorian manifestos achieves the highest accuracy and F1-score. Furthermore, in the absence of the Ecuadorian dataset, competitive performance is achieved by training the model with datasets from Spain and Uruguay.
要了解一个政党的政策和拟议行动,就必须对政治宣言进行内容分析。然而,手动标注政治文本既耗时又耗力。变压器网络已成为实现这一任务自动化的重要工具。然而,这些模型需要大量的数据集才能实现良好的性能。这在宣言分类中可能是一个限制,因为公开标注的数据集可能很少。为了应对这一挑战,在这项工作中,我们采用跨领域训练策略,开发了一种用于宣言分类的 Transformer 网络。利用比较宣言项目的数据库,我们实施了一个分数因子实验设计,以确定哪些西班牙文撰写的宣言是厄瓜多尔宣言标注的最佳训练集。此外,我们还统计分析了哪些 Transformer 架构和预处理操作可以提高模型的准确性。结果表明,创建一个包含西班牙和乌拉圭宣言的训练集,并实施词干化和词素化预处理操作,能产生最高的分类准确率。此外,我们还发现 DistilBERT 和 RoBERTa 变换器网络在宣言分类方面的表现在统计上相似且一致良好。使用跨语境训练策略,DistilBERT 和 RoBERTa 在厄瓜多尔宣言的分类中分别达到了 60.05% 和 57.64% 的准确率。最后,我们研究了训练集的组成对性能的影响。实验表明,仅使用厄瓜多尔宣言对 DistilBERT 进行训练可获得最高的准确率和 F1 分数。此外,在没有厄瓜多尔数据集的情况下,使用西班牙和乌拉圭的数据集对该模型进行训练,也能获得具有竞争力的性能。
{"title":"A Transformer Model for Manifesto Classification Using Cross-Context Training: An Ecuadorian Case Study","authors":"Fernanda Barzallo, Maria Baldeon-Calisto, Margorie Pérez, Maria Emilia Moscoso, Danny Navarrete, Daniel Riofrío, Pablo Medina-Peréz, Susana K Lai-Yuen, Diego Benítez, Noel Peréz, Ricardo Flores Moyano, Mateo Fierro","doi":"10.1177/08944393241266220","DOIUrl":"https://doi.org/10.1177/08944393241266220","url":null,"abstract":"Content analysis of political manifestos is necessary to understand the policies and proposed actions of a party. However, manually labeling political texts is time-consuming and labor-intensive. Transformer networks have become essential tools for automating this task. Nevertheless, these models require extensive datasets to achieve good performance. This can be a limitation in manifesto classification, where the availability of publicly labeled datasets can be scarce. To address this challenge, in this work, we developed a Transformer network for the classification of manifestos using a cross-domain training strategy. Using the database of the Comparative Manifesto Project, we implemented a fractional factorial experimental design to determine which Spanish-written manifestos form the best training set for Ecuadorian manifesto labeling. Furthermore, we statistically analyzed which Transformer architecture and preprocessing operations improve the model accuracy. The results indicate that creating a training set with manifestos from Spain and Uruguay, along with implementing stemming and lemmatization preprocessing operations, produces the highest classification accuracy. In addition, we found that the DistilBERT and RoBERTa transformer networks perform statistically similarly and consistently well in manifesto classification. Using the cross-context training strategy, DistilBERT and RoBERTa achieve 60.05% and 57.64% accuracy, respectively, in the classification of the Ecuadorian manifesto. Finally, we investigated the effect of the composition of the training set on performance. The experiments demonstrate that training DistilBERT solely with Ecuadorian manifestos achieves the highest accuracy and F1-score. Furthermore, in the absence of the Ecuadorian dataset, competitive performance is achieved by training the model with datasets from Spain and Uruguay.","PeriodicalId":49509,"journal":{"name":"Social Science Computer Review","volume":"53 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141755367","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online Harassment: The Mediating and Moderating Role of Thoughtfully Reflective Decision-Making 网络骚扰:深思熟虑的反思性决策的中介和调节作用
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-07-20 DOI: 10.1177/08944393241261983
C. Jordan Howell, Saeed Kabiri, Fangzhou Wang, Caitlyn N. Muniz, Eden Kamar, Mahmoud Sharepour, John Cochran, Seyyedeh Masoomeh (Shamila) Shadmanfaat
The current study employs a construct from the criminological literature, thoughtfully reflective decision-making (TRDM), to understand cyber offenders’ decision-making and offer relevant insights to prevent online harassment. Using a sample of Iranian high school students ( N = 366), we employ OLS and SEM to test whether and how TRDM, perceived deterrence, and prior victimization influence the most common forms of online harassment: cyberbullying and cyberstalking. Findings demonstrate cyberbullying and cyberstalking victimization increase engagement in offending behavior while participants’ fear of sanction reduces engagement in both cyberbullying and cyberstalking perpetration. Notably, results demonstrate that TRDM has a direct, mediating, and moderating effect on both forms of offending. TRDM also has an indirect effect on cyberbullying and cyberstalking perpetration through victimization and participants’ perceptions of sanction. Unlike contemporary, pre-dispositional theories of crime, TRDM is dynamic and can be improved via educational programming. We posit that current cyber hygiene campaigns should include elements aimed to improve individuals’ cognitive decision-making capabilities. Guided by theory, and based on the results of the current study, this translational approach could prevent victimization while simultaneously improving other elements of the participants’ life.
本研究采用犯罪学文献中的一个概念--深思熟虑的反思性决策(TRDM)--来理解网络犯罪者的决策,并为预防网络骚扰提供相关见解。我们以伊朗高中生(366 人)为样本,采用 OLS 和 SEM 方法检验 TRDM、感知威慑力和先前受害情况是否以及如何影响最常见的网络骚扰形式:网络欺凌和网络跟踪。研究结果表明,网络欺凌和网络跟踪的受害情况会增加犯罪行为的参与度,而参与者对制裁的恐惧则会减少网络欺凌和网络跟踪行为的参与度。值得注意的是,研究结果表明,TRDM 对这两种形式的犯罪行为都有直接、中介和调节作用。TRDM还通过受害情况和参与者对制裁的看法对网络欺凌和网络跟踪的实施产生间接影响。与当代预设犯罪理论不同,TRDM 是动态的,可以通过教育计划加以改进。我们认为,当前的网络卫生活动应包括旨在提高个人认知决策能力的内容。在理论指导下,根据当前研究的结果,这种转化方法可以在防止受害的同时改善参与者生活的其他方面。
{"title":"Online Harassment: The Mediating and Moderating Role of Thoughtfully Reflective Decision-Making","authors":"C. Jordan Howell, Saeed Kabiri, Fangzhou Wang, Caitlyn N. Muniz, Eden Kamar, Mahmoud Sharepour, John Cochran, Seyyedeh Masoomeh (Shamila) Shadmanfaat","doi":"10.1177/08944393241261983","DOIUrl":"https://doi.org/10.1177/08944393241261983","url":null,"abstract":"The current study employs a construct from the criminological literature, thoughtfully reflective decision-making (TRDM), to understand cyber offenders’ decision-making and offer relevant insights to prevent online harassment. Using a sample of Iranian high school students ( N = 366), we employ OLS and SEM to test whether and how TRDM, perceived deterrence, and prior victimization influence the most common forms of online harassment: cyberbullying and cyberstalking. Findings demonstrate cyberbullying and cyberstalking victimization increase engagement in offending behavior while participants’ fear of sanction reduces engagement in both cyberbullying and cyberstalking perpetration. Notably, results demonstrate that TRDM has a direct, mediating, and moderating effect on both forms of offending. TRDM also has an indirect effect on cyberbullying and cyberstalking perpetration through victimization and participants’ perceptions of sanction. Unlike contemporary, pre-dispositional theories of crime, TRDM is dynamic and can be improved via educational programming. We posit that current cyber hygiene campaigns should include elements aimed to improve individuals’ cognitive decision-making capabilities. Guided by theory, and based on the results of the current study, this translational approach could prevent victimization while simultaneously improving other elements of the participants’ life.","PeriodicalId":49509,"journal":{"name":"Social Science Computer Review","volume":"136 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141730514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Typing or Speaking? Comparing Text and Voice Answers to Open Questions on Sensitive Topics in Smartphone Surveys 打字还是说话?比较智能手机调查中对敏感话题开放式问题的文字和语音回答
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-05-28 DOI: 10.1177/08944393231160961
Jan Karem Höhne, Konstantin Gavras, Joshua Claassen
The smartphone increase in web surveys, coupled with technological developments, provides novel opportunities for measuring attitudes. For example, smartphones allow the collection of voice instead of text answers by using the built-in microphone. This may facilitate answering questions with open answer formats resulting in richer information and higher data quality. So far, there is only a little body of research investigating voice and text answers to open questions. In this study, we therefore compare the linguistic and content characteristics of voice and text answers to open questions on sensitive topics. For this purpose, we ran an experiment in a smartphone survey ( N = 1001) and randomly assigned respondents to an answer format condition (text or voice). The findings indicate that voice answers have a higher number of words and a higher number of topics than their text counterparts. We find no differences regarding sentiments (or extremity of answers). Our study provides new insights into the linguistic and content characteristics of voice and text answers. Furthermore, it helps to evaluate the usefulness and usability of voice answers for future smartphone surveys.
在网络调查中,智能手机的增加以及技术的发展为测量态度提供了新的机会。例如,智能手机允许使用内置麦克风收集语音而非文字答案。这可能有助于以开放的回答格式回答问题,从而获得更丰富的信息和更高的数据质量。迄今为止,对开放式问题的语音和文字回答的研究还很少。因此,在本研究中,我们比较了语音和文本回答敏感话题开放式问题的语言和内容特征。为此,我们在智能手机调查中进行了一次实验(N = 1001),并将受访者随机分配到一种回答格式条件下(文本或语音)。实验结果表明,语音回答的字数和话题数量均高于文字回答。我们没有发现情感(或答案的极端性)方面的差异。我们的研究为了解语音和文本答案的语言和内容特征提供了新的视角。此外,它还有助于评估语音回答在未来智能手机调查中的实用性和可用性。
{"title":"Typing or Speaking? Comparing Text and Voice Answers to Open Questions on Sensitive Topics in Smartphone Surveys","authors":"Jan Karem Höhne, Konstantin Gavras, Joshua Claassen","doi":"10.1177/08944393231160961","DOIUrl":"https://doi.org/10.1177/08944393231160961","url":null,"abstract":"The smartphone increase in web surveys, coupled with technological developments, provides novel opportunities for measuring attitudes. For example, smartphones allow the collection of voice instead of text answers by using the built-in microphone. This may facilitate answering questions with open answer formats resulting in richer information and higher data quality. So far, there is only a little body of research investigating voice and text answers to open questions. In this study, we therefore compare the linguistic and content characteristics of voice and text answers to open questions on sensitive topics. For this purpose, we ran an experiment in a smartphone survey ( N = 1001) and randomly assigned respondents to an answer format condition (text or voice). The findings indicate that voice answers have a higher number of words and a higher number of topics than their text counterparts. We find no differences regarding sentiments (or extremity of answers). Our study provides new insights into the linguistic and content characteristics of voice and text answers. Furthermore, it helps to evaluate the usefulness and usability of voice answers for future smartphone surveys.","PeriodicalId":49509,"journal":{"name":"Social Science Computer Review","volume":"40 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141165132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating Measurement Quality in Digital Trace Data and Surveys Using the MultiTrait MultiMethod Model 使用多特征多方法模型估算数字痕迹数据和调查的测量质量
IF 4.1 2区 社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-05-22 DOI: 10.1177/08944393241254464
Alexandru Cernat, Florian Keusch, Ruben L. Bach, Paulina K. Pankowska
Digital trace data are receiving increased attention as a potential way to capture human behavior. Nevertheless, this type of data is far from perfect and may not always provide better data compared to traditional social surveys. In this study we estimate measurement quality of survey and digital trace data on smartphone usage with a MultiTrait MultiMethod (MTMM) model. The experimental design included five topics relating to the use of smartphones (traits) measured with five methods: three different survey scales (a 5- and a 7-point frequency scale and an open-ended question on duration) and two measures from digital trace data (frequency and duration). We show that surveys and digital trace data measures have very low correlation with each other. We also show that all measures are far from perfect and, while digital trace data appears to have often better quality compared to surveys, that is not always the case.
数字跟踪数据作为一种捕捉人类行为的潜在方式,正受到越来越多的关注。然而,这类数据远非完美,与传统的社会调查相比,可能并不总能提供更好的数据。在本研究中,我们采用多特质多方法(MTMM)模型对智能手机使用调查和数字追踪数据的测量质量进行了评估。实验设计包括用五种方法测量与智能手机使用相关的五个主题(特质):三种不同的调查量表(一个 5 分和 7 分频率量表以及一个关于持续时间的开放式问题)和来自数字追踪数据的两种测量方法(频率和持续时间)。我们的研究表明,调查和数字跟踪数据测量之间的相关性非常低。我们还表明,所有测量方法都远非完美,虽然与调查相比,数字跟踪数据的质量似乎更高,但事实并非总是如此。
{"title":"Estimating Measurement Quality in Digital Trace Data and Surveys Using the MultiTrait MultiMethod Model","authors":"Alexandru Cernat, Florian Keusch, Ruben L. Bach, Paulina K. Pankowska","doi":"10.1177/08944393241254464","DOIUrl":"https://doi.org/10.1177/08944393241254464","url":null,"abstract":"Digital trace data are receiving increased attention as a potential way to capture human behavior. Nevertheless, this type of data is far from perfect and may not always provide better data compared to traditional social surveys. In this study we estimate measurement quality of survey and digital trace data on smartphone usage with a MultiTrait MultiMethod (MTMM) model. The experimental design included five topics relating to the use of smartphones (traits) measured with five methods: three different survey scales (a 5- and a 7-point frequency scale and an open-ended question on duration) and two measures from digital trace data (frequency and duration). We show that surveys and digital trace data measures have very low correlation with each other. We also show that all measures are far from perfect and, while digital trace data appears to have often better quality compared to surveys, that is not always the case.","PeriodicalId":49509,"journal":{"name":"Social Science Computer Review","volume":"64 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141085419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Social Science Computer Review
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1