社交媒体上的内容审核:谁和为什么要审核仇恨言论重要吗?

IF 3.9 2区心理学 Q1 PSYCHOLOGY, SOCIAL Cyberpsychology, behavior and social networking Pub Date : 2023-07-01 DOI:10.1089/cyber.2022.0158

Sai Wang, Ki Joon Kim

{"title":"社交媒体上的内容审核:谁和为什么要审核仇恨言论重要吗?","authors":"Sai Wang, Ki Joon Kim","doi":"10.1089/cyber.2022.0158","DOIUrl":null,"url":null,"abstract":"Artificial intelligence (AI) has been increasingly integrated into content moderation to detect and remove hate speech on social media. An online experiment (N = 478) was conducted to examine how moderation agents (AI vs. human vs. human-AI collaboration) and removal explanations (with vs. without) affect users' perceptions and acceptance of removal decisions for hate speech targeting social groups with certain characteristics, such as religion or sexual orientation. The results showed that individuals exhibit consistent levels of perceived trustworthiness and acceptance of removal decisions regardless of the type of moderation agent. When explanations for the content takedown were provided, removal decisions made jointly by humans and AI were perceived as more trustworthy than the same decisions made by humans alone, which increased users' willingness to accept the verdict. However, this moderated mediation effect was only significant when Muslims, not homosexuals, were the target of hate speech.","PeriodicalId":10872,"journal":{"name":"Cyberpsychology, behavior and social networking","volume":"26 7","pages":"527-534"},"PeriodicalIF":3.9000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Content Moderation on Social Media: Does It Matter Who and Why Moderates Hate Speech?\",\"authors\":\"Sai Wang, Ki Joon Kim\",\"doi\":\"10.1089/cyber.2022.0158\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Artificial intelligence (AI) has been increasingly integrated into content moderation to detect and remove hate speech on social media. An online experiment (N = 478) was conducted to examine how moderation agents (AI vs. human vs. human-AI collaboration) and removal explanations (with vs. without) affect users' perceptions and acceptance of removal decisions for hate speech targeting social groups with certain characteristics, such as religion or sexual orientation. The results showed that individuals exhibit consistent levels of perceived trustworthiness and acceptance of removal decisions regardless of the type of moderation agent. When explanations for the content takedown were provided, removal decisions made jointly by humans and AI were perceived as more trustworthy than the same decisions made by humans alone, which increased users' willingness to accept the verdict. However, this moderated mediation effect was only significant when Muslims, not homosexuals, were the target of hate speech.\",\"PeriodicalId\":10872,\"journal\":{\"name\":\"Cyberpsychology, behavior and social networking\",\"volume\":\"26 7\",\"pages\":\"527-534\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2023-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cyberpsychology, behavior and social networking\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://doi.org/10.1089/cyber.2022.0158\",\"RegionNum\":2,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PSYCHOLOGY, SOCIAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cyberpsychology, behavior and social networking","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1089/cyber.2022.0158","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, SOCIAL","Score":null,"Total":0}

引用次数: 1

摘要

人工智能(AI)越来越多地融入到内容审核中，以检测和删除社交媒体上的仇恨言论。进行了一项在线实验(N = 478)，以研究适度代理(AI vs.人类vs.人类-AI协作)和删除解释(有vs.没有)如何影响用户对针对具有某些特征的社会群体(如宗教或性取向)的仇恨言论的删除决定的感知和接受程度。结果表明，个体表现出一致的可信赖程度和对移除决定的接受程度，无论中介类型如何。当提供内容删除的解释时，人们认为人类和人工智能共同做出的删除决定比人类单独做出的决定更值得信赖，这增加了用户接受判决的意愿。然而，只有当穆斯林而不是同性恋者成为仇恨言论的目标时，这种缓和的中介效应才显著。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Content Moderation on Social Media: Does It Matter Who and Why Moderates Hate Speech?

Artificial intelligence (AI) has been increasingly integrated into content moderation to detect and remove hate speech on social media. An online experiment (N = 478) was conducted to examine how moderation agents (AI vs. human vs. human-AI collaboration) and removal explanations (with vs. without) affect users' perceptions and acceptance of removal decisions for hate speech targeting social groups with certain characteristics, such as religion or sexual orientation. The results showed that individuals exhibit consistent levels of perceived trustworthiness and acceptance of removal decisions regardless of the type of moderation agent. When explanations for the content takedown were provided, removal decisions made jointly by humans and AI were perceived as more trustworthy than the same decisions made by humans alone, which increased users' willingness to accept the verdict. However, this moderated mediation effect was only significant when Muslims, not homosexuals, were the target of hate speech.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Cyberpsychology, behavior and social networking PSYCHOLOGY, SOCIAL-

CiteScore

9.60

自引率

3.00%

发文量

123

期刊介绍： Cyberpsychology, Behavior, and Social Networking is a leading peer-reviewed journal that is recognized for its authoritative research on the social, behavioral, and psychological impacts of contemporary social networking practices. The journal covers a wide range of platforms, including Twitter, Facebook, internet gaming, and e-commerce, and examines how these digital environments shape human interaction and societal norms. For over two decades, this journal has been a pioneering voice in the exploration of social networking and virtual reality, establishing itself as an indispensable resource for professionals and academics in the field. It is particularly celebrated for its swift dissemination of findings through rapid communication articles, alongside comprehensive, in-depth studies that delve into the multifaceted effects of interactive technologies on both individual behavior and broader societal trends. The journal's scope encompasses the full spectrum of impacts—highlighting not only the potential benefits but also the challenges that arise as a result of these technologies. By providing a platform for rigorous research and critical discussions, it fosters a deeper understanding of the complex interplay between technology and human behavior.