“The Enemy Among Us”

Q3 Social Sciences NASSP Bulletin Pub Date : 2019-07-26 DOI:10.1145/3324997

Wafa Alorainy, P. Burnap, Han Liu, M. Williams

{"title":"“The Enemy Among Us”","authors":"Wafa Alorainy, P. Burnap, Han Liu, M. Williams","doi":"10.1145/3324997","DOIUrl":null,"url":null,"abstract":"Offensive or antagonistic language targeted at individuals and social groups based on their personal characteristics (also known as cyber hate speech or cyberhate) has been frequently posted and widely circulated via the World Wide Web. This can be considered as a key risk factor for individual and societal tension surrounding regional instability. Automated Web-based cyberhate detection is important for observing and understanding community and regional societal tension—especially in online social networks where posts can be rapidly and widely viewed and disseminated. While previous work has involved using lexicons, bags-of-words, or probabilistic language parsing approaches, they often suffer from a similar issue, which is that cyberhate can be subtle and indirect—thus, depending on the occurrence of individual words or phrases, can lead to a significant number of false negatives, providing inaccurate representation of the trends in cyberhate. This problem motivated us to challenge thinking around the representation of subtle language use, such as references to perceived threats from “the other” including immigration or job prosperity in a hateful context. We propose a novel “othering” feature set that utilizes language use around the concept of “othering” and intergroup threat theory to identify these subtleties, and we implement a wide range of classification methods using embedding learning to compute semantic distances between parts of speech considered to be part of an “othering” narrative. To validate our approach, we conducted two sets of experiments. The first involved comparing the results of our novel method with state-of-the-art baseline models from the literature. Our approach outperformed all existing methods. The second tested the best performing models from the first phase on unseen datasets for different types of cyberhate, namely religion, disability, race, and sexual orientation. The results showed F-measure scores for classifying hateful instances obtained through applying our model of 0.81, 0.71, 0.89, and 0.72, respectively, demonstrating the ability of the “othering” narrative to be an important part of model generalization.","PeriodicalId":39340,"journal":{"name":"NASSP Bulletin","volume":"13 1","pages":"1 - 26"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3324997","citationCount":"25","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NASSP Bulletin","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3324997","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Social Sciences","Score":null,"Total":0}

引用次数: 25

Abstract

Offensive or antagonistic language targeted at individuals and social groups based on their personal characteristics (also known as cyber hate speech or cyberhate) has been frequently posted and widely circulated via the World Wide Web. This can be considered as a key risk factor for individual and societal tension surrounding regional instability. Automated Web-based cyberhate detection is important for observing and understanding community and regional societal tension—especially in online social networks where posts can be rapidly and widely viewed and disseminated. While previous work has involved using lexicons, bags-of-words, or probabilistic language parsing approaches, they often suffer from a similar issue, which is that cyberhate can be subtle and indirect—thus, depending on the occurrence of individual words or phrases, can lead to a significant number of false negatives, providing inaccurate representation of the trends in cyberhate. This problem motivated us to challenge thinking around the representation of subtle language use, such as references to perceived threats from “the other” including immigration or job prosperity in a hateful context. We propose a novel “othering” feature set that utilizes language use around the concept of “othering” and intergroup threat theory to identify these subtleties, and we implement a wide range of classification methods using embedding learning to compute semantic distances between parts of speech considered to be part of an “othering” narrative. To validate our approach, we conducted two sets of experiments. The first involved comparing the results of our novel method with state-of-the-art baseline models from the literature. Our approach outperformed all existing methods. The second tested the best performing models from the first phase on unseen datasets for different types of cyberhate, namely religion, disability, race, and sexual orientation. The results showed F-measure scores for classifying hateful instances obtained through applying our model of 0.81, 0.71, 0.89, and 0.72, respectively, demonstrating the ability of the “othering” narrative to be an important part of model generalization.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

“我们中间的敌人”

基于个人特征针对个人和社会群体的攻击性或对抗性语言(也称为网络仇恨言论或网络仇恨)经常在万维网上发布和广泛传播。这可以被视为围绕区域不稳定的个人和社会紧张局势的一个关键风险因素。基于网络的自动网络仇恨检测对于观察和理解社区和区域社会紧张局势非常重要，特别是在帖子可以迅速广泛浏览和传播的在线社交网络中。虽然以前的工作涉及使用词典、词袋或概率语言解析方法，但它们经常遇到类似的问题，即网络仇恨可能是微妙和间接的——因此，根据单个单词或短语的出现，可能导致大量假阴性，从而提供对网络仇恨趋势的不准确表示。这个问题促使我们挑战围绕微妙语言使用表现的思考，比如在一个充满仇恨的环境中提到来自“他者”的感知威胁，包括移民或就业繁荣。我们提出了一种新颖的“他者”特征集，它利用围绕“他者”概念的语言使用和群体间威胁理论来识别这些微妙之处，我们实现了广泛的分类方法，使用嵌入学习来计算被认为是“他者”叙事的一部分的词性之间的语义距离。为了验证我们的方法，我们进行了两组实验。第一个涉及将我们的新方法的结果与文献中最先进的基线模型进行比较。我们的方法优于所有现有的方法。第二阶段在不同类型的网络仇恨(即宗教、残疾、种族和性取向)的未知数据集上测试了第一阶段中表现最好的模型。结果显示，通过应用我们的模型获得的仇恨实例分类的f测量得分分别为0.81,0.71,0.89和0.72，表明“他者”叙事是模型泛化的重要组成部分。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

NASSP Bulletin Social Sciences-Education

CiteScore

1.40

自引率

0.00%

发文量