“我们中间的敌人”

Q3 Social Sciences NASSP Bulletin Pub Date : 2019-07-26 DOI:10.1145/3324997
Wafa Alorainy, P. Burnap, Han Liu, M. Williams
{"title":"“我们中间的敌人”","authors":"Wafa Alorainy, P. Burnap, Han Liu, M. Williams","doi":"10.1145/3324997","DOIUrl":null,"url":null,"abstract":"Offensive or antagonistic language targeted at individuals and social groups based on their personal characteristics (also known as cyber hate speech or cyberhate) has been frequently posted and widely circulated via the World Wide Web. This can be considered as a key risk factor for individual and societal tension surrounding regional instability. Automated Web-based cyberhate detection is important for observing and understanding community and regional societal tension—especially in online social networks where posts can be rapidly and widely viewed and disseminated. While previous work has involved using lexicons, bags-of-words, or probabilistic language parsing approaches, they often suffer from a similar issue, which is that cyberhate can be subtle and indirect—thus, depending on the occurrence of individual words or phrases, can lead to a significant number of false negatives, providing inaccurate representation of the trends in cyberhate. This problem motivated us to challenge thinking around the representation of subtle language use, such as references to perceived threats from “the other” including immigration or job prosperity in a hateful context. We propose a novel “othering” feature set that utilizes language use around the concept of “othering” and intergroup threat theory to identify these subtleties, and we implement a wide range of classification methods using embedding learning to compute semantic distances between parts of speech considered to be part of an “othering” narrative. To validate our approach, we conducted two sets of experiments. The first involved comparing the results of our novel method with state-of-the-art baseline models from the literature. Our approach outperformed all existing methods. The second tested the best performing models from the first phase on unseen datasets for different types of cyberhate, namely religion, disability, race, and sexual orientation. The results showed F-measure scores for classifying hateful instances obtained through applying our model of 0.81, 0.71, 0.89, and 0.72, respectively, demonstrating the ability of the “othering” narrative to be an important part of model generalization.","PeriodicalId":39340,"journal":{"name":"NASSP Bulletin","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2019-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3324997","citationCount":"25","resultStr":"{\"title\":\"“The Enemy Among Us”\",\"authors\":\"Wafa Alorainy, P. Burnap, Han Liu, M. Williams\",\"doi\":\"10.1145/3324997\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Offensive or antagonistic language targeted at individuals and social groups based on their personal characteristics (also known as cyber hate speech or cyberhate) has been frequently posted and widely circulated via the World Wide Web. This can be considered as a key risk factor for individual and societal tension surrounding regional instability. Automated Web-based cyberhate detection is important for observing and understanding community and regional societal tension—especially in online social networks where posts can be rapidly and widely viewed and disseminated. While previous work has involved using lexicons, bags-of-words, or probabilistic language parsing approaches, they often suffer from a similar issue, which is that cyberhate can be subtle and indirect—thus, depending on the occurrence of individual words or phrases, can lead to a significant number of false negatives, providing inaccurate representation of the trends in cyberhate. This problem motivated us to challenge thinking around the representation of subtle language use, such as references to perceived threats from “the other” including immigration or job prosperity in a hateful context. We propose a novel “othering” feature set that utilizes language use around the concept of “othering” and intergroup threat theory to identify these subtleties, and we implement a wide range of classification methods using embedding learning to compute semantic distances between parts of speech considered to be part of an “othering” narrative. To validate our approach, we conducted two sets of experiments. The first involved comparing the results of our novel method with state-of-the-art baseline models from the literature. Our approach outperformed all existing methods. The second tested the best performing models from the first phase on unseen datasets for different types of cyberhate, namely religion, disability, race, and sexual orientation. The results showed F-measure scores for classifying hateful instances obtained through applying our model of 0.81, 0.71, 0.89, and 0.72, respectively, demonstrating the ability of the “othering” narrative to be an important part of model generalization.\",\"PeriodicalId\":39340,\"journal\":{\"name\":\"NASSP Bulletin\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-07-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1145/3324997\",\"citationCount\":\"25\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"NASSP Bulletin\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3324997\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Social Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"NASSP Bulletin","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3324997","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Social Sciences","Score":null,"Total":0}
引用次数: 25

摘要

基于个人特征针对个人和社会群体的攻击性或对抗性语言(也称为网络仇恨言论或网络仇恨)经常在万维网上发布和广泛传播。这可以被视为围绕区域不稳定的个人和社会紧张局势的一个关键风险因素。基于网络的自动网络仇恨检测对于观察和理解社区和区域社会紧张局势非常重要,特别是在帖子可以迅速广泛浏览和传播的在线社交网络中。虽然以前的工作涉及使用词典、词袋或概率语言解析方法,但它们经常遇到类似的问题,即网络仇恨可能是微妙和间接的——因此,根据单个单词或短语的出现,可能导致大量假阴性,从而提供对网络仇恨趋势的不准确表示。这个问题促使我们挑战围绕微妙语言使用表现的思考,比如在一个充满仇恨的环境中提到来自“他者”的感知威胁,包括移民或就业繁荣。我们提出了一种新颖的“他者”特征集,它利用围绕“他者”概念的语言使用和群体间威胁理论来识别这些微妙之处,我们实现了广泛的分类方法,使用嵌入学习来计算被认为是“他者”叙事的一部分的词性之间的语义距离。为了验证我们的方法,我们进行了两组实验。第一个涉及将我们的新方法的结果与文献中最先进的基线模型进行比较。我们的方法优于所有现有的方法。第二阶段在不同类型的网络仇恨(即宗教、残疾、种族和性取向)的未知数据集上测试了第一阶段中表现最好的模型。结果显示,通过应用我们的模型获得的仇恨实例分类的f测量得分分别为0.81,0.71,0.89和0.72,表明“他者”叙事是模型泛化的重要组成部分。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
“The Enemy Among Us”
Offensive or antagonistic language targeted at individuals and social groups based on their personal characteristics (also known as cyber hate speech or cyberhate) has been frequently posted and widely circulated via the World Wide Web. This can be considered as a key risk factor for individual and societal tension surrounding regional instability. Automated Web-based cyberhate detection is important for observing and understanding community and regional societal tension—especially in online social networks where posts can be rapidly and widely viewed and disseminated. While previous work has involved using lexicons, bags-of-words, or probabilistic language parsing approaches, they often suffer from a similar issue, which is that cyberhate can be subtle and indirect—thus, depending on the occurrence of individual words or phrases, can lead to a significant number of false negatives, providing inaccurate representation of the trends in cyberhate. This problem motivated us to challenge thinking around the representation of subtle language use, such as references to perceived threats from “the other” including immigration or job prosperity in a hateful context. We propose a novel “othering” feature set that utilizes language use around the concept of “othering” and intergroup threat theory to identify these subtleties, and we implement a wide range of classification methods using embedding learning to compute semantic distances between parts of speech considered to be part of an “othering” narrative. To validate our approach, we conducted two sets of experiments. The first involved comparing the results of our novel method with state-of-the-art baseline models from the literature. Our approach outperformed all existing methods. The second tested the best performing models from the first phase on unseen datasets for different types of cyberhate, namely religion, disability, race, and sexual orientation. The results showed F-measure scores for classifying hateful instances obtained through applying our model of 0.81, 0.71, 0.89, and 0.72, respectively, demonstrating the ability of the “othering” narrative to be an important part of model generalization.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
NASSP Bulletin
NASSP Bulletin Social Sciences-Education
CiteScore
1.40
自引率
0.00%
发文量
14
期刊最新文献
Principal and Assistant Principal Involvement in and Barriers to Supporting Secondary Transition for Students with Disabilities Leveraging Co-Teaching as a Social Justice Pedagogy: What School Leaders Need to Know Faculty and Administrator Attitudes Toward Internet-Based Technologies and Virtual Networking Leading to the Development and Support of Social Capital In this Issue… How Principals’ Knowledge and Experiences Inform Their Use of Restorative in Lieu of Exclusionary Practices
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1