Modeling Temporal Patterns of Cyberbullying Detection with Hierarchical Attention Networks

ACM/IMS transactions on data science Pub Date : 2021-04-02 DOI:10.1145/3441141

Lu Cheng, Ruocheng Guo, Yasin N. Silva, Deborah L. Hall, Huan Liu

{"title":"Modeling Temporal Patterns of Cyberbullying Detection with Hierarchical Attention Networks","authors":"Lu Cheng, Ruocheng Guo, Yasin N. Silva, Deborah L. Hall, Huan Liu","doi":"10.1145/3441141","DOIUrl":null,"url":null,"abstract":"Cyberbullying is rapidly becoming one of the most serious online risks for adolescents. This has motivated work on machine learning methods to automate the process of cyberbullying detection, which have so far mostly viewed cyberbullying as one-off incidents that occur at a single point in time. Comparatively less is known about how cyberbullying behavior occurs and evolves over time. This oversight highlights a crucial open challenge for cyberbullying-related research, given that cyberbullying is typically defined as intentional acts of aggression via electronic communication that occur repeatedly and persistently. In this article, we center our discussion on the challenge of modeling temporal patterns of cyberbullying behavior. Specifically, we investigate how temporal information within a social media session, which has an inherently hierarchical structure (e.g., words form a comment and comments form a session), can be leveraged to facilitate cyberbullying detection. Recent findings from interdisciplinary research suggest that the temporal characteristics of bullying sessions differ from those of non-bullying sessions and that the temporal information from users’ comments can improve cyberbullying detection. The proposed framework consists of three distinctive features: (1) a hierarchical structure that reflects how a social media session is formed in a bottom-up manner; (2) attention mechanisms applied at the word- and comment-level to differentiate the contributions of words and comments to the representation of a social media session; and (3) the incorporation of temporal features in modeling cyberbullying behavior at the comment-level. Quantitative and qualitative evaluations are conducted on a real-world dataset collected from Instagram, the social networking site with the highest percentage of users reporting cyberbullying experiences. Results from empirical evaluations show the significance of the proposed methods, which are tailored to capture temporal patterns of cyberbullying detection.","PeriodicalId":93404,"journal":{"name":"ACM/IMS transactions on data science","volume":"2 1","pages":"1 - 23"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3441141","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM/IMS transactions on data science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3441141","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 18

Abstract

Cyberbullying is rapidly becoming one of the most serious online risks for adolescents. This has motivated work on machine learning methods to automate the process of cyberbullying detection, which have so far mostly viewed cyberbullying as one-off incidents that occur at a single point in time. Comparatively less is known about how cyberbullying behavior occurs and evolves over time. This oversight highlights a crucial open challenge for cyberbullying-related research, given that cyberbullying is typically defined as intentional acts of aggression via electronic communication that occur repeatedly and persistently. In this article, we center our discussion on the challenge of modeling temporal patterns of cyberbullying behavior. Specifically, we investigate how temporal information within a social media session, which has an inherently hierarchical structure (e.g., words form a comment and comments form a session), can be leveraged to facilitate cyberbullying detection. Recent findings from interdisciplinary research suggest that the temporal characteristics of bullying sessions differ from those of non-bullying sessions and that the temporal information from users’ comments can improve cyberbullying detection. The proposed framework consists of three distinctive features: (1) a hierarchical structure that reflects how a social media session is formed in a bottom-up manner; (2) attention mechanisms applied at the word- and comment-level to differentiate the contributions of words and comments to the representation of a social media session; and (3) the incorporation of temporal features in modeling cyberbullying behavior at the comment-level. Quantitative and qualitative evaluations are conducted on a real-world dataset collected from Instagram, the social networking site with the highest percentage of users reporting cyberbullying experiences. Results from empirical evaluations show the significance of the proposed methods, which are tailored to capture temporal patterns of cyberbullying detection.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用层次注意网络建模网络欺凌检测的时间模式

网络欺凌正迅速成为青少年最严重的网络风险之一。这推动了机器学习方法的研究，使网络欺凌检测过程自动化，到目前为止，这些方法大多将网络欺凌视为在单个时间点发生的一次性事件。相对而言，人们对网络欺凌行为是如何随着时间的推移而发生和演变的知之甚少。这一监督突显了网络欺凌相关研究面临的一个关键的公开挑战，因为网络欺凌通常被定义为通过电子通信反复持续发生的故意攻击行为。在这篇文章中，我们集中讨论了网络欺凌行为时间模式建模的挑战。具体而言，我们研究了社交媒体会话中的时间信息如何被利用来促进网络欺凌检测，社交媒体会话具有固有的层次结构（例如，单词形成评论，评论形成会话）。跨学科研究的最新发现表明，欺凌会话的时间特征与非欺凌会话不同，来自用户评论的时间信息可以提高网络欺凌检测。所提出的框架由三个显著特征组成：（1）层次结构，反映了社交媒体会话是如何以自下而上的方式形成的；（2）在单词和评论层面应用的注意力机制，以区分单词和评论对社交媒体会话表现的贡献；以及（3）将时间特征纳入评论层面的网络欺凌行为建模中。定量和定性评估是在从Instagram收集的真实世界数据集上进行的，Instagram是报告网络欺凌经历的用户比例最高的社交网站。实证评估的结果表明了所提出的方法的重要性，这些方法是为捕捉网络欺凌检测的时间模式而定制的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

ACM/IMS transactions on data science

自引率

0.00%

发文量