Myron Darrel L. Montefalcon, Jay Rhald Padilla, J. Paulino, Jeline G. Go, Ramon Llabanes Rodriguez, Joseph Marvin Imperial
{"title":"Understanding Facial Expression Expressing Hate from Online Short-form Videos","authors":"Myron Darrel L. Montefalcon, Jay Rhald Padilla, J. Paulino, Jeline G. Go, Ramon Llabanes Rodriguez, Joseph Marvin Imperial","doi":"10.1145/3485768.3485785","DOIUrl":null,"url":null,"abstract":"The impact of hate speech is not only detrimental to an individual's human rights; but also, a grave threat to social stability and democracy. Through social media, the spread of hate speech has alarmingly increased across the globe. Various social media platform's goal is to eliminate hateful content and this challenge poses the need for automatic and accurate hate speech detection. Presently, known techniques in this research primarily made use of either text or audio features. However, the use of the facial expression in hate speech detection is not that explored. Thus, for this study, the use of facial expressions to understand hate speech has been thoroughly investigated. The dataset used is image data generated from Filipino Tiktok videos with a frame size of 1080 x 1920 pixels and divided into 5 frames per second. Two approaches namely conventional and deep learning-based frameworks have been implemented in building the Facial Expression Recognition (FER) model to understand hate speech. Based on the experimentation, the conventional approach using the Random Forest approach has achieved the best performance with 86.9% training accuracy and 84.8% validation accuracy, outperforming the other conventional classifiers and the DL-based approach significantly. For future direction, facial expression features combined with text or audio input type will be implemented to examine whether the use of facial expression can complement or improve hate speech detection models.","PeriodicalId":328771,"journal":{"name":"2021 5th International Conference on E-Society, E-Education and E-Technology","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 5th International Conference on E-Society, E-Education and E-Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3485768.3485785","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The impact of hate speech is not only detrimental to an individual's human rights; but also, a grave threat to social stability and democracy. Through social media, the spread of hate speech has alarmingly increased across the globe. Various social media platform's goal is to eliminate hateful content and this challenge poses the need for automatic and accurate hate speech detection. Presently, known techniques in this research primarily made use of either text or audio features. However, the use of the facial expression in hate speech detection is not that explored. Thus, for this study, the use of facial expressions to understand hate speech has been thoroughly investigated. The dataset used is image data generated from Filipino Tiktok videos with a frame size of 1080 x 1920 pixels and divided into 5 frames per second. Two approaches namely conventional and deep learning-based frameworks have been implemented in building the Facial Expression Recognition (FER) model to understand hate speech. Based on the experimentation, the conventional approach using the Random Forest approach has achieved the best performance with 86.9% training accuracy and 84.8% validation accuracy, outperforming the other conventional classifiers and the DL-based approach significantly. For future direction, facial expression features combined with text or audio input type will be implemented to examine whether the use of facial expression can complement or improve hate speech detection models.
仇恨言论的影响不仅损害个人的人权;但同时也是对社会稳定和民主的严重威胁。通过社交媒体,仇恨言论在全球范围内的传播速度惊人地增加。各种社交媒体平台的目标是消除仇恨内容,这一挑战提出了对自动准确的仇恨言论检测的需求。目前,在这项研究中已知的技术主要是利用文本或音频特征。然而,面部表情在仇恨言论检测中的应用并没有得到深入的探讨。因此,在这项研究中,使用面部表情来理解仇恨言论已经得到了彻底的研究。使用的数据集是从菲律宾抖音视频中生成的图像数据,帧大小为1080 x 1920像素,每秒分为5帧。在构建面部表情识别(FER)模型以理解仇恨言论方面,采用了传统的和基于深度学习的两种方法。实验结果表明,采用随机森林方法的传统分类器的训练准确率为86.9%,验证准确率为84.8%,显著优于其他传统分类器和基于dl的分类器。对于未来的方向,将实现与文本或音频输入类型相结合的面部表情特征,以研究使用面部表情是否可以补充或改进仇恨言论检测模型。