Alwin T. Aind, Akashdeep Ramnaney, Divyashikha Sethia
{"title":"Q-Bully:基于强化学习的网络欺凌检测框架","authors":"Alwin T. Aind, Akashdeep Ramnaney, Divyashikha Sethia","doi":"10.1109/incet49848.2020.9154092","DOIUrl":null,"url":null,"abstract":"With the ever-increasing involvement of people into social media and online gaming, Cyberbullying has become a serious issue affecting almost all parts of the demographic. Cyberbullying can cause severe mental and emotional impacts on a person, especially on minors; hence, there is a requirement of having intelligent automated systems to detect questionable content present on social media platforms and remove it. In this paper, we introduce our novel algorithm Q-Bully which can automatically detect cyberbullying on various social media and online gaming platforms using Reinforcement Learning along with Natural Language Processing techniques. Previously the techniques used to detect cyberbullying have a good accuracy related to the text they have been trained on and do not incorporate new word patterns without complete retraining of model. In this paper, we incorporated the use of Reinforcement Learning and have conducted an experimental study in which we feed the messages and posts of bullies as well as victims to a Reinforcement Learning Agent for classification. We compare our model with the other baseline models on based on F1 scores (0.86 a benchmark dataset of 16K annotated tweets) and are able to infer that our model outperforms other state-of-the-art models when the dataset is highly dynamic and populated with words which are deliberately misspelled to trick the conventional detection systems.","PeriodicalId":174411,"journal":{"name":"2020 International Conference for Emerging Technology (INCET)","volume":"105 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Q-Bully: A Reinforcement Learning based Cyberbullying Detection Framework\",\"authors\":\"Alwin T. Aind, Akashdeep Ramnaney, Divyashikha Sethia\",\"doi\":\"10.1109/incet49848.2020.9154092\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the ever-increasing involvement of people into social media and online gaming, Cyberbullying has become a serious issue affecting almost all parts of the demographic. Cyberbullying can cause severe mental and emotional impacts on a person, especially on minors; hence, there is a requirement of having intelligent automated systems to detect questionable content present on social media platforms and remove it. In this paper, we introduce our novel algorithm Q-Bully which can automatically detect cyberbullying on various social media and online gaming platforms using Reinforcement Learning along with Natural Language Processing techniques. Previously the techniques used to detect cyberbullying have a good accuracy related to the text they have been trained on and do not incorporate new word patterns without complete retraining of model. In this paper, we incorporated the use of Reinforcement Learning and have conducted an experimental study in which we feed the messages and posts of bullies as well as victims to a Reinforcement Learning Agent for classification. We compare our model with the other baseline models on based on F1 scores (0.86 a benchmark dataset of 16K annotated tweets) and are able to infer that our model outperforms other state-of-the-art models when the dataset is highly dynamic and populated with words which are deliberately misspelled to trick the conventional detection systems.\",\"PeriodicalId\":174411,\"journal\":{\"name\":\"2020 International Conference for Emerging Technology (INCET)\",\"volume\":\"105 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference for Emerging Technology (INCET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/incet49848.2020.9154092\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference for Emerging Technology (INCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/incet49848.2020.9154092","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Q-Bully: A Reinforcement Learning based Cyberbullying Detection Framework
With the ever-increasing involvement of people into social media and online gaming, Cyberbullying has become a serious issue affecting almost all parts of the demographic. Cyberbullying can cause severe mental and emotional impacts on a person, especially on minors; hence, there is a requirement of having intelligent automated systems to detect questionable content present on social media platforms and remove it. In this paper, we introduce our novel algorithm Q-Bully which can automatically detect cyberbullying on various social media and online gaming platforms using Reinforcement Learning along with Natural Language Processing techniques. Previously the techniques used to detect cyberbullying have a good accuracy related to the text they have been trained on and do not incorporate new word patterns without complete retraining of model. In this paper, we incorporated the use of Reinforcement Learning and have conducted an experimental study in which we feed the messages and posts of bullies as well as victims to a Reinforcement Learning Agent for classification. We compare our model with the other baseline models on based on F1 scores (0.86 a benchmark dataset of 16K annotated tweets) and are able to infer that our model outperforms other state-of-the-art models when the dataset is highly dynamic and populated with words which are deliberately misspelled to trick the conventional detection systems.