Manash Sarker, Md. Forhad Hossain, Fahmida Rahman Liza, S. N. Sakib, Abdullah Al Farooq
{"title":"分类社交媒体上反社会孟加拉语评论的机器学习方法","authors":"Manash Sarker, Md. Forhad Hossain, Fahmida Rahman Liza, S. N. Sakib, Abdullah Al Farooq","doi":"10.1109/icaeee54957.2022.9836407","DOIUrl":null,"url":null,"abstract":"The growth of social media is causing the emergence of hate speech. Email extortion and cyberbullying are on the rise in Bangladesh, along with online sexual harassment of women. In order to prevent these crimes, studies on Bengali comments on social media have become progressively important. However, the requisite datasets are scarce for this kind of study. The motive of this research is to create a dataset of Bangla comments from social platforms and develop a classifier model as well as to detect whether the comments are social or anti-social quickly and efficiently. 2000 comments were gathered from Facebook and YouTube, two prominent platforms for social media. In our study, an artificial neural network model like Gated Recurrent Unit (GRU), and supervised machine learning classifiers like Logistic Regression (LR), Random Forest (RF), Multinomial Naive Bayes (MNB), and Support Vector Machine (SVM) were utilized in our study to distinguish between anti-social and socially acceptable comments. Finally, language models such as unigrams, bigrams, and trigrams have been implemented in our research. To the best of our knowledge, there are no studies regarding the anti-social classification in Bangla language. This work will help to prevent anti-social activities in Bangla community.","PeriodicalId":383872,"journal":{"name":"2022 International Conference on Advancement in Electrical and Electronic Engineering (ICAEEE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A Machine Learning Approach to Classify Anti-social Bengali Comments on Social Media\",\"authors\":\"Manash Sarker, Md. Forhad Hossain, Fahmida Rahman Liza, S. N. Sakib, Abdullah Al Farooq\",\"doi\":\"10.1109/icaeee54957.2022.9836407\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The growth of social media is causing the emergence of hate speech. Email extortion and cyberbullying are on the rise in Bangladesh, along with online sexual harassment of women. In order to prevent these crimes, studies on Bengali comments on social media have become progressively important. However, the requisite datasets are scarce for this kind of study. The motive of this research is to create a dataset of Bangla comments from social platforms and develop a classifier model as well as to detect whether the comments are social or anti-social quickly and efficiently. 2000 comments were gathered from Facebook and YouTube, two prominent platforms for social media. In our study, an artificial neural network model like Gated Recurrent Unit (GRU), and supervised machine learning classifiers like Logistic Regression (LR), Random Forest (RF), Multinomial Naive Bayes (MNB), and Support Vector Machine (SVM) were utilized in our study to distinguish between anti-social and socially acceptable comments. Finally, language models such as unigrams, bigrams, and trigrams have been implemented in our research. To the best of our knowledge, there are no studies regarding the anti-social classification in Bangla language. This work will help to prevent anti-social activities in Bangla community.\",\"PeriodicalId\":383872,\"journal\":{\"name\":\"2022 International Conference on Advancement in Electrical and Electronic Engineering (ICAEEE)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-02-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Advancement in Electrical and Electronic Engineering (ICAEEE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/icaeee54957.2022.9836407\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Advancement in Electrical and Electronic Engineering (ICAEEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icaeee54957.2022.9836407","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Machine Learning Approach to Classify Anti-social Bengali Comments on Social Media
The growth of social media is causing the emergence of hate speech. Email extortion and cyberbullying are on the rise in Bangladesh, along with online sexual harassment of women. In order to prevent these crimes, studies on Bengali comments on social media have become progressively important. However, the requisite datasets are scarce for this kind of study. The motive of this research is to create a dataset of Bangla comments from social platforms and develop a classifier model as well as to detect whether the comments are social or anti-social quickly and efficiently. 2000 comments were gathered from Facebook and YouTube, two prominent platforms for social media. In our study, an artificial neural network model like Gated Recurrent Unit (GRU), and supervised machine learning classifiers like Logistic Regression (LR), Random Forest (RF), Multinomial Naive Bayes (MNB), and Support Vector Machine (SVM) were utilized in our study to distinguish between anti-social and socially acceptable comments. Finally, language models such as unigrams, bigrams, and trigrams have been implemented in our research. To the best of our knowledge, there are no studies regarding the anti-social classification in Bangla language. This work will help to prevent anti-social activities in Bangla community.