Khin Me Me Chit, Yi Yi Chan Myae Win Shein, Wai Yan, A. Khine
{"title":"警笛!在社交媒体上发现缅甸人的仇恨言论评论","authors":"Khin Me Me Chit, Yi Yi Chan Myae Win Shein, Wai Yan, A. Khine","doi":"10.1109/KST53302.2022.9729075","DOIUrl":null,"url":null,"abstract":"Hate Speech on Social Media is definitely an evolving threat for every nation, especially for countries like Myanmar. Lack of media and digital literacy is playing a huge role in making people insult to each other or misallocating their stresses to others without physical encounter. Moreover, disingenuous politicians fuel online hate speech campaigns backstage of the elections by targeting different religions in the regard of heretics and using racialism. To emphasize this matter, we scraped over 16,000 social media comments from the most popular social media platform in Myanmar and performed hate-speech research using those samples. With the precise definition of a hate speech labelling guideline, annotation on the sample dataset was done systematically and efficiently. Experiments and evaluations were conducted using different linear and non-linear deep-learning classification models. Performances of the models are at the peak in Logistic Regression among linear models with 0.8974 AUC score and XLM-RoBERTa among deep learning models with 0.8958 AUC score on the test dataset. We observed that it is more advantageous to use linear models on our dataset since they achieved comparable results to the deep learning models and have much lower computational cost.","PeriodicalId":433638,"journal":{"name":"2022 14th International Conference on Knowledge and Smart Technology (KST)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SIREN! Detecting Burmese Hate Speech Comments on Social Media\",\"authors\":\"Khin Me Me Chit, Yi Yi Chan Myae Win Shein, Wai Yan, A. Khine\",\"doi\":\"10.1109/KST53302.2022.9729075\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Hate Speech on Social Media is definitely an evolving threat for every nation, especially for countries like Myanmar. Lack of media and digital literacy is playing a huge role in making people insult to each other or misallocating their stresses to others without physical encounter. Moreover, disingenuous politicians fuel online hate speech campaigns backstage of the elections by targeting different religions in the regard of heretics and using racialism. To emphasize this matter, we scraped over 16,000 social media comments from the most popular social media platform in Myanmar and performed hate-speech research using those samples. With the precise definition of a hate speech labelling guideline, annotation on the sample dataset was done systematically and efficiently. Experiments and evaluations were conducted using different linear and non-linear deep-learning classification models. Performances of the models are at the peak in Logistic Regression among linear models with 0.8974 AUC score and XLM-RoBERTa among deep learning models with 0.8958 AUC score on the test dataset. We observed that it is more advantageous to use linear models on our dataset since they achieved comparable results to the deep learning models and have much lower computational cost.\",\"PeriodicalId\":433638,\"journal\":{\"name\":\"2022 14th International Conference on Knowledge and Smart Technology (KST)\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 14th International Conference on Knowledge and Smart Technology (KST)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/KST53302.2022.9729075\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 14th International Conference on Knowledge and Smart Technology (KST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KST53302.2022.9729075","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
SIREN! Detecting Burmese Hate Speech Comments on Social Media
Hate Speech on Social Media is definitely an evolving threat for every nation, especially for countries like Myanmar. Lack of media and digital literacy is playing a huge role in making people insult to each other or misallocating their stresses to others without physical encounter. Moreover, disingenuous politicians fuel online hate speech campaigns backstage of the elections by targeting different religions in the regard of heretics and using racialism. To emphasize this matter, we scraped over 16,000 social media comments from the most popular social media platform in Myanmar and performed hate-speech research using those samples. With the precise definition of a hate speech labelling guideline, annotation on the sample dataset was done systematically and efficiently. Experiments and evaluations were conducted using different linear and non-linear deep-learning classification models. Performances of the models are at the peak in Logistic Regression among linear models with 0.8974 AUC score and XLM-RoBERTa among deep learning models with 0.8958 AUC score on the test dataset. We observed that it is more advantageous to use linear models on our dataset since they achieved comparable results to the deep learning models and have much lower computational cost.