A Framework for Early Detection of Cyberbullying in Chinese-English Code-Mixed Social Media Text Using Natural Language Processing and Machine Learning
Carlin Chun-Fai Chu, Raymond So, Simon Siu-Wai Li, Ernest Kan-Lam Kwong, Chun-Hung Chiu
{"title":"A Framework for Early Detection of Cyberbullying in Chinese-English Code-Mixed Social Media Text Using Natural Language Processing and Machine Learning","authors":"Carlin Chun-Fai Chu, Raymond So, Simon Siu-Wai Li, Ernest Kan-Lam Kwong, Chun-Hung Chiu","doi":"10.1109/ICNLP58431.2023.00061","DOIUrl":null,"url":null,"abstract":"This study develops a new expert system framework to address the issue of early detection of cyberbullying incidents in Chinese-English code-mixed language on social media networks. The framework covers the crawling of session-based social media texts with potential cyberbullying messages with a crowdsourcing web application to systematically retrieve and manually annotate a cyberbullying dataset, and most importantly establishes an explainable artificial intelligence model based on natural language processing algorithm for identification of targeted emotional colloquial slang phrases and machine learning method using Shapley value and transfer learning approach for automatic early detection of cyberbullying incidents in Chinese-English codemixed language.","PeriodicalId":53637,"journal":{"name":"Icon","volume":"3 1","pages":"298-302"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Icon","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNLP58431.2023.00061","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Arts and Humanities","Score":null,"Total":0}
引用次数: 0
Abstract
This study develops a new expert system framework to address the issue of early detection of cyberbullying incidents in Chinese-English code-mixed language on social media networks. The framework covers the crawling of session-based social media texts with potential cyberbullying messages with a crowdsourcing web application to systematically retrieve and manually annotate a cyberbullying dataset, and most importantly establishes an explainable artificial intelligence model based on natural language processing algorithm for identification of targeted emotional colloquial slang phrases and machine learning method using Shapley value and transfer learning approach for automatic early detection of cyberbullying incidents in Chinese-English codemixed language.