{"title":"基于平台不可知模型的社交媒体恐华症检测","authors":"Matthew Morgan, Adita Kulkarni","doi":"10.1145/3564746.3587024","DOIUrl":null,"url":null,"abstract":"Although the boom of social media in the past decade has enabled the creation, distribution, and consumption of information at a remarkable rate, it has also led to the growth of different forms of online abuse. Since the outbreak of COVID-19, hate against Chinese or Sinophobia has increased significantly in real world as well as on online platforms making it necessary to design ways to combat it. In this paper, we design a platform-agnostic model to detect Sinophobic content on social media websites automatically. We use pre-trained word embeddings with several machine learning classifiers to detect Sinophobia on three platforms---Parler, Reddit, and Twitter. Our results demonstrate that the BERT model shows the best performance among all the models by achieving an accuracy of 98.51% on Parler, 95.36% on Reddit, and 88.12% on Twitter datasets.","PeriodicalId":322431,"journal":{"name":"Proceedings of the 2023 ACM Southeast Conference","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Platform-agnostic Model to Detect Sinophobia on Social Media\",\"authors\":\"Matthew Morgan, Adita Kulkarni\",\"doi\":\"10.1145/3564746.3587024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Although the boom of social media in the past decade has enabled the creation, distribution, and consumption of information at a remarkable rate, it has also led to the growth of different forms of online abuse. Since the outbreak of COVID-19, hate against Chinese or Sinophobia has increased significantly in real world as well as on online platforms making it necessary to design ways to combat it. In this paper, we design a platform-agnostic model to detect Sinophobic content on social media websites automatically. We use pre-trained word embeddings with several machine learning classifiers to detect Sinophobia on three platforms---Parler, Reddit, and Twitter. Our results demonstrate that the BERT model shows the best performance among all the models by achieving an accuracy of 98.51% on Parler, 95.36% on Reddit, and 88.12% on Twitter datasets.\",\"PeriodicalId\":322431,\"journal\":{\"name\":\"Proceedings of the 2023 ACM Southeast Conference\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2023 ACM Southeast Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3564746.3587024\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 ACM Southeast Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3564746.3587024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Platform-agnostic Model to Detect Sinophobia on Social Media
Although the boom of social media in the past decade has enabled the creation, distribution, and consumption of information at a remarkable rate, it has also led to the growth of different forms of online abuse. Since the outbreak of COVID-19, hate against Chinese or Sinophobia has increased significantly in real world as well as on online platforms making it necessary to design ways to combat it. In this paper, we design a platform-agnostic model to detect Sinophobic content on social media websites automatically. We use pre-trained word embeddings with several machine learning classifiers to detect Sinophobia on three platforms---Parler, Reddit, and Twitter. Our results demonstrate that the BERT model shows the best performance among all the models by achieving an accuracy of 98.51% on Parler, 95.36% on Reddit, and 88.12% on Twitter datasets.