Ayu Mutiara Sari, Nurul Fajrin Ariyani, A. Ahmadiyah
{"title":"评估COVID-19推特上假新闻识别的初步模型","authors":"Ayu Mutiara Sari, Nurul Fajrin Ariyani, A. Ahmadiyah","doi":"10.1109/ICTS52701.2021.9607996","DOIUrl":null,"url":null,"abstract":"The spread propagation of fake news about COVID-19 can make it distressing to handle the pandemic situation. Identifying the fake and real news on social media needs to be done as quickly as possible to prevent chaos in the community and hampering the handling of COVID-19. In this study, we conducted some experiments to get a model that works well for classifying information into fake or real news using tweet data. We implemented two different ways to represent data to train machine learning classifier models, syntactic-based using Bag-of-Words and TF-IDF, and semantic-based using Word2Vec and FastText. We evaluated each model produced by the training process using two types of testing data. The results show that The Linear Support Vector Machine model using TF-IDF obtained the best F1-Score value in both testing data. The model obtained F1-Score 92.21% in Testing Data 1 and 93.33% in Testing Data 2.","PeriodicalId":6738,"journal":{"name":"2021 13th International Conference on Information & Communication Technology and System (ICTS)","volume":"75 1","pages":"336-341"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Evaluating The Preliminary Models to Identify Fake News on COVID-19 Tweets\",\"authors\":\"Ayu Mutiara Sari, Nurul Fajrin Ariyani, A. Ahmadiyah\",\"doi\":\"10.1109/ICTS52701.2021.9607996\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The spread propagation of fake news about COVID-19 can make it distressing to handle the pandemic situation. Identifying the fake and real news on social media needs to be done as quickly as possible to prevent chaos in the community and hampering the handling of COVID-19. In this study, we conducted some experiments to get a model that works well for classifying information into fake or real news using tweet data. We implemented two different ways to represent data to train machine learning classifier models, syntactic-based using Bag-of-Words and TF-IDF, and semantic-based using Word2Vec and FastText. We evaluated each model produced by the training process using two types of testing data. The results show that The Linear Support Vector Machine model using TF-IDF obtained the best F1-Score value in both testing data. The model obtained F1-Score 92.21% in Testing Data 1 and 93.33% in Testing Data 2.\",\"PeriodicalId\":6738,\"journal\":{\"name\":\"2021 13th International Conference on Information & Communication Technology and System (ICTS)\",\"volume\":\"75 1\",\"pages\":\"336-341\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 13th International Conference on Information & Communication Technology and System (ICTS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICTS52701.2021.9607996\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 13th International Conference on Information & Communication Technology and System (ICTS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTS52701.2021.9607996","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Evaluating The Preliminary Models to Identify Fake News on COVID-19 Tweets
The spread propagation of fake news about COVID-19 can make it distressing to handle the pandemic situation. Identifying the fake and real news on social media needs to be done as quickly as possible to prevent chaos in the community and hampering the handling of COVID-19. In this study, we conducted some experiments to get a model that works well for classifying information into fake or real news using tweet data. We implemented two different ways to represent data to train machine learning classifier models, syntactic-based using Bag-of-Words and TF-IDF, and semantic-based using Word2Vec and FastText. We evaluated each model produced by the training process using two types of testing data. The results show that The Linear Support Vector Machine model using TF-IDF obtained the best F1-Score value in both testing data. The model obtained F1-Score 92.21% in Testing Data 1 and 93.33% in Testing Data 2.