{"title":"Covid-19大流行对经济的影响:使用Naïve贝叶斯分类器和支持向量机对Twitter的情绪分析","authors":"Qurrotul Aini, Raffie Rizky Fauzi, Eva Khudzaeva","doi":"10.30630/joiv.7.3.1474","DOIUrl":null,"url":null,"abstract":"Covid-19 is an outbreak caused by severe acute respiratory syndrome. Covid-19 first appeared in Indonesia on March 2, 2020, with two confirmed cases and increased to 1285 cases in 30 provinces. One of the impacts of the Covid-19 pandemic is on the economic aspect, which has experienced a drastic decline in income. This study aims to classify public opinion to determine the level of public sentiment on the economic impact of the Covid-19 pandemic and to identify parameters that influence the accuracy of the sentiment analysis classification model. The methods used in this current research are Lexicon, Support Vector Machine (SVM), and Naive Bayes Classifier (NBC). First, Lexicon is used for scoring and labeling the preprocessed data. Second, SVM is used to classify the sentiment, then find the best accuracy using linear, radial, polynomial, and sigmoid kernels. Third, NBC is used to classify sentiment as a comparison method. The results indicated that 255 tweet data consisted of 44 positive tweets (17.25%), 46 neutral tweets (18.04%), and 165 negative tweets (64.71%). Therefore, it can be inferred that the economic impact on the Indonesian people due to the Covid-19 pandemic has a high negative sentiment value. In the performance, SVM yielded a better accuracy of 100%, precision, recall, and F-measure are 1. This study proves that selecting the kernel type and applying underfitting can improve the accuracy of SVM. Also, SVM can perform well on a small amount of training data.","PeriodicalId":32468,"journal":{"name":"JOIV International Journal on Informatics Visualization","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Economic Impact due Covid-19 Pandemic: Sentiment Analysis on Twitter Using Naïve Bayes Classifier and Support Vector Machine\",\"authors\":\"Qurrotul Aini, Raffie Rizky Fauzi, Eva Khudzaeva\",\"doi\":\"10.30630/joiv.7.3.1474\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Covid-19 is an outbreak caused by severe acute respiratory syndrome. Covid-19 first appeared in Indonesia on March 2, 2020, with two confirmed cases and increased to 1285 cases in 30 provinces. One of the impacts of the Covid-19 pandemic is on the economic aspect, which has experienced a drastic decline in income. This study aims to classify public opinion to determine the level of public sentiment on the economic impact of the Covid-19 pandemic and to identify parameters that influence the accuracy of the sentiment analysis classification model. The methods used in this current research are Lexicon, Support Vector Machine (SVM), and Naive Bayes Classifier (NBC). First, Lexicon is used for scoring and labeling the preprocessed data. Second, SVM is used to classify the sentiment, then find the best accuracy using linear, radial, polynomial, and sigmoid kernels. Third, NBC is used to classify sentiment as a comparison method. The results indicated that 255 tweet data consisted of 44 positive tweets (17.25%), 46 neutral tweets (18.04%), and 165 negative tweets (64.71%). Therefore, it can be inferred that the economic impact on the Indonesian people due to the Covid-19 pandemic has a high negative sentiment value. In the performance, SVM yielded a better accuracy of 100%, precision, recall, and F-measure are 1. This study proves that selecting the kernel type and applying underfitting can improve the accuracy of SVM. Also, SVM can perform well on a small amount of training data.\",\"PeriodicalId\":32468,\"journal\":{\"name\":\"JOIV International Journal on Informatics Visualization\",\"volume\":\"73 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JOIV International Journal on Informatics Visualization\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.30630/joiv.7.3.1474\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Decision Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JOIV International Journal on Informatics Visualization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30630/joiv.7.3.1474","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Decision Sciences","Score":null,"Total":0}
Economic Impact due Covid-19 Pandemic: Sentiment Analysis on Twitter Using Naïve Bayes Classifier and Support Vector Machine
Covid-19 is an outbreak caused by severe acute respiratory syndrome. Covid-19 first appeared in Indonesia on March 2, 2020, with two confirmed cases and increased to 1285 cases in 30 provinces. One of the impacts of the Covid-19 pandemic is on the economic aspect, which has experienced a drastic decline in income. This study aims to classify public opinion to determine the level of public sentiment on the economic impact of the Covid-19 pandemic and to identify parameters that influence the accuracy of the sentiment analysis classification model. The methods used in this current research are Lexicon, Support Vector Machine (SVM), and Naive Bayes Classifier (NBC). First, Lexicon is used for scoring and labeling the preprocessed data. Second, SVM is used to classify the sentiment, then find the best accuracy using linear, radial, polynomial, and sigmoid kernels. Third, NBC is used to classify sentiment as a comparison method. The results indicated that 255 tweet data consisted of 44 positive tweets (17.25%), 46 neutral tweets (18.04%), and 165 negative tweets (64.71%). Therefore, it can be inferred that the economic impact on the Indonesian people due to the Covid-19 pandemic has a high negative sentiment value. In the performance, SVM yielded a better accuracy of 100%, precision, recall, and F-measure are 1. This study proves that selecting the kernel type and applying underfitting can improve the accuracy of SVM. Also, SVM can perform well on a small amount of training data.