M. A. Gumilang, T. D. Puspitasari, Hermawan Arief Putranto, Abdul Kholiq, A. Samsudin
{"title":"Sentiment Analysis Based on Tweet Reply at Public Figure Account using Machine Learning and Latent Semantic Analysis","authors":"M. A. Gumilang, T. D. Puspitasari, Hermawan Arief Putranto, Abdul Kholiq, A. Samsudin","doi":"10.1109/ICST56971.2022.10136288","DOIUrl":null,"url":null,"abstract":"Twitter is a social media platform that enables its user to communicate or see various running events online. It also enables everyone to share information. Public figures, such as celebrities, artists, or politicians often dominate the talk and trending topics. The pros and cons among the public figure accounts cause either positive or negative sentiments. Thus this condition urges a system that can classify each of the user's replies to a public figure account as a consideration to change to a better communication pattern. Some of the possible methods are the naive Bayes, SVM, and logistic regression, all of these methods are combined with the Latent Semantic Analysis (LSA). The classification system will be based on 1500 dataset that has been labeled and divided into 80% training data and 20% testing data. The result of the confusion matrix showed the highest accuracy for SVM 80.4%, logistic regression 80.6%, and multinomial Naive Bayes 78.6%.","PeriodicalId":277761,"journal":{"name":"2022 8th International Conference on Science and Technology (ICST)","volume":"119 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 8th International Conference on Science and Technology (ICST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICST56971.2022.10136288","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Twitter is a social media platform that enables its user to communicate or see various running events online. It also enables everyone to share information. Public figures, such as celebrities, artists, or politicians often dominate the talk and trending topics. The pros and cons among the public figure accounts cause either positive or negative sentiments. Thus this condition urges a system that can classify each of the user's replies to a public figure account as a consideration to change to a better communication pattern. Some of the possible methods are the naive Bayes, SVM, and logistic regression, all of these methods are combined with the Latent Semantic Analysis (LSA). The classification system will be based on 1500 dataset that has been labeled and divided into 80% training data and 20% testing data. The result of the confusion matrix showed the highest accuracy for SVM 80.4%, logistic regression 80.6%, and multinomial Naive Bayes 78.6%.