{"title":"基于方面的情感分析中的不平衡数据集对游戏《源氏物语》影响的评论","authors":"Prabowo Adi Perwira, Nelly Indriani Widiastuti","doi":"10.20895/infotel.v16i1.984","DOIUrl":null,"url":null,"abstract":"Sentiment analysis was commonly used to determine the polarity of the review text. However, there is a problem if some reviews have more than one aspect with different polarities, so the reviews have more than one polarity. That has happened in some reviews on the game Genshin Impact. Not merely that, the number of sentiments contained in a review is not always the same as other reviews will cause imbalanced data. So, this study will handle imbalance data with Random Under-Sampling and Random Over-Sampling on aspect-based-sentiment-analysis of Genshin Impact Review with Multinomial Naïve-Bayes, so that the classification prediction does not ignore the minority class due to the dominance of the majority class. The classification process used K-Fold Cross Validation (k=10) validation method and the Laplace smoothing technique on Multinomial Naïve Bayes. As a result, the conclusion is that Random Oversampling had better accuracy than Random Undersampling in handling imbalanced data on aspect-based sentiment analysis of Genshin Impact game Review in Indonesian with Naïve Bayes Multinomial, with the highest accuracy of 85.55%.","PeriodicalId":30672,"journal":{"name":"Jurnal Infotel","volume":"24 5","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Imbalance Dataset in Aspect-Based Sentiment Analysis on Game Genshin Impact Review\",\"authors\":\"Prabowo Adi Perwira, Nelly Indriani Widiastuti\",\"doi\":\"10.20895/infotel.v16i1.984\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sentiment analysis was commonly used to determine the polarity of the review text. However, there is a problem if some reviews have more than one aspect with different polarities, so the reviews have more than one polarity. That has happened in some reviews on the game Genshin Impact. Not merely that, the number of sentiments contained in a review is not always the same as other reviews will cause imbalanced data. So, this study will handle imbalance data with Random Under-Sampling and Random Over-Sampling on aspect-based-sentiment-analysis of Genshin Impact Review with Multinomial Naïve-Bayes, so that the classification prediction does not ignore the minority class due to the dominance of the majority class. The classification process used K-Fold Cross Validation (k=10) validation method and the Laplace smoothing technique on Multinomial Naïve Bayes. As a result, the conclusion is that Random Oversampling had better accuracy than Random Undersampling in handling imbalanced data on aspect-based sentiment analysis of Genshin Impact game Review in Indonesian with Naïve Bayes Multinomial, with the highest accuracy of 85.55%.\",\"PeriodicalId\":30672,\"journal\":{\"name\":\"Jurnal Infotel\",\"volume\":\"24 5\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Jurnal Infotel\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.20895/infotel.v16i1.984\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jurnal Infotel","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.20895/infotel.v16i1.984","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Imbalance Dataset in Aspect-Based Sentiment Analysis on Game Genshin Impact Review
Sentiment analysis was commonly used to determine the polarity of the review text. However, there is a problem if some reviews have more than one aspect with different polarities, so the reviews have more than one polarity. That has happened in some reviews on the game Genshin Impact. Not merely that, the number of sentiments contained in a review is not always the same as other reviews will cause imbalanced data. So, this study will handle imbalance data with Random Under-Sampling and Random Over-Sampling on aspect-based-sentiment-analysis of Genshin Impact Review with Multinomial Naïve-Bayes, so that the classification prediction does not ignore the minority class due to the dominance of the majority class. The classification process used K-Fold Cross Validation (k=10) validation method and the Laplace smoothing technique on Multinomial Naïve Bayes. As a result, the conclusion is that Random Oversampling had better accuracy than Random Undersampling in handling imbalanced data on aspect-based sentiment analysis of Genshin Impact game Review in Indonesian with Naïve Bayes Multinomial, with the highest accuracy of 85.55%.