{"title":"Using Cohesion-Based and Sentiment-Based Attributes to Classify Spoilers in Movie Reviews","authors":"Rangsipan Marukatat","doi":"10.1109/IC2IE56416.2022.9970137","DOIUrl":null,"url":null,"abstract":"Spoiler reviews have different narrative patterns from non-spoiler reviews. Their narrative is more precise about what happened in the movies, while that of non-spoiler reviews is more obscure due to the omission of specific details. Our research extracted 108 cohesion-based and 6 sentiment-based attributes from movie reviews, which captured these patterns. The classification was done using Naive Bayes and a support vector machine (SVM) with a linear kernel. SVM achieved the best performance of 78% accuracy and 0.78 F -measure of class spoiler. Most contributing attributes were also determined from the weight vector given by the SVM. They supported our initial observation about the differences in narrative patterns between spoilers and non-spoilers.","PeriodicalId":151165,"journal":{"name":"2022 5th International Conference of Computer and Informatics Engineering (IC2IE)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th International Conference of Computer and Informatics Engineering (IC2IE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC2IE56416.2022.9970137","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Spoiler reviews have different narrative patterns from non-spoiler reviews. Their narrative is more precise about what happened in the movies, while that of non-spoiler reviews is more obscure due to the omission of specific details. Our research extracted 108 cohesion-based and 6 sentiment-based attributes from movie reviews, which captured these patterns. The classification was done using Naive Bayes and a support vector machine (SVM) with a linear kernel. SVM achieved the best performance of 78% accuracy and 0.78 F -measure of class spoiler. Most contributing attributes were also determined from the weight vector given by the SVM. They supported our initial observation about the differences in narrative patterns between spoilers and non-spoilers.
剧透评论与非剧透评论有着不同的叙事模式。他们的叙述对电影中发生的事情更加精确,而非剧透评论由于遗漏了具体细节而更加模糊。我们的研究从电影评论中提取了108个基于凝聚力和6个基于情感的属性,这些属性捕捉到了这些模式。使用朴素贝叶斯和线性核支持向量机(SVM)进行分类。SVM获得了78%的准确率和0.78 F -measure的最佳性能。大多数贡献属性也由支持向量机给出的权重向量确定。他们支持了我们最初关于剧透者和非剧透者之间叙事模式差异的观察。