{"title":"Spam Detection on Indonesian Beauty Product Review","authors":"Muhammad Ahsan Athallah, A. Romadhony","doi":"10.1109/ICoICT52021.2021.9527409","DOIUrl":null,"url":null,"abstract":"A product review is one of the most important sources of information that can help consumers to find the most suitable products for their needs. However, there is a chance a reviewer has other intentions than providing an honest review, including advertising the brand or other brands. A review that does not contain any information related to the product’s aspects/features could be considered spam. This paper presents our work on spam review detection, specifically in the domain of beauty products. We used SVM and Logistic Regression classifier and the following features: the review sentiments, product-related features, and review-centric features extracted from the reviews. We classified the beauty product review texts as spam and non-spam reviews. The experimental result showed that the best accuracy percentage was 81%, obtained when we used the sentiments and review-centric features with the SVM algorithm.","PeriodicalId":191671,"journal":{"name":"2021 9th International Conference on Information and Communication Technology (ICoICT)","volume":"141 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 9th International Conference on Information and Communication Technology (ICoICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICoICT52021.2021.9527409","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A product review is one of the most important sources of information that can help consumers to find the most suitable products for their needs. However, there is a chance a reviewer has other intentions than providing an honest review, including advertising the brand or other brands. A review that does not contain any information related to the product’s aspects/features could be considered spam. This paper presents our work on spam review detection, specifically in the domain of beauty products. We used SVM and Logistic Regression classifier and the following features: the review sentiments, product-related features, and review-centric features extracted from the reviews. We classified the beauty product review texts as spam and non-spam reviews. The experimental result showed that the best accuracy percentage was 81%, obtained when we used the sentiments and review-centric features with the SVM algorithm.