Hussein Faisol, Kevin Djajadinata, Muljono Muljono
{"title":"Sentiment Analysis of Yelp Review","authors":"Hussein Faisol, Kevin Djajadinata, Muljono Muljono","doi":"10.1109/iSemantic50169.2020.9234213","DOIUrl":null,"url":null,"abstract":"Digital data has developed very fast in the current era. Digital data has various forms, one of which is text data. There are a lot of text data source from many ways, such as review text data. Yelp is a local business directory and forum to review products, services, or places. We used Yelp’s review data to determine user’s sentiment or opinion about products, services, or places. Sentiment or opinion are classified into positive reviews, or negative reviews. The algorithms that used are Gaussian Naïve Bayes, Gaussian Naïve Bayes with AdaBoost, and K-NN. In this research review text data will be through a preprocessing and feature extraction stage. We used n-gram to extract the feature from the data with unigram, bigram and uni+bi-gram for indexing text. The result of this research that algorithms that has the highest accuracy rate was Gaussian Naïve Bayes with combined unigram and bigram with 86.7% from 5 fold cross-validation.","PeriodicalId":345558,"journal":{"name":"2020 International Seminar on Application for Technology of Information and Communication (iSemantic)","volume":"12 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Seminar on Application for Technology of Information and Communication (iSemantic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iSemantic50169.2020.9234213","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Digital data has developed very fast in the current era. Digital data has various forms, one of which is text data. There are a lot of text data source from many ways, such as review text data. Yelp is a local business directory and forum to review products, services, or places. We used Yelp’s review data to determine user’s sentiment or opinion about products, services, or places. Sentiment or opinion are classified into positive reviews, or negative reviews. The algorithms that used are Gaussian Naïve Bayes, Gaussian Naïve Bayes with AdaBoost, and K-NN. In this research review text data will be through a preprocessing and feature extraction stage. We used n-gram to extract the feature from the data with unigram, bigram and uni+bi-gram for indexing text. The result of this research that algorithms that has the highest accuracy rate was Gaussian Naïve Bayes with combined unigram and bigram with 86.7% from 5 fold cross-validation.