{"title":"使用位置LSTM / K-L散度集成方法检测虚假Yelp评论","authors":"Christopher G. Harris","doi":"10.1109/ICISIT54091.2022.9872788","DOIUrl":null,"url":null,"abstract":"Online reviews of products and services often have a significant impact on future purchases. Unfortunately, this invites opportunities for fraud –to provide fake reviews to improve a company’s reputation in the eyes of consumers or to disparage the reputation of a competitor. We combine two methods to detect fraud in Yelp restaurant reviews. First, we develop and apply a bi-directional Long-Short Term Memory (LSTM), a type of recurrent neural network, to take advantage of the positional relevancy of comments within reviews. LSTMs are well-suited to examine linguistic features in text and evaluating different regions of text within the review enhances our model’s accuracy. To this component, we apply a Kullback-Leibler (K-L) divergence technique to examine the discrepancy in the term rankings between real and fake Yelp reviews. This ensemble is used to discriminate fake reviews from real ones, achieving an Average Precision (AP) of 0.5402 and an Area Under the Curve (AOC) of 0.866. which is an improvement over other state-of-the-art techniques on the same YelpNYC dataset.","PeriodicalId":214014,"journal":{"name":"2022 1st International Conference on Information System & Information Technology (ICISIT)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Detecting Fake Yelp Reviews Using a Positional LSTM / K-L Divergence Ensemble Approach\",\"authors\":\"Christopher G. Harris\",\"doi\":\"10.1109/ICISIT54091.2022.9872788\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Online reviews of products and services often have a significant impact on future purchases. Unfortunately, this invites opportunities for fraud –to provide fake reviews to improve a company’s reputation in the eyes of consumers or to disparage the reputation of a competitor. We combine two methods to detect fraud in Yelp restaurant reviews. First, we develop and apply a bi-directional Long-Short Term Memory (LSTM), a type of recurrent neural network, to take advantage of the positional relevancy of comments within reviews. LSTMs are well-suited to examine linguistic features in text and evaluating different regions of text within the review enhances our model’s accuracy. To this component, we apply a Kullback-Leibler (K-L) divergence technique to examine the discrepancy in the term rankings between real and fake Yelp reviews. This ensemble is used to discriminate fake reviews from real ones, achieving an Average Precision (AP) of 0.5402 and an Area Under the Curve (AOC) of 0.866. which is an improvement over other state-of-the-art techniques on the same YelpNYC dataset.\",\"PeriodicalId\":214014,\"journal\":{\"name\":\"2022 1st International Conference on Information System & Information Technology (ICISIT)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 1st International Conference on Information System & Information Technology (ICISIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICISIT54091.2022.9872788\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 1st International Conference on Information System & Information Technology (ICISIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICISIT54091.2022.9872788","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Detecting Fake Yelp Reviews Using a Positional LSTM / K-L Divergence Ensemble Approach
Online reviews of products and services often have a significant impact on future purchases. Unfortunately, this invites opportunities for fraud –to provide fake reviews to improve a company’s reputation in the eyes of consumers or to disparage the reputation of a competitor. We combine two methods to detect fraud in Yelp restaurant reviews. First, we develop and apply a bi-directional Long-Short Term Memory (LSTM), a type of recurrent neural network, to take advantage of the positional relevancy of comments within reviews. LSTMs are well-suited to examine linguistic features in text and evaluating different regions of text within the review enhances our model’s accuracy. To this component, we apply a Kullback-Leibler (K-L) divergence technique to examine the discrepancy in the term rankings between real and fake Yelp reviews. This ensemble is used to discriminate fake reviews from real ones, achieving an Average Precision (AP) of 0.5402 and an Area Under the Curve (AOC) of 0.866. which is an improvement over other state-of-the-art techniques on the same YelpNYC dataset.