{"title":"基于AspamGAN的半监督虚假评论检测","authors":"Chen Jing-Yu, Wang Ya-jun","doi":"10.36548/jaicn.2022.1.002","DOIUrl":null,"url":null,"abstract":"With the popularization of social software and e-business in recent years, more and more consumers like to share their consumption experiences on social networks and refer to other consumers' reviews and opinions when making consumption decisions. Online reviews have become an essential part of browsing on websites such as shopping, and people's reliance on informative reviews have contributed to the rise of fake reviews. The traditional classification method is affected by the label dataset, which is not only time-consuming, laborious, and subjective, but also the extraction of artificial features also affects the classification accuracy. Due to the relative length of the online text, the possibility of the classifier losing important information increases, this weakens the model’s detection capability. To solve this aforementioned problem, a semi-supervised Generative Adversarial Network (AspamGAN) fake reviews detection method incorporating an attention mechanism is proposed. Using labeled and unlabeled data to correctly learn input distributions, the features required for classification are automatically discovered using deep neural networks, providing better prediction accuracy for online reviews. The approach includes attention mechanisms in the classifier to obtain an adequate semantic representation and relies on a limited dataset of labeled data to detect false reviews, and is applied on the TripAdvisor dataset. Experimental results show that the proposed algorithm outperforms state-of-the-art semi-supervised fake review detection techniques when the label dataset is limited.","PeriodicalId":10940,"journal":{"name":"Day 2 Tue, March 22, 2022","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Semi-Supervised Fake Reviews Detection based on AspamGAN\",\"authors\":\"Chen Jing-Yu, Wang Ya-jun\",\"doi\":\"10.36548/jaicn.2022.1.002\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the popularization of social software and e-business in recent years, more and more consumers like to share their consumption experiences on social networks and refer to other consumers' reviews and opinions when making consumption decisions. Online reviews have become an essential part of browsing on websites such as shopping, and people's reliance on informative reviews have contributed to the rise of fake reviews. The traditional classification method is affected by the label dataset, which is not only time-consuming, laborious, and subjective, but also the extraction of artificial features also affects the classification accuracy. Due to the relative length of the online text, the possibility of the classifier losing important information increases, this weakens the model’s detection capability. To solve this aforementioned problem, a semi-supervised Generative Adversarial Network (AspamGAN) fake reviews detection method incorporating an attention mechanism is proposed. Using labeled and unlabeled data to correctly learn input distributions, the features required for classification are automatically discovered using deep neural networks, providing better prediction accuracy for online reviews. The approach includes attention mechanisms in the classifier to obtain an adequate semantic representation and relies on a limited dataset of labeled data to detect false reviews, and is applied on the TripAdvisor dataset. Experimental results show that the proposed algorithm outperforms state-of-the-art semi-supervised fake review detection techniques when the label dataset is limited.\",\"PeriodicalId\":10940,\"journal\":{\"name\":\"Day 2 Tue, March 22, 2022\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Day 2 Tue, March 22, 2022\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.36548/jaicn.2022.1.002\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Day 2 Tue, March 22, 2022","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36548/jaicn.2022.1.002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Semi-Supervised Fake Reviews Detection based on AspamGAN
With the popularization of social software and e-business in recent years, more and more consumers like to share their consumption experiences on social networks and refer to other consumers' reviews and opinions when making consumption decisions. Online reviews have become an essential part of browsing on websites such as shopping, and people's reliance on informative reviews have contributed to the rise of fake reviews. The traditional classification method is affected by the label dataset, which is not only time-consuming, laborious, and subjective, but also the extraction of artificial features also affects the classification accuracy. Due to the relative length of the online text, the possibility of the classifier losing important information increases, this weakens the model’s detection capability. To solve this aforementioned problem, a semi-supervised Generative Adversarial Network (AspamGAN) fake reviews detection method incorporating an attention mechanism is proposed. Using labeled and unlabeled data to correctly learn input distributions, the features required for classification are automatically discovered using deep neural networks, providing better prediction accuracy for online reviews. The approach includes attention mechanisms in the classifier to obtain an adequate semantic representation and relies on a limited dataset of labeled data to detect false reviews, and is applied on the TripAdvisor dataset. Experimental results show that the proposed algorithm outperforms state-of-the-art semi-supervised fake review detection techniques when the label dataset is limited.