IN-GFD：针对垃圾评论的可解释图形欺诈检测模型

IEEE transactions on artificial intelligence Pub Date : 2024-06-27 DOI:10.1109/TAI.2024.3420262

Hang Yu;Weixu Liu;Nengjun Zhu;Pengbo Li;Xiangfeng Luo

{"title":"IN-GFD：针对垃圾评论的可解释图形欺诈检测模型","authors":"Hang Yu;Weixu Liu;Nengjun Zhu;Pengbo Li;Xiangfeng Luo","doi":"10.1109/TAI.2024.3420262","DOIUrl":null,"url":null,"abstract":"With the development of the e-commerce platform, more and more reviews of its various formats continue to appear. Reviews help people buy the right item faster, and instead, spam reviews reduce the user experience. To be able to detect spam reviews, statistical machine learning-based methods were commonly used in the past, but these approaches ignored the correlation between reviews. With the development of the graph fraud detection model, people have started to graph model the review data. However, typical graph fraud detection models still have problems with interpretability. Therefore, we propose here an interpretable graph fraud detection model for spam reviews, which is also named IN-GFD. As for the interpretability issue, we leveraged the relationship against the predicted score and whether a review is spam or not to build a loss function on top of the feature-embedding matrix, and introduced a scoring difference threshold mechanism, which can allow our model to have antehoc interpretability. In addition, to address class imbalance issues, IN-GFD utilizes the oversampling of the spam nodes to balance them with normal nodes and introduces an edge-loss function to learn new edge relationships. After extensive experiments, our method proves to be better than other state-of-the-arts (SOTA) models in terms of fraud detection and offers the benefit of interpretability. Finally, our study combines detection models with antehoc interpretability, offering a promising direction in review detection. Our approach has wide applicability, detecting spam reviews in datasets with user reviews and providing reasonable interpretations.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 10","pages":"5325-5339"},"PeriodicalIF":0.0000,"publicationDate":"2024-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"IN-GFD: An Interpretable Graph Fraud Detection Model for Spam Reviews\",\"authors\":\"Hang Yu;Weixu Liu;Nengjun Zhu;Pengbo Li;Xiangfeng Luo\",\"doi\":\"10.1109/TAI.2024.3420262\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the development of the e-commerce platform, more and more reviews of its various formats continue to appear. Reviews help people buy the right item faster, and instead, spam reviews reduce the user experience. To be able to detect spam reviews, statistical machine learning-based methods were commonly used in the past, but these approaches ignored the correlation between reviews. With the development of the graph fraud detection model, people have started to graph model the review data. However, typical graph fraud detection models still have problems with interpretability. Therefore, we propose here an interpretable graph fraud detection model for spam reviews, which is also named IN-GFD. As for the interpretability issue, we leveraged the relationship against the predicted score and whether a review is spam or not to build a loss function on top of the feature-embedding matrix, and introduced a scoring difference threshold mechanism, which can allow our model to have antehoc interpretability. In addition, to address class imbalance issues, IN-GFD utilizes the oversampling of the spam nodes to balance them with normal nodes and introduces an edge-loss function to learn new edge relationships. After extensive experiments, our method proves to be better than other state-of-the-arts (SOTA) models in terms of fraud detection and offers the benefit of interpretability. Finally, our study combines detection models with antehoc interpretability, offering a promising direction in review detection. Our approach has wide applicability, detecting spam reviews in datasets with user reviews and providing reasonable interpretations.\",\"PeriodicalId\":73305,\"journal\":{\"name\":\"IEEE transactions on artificial intelligence\",\"volume\":\"5 10\",\"pages\":\"5325-5339\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE transactions on artificial intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10574870/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10574870/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

随着电子商务平台的发展，越来越多的各种形式的评论不断出现。评论可以帮助人们更快地购买到合适的商品，而垃圾评论反而会降低用户体验。为了检测垃圾评论，过去通常使用基于统计的机器学习方法，但这些方法忽略了评论之间的相关性。随着图欺诈检测模型的发展，人们开始对评论数据进行图建模。然而，典型的图欺诈检测模型仍然存在可解释性的问题。因此，我们在此提出一种可解释的垃圾评论图欺诈检测模型，并将其命名为 IN-GFD。针对可解释性问题，我们利用预测得分与评论是否为垃圾评论之间的关系，在特征嵌入矩阵之上建立了一个损失函数，并引入了评分差异阈值机制，从而使我们的模型具有临时可解释性。此外，为了解决类不平衡问题，IN-GFD 利用对垃圾节点的超采样来平衡它们与正常节点的关系，并引入边缘损失函数来学习新的边缘关系。经过大量实验证明，我们的方法在欺诈检测方面优于其他先进（SOTA）模型，并且具有可解释性强的优点。最后，我们的研究将检测模型与前置可解释性相结合，为评论检测提供了一个前景广阔的方向。我们的方法具有广泛的适用性，可以在包含用户评论的数据集中检测出垃圾评论，并提供合理的解释。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

IN-GFD: An Interpretable Graph Fraud Detection Model for Spam Reviews

With the development of the e-commerce platform, more and more reviews of its various formats continue to appear. Reviews help people buy the right item faster, and instead, spam reviews reduce the user experience. To be able to detect spam reviews, statistical machine learning-based methods were commonly used in the past, but these approaches ignored the correlation between reviews. With the development of the graph fraud detection model, people have started to graph model the review data. However, typical graph fraud detection models still have problems with interpretability. Therefore, we propose here an interpretable graph fraud detection model for spam reviews, which is also named IN-GFD. As for the interpretability issue, we leveraged the relationship against the predicted score and whether a review is spam or not to build a loss function on top of the feature-embedding matrix, and introduced a scoring difference threshold mechanism, which can allow our model to have antehoc interpretability. In addition, to address class imbalance issues, IN-GFD utilizes the oversampling of the spam nodes to balance them with normal nodes and introduces an edge-loss function to learn new edge relationships. After extensive experiments, our method proves to be better than other state-of-the-arts (SOTA) models in terms of fraud detection and offers the benefit of interpretability. Finally, our study combines detection models with antehoc interpretability, offering a promising direction in review detection. Our approach has wide applicability, detecting spam reviews in datasets with user reviews and providing reasonable interpretations.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助