Anomalous node detection in attributed social networks using dual variational autoencoder with generative adversarial networks

Data Science and Management Pub Date : 2024-06-01 Epub Date: 2023-11-02 DOI:10.1016/j.dsm.2023.10.005

Wasim Khan , Shafiqul Abidin , Mohammad Arif , Mohammad Ishrat , Mohd Haleem , Anwar Ahamed Shaikh , Nafees Akhtar Farooqui , Syed Mohd Faisal

{"title":"Anomalous node detection in attributed social networks using dual variational autoencoder with generative adversarial networks","authors":"Wasim Khan , Shafiqul Abidin , Mohammad Arif , Mohammad Ishrat , Mohd Haleem , Anwar Ahamed Shaikh , Nafees Akhtar Farooqui , Syed Mohd Faisal","doi":"10.1016/j.dsm.2023.10.005","DOIUrl":null,"url":null,"abstract":"<div><p>Many types of real-world information systems, including social media and e-commerce platforms, can be modelled by means of attribute-rich, connected networks. The goal of anomaly detection in artificial intelligence is to identify illustrations that deviate significantly from the main distribution of data or that differ from known cases. Anomalous nodes in node-attributed networks can be identified with greater precision if both graph and node attributes are taken into account. Almost all of the studies in this area focus on supervised techniques for spotting outliers. While supervised algorithms for anomaly detection work well in theory, they cannot be applied to real-world applications owing to a lack of labelled data. Considering the possible data distribution, our model employs a dual variational autoencoder (VAE), while a generative adversarial network (GAN) assures the model is robust to adversarial training. The dual VAEs are used in another capacity: as a fake-node generator. Adversarial training is used to ensure that our latent codes have a Gaussian or uniform distribution. To provide a fair presentation of the graph, the discriminator instructs the generator to generate latent variables with distributions that are more consistent with the actual distribution of the data. Once the model has been learned, the discriminator is used for anomaly detection via reconstruction loss it has been trained to distinguish between the normal and artificial distributions of data. First, using a dual VAE, our model simultaneously captures cross-modality interactions between topological structure and node characteristics and overcomes the problem of unlabeled anomalies, allowing us to better understand the network sparsity and nonlinearity. Second, the proposed model considers the regularization of the latent codes while solving the issue of unregularized embedding techniques that can quickly lead to unsatisfactory representation. Finally, we use the discriminator reconstruction loss for anomaly detection as the discriminator is well-trained to separate the normal and generated data distributions because reconstruction-based loss does not include the adversarial component. Experiments conducted on attributed networks demonstrate the effectiveness of the proposed model and show that it greatly surpasses the previous methods. The area under the curve scores of our proposed model for the BlogCatalog, Flickr, and Enron datasets are 0.83680, 0.82020, and 0.71180, respectively, proving the effectiveness of the proposed model. The result of the proposed model on the Enron dataset is slightly worse than the other models; we attribute this to the dataset's low dimensionality as the most probable explanation.</p></div>","PeriodicalId":100353,"journal":{"name":"Data Science and Management","volume":"7 2","pages":"Pages 89-98"},"PeriodicalIF":0.0000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666764923000486/pdfft?md5=e26fa7989cfa05fc83b6e2a56b647889&pid=1-s2.0-S2666764923000486-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data Science and Management","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666764923000486","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/11/2 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Many types of real-world information systems, including social media and e-commerce platforms, can be modelled by means of attribute-rich, connected networks. The goal of anomaly detection in artificial intelligence is to identify illustrations that deviate significantly from the main distribution of data or that differ from known cases. Anomalous nodes in node-attributed networks can be identified with greater precision if both graph and node attributes are taken into account. Almost all of the studies in this area focus on supervised techniques for spotting outliers. While supervised algorithms for anomaly detection work well in theory, they cannot be applied to real-world applications owing to a lack of labelled data. Considering the possible data distribution, our model employs a dual variational autoencoder (VAE), while a generative adversarial network (GAN) assures the model is robust to adversarial training. The dual VAEs are used in another capacity: as a fake-node generator. Adversarial training is used to ensure that our latent codes have a Gaussian or uniform distribution. To provide a fair presentation of the graph, the discriminator instructs the generator to generate latent variables with distributions that are more consistent with the actual distribution of the data. Once the model has been learned, the discriminator is used for anomaly detection via reconstruction loss it has been trained to distinguish between the normal and artificial distributions of data. First, using a dual VAE, our model simultaneously captures cross-modality interactions between topological structure and node characteristics and overcomes the problem of unlabeled anomalies, allowing us to better understand the network sparsity and nonlinearity. Second, the proposed model considers the regularization of the latent codes while solving the issue of unregularized embedding techniques that can quickly lead to unsatisfactory representation. Finally, we use the discriminator reconstruction loss for anomaly detection as the discriminator is well-trained to separate the normal and generated data distributions because reconstruction-based loss does not include the adversarial component. Experiments conducted on attributed networks demonstrate the effectiveness of the proposed model and show that it greatly surpasses the previous methods. The area under the curve scores of our proposed model for the BlogCatalog, Flickr, and Enron datasets are 0.83680, 0.82020, and 0.71180, respectively, proving the effectiveness of the proposed model. The result of the proposed model on the Enron dataset is slightly worse than the other models; we attribute this to the dataset's low dimensionality as the most probable explanation.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

使用生成式对抗网络的双变异自动编码器检测属性社交网络中的异常节点

现实世界中许多类型的信息系统，包括社交媒体和电子商务平台，都可以通过属性丰富的连接网络来建模。人工智能异常检测的目标是识别严重偏离数据主要分布或与已知情况不同的图例。如果同时考虑图和节点的属性，就能更精确地识别节点属性网络中的异常节点。该领域几乎所有的研究都集中在发现异常值的监督技术上。虽然异常检测的监督算法在理论上行之有效，但由于缺乏标记数据，这些算法无法应用于现实世界。考虑到可能的数据分布，我们的模型采用了双变异自动编码器（VAE），而生成式对抗网络（GAN）则确保了模型对对抗训练的鲁棒性。双变自编码器还有另一个用途：假节点生成器。对抗训练用于确保我们的潜码具有高斯或均匀分布。为了公平地展示图形，判别器会指示生成器生成分布更符合数据实际分布的潜变量。一旦学习了模型，判别器就可以通过重构损失进行异常检测，经过训练，它可以区分数据的正态分布和人工分布。首先，利用双 VAE，我们的模型可以同时捕捉拓扑结构和节点特征之间的跨模态交互作用，并克服无标记异常的问题，使我们能够更好地理解网络的稀疏性和非线性。其次，我们提出的模型考虑了潜码的正则化，同时解决了非正则化嵌入技术的问题，因为这种技术很快就会导致不理想的表示。最后，我们使用判别器重构损失进行异常检测，因为基于重构的损失不包括对抗成分，所以判别器经过良好训练，可以分离正常数据分布和生成数据分布。在属性网络上进行的实验证明了所提出模型的有效性，并表明它大大超过了以前的方法。我们提出的模型在 BlogCatalog、Flickr 和安然数据集上的曲线下面积得分分别为 0.83680、0.82020 和 0.71180，证明了所提模型的有效性。在安然数据集上，所提模型的结果比其他模型略差；我们认为最有可能的原因是该数据集的维度较低。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Data Science and Management

CiteScore

7.50

自引率

0.00%

发文量