Penghui Xi , Debo Cheng , Guangquan Lu , Zhenyun Deng , Guixian Zhang , Shichao Zhang
{"title":"识别局部有用信息用于属性图异常检测","authors":"Penghui Xi , Debo Cheng , Guangquan Lu , Zhenyun Deng , Guixian Zhang , Shichao Zhang","doi":"10.1016/j.neucom.2024.128900","DOIUrl":null,"url":null,"abstract":"<div><div>Graph anomaly detection primarily relies on shallow learning methods based on feature engineering and deep learning strategies centred on autoencoder-based reconstruction. However, these methods frequently fail to harness the local attributes and structural information within graph data, making it challenging to capture the underlying distribution in scenarios with class-imbalanced graph anomalies, which can result in overfitting. To deal with the above issue, this paper proposes a new anomaly detection method called LIAD (Identifying <u>L</u>ocal Useful <u>I</u>nformation for <u>A</u>ttribute Graph Anomaly <u>D</u>etection), which learns the data’s underlying distribution and captures richer local information. First, LIAD employs data augmentation techniques to create masked graphs and pairs of positive and negative subgraphs. Then, LIAD leverages contrastive learning to derive rich embedding representations from diverse local structural information. Additionally, LIAD utilizes a variational autoencoder (VAE) to generate new graph data, capturing the neighbourhood distribution within the masked graph. During the training process, LIAD aligns the generated graph data with the original to deepen its comprehension of local information. Finally, anomaly scoring is achieved by comparing the discrimination and reconstruction scores of the contrastive pairs, enabling effective anomaly detection. Extensive experiments on five real-world datasets demonstrate the effectiveness of LIAD compared to state-of-the-art methods. Comprehensive ablation studies and parametric analyses further affirm the robustness and efficacy of our model.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"617 ","pages":"Article 128900"},"PeriodicalIF":5.5000,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identifying local useful information for attribute graph anomaly detection\",\"authors\":\"Penghui Xi , Debo Cheng , Guangquan Lu , Zhenyun Deng , Guixian Zhang , Shichao Zhang\",\"doi\":\"10.1016/j.neucom.2024.128900\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Graph anomaly detection primarily relies on shallow learning methods based on feature engineering and deep learning strategies centred on autoencoder-based reconstruction. However, these methods frequently fail to harness the local attributes and structural information within graph data, making it challenging to capture the underlying distribution in scenarios with class-imbalanced graph anomalies, which can result in overfitting. To deal with the above issue, this paper proposes a new anomaly detection method called LIAD (Identifying <u>L</u>ocal Useful <u>I</u>nformation for <u>A</u>ttribute Graph Anomaly <u>D</u>etection), which learns the data’s underlying distribution and captures richer local information. First, LIAD employs data augmentation techniques to create masked graphs and pairs of positive and negative subgraphs. Then, LIAD leverages contrastive learning to derive rich embedding representations from diverse local structural information. Additionally, LIAD utilizes a variational autoencoder (VAE) to generate new graph data, capturing the neighbourhood distribution within the masked graph. During the training process, LIAD aligns the generated graph data with the original to deepen its comprehension of local information. Finally, anomaly scoring is achieved by comparing the discrimination and reconstruction scores of the contrastive pairs, enabling effective anomaly detection. Extensive experiments on five real-world datasets demonstrate the effectiveness of LIAD compared to state-of-the-art methods. Comprehensive ablation studies and parametric analyses further affirm the robustness and efficacy of our model.</div></div>\",\"PeriodicalId\":19268,\"journal\":{\"name\":\"Neurocomputing\",\"volume\":\"617 \",\"pages\":\"Article 128900\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2024-11-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neurocomputing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0925231224016710\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231224016710","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
摘要
图异常检测主要依赖于基于特征工程的浅学习方法和基于自编码器重建的深度学习策略。然而,这些方法经常不能利用图数据中的局部属性和结构信息,使得在类不平衡的图异常场景中捕捉底层分布变得困难,这可能导致过拟合。针对上述问题,本文提出了一种新的异常检测方法LIAD (identification Local Useful Information for Attribute Graph anomaly detection),该方法学习数据的底层分布,捕获更丰富的局部信息。首先,LIAD使用数据增强技术来创建掩码图和正负子图对。然后,LIAD利用对比学习从不同的局部结构信息中获得丰富的嵌入表示。此外,LIAD利用变分自编码器(VAE)生成新的图数据,捕获掩码图内的邻域分布。在训练过程中,LIAD将生成的图数据与原始图数据对齐,以加深对局部信息的理解。最后,通过对比对的判别分数和重建分数进行异常评分,实现有效的异常检测。在五个真实世界数据集上进行的大量实验表明,与最先进的方法相比,LIAD的有效性。综合消融研究和参数分析进一步证实了我们模型的稳健性和有效性。
Identifying local useful information for attribute graph anomaly detection
Graph anomaly detection primarily relies on shallow learning methods based on feature engineering and deep learning strategies centred on autoencoder-based reconstruction. However, these methods frequently fail to harness the local attributes and structural information within graph data, making it challenging to capture the underlying distribution in scenarios with class-imbalanced graph anomalies, which can result in overfitting. To deal with the above issue, this paper proposes a new anomaly detection method called LIAD (Identifying Local Useful Information for Attribute Graph Anomaly Detection), which learns the data’s underlying distribution and captures richer local information. First, LIAD employs data augmentation techniques to create masked graphs and pairs of positive and negative subgraphs. Then, LIAD leverages contrastive learning to derive rich embedding representations from diverse local structural information. Additionally, LIAD utilizes a variational autoencoder (VAE) to generate new graph data, capturing the neighbourhood distribution within the masked graph. During the training process, LIAD aligns the generated graph data with the original to deepen its comprehension of local information. Finally, anomaly scoring is achieved by comparing the discrimination and reconstruction scores of the contrastive pairs, enabling effective anomaly detection. Extensive experiments on five real-world datasets demonstrate the effectiveness of LIAD compared to state-of-the-art methods. Comprehensive ablation studies and parametric analyses further affirm the robustness and efficacy of our model.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.