Robust anomaly detection via adversarial counterfactual generation

IF 2.5 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Knowledge and Information Systems Pub Date : 2024-07-17 DOI:10.1007/s10115-024-02172-w

Angelica Liguori, Ettore Ritacco, Francesco Sergio Pisani, Giuseppe Manco

{"title":"Robust anomaly detection via adversarial counterfactual generation","authors":"Angelica Liguori, Ettore Ritacco, Francesco Sergio Pisani, Giuseppe Manco","doi":"10.1007/s10115-024-02172-w","DOIUrl":null,"url":null,"abstract":"The capability to devise robust outlier and anomaly detection tools is an important research topic in machine learning and data mining. Recent techniques have been focusing on reinforcing detection with sophisticated data generation tools that successfully refine the learning process by generating variants of the data that expand the recognition capabilities of the outlier detector. In this paper, we propose \\(\\textrm{ARN}\\), a semi-supervised anomaly detection and generation method based on adversarial counterfactual reconstruction. \\(\\textrm{ARN}\\) exploits a regularized autoencoder to optimize the reconstruction of variants of normal examples with minimal differences that are recognized as outliers. The combination of regularization and counterfactual reconstruction helps to stabilize the learning process, which results in both realistic outlier generation and substantially extended detection capability. In fact, the counterfactual generation enables a smart exploration of the search space by successfully relating small changes in all the actual samples from the true distribution to high anomaly scores. Experiments on several benchmark datasets show that our model improves the current state of the art by valuable margins because of its ability to model the true boundaries of the data manifold.","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"10 1","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge and Information Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s10115-024-02172-w","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

The capability to devise robust outlier and anomaly detection tools is an important research topic in machine learning and data mining. Recent techniques have been focusing on reinforcing detection with sophisticated data generation tools that successfully refine the learning process by generating variants of the data that expand the recognition capabilities of the outlier detector. In this paper, we propose \(\textrm{ARN}\), a semi-supervised anomaly detection and generation method based on adversarial counterfactual reconstruction. \(\textrm{ARN}\) exploits a regularized autoencoder to optimize the reconstruction of variants of normal examples with minimal differences that are recognized as outliers. The combination of regularization and counterfactual reconstruction helps to stabilize the learning process, which results in both realistic outlier generation and substantially extended detection capability. In fact, the counterfactual generation enables a smart exploration of the search space by successfully relating small changes in all the actual samples from the true distribution to high anomaly scores. Experiments on several benchmark datasets show that our model improves the current state of the art by valuable margins because of its ability to model the true boundaries of the data manifold.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

通过对抗性反事实生成进行稳健异常检测

设计强大的离群点和异常点检测工具是机器学习和数据挖掘领域的一个重要研究课题。最近的技术一直专注于利用复杂的数据生成工具来强化检测，这些工具通过生成数据的变体来扩展离群点检测器的识别能力，从而成功地完善了学习过程。在本文中，我们提出了一种基于对抗性反事实重构的半监督异常检测和生成方法--（\textrm{ARN}\）。\textrm{ARN}（）利用正则化自动编码器来优化正常示例的变体重建，这些变体的差异最小，会被识别为异常值。正则化和反事实重构的结合有助于稳定学习过程，从而既能生成真实的离群值，又能大大提高检测能力。事实上，反事实生成技术通过成功地将所有实际样本与真实分布之间的微小变化与高异常分数联系起来，实现了对搜索空间的智能探索。在多个基准数据集上进行的实验表明，我们的模型能够模拟数据流形的真实边界，因此在很大程度上改善了当前的技术水平。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Knowledge and Information Systems 工程技术-计算机：人工智能

CiteScore

5.70

自引率

7.40%

发文量

152

审稿时长

7.2 months

期刊介绍： Knowledge and Information Systems (KAIS) provides an international forum for researchers and professionals to share their knowledge and report new advances on all topics related to knowledge systems and advanced information systems. This monthly peer-reviewed archival journal publishes state-of-the-art research reports on emerging topics in KAIS, reviews of important techniques in related areas, and application papers of interest to a general readership.