{"title":"高度不平衡数据的异常检测——实证分析","authors":"Akshat Ajay Das, V. Mayya, Manohara M M Pai","doi":"10.1109/ESCI56872.2023.10100135","DOIUrl":null,"url":null,"abstract":"An event or an observation that is statistically different from the others is termed an anomaly. Anomaly detection is the process of identifying such anomalies. Anomaly detection is an effective tool for risk mitigation, fraud detection, and improving the system's robustness. It is also an active research area, with numerous algorithms being proposed. In this paper, we compare the performance of various anomaly detection algorithms on mul-tivariate as well as univariate datasets. The assessment measures generated are important and can be beneficial for predicting anomalies in a timely and accurate manner. Experimental results demonstrate that on a univariate dataset, the auto-regressive moving average (ARMA), performs better than the local outlier factor (LOF), while on a multivariate dataset, the LOF model performs better. The prototype developed has been extensively tested on publicly available datasets and can be evaluated on larger, more comprehensive datasets for deployment in the real-time anomaly detection setup.","PeriodicalId":441215,"journal":{"name":"2023 International Conference on Emerging Smart Computing and Informatics (ESCI)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Anomaly Detection for Highly Imbalanced Data–an Empirical Analysis\",\"authors\":\"Akshat Ajay Das, V. Mayya, Manohara M M Pai\",\"doi\":\"10.1109/ESCI56872.2023.10100135\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An event or an observation that is statistically different from the others is termed an anomaly. Anomaly detection is the process of identifying such anomalies. Anomaly detection is an effective tool for risk mitigation, fraud detection, and improving the system's robustness. It is also an active research area, with numerous algorithms being proposed. In this paper, we compare the performance of various anomaly detection algorithms on mul-tivariate as well as univariate datasets. The assessment measures generated are important and can be beneficial for predicting anomalies in a timely and accurate manner. Experimental results demonstrate that on a univariate dataset, the auto-regressive moving average (ARMA), performs better than the local outlier factor (LOF), while on a multivariate dataset, the LOF model performs better. The prototype developed has been extensively tested on publicly available datasets and can be evaluated on larger, more comprehensive datasets for deployment in the real-time anomaly detection setup.\",\"PeriodicalId\":441215,\"journal\":{\"name\":\"2023 International Conference on Emerging Smart Computing and Informatics (ESCI)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Emerging Smart Computing and Informatics (ESCI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ESCI56872.2023.10100135\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Emerging Smart Computing and Informatics (ESCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ESCI56872.2023.10100135","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Anomaly Detection for Highly Imbalanced Data–an Empirical Analysis
An event or an observation that is statistically different from the others is termed an anomaly. Anomaly detection is the process of identifying such anomalies. Anomaly detection is an effective tool for risk mitigation, fraud detection, and improving the system's robustness. It is also an active research area, with numerous algorithms being proposed. In this paper, we compare the performance of various anomaly detection algorithms on mul-tivariate as well as univariate datasets. The assessment measures generated are important and can be beneficial for predicting anomalies in a timely and accurate manner. Experimental results demonstrate that on a univariate dataset, the auto-regressive moving average (ARMA), performs better than the local outlier factor (LOF), while on a multivariate dataset, the LOF model performs better. The prototype developed has been extensively tested on publicly available datasets and can be evaluated on larger, more comprehensive datasets for deployment in the real-time anomaly detection setup.