{"title":"Unsupervised anomaly detection in time series exploiting local and global information","authors":"Emanuele La Malfa, G. Malfa","doi":"10.1063/1.5138100","DOIUrl":null,"url":null,"abstract":"We introduce a novel machine learning ensemble architecture for anomaly detection, that exploits global and local information from a 1d time series. A double step validation is performed to decide if a time period is anomalous: from one side a Long Short-Term Memory is trained to be reliable at forecasting, hence a parametric test on the forecasting’s error is used spot the anomalies. Concurrently, a Variational Autoencoder is trained to compress both global and local information from the series to a low-dimensional normal distribution, raising an anomaly if a time step’s likelihood is below a threshold. While anomaly detection with deep learning techniques often comes with the assumption that forecasting error is gaussian, we prove that this is in general a wrong assumption: we show that error function is better approximated by a distribution chosen dynamically. We validate our work on some public physical datasets, outperforming the current deep learning methods in terms of precision and recall.We introduce a novel machine learning ensemble architecture for anomaly detection, that exploits global and local information from a 1d time series. A double step validation is performed to decide if a time period is anomalous: from one side a Long Short-Term Memory is trained to be reliable at forecasting, hence a parametric test on the forecasting’s error is used spot the anomalies. Concurrently, a Variational Autoencoder is trained to compress both global and local information from the series to a low-dimensional normal distribution, raising an anomaly if a time step’s likelihood is below a threshold. While anomaly detection with deep learning techniques often comes with the assumption that forecasting error is gaussian, we prove that this is in general a wrong assumption: we show that error function is better approximated by a distribution chosen dynamically. We validate our work on some public physical datasets, outperforming the current deep learning methods in terms of precision and recall.","PeriodicalId":20565,"journal":{"name":"PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING 2019 (ICCMSE-2019)","volume":"12 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING 2019 (ICCMSE-2019)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1063/1.5138100","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We introduce a novel machine learning ensemble architecture for anomaly detection, that exploits global and local information from a 1d time series. A double step validation is performed to decide if a time period is anomalous: from one side a Long Short-Term Memory is trained to be reliable at forecasting, hence a parametric test on the forecasting’s error is used spot the anomalies. Concurrently, a Variational Autoencoder is trained to compress both global and local information from the series to a low-dimensional normal distribution, raising an anomaly if a time step’s likelihood is below a threshold. While anomaly detection with deep learning techniques often comes with the assumption that forecasting error is gaussian, we prove that this is in general a wrong assumption: we show that error function is better approximated by a distribution chosen dynamically. We validate our work on some public physical datasets, outperforming the current deep learning methods in terms of precision and recall.We introduce a novel machine learning ensemble architecture for anomaly detection, that exploits global and local information from a 1d time series. A double step validation is performed to decide if a time period is anomalous: from one side a Long Short-Term Memory is trained to be reliable at forecasting, hence a parametric test on the forecasting’s error is used spot the anomalies. Concurrently, a Variational Autoencoder is trained to compress both global and local information from the series to a low-dimensional normal distribution, raising an anomaly if a time step’s likelihood is below a threshold. While anomaly detection with deep learning techniques often comes with the assumption that forecasting error is gaussian, we prove that this is in general a wrong assumption: we show that error function is better approximated by a distribution chosen dynamically. We validate our work on some public physical datasets, outperforming the current deep learning methods in terms of precision and recall.