{"title":"OML-AD: Online Machine Learning for Anomaly Detection in Time Series Data","authors":"Sebastian Wette, Florian Heinrichs","doi":"arxiv-2409.09742","DOIUrl":null,"url":null,"abstract":"Time series are ubiquitous and occur naturally in a variety of applications\n-- from data recorded by sensors in manufacturing processes, over financial\ndata streams to climate data. Different tasks arise, such as regression,\nclassification or segmentation of the time series. However, to reliably solve\nthese challenges, it is important to filter out abnormal observations that\ndeviate from the usual behavior of the time series. While many anomaly\ndetection methods exist for independent data and stationary time series, these\nmethods are not applicable to non-stationary time series. To allow for\nnon-stationarity in the data, while simultaneously detecting anomalies, we\npropose OML-AD, a novel approach for anomaly detection (AD) based on online\nmachine learning (OML). We provide an implementation of OML-AD within the\nPython library River and show that it outperforms state-of-the-art baseline\nmethods in terms of accuracy and computational efficiency.","PeriodicalId":501340,"journal":{"name":"arXiv - STAT - Machine Learning","volume":"31 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.09742","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Time series are ubiquitous and occur naturally in a variety of applications
-- from data recorded by sensors in manufacturing processes, over financial
data streams to climate data. Different tasks arise, such as regression,
classification or segmentation of the time series. However, to reliably solve
these challenges, it is important to filter out abnormal observations that
deviate from the usual behavior of the time series. While many anomaly
detection methods exist for independent data and stationary time series, these
methods are not applicable to non-stationary time series. To allow for
non-stationarity in the data, while simultaneously detecting anomalies, we
propose OML-AD, a novel approach for anomaly detection (AD) based on online
machine learning (OML). We provide an implementation of OML-AD within the
Python library River and show that it outperforms state-of-the-art baseline
methods in terms of accuracy and computational efficiency.