Iman Vasheghani Farahani, Alex Chien, R. King, M. Kay, Brad Klenz
{"title":"基于马尔可夫链的时间序列异常检测","authors":"Iman Vasheghani Farahani, Alex Chien, R. King, M. Kay, Brad Klenz","doi":"10.1109/ICMLA.2019.00170","DOIUrl":null,"url":null,"abstract":"This paper introduces a new method for the pattern-wise anomaly detection problem, which aims to find segments whose behaviors are different from the rest of the segments in the time series (as opposed to finding a single data-point in classic anomaly detection problems). An important motivation for studying this problem is to find anomalies whose data-points are within the normal range but they create an unusual pattern. To this end, normal characteristics of the data are found by clustering the overlapping subsequences of the training dataset and analyzing their orders by Markov chains. The trained model is used to assess how well the testing dataset suits the baseline behavior. The designed anomaly detection framework is capable of discovering unusual patterns in both streaming data (online) and stored data (offline). The performance of the methodology is evaluated by applying it to three datasets from different fields: a medical dataset (electrocardiogram), a utility usage dataset, and a New York City taxi demand dataset. The detected anomaly in the medical data agrees with the results of the studies in the literature. A domain expert confirmed the accuracy of the results for the utility usage data, and the anomalies of the New York City taxi demand data referred to major US holidays.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Time Series Anomaly Detection from a Markov Chain Perspective\",\"authors\":\"Iman Vasheghani Farahani, Alex Chien, R. King, M. Kay, Brad Klenz\",\"doi\":\"10.1109/ICMLA.2019.00170\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper introduces a new method for the pattern-wise anomaly detection problem, which aims to find segments whose behaviors are different from the rest of the segments in the time series (as opposed to finding a single data-point in classic anomaly detection problems). An important motivation for studying this problem is to find anomalies whose data-points are within the normal range but they create an unusual pattern. To this end, normal characteristics of the data are found by clustering the overlapping subsequences of the training dataset and analyzing their orders by Markov chains. The trained model is used to assess how well the testing dataset suits the baseline behavior. The designed anomaly detection framework is capable of discovering unusual patterns in both streaming data (online) and stored data (offline). The performance of the methodology is evaluated by applying it to three datasets from different fields: a medical dataset (electrocardiogram), a utility usage dataset, and a New York City taxi demand dataset. The detected anomaly in the medical data agrees with the results of the studies in the literature. A domain expert confirmed the accuracy of the results for the utility usage data, and the anomalies of the New York City taxi demand data referred to major US holidays.\",\"PeriodicalId\":436714,\"journal\":{\"name\":\"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLA.2019.00170\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2019.00170","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Time Series Anomaly Detection from a Markov Chain Perspective
This paper introduces a new method for the pattern-wise anomaly detection problem, which aims to find segments whose behaviors are different from the rest of the segments in the time series (as opposed to finding a single data-point in classic anomaly detection problems). An important motivation for studying this problem is to find anomalies whose data-points are within the normal range but they create an unusual pattern. To this end, normal characteristics of the data are found by clustering the overlapping subsequences of the training dataset and analyzing their orders by Markov chains. The trained model is used to assess how well the testing dataset suits the baseline behavior. The designed anomaly detection framework is capable of discovering unusual patterns in both streaming data (online) and stored data (offline). The performance of the methodology is evaluated by applying it to three datasets from different fields: a medical dataset (electrocardiogram), a utility usage dataset, and a New York City taxi demand dataset. The detected anomaly in the medical data agrees with the results of the studies in the literature. A domain expert confirmed the accuracy of the results for the utility usage data, and the anomalies of the New York City taxi demand data referred to major US holidays.