{"title":"A Multiscale Concept Drift Detection Method for Learning from Data Streams","authors":"XueSong Wang, Q. Kang, Mengchu Zhou, SiYa Yao","doi":"10.1109/COASE.2018.8560554","DOIUrl":null,"url":null,"abstract":"Concept drifts can occur in data streams, which disable any models built on static data distribution to fit dynamic or recurrent concepts. If there is a feature that can monitor the stableness of such distribution, then we have a proper reference to adapt the model. Based on this idea, we propose a novel approach named Multiscale Drift Detection Test (MDDT) that localizes abrupt drift points when detection feature values fluctuate. MDDT is based on a resampling scheme and a paired student t-test. It applies a detection procedure on a broad and a narrow scale. This multiscale structure not only reduces massive time of a constant checking process, but also filters noise in the detection features. Experiments are performed via synthetic and real-world datasets. The results indicate that the proposed method outperforms the state-of-art algorithms in terms of computation cost and average accuracy.","PeriodicalId":6518,"journal":{"name":"2018 IEEE 14th International Conference on Automation Science and Engineering (CASE)","volume":"27 1","pages":"786-790"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 14th International Conference on Automation Science and Engineering (CASE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COASE.2018.8560554","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Concept drifts can occur in data streams, which disable any models built on static data distribution to fit dynamic or recurrent concepts. If there is a feature that can monitor the stableness of such distribution, then we have a proper reference to adapt the model. Based on this idea, we propose a novel approach named Multiscale Drift Detection Test (MDDT) that localizes abrupt drift points when detection feature values fluctuate. MDDT is based on a resampling scheme and a paired student t-test. It applies a detection procedure on a broad and a narrow scale. This multiscale structure not only reduces massive time of a constant checking process, but also filters noise in the detection features. Experiments are performed via synthetic and real-world datasets. The results indicate that the proposed method outperforms the state-of-art algorithms in terms of computation cost and average accuracy.