{"title":"基于数据流学习的多尺度概念漂移检测方法","authors":"XueSong Wang, Q. Kang, Mengchu Zhou, SiYa Yao","doi":"10.1109/COASE.2018.8560554","DOIUrl":null,"url":null,"abstract":"Concept drifts can occur in data streams, which disable any models built on static data distribution to fit dynamic or recurrent concepts. If there is a feature that can monitor the stableness of such distribution, then we have a proper reference to adapt the model. Based on this idea, we propose a novel approach named Multiscale Drift Detection Test (MDDT) that localizes abrupt drift points when detection feature values fluctuate. MDDT is based on a resampling scheme and a paired student t-test. It applies a detection procedure on a broad and a narrow scale. This multiscale structure not only reduces massive time of a constant checking process, but also filters noise in the detection features. Experiments are performed via synthetic and real-world datasets. The results indicate that the proposed method outperforms the state-of-art algorithms in terms of computation cost and average accuracy.","PeriodicalId":6518,"journal":{"name":"2018 IEEE 14th International Conference on Automation Science and Engineering (CASE)","volume":"27 1","pages":"786-790"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"A Multiscale Concept Drift Detection Method for Learning from Data Streams\",\"authors\":\"XueSong Wang, Q. Kang, Mengchu Zhou, SiYa Yao\",\"doi\":\"10.1109/COASE.2018.8560554\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Concept drifts can occur in data streams, which disable any models built on static data distribution to fit dynamic or recurrent concepts. If there is a feature that can monitor the stableness of such distribution, then we have a proper reference to adapt the model. Based on this idea, we propose a novel approach named Multiscale Drift Detection Test (MDDT) that localizes abrupt drift points when detection feature values fluctuate. MDDT is based on a resampling scheme and a paired student t-test. It applies a detection procedure on a broad and a narrow scale. This multiscale structure not only reduces massive time of a constant checking process, but also filters noise in the detection features. Experiments are performed via synthetic and real-world datasets. The results indicate that the proposed method outperforms the state-of-art algorithms in terms of computation cost and average accuracy.\",\"PeriodicalId\":6518,\"journal\":{\"name\":\"2018 IEEE 14th International Conference on Automation Science and Engineering (CASE)\",\"volume\":\"27 1\",\"pages\":\"786-790\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE 14th International Conference on Automation Science and Engineering (CASE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/COASE.2018.8560554\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 14th International Conference on Automation Science and Engineering (CASE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COASE.2018.8560554","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Multiscale Concept Drift Detection Method for Learning from Data Streams
Concept drifts can occur in data streams, which disable any models built on static data distribution to fit dynamic or recurrent concepts. If there is a feature that can monitor the stableness of such distribution, then we have a proper reference to adapt the model. Based on this idea, we propose a novel approach named Multiscale Drift Detection Test (MDDT) that localizes abrupt drift points when detection feature values fluctuate. MDDT is based on a resampling scheme and a paired student t-test. It applies a detection procedure on a broad and a narrow scale. This multiscale structure not only reduces massive time of a constant checking process, but also filters noise in the detection features. Experiments are performed via synthetic and real-world datasets. The results indicate that the proposed method outperforms the state-of-art algorithms in terms of computation cost and average accuracy.