Xin Lu, Shaonan Jin, Xun Wang, Jiao Yuan, Kun Fu, Ke Yang
{"title":"基于滑动时间衰减窗口的流数据频繁项集挖掘算法","authors":"Xin Lu, Shaonan Jin, Xun Wang, Jiao Yuan, Kun Fu, Ke Yang","doi":"10.1145/3430199.3430226","DOIUrl":null,"url":null,"abstract":"In order to reduce the time and memory consumption of frequent itemsets mining in stream data, and weaken the impact of historical transactions on data patterns, this paper proposes a frequent itemsets mining algorithm SWFIUT-stream based on sliding decay time window. In this algorithm, the time attenuation factor is introduced to assign different weights to each window unit to weaken their influence on data mode. In order to realize the fast stream data mining processing, when mining the frequent itemsets, the two-dimensional table is used to scan and decompose the itemsets synchronously to mine all the frequent itemsets in the window, and the distributed parallel computing processing is carried out based on storm framework. Experimental data show that the algorithm consumes less time and consumes less memory space than conventional algorithms when mining frequent itemsets in stream data.","PeriodicalId":371055,"journal":{"name":"Proceedings of the 2020 3rd International Conference on Artificial Intelligence and Pattern Recognition","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Mining Frequent Itemsets Algorithm in Stream Data Based on Sliding Time Decay Window\",\"authors\":\"Xin Lu, Shaonan Jin, Xun Wang, Jiao Yuan, Kun Fu, Ke Yang\",\"doi\":\"10.1145/3430199.3430226\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In order to reduce the time and memory consumption of frequent itemsets mining in stream data, and weaken the impact of historical transactions on data patterns, this paper proposes a frequent itemsets mining algorithm SWFIUT-stream based on sliding decay time window. In this algorithm, the time attenuation factor is introduced to assign different weights to each window unit to weaken their influence on data mode. In order to realize the fast stream data mining processing, when mining the frequent itemsets, the two-dimensional table is used to scan and decompose the itemsets synchronously to mine all the frequent itemsets in the window, and the distributed parallel computing processing is carried out based on storm framework. Experimental data show that the algorithm consumes less time and consumes less memory space than conventional algorithms when mining frequent itemsets in stream data.\",\"PeriodicalId\":371055,\"journal\":{\"name\":\"Proceedings of the 2020 3rd International Conference on Artificial Intelligence and Pattern Recognition\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2020 3rd International Conference on Artificial Intelligence and Pattern Recognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3430199.3430226\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 3rd International Conference on Artificial Intelligence and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3430199.3430226","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Mining Frequent Itemsets Algorithm in Stream Data Based on Sliding Time Decay Window
In order to reduce the time and memory consumption of frequent itemsets mining in stream data, and weaken the impact of historical transactions on data patterns, this paper proposes a frequent itemsets mining algorithm SWFIUT-stream based on sliding decay time window. In this algorithm, the time attenuation factor is introduced to assign different weights to each window unit to weaken their influence on data mode. In order to realize the fast stream data mining processing, when mining the frequent itemsets, the two-dimensional table is used to scan and decompose the itemsets synchronously to mine all the frequent itemsets in the window, and the distributed parallel computing processing is carried out based on storm framework. Experimental data show that the algorithm consumes less time and consumes less memory space than conventional algorithms when mining frequent itemsets in stream data.