{"title":"A Model of Mining Noise-Tolerant Frequent Itemset in Transactional Databases","authors":"Xiaomei Yu, Hong Wang, Xiangwei Zheng, Shuang Liu","doi":"10.1109/INCoS.2015.87","DOIUrl":null,"url":null,"abstract":"Nowadays, mining approximate frequent itemsets from noisy data has attracted much attention in real applications. However, there is not widely accepted algorithm at present to solve the problem under noisy databases, which dues to two key issues. Firstly, the anti-monotonicity property does not hold which is used to prune candidate itemsets efficiently. And secondly, the computation of support counting turns out to be NP-hard. In this paper, we propose a novel model which is based on rough set theory and capable to recover the noise-tolerant frequent itemsets from \"reduced itemsets\". The novel model applies depth-first growing method to generate candidate itemsets and exerts effective pruning strategies, which narrows the searching space and mines indeed meaningful noise-tolerant frequent itemsets efficiently.","PeriodicalId":345650,"journal":{"name":"2015 International Conference on Intelligent Networking and Collaborative Systems","volume":"82 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Intelligent Networking and Collaborative Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INCoS.2015.87","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Nowadays, mining approximate frequent itemsets from noisy data has attracted much attention in real applications. However, there is not widely accepted algorithm at present to solve the problem under noisy databases, which dues to two key issues. Firstly, the anti-monotonicity property does not hold which is used to prune candidate itemsets efficiently. And secondly, the computation of support counting turns out to be NP-hard. In this paper, we propose a novel model which is based on rough set theory and capable to recover the noise-tolerant frequent itemsets from "reduced itemsets". The novel model applies depth-first growing method to generate candidate itemsets and exerts effective pruning strategies, which narrows the searching space and mines indeed meaningful noise-tolerant frequent itemsets efficiently.