{"title":"A graph-based algorithm for frequent closed itemsets mining","authors":"Li Li, Dong-hai Zhai, F. Jin","doi":"10.1109/SIEDS.2003.157999","DOIUrl":null,"url":null,"abstract":"Frequent itemsets mining plays an essential role in data mining, but it often generates a large number of redundant itemsets that reduce the efficiency of the mining task. Frequent closed itemsets are subset of frequent itemsets, but they contain all information of frequent itemsets. The most existing methods of frequent closed itemset mining are apriori-based. The efficiency of those methods is limited to the repeated database scan and the candidate set generation. We propose a graph-based algorithm for mining frequent closed itemsets called GFCG (graph-based frequent closed itemset generation). The new algorithm constructs an association graph to represent the frequent relationship between items, and recursively generates frequent closed itemset based on that graph. It scans the database for only two times, and avoids candidate set generation. GFCG outperforms a priori-based algorithm in experiment study and shows good performance both in speed and scale up properties.","PeriodicalId":256790,"journal":{"name":"IEEE Systems and Information Engineering Design Symposium, 2003","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Systems and Information Engineering Design Symposium, 2003","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIEDS.2003.157999","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Frequent itemsets mining plays an essential role in data mining, but it often generates a large number of redundant itemsets that reduce the efficiency of the mining task. Frequent closed itemsets are subset of frequent itemsets, but they contain all information of frequent itemsets. The most existing methods of frequent closed itemset mining are apriori-based. The efficiency of those methods is limited to the repeated database scan and the candidate set generation. We propose a graph-based algorithm for mining frequent closed itemsets called GFCG (graph-based frequent closed itemset generation). The new algorithm constructs an association graph to represent the frequent relationship between items, and recursively generates frequent closed itemset based on that graph. It scans the database for only two times, and avoids candidate set generation. GFCG outperforms a priori-based algorithm in experiment study and shows good performance both in speed and scale up properties.