A Fast Algorithm for Deriving Frequent Itemsets

Cheng-Wei Wu, Yun-Wei Lin, Ming Chen, Jiashu Cheng
{"title":"A Fast Algorithm for Deriving Frequent Itemsets","authors":"Cheng-Wei Wu, Yun-Wei Lin, Ming Chen, Jiashu Cheng","doi":"10.1109/taai54685.2021.00047","DOIUrl":null,"url":null,"abstract":"When mining frequent itemsets (abbr. FIs) from dense datasets, too many itemsets are generated and results in the mining task from a large amount of execution time and high memory consumption. Frequent closed itemset (abbr. FCI) is a lossless and concise representation of FIs. Mining FCIs can not only greatly reduce the execution time and memory consumption, but also retain the complete information all of FI. Although many studies have proposed different mining FCI algorithms, but they have less developed methods that can effectively derive all FIs from FCIs. Form this point of view, this study proposes a novel efficient algorithm named DFI-List for efficiently deriving FIS from FCIs. The algorithm adopts the methodology of depth-first-search and divide-and-conquer to derive all FIs from FCIs. DFI-List efficiently derives all the FIs with vertical index structure called Cid List and uses SC Table to quickly find the support count of the derived FI. Experimental results show that the execution speed and memory consumption of the proposed algorithm with the proposed strategy is better than of the state-of-art algorithm.","PeriodicalId":343821,"journal":{"name":"2021 International Conference on Technologies and Applications of Artificial Intelligence (TAAI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Technologies and Applications of Artificial Intelligence (TAAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/taai54685.2021.00047","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

When mining frequent itemsets (abbr. FIs) from dense datasets, too many itemsets are generated and results in the mining task from a large amount of execution time and high memory consumption. Frequent closed itemset (abbr. FCI) is a lossless and concise representation of FIs. Mining FCIs can not only greatly reduce the execution time and memory consumption, but also retain the complete information all of FI. Although many studies have proposed different mining FCI algorithms, but they have less developed methods that can effectively derive all FIs from FCIs. Form this point of view, this study proposes a novel efficient algorithm named DFI-List for efficiently deriving FIS from FCIs. The algorithm adopts the methodology of depth-first-search and divide-and-conquer to derive all FIs from FCIs. DFI-List efficiently derives all the FIs with vertical index structure called Cid List and uses SC Table to quickly find the support count of the derived FI. Experimental results show that the execution speed and memory consumption of the proposed algorithm with the proposed strategy is better than of the state-of-art algorithm.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
频繁项集的快速生成算法
在从密集数据集中挖掘频繁项集(简称fi)时,会产生过多的项集,导致挖掘任务执行时间长,内存消耗高。频繁闭项集(简称FCI)是FCI的一种无损而简洁的表示。挖掘fci不仅可以大大减少执行时间和内存消耗,而且可以保留所有fci的完整信息。虽然许多研究提出了不同的挖掘FCI算法,但能够有效地从FCI中提取所有FCI的方法尚不多见。基于此,本研究提出了一种新的高效算法DFI-List,用于从fci中高效地提取FIS。该算法采用深度优先搜索和分而治之的方法,从fci中导出所有的fci。DFI-List通过称为Cid List的垂直索引结构有效地派生出所有FI,并使用SC Table快速查找派生出的FI的支持计数。实验结果表明,采用该策略的算法在执行速度和内存消耗方面均优于现有算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Using Random Forests and Decision Trees to Predict Viewing Game Live Streaming via Viewers’ Comments [Title page iii] An Automatic Response System based on Multi-layer Perceptual Neural Network and Web Crawler MLNN: A Novel Network Intrusion Detection Based on Multilayer Neural Network A Hybrid Deep Learning Network for Long-Term Travel Time Prediction in Freeways
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1