M. Sinthuja , S. Pravinthraja , B K Dhanalakshmi , H L Gururaj , Vinayakumar Ravi , G Jyothish Lal
{"title":"An efficient and resilience linear prefix approach for mining maximal frequent itemset using clustering","authors":"M. Sinthuja , S. Pravinthraja , B K Dhanalakshmi , H L Gururaj , Vinayakumar Ravi , G Jyothish Lal","doi":"10.1016/j.jnlssr.2024.08.001","DOIUrl":null,"url":null,"abstract":"<div><div>The numerous volumes of data generated every day necessitate the deployment of new technologies capable of dealing with massive amounts of data efficiently. This is the case with Association Rules, a tool for unsupervised data mining that extracts information in the form of IF-THEN patterns. Although various approaches for extracting frequent itemset (prior step before mining association rules) in extremely large databases have been presented, the high computational cost and shortage of memory remain key issues to be addressed while processing enormous data. The objective of this research is to discover frequent itemset by using clustering for preprocessing and adopting the linear prefix tree algorithm for mining the maximal frequent itemset. The performance of the proposed CL-LP-MAX-tree was evaluated by comparing it with the existing FP-max algorithm. Experimentation was performed with the three different standard datasets to record evidence to prove that the proposed CL-LP-MAX-tree algorithm outperform the existing FP-max algorithm in terms of runtime and memory consumption.</div></div>","PeriodicalId":62710,"journal":{"name":"安全科学与韧性(英文)","volume":"6 1","pages":"Pages 93-104"},"PeriodicalIF":3.7000,"publicationDate":"2024-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"安全科学与韧性(英文)","FirstCategoryId":"1087","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666449624000689","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0
Abstract
The numerous volumes of data generated every day necessitate the deployment of new technologies capable of dealing with massive amounts of data efficiently. This is the case with Association Rules, a tool for unsupervised data mining that extracts information in the form of IF-THEN patterns. Although various approaches for extracting frequent itemset (prior step before mining association rules) in extremely large databases have been presented, the high computational cost and shortage of memory remain key issues to be addressed while processing enormous data. The objective of this research is to discover frequent itemset by using clustering for preprocessing and adopting the linear prefix tree algorithm for mining the maximal frequent itemset. The performance of the proposed CL-LP-MAX-tree was evaluated by comparing it with the existing FP-max algorithm. Experimentation was performed with the three different standard datasets to record evidence to prove that the proposed CL-LP-MAX-tree algorithm outperform the existing FP-max algorithm in terms of runtime and memory consumption.