{"title":"Frequent Itemset Mining techniques — A technical review","authors":"Tushar M. Chaure, Kavita R. Singh","doi":"10.1109/STARTUP.2016.7583968","DOIUrl":null,"url":null,"abstract":"Frequent Itemset Mining is one of the most popular techniques to extract knowledge from data. However, these mining methods become more problematic when they are applied to Big Data. Fortunately, recent improvements in the field of parallel programming provide many tools to tackle this problem. However, these tools come with their own technical challenges such as balanced data distribution and inter-communication costs. In this paper, we are presenting a detailed survey of Hadoop, which helps in storing data and parallel processing in distributed environment. Here we have explored various Frequent Itemset Mining techniques on parallel and distributed environment. The aim of this paper is to present a comparison of different frequent itemset mining techniques and help to develop efficient and scalable frequent itemset mining techniques.","PeriodicalId":355852,"journal":{"name":"2016 World Conference on Futuristic Trends in Research and Innovation for Social Welfare (Startup Conclave)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 World Conference on Futuristic Trends in Research and Innovation for Social Welfare (Startup Conclave)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/STARTUP.2016.7583968","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Frequent Itemset Mining is one of the most popular techniques to extract knowledge from data. However, these mining methods become more problematic when they are applied to Big Data. Fortunately, recent improvements in the field of parallel programming provide many tools to tackle this problem. However, these tools come with their own technical challenges such as balanced data distribution and inter-communication costs. In this paper, we are presenting a detailed survey of Hadoop, which helps in storing data and parallel processing in distributed environment. Here we have explored various Frequent Itemset Mining techniques on parallel and distributed environment. The aim of this paper is to present a comparison of different frequent itemset mining techniques and help to develop efficient and scalable frequent itemset mining techniques.