{"title":"An Improved Association Rules Mining Algorithm Based on Power Set and Hadoop","authors":"W. Mao, Weibin Guo","doi":"10.1109/ISCC-C.2013.39","DOIUrl":null,"url":null,"abstract":"The association rules mining has an very important impact in data mining. As the rapid growth of datasets, the required memory increase seriously and the operating efficiency declines rapidly. Cloud computing provides efficient and cheap solutions to analyze and implement the association rules mining algorithms in parallel. This paper proposes an improved association mining algorithm based on power set and MapReduce programming model, which can process massive datasets with a cluster of machines on Hadoop platform. The results of the numerical experiments show that the proposed algorithm can achieve higher efficiency in the association rules mining.","PeriodicalId":313511,"journal":{"name":"2013 International Conference on Information Science and Cloud Computing Companion","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2013-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 International Conference on Information Science and Cloud Computing Companion","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCC-C.2013.39","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
The association rules mining has an very important impact in data mining. As the rapid growth of datasets, the required memory increase seriously and the operating efficiency declines rapidly. Cloud computing provides efficient and cheap solutions to analyze and implement the association rules mining algorithms in parallel. This paper proposes an improved association mining algorithm based on power set and MapReduce programming model, which can process massive datasets with a cluster of machines on Hadoop platform. The results of the numerical experiments show that the proposed algorithm can achieve higher efficiency in the association rules mining.