Amna Tehreem, S. G. Khawaja, M. Akram, S. Khan, Muhammad G. Ali
{"title":"使用fp增长实现频繁项集挖掘的并行架构","authors":"Amna Tehreem, S. G. Khawaja, M. Akram, S. Khan, Muhammad G. Ali","doi":"10.1109/ICSIGSYS.2017.7967077","DOIUrl":null,"url":null,"abstract":"Frequent itemset mining is a fundamental step in analysis of big data where correlation among the raw data in deemed necessary. In modern era the amount of data available for processing has grown exponentially, making it a stepper task for mining algorithms to provide solution in a timely manner. The software implementations are normally not efficient in handling such datasets thus focus on parallel architecture seems imminent. In this paper we propose a Multi-Processor based sequentially unfolded architecture for implementation of FP-Growth algorithm. The proposed framework exploits the inherent parallelism available in the FP-Growth algorithm such that N-processing entities (PEs) can work in a collaborative environment. The processing entities work in an independent manner in parallel and largely interchange data at the close of each iteration. The overall architecture is modular which permits scalability of the design with regards to the number of parallel processing entities. The performance of the framework is evaluated using benchmark datasets and their results show a linear increase in the speedup of our proposed framework with increase in PEs.","PeriodicalId":212068,"journal":{"name":"2017 International Conference on Signals and Systems (ICSigSys)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Parallel architecture for implementation of frequent itemset mining using FP-growth\",\"authors\":\"Amna Tehreem, S. G. Khawaja, M. Akram, S. Khan, Muhammad G. Ali\",\"doi\":\"10.1109/ICSIGSYS.2017.7967077\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Frequent itemset mining is a fundamental step in analysis of big data where correlation among the raw data in deemed necessary. In modern era the amount of data available for processing has grown exponentially, making it a stepper task for mining algorithms to provide solution in a timely manner. The software implementations are normally not efficient in handling such datasets thus focus on parallel architecture seems imminent. In this paper we propose a Multi-Processor based sequentially unfolded architecture for implementation of FP-Growth algorithm. The proposed framework exploits the inherent parallelism available in the FP-Growth algorithm such that N-processing entities (PEs) can work in a collaborative environment. The processing entities work in an independent manner in parallel and largely interchange data at the close of each iteration. The overall architecture is modular which permits scalability of the design with regards to the number of parallel processing entities. The performance of the framework is evaluated using benchmark datasets and their results show a linear increase in the speedup of our proposed framework with increase in PEs.\",\"PeriodicalId\":212068,\"journal\":{\"name\":\"2017 International Conference on Signals and Systems (ICSigSys)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Signals and Systems (ICSigSys)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSIGSYS.2017.7967077\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Signals and Systems (ICSigSys)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSIGSYS.2017.7967077","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Parallel architecture for implementation of frequent itemset mining using FP-growth
Frequent itemset mining is a fundamental step in analysis of big data where correlation among the raw data in deemed necessary. In modern era the amount of data available for processing has grown exponentially, making it a stepper task for mining algorithms to provide solution in a timely manner. The software implementations are normally not efficient in handling such datasets thus focus on parallel architecture seems imminent. In this paper we propose a Multi-Processor based sequentially unfolded architecture for implementation of FP-Growth algorithm. The proposed framework exploits the inherent parallelism available in the FP-Growth algorithm such that N-processing entities (PEs) can work in a collaborative environment. The processing entities work in an independent manner in parallel and largely interchange data at the close of each iteration. The overall architecture is modular which permits scalability of the design with regards to the number of parallel processing entities. The performance of the framework is evaluated using benchmark datasets and their results show a linear increase in the speedup of our proposed framework with increase in PEs.