{"title":"Data Mining Association Rule Algorithm Based on Hadoop","authors":"Huang Suyu","doi":"10.1109/ICICTA.2015.94","DOIUrl":null,"url":null,"abstract":"This paper proposes a kind of speculated task scheduling based on data locality, aimed on the current not high optimization of task scheduling algorithm on Hadoop platform. This algorithm combined the local features of data at different nodes, through the length proportion of Map and Reduce task on computing nodes, adopts more accurate task scheduling detection mode than current algorithm to find out fast or slow node, and backup for backward task with longest remaining start time at fast node, use mobile computing instead of mobile data. It conducts experiment in Hadoop environment, the result demonstrates that this algorithm has shorten the task average operation time than current algorithm, while speed up the task execution efficiency.","PeriodicalId":231694,"journal":{"name":"2015 8th International Conference on Intelligent Computation Technology and Automation (ICICTA)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 8th International Conference on Intelligent Computation Technology and Automation (ICICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICTA.2015.94","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
This paper proposes a kind of speculated task scheduling based on data locality, aimed on the current not high optimization of task scheduling algorithm on Hadoop platform. This algorithm combined the local features of data at different nodes, through the length proportion of Map and Reduce task on computing nodes, adopts more accurate task scheduling detection mode than current algorithm to find out fast or slow node, and backup for backward task with longest remaining start time at fast node, use mobile computing instead of mobile data. It conducts experiment in Hadoop environment, the result demonstrates that this algorithm has shorten the task average operation time than current algorithm, while speed up the task execution efficiency.