{"title":"mapreduce/hadoop工作负载的性能建模和分析","authors":"Xiao Yu, Wei Li","doi":"10.1109/LANMAN.2015.7114723","DOIUrl":null,"url":null,"abstract":"Data center is the infrastructure in big data processing, which constructs computing platform by distributed computer. The paper aims to investigate the analytical model by adopting queueing theory in data center of big data. The new queueing model developed fits the MapReduce programming model accurately and discovers the nature of the programming model. The utilizations and mean waiting times of Mapper and Reducer are obtained respectively. The effect of workload (and number of Mapper slots) on the system performance (i.e., utilization) is exposed. The significance of this paper is it explores the theoretical insight of the MapReduce programming model and provides the optimal parameter recommendation for computing resource configuration.","PeriodicalId":193630,"journal":{"name":"The 21st IEEE International Workshop on Local and Metropolitan Area Networks","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Performance modelling and analysis of mapreduce/hadoop workloads\",\"authors\":\"Xiao Yu, Wei Li\",\"doi\":\"10.1109/LANMAN.2015.7114723\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data center is the infrastructure in big data processing, which constructs computing platform by distributed computer. The paper aims to investigate the analytical model by adopting queueing theory in data center of big data. The new queueing model developed fits the MapReduce programming model accurately and discovers the nature of the programming model. The utilizations and mean waiting times of Mapper and Reducer are obtained respectively. The effect of workload (and number of Mapper slots) on the system performance (i.e., utilization) is exposed. The significance of this paper is it explores the theoretical insight of the MapReduce programming model and provides the optimal parameter recommendation for computing resource configuration.\",\"PeriodicalId\":193630,\"journal\":{\"name\":\"The 21st IEEE International Workshop on Local and Metropolitan Area Networks\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The 21st IEEE International Workshop on Local and Metropolitan Area Networks\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/LANMAN.2015.7114723\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The 21st IEEE International Workshop on Local and Metropolitan Area Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/LANMAN.2015.7114723","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Performance modelling and analysis of mapreduce/hadoop workloads
Data center is the infrastructure in big data processing, which constructs computing platform by distributed computer. The paper aims to investigate the analytical model by adopting queueing theory in data center of big data. The new queueing model developed fits the MapReduce programming model accurately and discovers the nature of the programming model. The utilizations and mean waiting times of Mapper and Reducer are obtained respectively. The effect of workload (and number of Mapper slots) on the system performance (i.e., utilization) is exposed. The significance of this paper is it explores the theoretical insight of the MapReduce programming model and provides the optimal parameter recommendation for computing resource configuration.