{"title":"A novel performance aware real-time data handling for big data platforms on Lambda architecture","authors":"Rizwan Patan, M. Babu","doi":"10.1504/IJCAET.2018.10012354","DOIUrl":null,"url":null,"abstract":"Big data is becoming a popular technology for analytics. But, its techniques and tools are very limited to solve the energy aware real time data handling problems. The real time data handling can be in one of the two computing areas: 1) batch computing; 2) stream computing. Stream computing environment uses round robin algorithm as default scheduling strategy whereas batch process uses distributed scheduling for allocation of its resources. But these computing are not considered proper energy aware distributed scheduling policies for allocation of its resources. This paper presents development of management policies that reduces the energy for the allocation of resources. The big data fusion has been used to improve the efficiency for handing different data types: Batch data, online data, and real-time data. A hybrid computational model has been applied to improve the performance further through Lambda architecture. Finally, experimental results have shown 20% performance improvement.","PeriodicalId":346646,"journal":{"name":"Int. J. Comput. Aided Eng. Technol.","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Comput. Aided Eng. Technol.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJCAET.2018.10012354","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
Big data is becoming a popular technology for analytics. But, its techniques and tools are very limited to solve the energy aware real time data handling problems. The real time data handling can be in one of the two computing areas: 1) batch computing; 2) stream computing. Stream computing environment uses round robin algorithm as default scheduling strategy whereas batch process uses distributed scheduling for allocation of its resources. But these computing are not considered proper energy aware distributed scheduling policies for allocation of its resources. This paper presents development of management policies that reduces the energy for the allocation of resources. The big data fusion has been used to improve the efficiency for handing different data types: Batch data, online data, and real-time data. A hybrid computational model has been applied to improve the performance further through Lambda architecture. Finally, experimental results have shown 20% performance improvement.