Nemouchi Warda Ismahene, Souheila Boudouda, N. Zarour
{"title":"An Auto Scaling Energy Efficient Approach in Apache Hadoop","authors":"Nemouchi Warda Ismahene, Souheila Boudouda, N. Zarour","doi":"10.1109/ICAASE51408.2020.9380109","DOIUrl":null,"url":null,"abstract":"Cloud Computing has emerged as revolutionary paradigm for large-scale data intensive analysis over the last decade. In addition, Map Reduce and its implementation Hadoop have been successful at developing and running Big Data Distributed computations. However, their effect on datacenters energy efficiency has become significant; some of the servers are run without being used actively on daily basis. Making use of Cloud Computing advantages such as elasticity and scalability along with Hadoop’s powerful distributed architecture has been an important research axis. The ability of managing resources (adding/removing nodes that run Map Reduce jobs to the cluster) automatically based on workloads without affecting time response has been investigated. This paper presents an approach of auto-scaling in the Hadoop framework, we have focused on separating nodes to core/computation to avoid data loss and guarantee the ability to remove nodes smoothly and instantly.","PeriodicalId":405638,"journal":{"name":"2020 International Conference on Advanced Aspects of Software Engineering (ICAASE)","volume":"291 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Advanced Aspects of Software Engineering (ICAASE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAASE51408.2020.9380109","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Cloud Computing has emerged as revolutionary paradigm for large-scale data intensive analysis over the last decade. In addition, Map Reduce and its implementation Hadoop have been successful at developing and running Big Data Distributed computations. However, their effect on datacenters energy efficiency has become significant; some of the servers are run without being used actively on daily basis. Making use of Cloud Computing advantages such as elasticity and scalability along with Hadoop’s powerful distributed architecture has been an important research axis. The ability of managing resources (adding/removing nodes that run Map Reduce jobs to the cluster) automatically based on workloads without affecting time response has been investigated. This paper presents an approach of auto-scaling in the Hadoop framework, we have focused on separating nodes to core/computation to avoid data loss and guarantee the ability to remove nodes smoothly and instantly.