{"title":"TPC-H benchmarking of Pig Latin on a Hadoop cluster","authors":"Rim Moussa","doi":"10.1109/ICCITECHNOL.2012.6285848","DOIUrl":null,"url":null,"abstract":"Several companies report success stories after migration from relational database management systems to NoSQL systems (Not only SQL). The latter seem to take over in most data storage fields. These technologies must be used properly, and businesses must be aware of the limitations of NoSQL, for providing real benefits. Pig Latin is a high-level language for expressing data analysis programs, and implementing the MapReduce framework on top of Hadoop Distributed File System. This paper benchmarks Pig Latin using the well known TPC-H benchmark -a Decision Support System benchmark, and reports performance results for different settings on GRID5000 clusters.","PeriodicalId":435718,"journal":{"name":"2012 International Conference on Communications and Information Technology (ICCIT)","volume":"175 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 International Conference on Communications and Information Technology (ICCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCITECHNOL.2012.6285848","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
Several companies report success stories after migration from relational database management systems to NoSQL systems (Not only SQL). The latter seem to take over in most data storage fields. These technologies must be used properly, and businesses must be aware of the limitations of NoSQL, for providing real benefits. Pig Latin is a high-level language for expressing data analysis programs, and implementing the MapReduce framework on top of Hadoop Distributed File System. This paper benchmarks Pig Latin using the well known TPC-H benchmark -a Decision Support System benchmark, and reports performance results for different settings on GRID5000 clusters.