{"title":"Metocean Prediction using Hadoop, Spark & R","authors":"Sumayema Kabir Ricky, L. Rahim","doi":"10.1109/ICCOINS49721.2021.9497204","DOIUrl":null,"url":null,"abstract":"This project is the development of an analysis system for historical Metocean Data. It is a single page reactive web application with shiny web UI package of R containing forecasting model, ARIMA and two ML algorithms, Linear Regression and H2O AutoML developed with R for the variables of Metocean data stored in HDFS of a virtual Hadoop cluster and spark is integrated to make the computations happen in-memory. The predictions is compared to the actual data to see its correctness with RMSE. Performance difference of the application deployed on desktop and on the server is also discussed. The application performs better when running in the server than on desktop.","PeriodicalId":245662,"journal":{"name":"2021 International Conference on Computer & Information Sciences (ICCOINS)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computer & Information Sciences (ICCOINS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCOINS49721.2021.9497204","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This project is the development of an analysis system for historical Metocean Data. It is a single page reactive web application with shiny web UI package of R containing forecasting model, ARIMA and two ML algorithms, Linear Regression and H2O AutoML developed with R for the variables of Metocean data stored in HDFS of a virtual Hadoop cluster and spark is integrated to make the computations happen in-memory. The predictions is compared to the actual data to see its correctness with RMSE. Performance difference of the application deployed on desktop and on the server is also discussed. The application performs better when running in the server than on desktop.
本项目是开发一个历史海洋气象数据分析系统。它是一个单页响应式web应用程序,具有闪亮的R web UI包,包含预测模型,ARIMA和两种ML算法,线性回归和H2O AutoML,用R开发,用于存储在虚拟Hadoop集群的HDFS中的Metocean数据的变量,并集成spark使计算发生在内存中。将预测与实际数据进行比较,以查看其与RMSE的正确性。还讨论了部署在桌面和服务器上的应用程序的性能差异。应用程序在服务器上运行时比在桌面上运行时性能更好。