{"title":"Performance Prediction of Jupyter Notebook in JupyterHub using Machine Learning","authors":"Pariwat Prathanrat, Chantri Polprasert","doi":"10.1109/ICIIBMS.2018.8550030","DOIUrl":null,"url":null,"abstract":"In this paper, we employ machine learning to predict the performance of Jupyter notebook on JupyterHub. We show that the notebook's CPU profile, the notebook's RAM profile, number of users and average delay between cells are crucial features that impact the performance of the machine learning models to accurately predict the performance of Jupyter notebook in term of the response time. We characterize the performance of our model to predict the notebook's response time in terms of the mean absolute error (MAE) and mean absolute percentage error (MAPE). Results show that the random forest model yields strongest performance to predict the performance of Jupyter notebook with MAPE equal to 9.849% and MAE equal to 13.768 seconds. with r-square equal to 0.93.","PeriodicalId":430326,"journal":{"name":"2018 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)","volume":"70 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIIBMS.2018.8550030","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
In this paper, we employ machine learning to predict the performance of Jupyter notebook on JupyterHub. We show that the notebook's CPU profile, the notebook's RAM profile, number of users and average delay between cells are crucial features that impact the performance of the machine learning models to accurately predict the performance of Jupyter notebook in term of the response time. We characterize the performance of our model to predict the notebook's response time in terms of the mean absolute error (MAE) and mean absolute percentage error (MAPE). Results show that the random forest model yields strongest performance to predict the performance of Jupyter notebook with MAPE equal to 9.849% and MAE equal to 13.768 seconds. with r-square equal to 0.93.