{"title":"Software for Prediction of Task Start Moment in Computer Cluster by Statistical Analysis of Jobs Queue History","authors":"I. Yashchenko, A. Salnikov","doi":"10.1134/S1063779624030924","DOIUrl":null,"url":null,"abstract":"<p>Predicting the start time of a task without placing the task in a task queue is very difficult in modern data centers. The article proposes to improve the previously created forecasting tool through the use of more modern software interfaces to the SchedMD queuing system (Slurm), as well as the use of various mathematical forecasting methods to predict the moment of task launch. To compare and evaluate machine learning models, we selected a SWF file with task execution history data on the University of Luxembourg computing cluster with 59 715 tasks. The following methods were investigated: linear regression with L2 regularization, support vector machine, random forest, LightGBM, CatBoost, LightGBM with parameter optimization, CatBoost with parameter optimization. To check the correctness of the prediction, a test bench was built based on the Slurm simulator (SUNY Center for Computational Research at the University at Buffalo, USA), which works based on executing tasks and organizing a queue of saved logs.</p>","PeriodicalId":729,"journal":{"name":"Physics of Particles and Nuclei","volume":"55 3","pages":"427 - 429"},"PeriodicalIF":0.5000,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physics of Particles and Nuclei","FirstCategoryId":"101","ListUrlMain":"https://link.springer.com/article/10.1134/S1063779624030924","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"PHYSICS, PARTICLES & FIELDS","Score":null,"Total":0}
引用次数: 0
Abstract
Predicting the start time of a task without placing the task in a task queue is very difficult in modern data centers. The article proposes to improve the previously created forecasting tool through the use of more modern software interfaces to the SchedMD queuing system (Slurm), as well as the use of various mathematical forecasting methods to predict the moment of task launch. To compare and evaluate machine learning models, we selected a SWF file with task execution history data on the University of Luxembourg computing cluster with 59 715 tasks. The following methods were investigated: linear regression with L2 regularization, support vector machine, random forest, LightGBM, CatBoost, LightGBM with parameter optimization, CatBoost with parameter optimization. To check the correctness of the prediction, a test bench was built based on the Slurm simulator (SUNY Center for Computational Research at the University at Buffalo, USA), which works based on executing tasks and organizing a queue of saved logs.
期刊介绍:
The journal Fizika Elementarnykh Chastits i Atomnogo Yadr of the Joint Institute for Nuclear Research (JINR, Dubna) was founded by Academician N.N. Bogolyubov in August 1969. The Editors-in-chief of the journal were Academician N.N. Bogolyubov (1970–1992) and Academician A.M. Baldin (1992–2001). Its English translation, Physics of Particles and Nuclei, appears simultaneously with the original Russian-language edition. Published by leading physicists from the JINR member states, as well as by scientists from other countries, review articles in this journal examine problems of elementary particle physics, nuclear physics, condensed matter physics, experimental data processing, accelerators and related instrumentation ecology and radiology.