Hung-Ming Chen, Shih-Ying Chen, Sheng-Hsien Hsueh, Sheng-Kai Wang
{"title":"在Kubernetes上设计改进的ML任务调度机制","authors":"Hung-Ming Chen, Shih-Ying Chen, Sheng-Hsien Hsueh, Sheng-Kai Wang","doi":"10.1109/IS3C57901.2023.00024","DOIUrl":null,"url":null,"abstract":"As the fields related to machine learning (ML)/deep learning (DL) continue to mature, the MLOps machine learning automation process is also gradually emerging. Then, many open-source MLOps frameworks based on Kubernetes have begun to be proposed. Currently, most Kubernetes-based MLOps frameworks aim to establish a common and easy-to-use ML pipeline environment for users to use based on ML containerized tasks. However, Kubernetes’ default container scheduler only considers the resource conditions of individual containerized tasks, rather than considering the scheduling of the entire containerized ML task composition. Such a situation may lead to the system resources not being utilized properly. Therefore, this study designs an improved ML task mechanism based on the Kubernetes-based platform to replace the Kubernetes default scheduler. The scheduling strategy in Kubernetes can be modified to better suit the needs of the machine learning development environment.","PeriodicalId":142483,"journal":{"name":"2023 Sixth International Symposium on Computer, Consumer and Control (IS3C)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Designing an Improved ML Task Scheduling Mechanism on Kubernetes\",\"authors\":\"Hung-Ming Chen, Shih-Ying Chen, Sheng-Hsien Hsueh, Sheng-Kai Wang\",\"doi\":\"10.1109/IS3C57901.2023.00024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As the fields related to machine learning (ML)/deep learning (DL) continue to mature, the MLOps machine learning automation process is also gradually emerging. Then, many open-source MLOps frameworks based on Kubernetes have begun to be proposed. Currently, most Kubernetes-based MLOps frameworks aim to establish a common and easy-to-use ML pipeline environment for users to use based on ML containerized tasks. However, Kubernetes’ default container scheduler only considers the resource conditions of individual containerized tasks, rather than considering the scheduling of the entire containerized ML task composition. Such a situation may lead to the system resources not being utilized properly. Therefore, this study designs an improved ML task mechanism based on the Kubernetes-based platform to replace the Kubernetes default scheduler. The scheduling strategy in Kubernetes can be modified to better suit the needs of the machine learning development environment.\",\"PeriodicalId\":142483,\"journal\":{\"name\":\"2023 Sixth International Symposium on Computer, Consumer and Control (IS3C)\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 Sixth International Symposium on Computer, Consumer and Control (IS3C)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IS3C57901.2023.00024\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 Sixth International Symposium on Computer, Consumer and Control (IS3C)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IS3C57901.2023.00024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Designing an Improved ML Task Scheduling Mechanism on Kubernetes
As the fields related to machine learning (ML)/deep learning (DL) continue to mature, the MLOps machine learning automation process is also gradually emerging. Then, many open-source MLOps frameworks based on Kubernetes have begun to be proposed. Currently, most Kubernetes-based MLOps frameworks aim to establish a common and easy-to-use ML pipeline environment for users to use based on ML containerized tasks. However, Kubernetes’ default container scheduler only considers the resource conditions of individual containerized tasks, rather than considering the scheduling of the entire containerized ML task composition. Such a situation may lead to the system resources not being utilized properly. Therefore, this study designs an improved ML task mechanism based on the Kubernetes-based platform to replace the Kubernetes default scheduler. The scheduling strategy in Kubernetes can be modified to better suit the needs of the machine learning development environment.