Wang Simin , Kang Yifei , Xu Yixuan , Ma Chunmiao , Wang Haitao , Wu Weiguo
{"title":"基于两阶段自愈温度预测模型的数据中心温度预测与管理","authors":"Wang Simin , Kang Yifei , Xu Yixuan , Ma Chunmiao , Wang Haitao , Wu Weiguo","doi":"10.1016/j.simpat.2023.102883","DOIUrl":null,"url":null,"abstract":"<div><p><span><span><span><span>While providing efficient and convenient cloud services, data center<span><span> also brings great pressure to energy consumption and environment. The rise of server temperature not only increases the refrigeration cost, but also seriously affects the operation safety of the data center. Effective analysis and prediction of data center temperature is not only conducive to preventing server overheating and shutdown, but also crucial to data center task scheduling, resource allocation optimization and energy efficiency improvement of data center. Therefore, this article proposes a Two-stage </span>Gated Recurrent Unit (GRU) temperature </span></span>prediction algorithm<span> with self-healing mechanism. The algorithm establishes a prediction model for the important parameters affecting temperature prediction - CPU utilization, and takes the output of the model as the input parameter of the server temperature prediction model, which fits the changes of each parameter more accurate. To avoid the decrease in prediction accuracy caused by new operating conditions that have not been learned before and changes in physical environmental factors during the operation of the model, a self-healing mechanism is proposed to ensure the prediction accuracy of the model. Experiments show that our prediction model can accurately predict the inlet temperature evolution of the server with </span></span>dynamic workload. It reduces the prediction error (RSME) to 0.280, and the average prediction temperature difference is only 0.675, which is 10 % higher than the single stage prediction accuracy. The use of Two-stage prediction methods in other </span>machine learning methods can also improve prediction accuracy. Based on the prediction model, this paper proposes a task </span>scheduling algorithm that minimizes temperature difference. The algorithm can make the temperature between servers more balanced after task allocation, effectively reducing the number of servers running at high and low temperatures in the data center, avoiding refrigeration waste, and achieving energy conservation in the data center.</p></div>","PeriodicalId":49518,"journal":{"name":"Simulation Modelling Practice and Theory","volume":null,"pages":null},"PeriodicalIF":3.5000,"publicationDate":"2023-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data center temperature prediction and management based on a Two-stage self-healing model\",\"authors\":\"Wang Simin , Kang Yifei , Xu Yixuan , Ma Chunmiao , Wang Haitao , Wu Weiguo\",\"doi\":\"10.1016/j.simpat.2023.102883\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p><span><span><span><span>While providing efficient and convenient cloud services, data center<span><span> also brings great pressure to energy consumption and environment. The rise of server temperature not only increases the refrigeration cost, but also seriously affects the operation safety of the data center. Effective analysis and prediction of data center temperature is not only conducive to preventing server overheating and shutdown, but also crucial to data center task scheduling, resource allocation optimization and energy efficiency improvement of data center. Therefore, this article proposes a Two-stage </span>Gated Recurrent Unit (GRU) temperature </span></span>prediction algorithm<span> with self-healing mechanism. The algorithm establishes a prediction model for the important parameters affecting temperature prediction - CPU utilization, and takes the output of the model as the input parameter of the server temperature prediction model, which fits the changes of each parameter more accurate. To avoid the decrease in prediction accuracy caused by new operating conditions that have not been learned before and changes in physical environmental factors during the operation of the model, a self-healing mechanism is proposed to ensure the prediction accuracy of the model. Experiments show that our prediction model can accurately predict the inlet temperature evolution of the server with </span></span>dynamic workload. It reduces the prediction error (RSME) to 0.280, and the average prediction temperature difference is only 0.675, which is 10 % higher than the single stage prediction accuracy. The use of Two-stage prediction methods in other </span>machine learning methods can also improve prediction accuracy. Based on the prediction model, this paper proposes a task </span>scheduling algorithm that minimizes temperature difference. The algorithm can make the temperature between servers more balanced after task allocation, effectively reducing the number of servers running at high and low temperatures in the data center, avoiding refrigeration waste, and achieving energy conservation in the data center.</p></div>\",\"PeriodicalId\":49518,\"journal\":{\"name\":\"Simulation Modelling Practice and Theory\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2023-12-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Simulation Modelling Practice and Theory\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1569190X23001600\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Simulation Modelling Practice and Theory","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1569190X23001600","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Data center temperature prediction and management based on a Two-stage self-healing model
While providing efficient and convenient cloud services, data center also brings great pressure to energy consumption and environment. The rise of server temperature not only increases the refrigeration cost, but also seriously affects the operation safety of the data center. Effective analysis and prediction of data center temperature is not only conducive to preventing server overheating and shutdown, but also crucial to data center task scheduling, resource allocation optimization and energy efficiency improvement of data center. Therefore, this article proposes a Two-stage Gated Recurrent Unit (GRU) temperature prediction algorithm with self-healing mechanism. The algorithm establishes a prediction model for the important parameters affecting temperature prediction - CPU utilization, and takes the output of the model as the input parameter of the server temperature prediction model, which fits the changes of each parameter more accurate. To avoid the decrease in prediction accuracy caused by new operating conditions that have not been learned before and changes in physical environmental factors during the operation of the model, a self-healing mechanism is proposed to ensure the prediction accuracy of the model. Experiments show that our prediction model can accurately predict the inlet temperature evolution of the server with dynamic workload. It reduces the prediction error (RSME) to 0.280, and the average prediction temperature difference is only 0.675, which is 10 % higher than the single stage prediction accuracy. The use of Two-stage prediction methods in other machine learning methods can also improve prediction accuracy. Based on the prediction model, this paper proposes a task scheduling algorithm that minimizes temperature difference. The algorithm can make the temperature between servers more balanced after task allocation, effectively reducing the number of servers running at high and low temperatures in the data center, avoiding refrigeration waste, and achieving energy conservation in the data center.
期刊介绍:
The journal Simulation Modelling Practice and Theory provides a forum for original, high-quality papers dealing with any aspect of systems simulation and modelling.
The journal aims at being a reference and a powerful tool to all those professionally active and/or interested in the methods and applications of simulation. Submitted papers will be peer reviewed and must significantly contribute to modelling and simulation in general or use modelling and simulation in application areas.
Paper submission is solicited on:
• theoretical aspects of modelling and simulation including formal modelling, model-checking, random number generators, sensitivity analysis, variance reduction techniques, experimental design, meta-modelling, methods and algorithms for validation and verification, selection and comparison procedures etc.;
• methodology and application of modelling and simulation in any area, including computer systems, networks, real-time and embedded systems, mobile and intelligent agents, manufacturing and transportation systems, management, engineering, biomedical engineering, economics, ecology and environment, education, transaction handling, etc.;
• simulation languages and environments including those, specific to distributed computing, grid computing, high performance computers or computer networks, etc.;
• distributed and real-time simulation, simulation interoperability;
• tools for high performance computing simulation, including dedicated architectures and parallel computing.