Lin Li, Shuxiang Guo, Lingshuai Meng, Haibin Zhai, Z. Hui, Bingnan Ma, Shijun Shen
{"title":"基于温度、性能和能量约束建模的改进q学习系统功率优化","authors":"Lin Li, Shuxiang Guo, Lingshuai Meng, Haibin Zhai, Z. Hui, Bingnan Ma, Shijun Shen","doi":"10.1109/TOCS50858.2020.9339699","DOIUrl":null,"url":null,"abstract":"Power management of embedded systems based on machine learning have drawn more and more attention. High-level software power management and optimization have gradually become important technologies for controlling the computer system power dissipation. In paper, we have employed an improved power optimization management technique which employ Q-learning algorithm based on temperature, performance and energy. The improved Q-learning has been employed to control the uncertain states of the running system and can effectively make decisions to select a rational policy with multiple parameter constraints. As running hardware and application data can be effectively collected and modeled, the power management framework can easily explore an ideal policy by value function of Q-learning algorithm.","PeriodicalId":373862,"journal":{"name":"2020 IEEE Conference on Telecommunications, Optics and Computer Science (TOCS)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Improved Q-Learning for System Power Optimization with Temperature, Performance and Energy Constraint Modeling\",\"authors\":\"Lin Li, Shuxiang Guo, Lingshuai Meng, Haibin Zhai, Z. Hui, Bingnan Ma, Shijun Shen\",\"doi\":\"10.1109/TOCS50858.2020.9339699\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Power management of embedded systems based on machine learning have drawn more and more attention. High-level software power management and optimization have gradually become important technologies for controlling the computer system power dissipation. In paper, we have employed an improved power optimization management technique which employ Q-learning algorithm based on temperature, performance and energy. The improved Q-learning has been employed to control the uncertain states of the running system and can effectively make decisions to select a rational policy with multiple parameter constraints. As running hardware and application data can be effectively collected and modeled, the power management framework can easily explore an ideal policy by value function of Q-learning algorithm.\",\"PeriodicalId\":373862,\"journal\":{\"name\":\"2020 IEEE Conference on Telecommunications, Optics and Computer Science (TOCS)\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE Conference on Telecommunications, Optics and Computer Science (TOCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TOCS50858.2020.9339699\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Conference on Telecommunications, Optics and Computer Science (TOCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TOCS50858.2020.9339699","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Improved Q-Learning for System Power Optimization with Temperature, Performance and Energy Constraint Modeling
Power management of embedded systems based on machine learning have drawn more and more attention. High-level software power management and optimization have gradually become important technologies for controlling the computer system power dissipation. In paper, we have employed an improved power optimization management technique which employ Q-learning algorithm based on temperature, performance and energy. The improved Q-learning has been employed to control the uncertain states of the running system and can effectively make decisions to select a rational policy with multiple parameter constraints. As running hardware and application data can be effectively collected and modeled, the power management framework can easily explore an ideal policy by value function of Q-learning algorithm.