OMBM-ML: An Efficient Memory Bandwidth Management for Ensuring QoS and Improving Server Utilization

Min Jeesoo, Sung Hanul, Eom Hyeonsang
{"title":"OMBM-ML: An Efficient Memory Bandwidth Management for Ensuring QoS and Improving Server Utilization","authors":"Min Jeesoo, Sung Hanul, Eom Hyeonsang","doi":"10.1109/FAS-W.2018.00028","DOIUrl":null,"url":null,"abstract":"As cloud data centers are dramatically growing, various applications are moved to cloud data centers owing to cost benefits for maintenance and hardware resources. However, latency-critical workloads among them suffer from some problems to fully achieve the cost effectiveness. The latency-critical workloads should show latencies in a stable manner, to be predicted, for strictly meeting QoSs. However, if they are executed with other workloads to save the cost, they experience QoS violation due to the contention for the hardware resources shared with co-location workloads. In order to guarantee QoSs and to improve the hardware resourse utilization, we proposed a memory bandwidth management method with an effective prediction model using machine learning. The prediction model estimates the amount of memory bandwidth that will be allocated to the latency-critical workload based on a REP decision tree. To construct this model, we first collect data and train the model with the data. The generated model can estimate the amount of memory bandwidth for meeting the SLO of the latency-critical workload no matter what batch processing workloads are collocated. The use of our approach achieves up to 99% SLO assurance and improves the server utilization up to 6.8x on average.","PeriodicalId":164903,"journal":{"name":"2018 IEEE 3rd International Workshops on Foundations and Applications of Self* Systems (FAS*W)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 3rd International Workshops on Foundations and Applications of Self* Systems (FAS*W)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FAS-W.2018.00028","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

As cloud data centers are dramatically growing, various applications are moved to cloud data centers owing to cost benefits for maintenance and hardware resources. However, latency-critical workloads among them suffer from some problems to fully achieve the cost effectiveness. The latency-critical workloads should show latencies in a stable manner, to be predicted, for strictly meeting QoSs. However, if they are executed with other workloads to save the cost, they experience QoS violation due to the contention for the hardware resources shared with co-location workloads. In order to guarantee QoSs and to improve the hardware resourse utilization, we proposed a memory bandwidth management method with an effective prediction model using machine learning. The prediction model estimates the amount of memory bandwidth that will be allocated to the latency-critical workload based on a REP decision tree. To construct this model, we first collect data and train the model with the data. The generated model can estimate the amount of memory bandwidth for meeting the SLO of the latency-critical workload no matter what batch processing workloads are collocated. The use of our approach achieves up to 99% SLO assurance and improves the server utilization up to 6.8x on average.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
OMBM-ML:保证服务质量和提高服务器利用率的高效内存带宽管理
随着云数据中心的急剧增长,由于维护和硬件资源的成本优势,各种应用程序被转移到云数据中心。但是,延迟关键型工作负载在完全实现成本效益方面存在一些问题。延迟关键型工作负载应该以稳定的方式显示延迟,以预测延迟,以严格满足qos。但是,如果它们与其他工作负载一起执行以节省成本,则由于争用与协同定位工作负载共享的硬件资源,它们会遇到QoS冲突。为了保证qos和提高硬件资源利用率,我们提出了一种基于机器学习的有效预测模型的内存带宽管理方法。预测模型根据REP决策树估计将分配给延迟关键工作负载的内存带宽量。为了构建这个模型,我们首先收集数据并用数据训练模型。生成的模型可以估计满足延迟关键型工作负载的SLO所需的内存带宽量,而不管并置了什么批处理工作负载。使用我们的方法可以实现高达99%的SLO保证,并将服务器利用率平均提高到6.8倍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Towards Self-Adaptive Systems with Hierarchical Decentralised Control DymGPU: Dynamic Memory Management for Sharing GPUs in Virtualized Clouds Reactive and Adaptive Security Monitoring in Cloud Computing Aspects of Measuring and Evaluating the Integration Status of a (Sub-)System at Runtime Efficient Classification of Application Characteristics by Using Hardware Performance Counters with Data Mining
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1