Flexible Computing: A New Framework for Improving Resource Allocation and Scheduling in Elastic Computing

IF 5.8 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Transactions on Services Computing Pub Date : 2024-10-31 DOI:10.1109/TSC.2024.3489433
Weipeng Cao;Jiongjiong Gu;Zhong Ming;Zhiyuan Cai;Yuzhao Wang;Changping Ji;Zhijiao Xiao;Yuhong Feng;Ye Liu;Liang-Jie Zhang
{"title":"Flexible Computing: A New Framework for Improving Resource Allocation and Scheduling in Elastic Computing","authors":"Weipeng Cao;Jiongjiong Gu;Zhong Ming;Zhiyuan Cai;Yuzhao Wang;Changping Ji;Zhijiao Xiao;Yuhong Feng;Ye Liu;Liang-Jie Zhang","doi":"10.1109/TSC.2024.3489433","DOIUrl":null,"url":null,"abstract":"Since the advent of cloud computing, Elastic Computing (EC) has become the standard architecture for resource allocation and scheduling. EC typically allocates computing resources based on predefined specifications, such as virtual machine or container flavors. However, these flavors are often constrained by fixed CPU-to-memory ratios, which frequently fail to match the actual resource needs of applications. As a result, cloud providers experience high resource allocation rates nearing saturation (<inline-formula><tex-math>$&gt; $</tex-math></inline-formula>80%) but with low utilization (<inline-formula><tex-math>$&lt; $</tex-math></inline-formula>25%). This study introduces Flexible Computing (FC), a novel approach to resource allocation and scheduling. Unlike EC, FC allocates resources based on an application resource usage profile, derived from the historical resource consumption of workloads, rather than relying on fixed specifications. Additionally, FC incorporates a real-time performance degradation detection mechanism to address performance issues caused by the noisy-neighbor effect when colocated workloads interfere with each other. FC dynamically adjusts resource allocation according to actual usage, ensuring that application performance meets Service Level Agreements (SLAs), while preventing resource waste and performance degradation from improper resource over-commitment. Large-scale experimental validations conducted on the FC architecture within Huawei Cloud data centers demonstrate that, compared to EC, FC can reduce computing resource consumption by over 33% while managing the same workloads. Furthermore, FC's real-time performance degradation detection model achieves a prediction error of less than 5% across various testing environments, highlighting its commercial viability.","PeriodicalId":13255,"journal":{"name":"IEEE Transactions on Services Computing","volume":"18 1","pages":"198-211"},"PeriodicalIF":5.8000,"publicationDate":"2024-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Services Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10740339/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Since the advent of cloud computing, Elastic Computing (EC) has become the standard architecture for resource allocation and scheduling. EC typically allocates computing resources based on predefined specifications, such as virtual machine or container flavors. However, these flavors are often constrained by fixed CPU-to-memory ratios, which frequently fail to match the actual resource needs of applications. As a result, cloud providers experience high resource allocation rates nearing saturation ($> $80%) but with low utilization ($< $25%). This study introduces Flexible Computing (FC), a novel approach to resource allocation and scheduling. Unlike EC, FC allocates resources based on an application resource usage profile, derived from the historical resource consumption of workloads, rather than relying on fixed specifications. Additionally, FC incorporates a real-time performance degradation detection mechanism to address performance issues caused by the noisy-neighbor effect when colocated workloads interfere with each other. FC dynamically adjusts resource allocation according to actual usage, ensuring that application performance meets Service Level Agreements (SLAs), while preventing resource waste and performance degradation from improper resource over-commitment. Large-scale experimental validations conducted on the FC architecture within Huawei Cloud data centers demonstrate that, compared to EC, FC can reduce computing resource consumption by over 33% while managing the same workloads. Furthermore, FC's real-time performance degradation detection model achieves a prediction error of less than 5% across various testing environments, highlighting its commercial viability.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
灵活计算:改进弹性计算中资源分配和调度的新框架
自云计算出现以来,弹性计算(Elastic computing, EC)已经成为资源分配和调度的标准架构。EC通常根据预定义的规范(如虚拟机或容器类型)分配计算资源。然而,这些风格通常受到固定的cpu -内存比率的限制,这些比率经常无法匹配应用程序的实际资源需求。因此,云提供商的资源分配率接近饱和($>;$80%),但利用率较低($<;美元的25%)。本文介绍了一种新的资源分配和调度方法——柔性计算(FC)。与EC不同,FC根据应用程序资源使用配置文件分配资源,该配置文件来自工作负载的历史资源消耗,而不是依赖于固定的规范。此外,FC还集成了实时性能下降检测机制,以解决由共存的工作负载相互干扰时的噪声邻居效应引起的性能问题。FC根据实际使用情况动态调整资源分配,保证应用性能满足sla (Service Level Agreements)的要求,同时避免因资源过度使用导致的资源浪费和性能下降。在华为云数据中心对FC架构进行的大规模实验验证表明,与EC相比,FC在管理相同工作负载的情况下,可以减少33%以上的计算资源消耗。此外,FC的实时性能退化检测模型在各种测试环境下的预测误差小于5%,突出了其商业可行性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Services Computing
IEEE Transactions on Services Computing COMPUTER SCIENCE, INFORMATION SYSTEMS-COMPUTER SCIENCE, SOFTWARE ENGINEERING
CiteScore
11.50
自引率
6.20%
发文量
278
审稿时长
>12 weeks
期刊介绍: IEEE Transactions on Services Computing encompasses the computing and software aspects of the science and technology of services innovation research and development. It places emphasis on algorithmic, mathematical, statistical, and computational methods central to services computing. Topics covered include Service Oriented Architecture, Web Services, Business Process Integration, Solution Performance Management, and Services Operations and Management. The transactions address mathematical foundations, security, privacy, agreement, contract, discovery, negotiation, collaboration, and quality of service for web services. It also covers areas like composite web service creation, business and scientific applications, standards, utility models, business process modeling, integration, collaboration, and more in the realm of Services Computing.
期刊最新文献
LIMR: Intent-Aware Mashup API Recommendation via LLM-Augmented Multi-Scale Fusion Enhancing Real-Time Services in Edge Cloud Data Centers: A Novel Lightweight Virtual Machine Scheduling Approach InfSquad: SLO-aware Serverless Machine Learning Inference with Wasm-assisted Hybrid Functions Container-State-Aware Energy Consumption Minimization for Serverless Functions Interactive Fast Computation Offloading and Resource Allocation: A Joint Optimization Approach for Metaverse Applications
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1