DRPC: Distributed Reinforcement Learning Approach for Scalable Resource Provisioning in Container-Based Clusters

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Transactions on Services Computing Pub Date : 2024-07-25 DOI:10.1109/TSC.2024.3433388

Haoyu Bai;Minxian Xu;Kejiang Ye;Rajkumar Buyya;Chengzhong Xu

{"title":"DRPC: Distributed Reinforcement Learning Approach for Scalable Resource Provisioning in Container-Based Clusters","authors":"Haoyu Bai;Minxian Xu;Kejiang Ye;Rajkumar Buyya;Chengzhong Xu","doi":"10.1109/TSC.2024.3433388","DOIUrl":null,"url":null,"abstract":"Microservices have transformed monolithic applications into lightweight, self-contained, and isolated application components, establishing themselves as a dominant paradigm for application development and deployment in public clouds such as Google and Alibaba. Autoscaling emerges as an efficient strategy for managing resources allocated to microservices’ replicas. However, the dynamic and intricate dependencies within microservice chains present challenges to the effective management of scaled microservices. Additionally, the centralized autoscaling approach can encounter scalability issues, especially in the management of large-scale microservice-based clusters. To address these challenges and enhance scalability, we propose an innovative distributed resource provisioning approach for microservices based on the Twin Delayed Deep Deterministic Policy Gradient algorithm. This approach enables effective autoscaling decisions and decentralizes responsibilities from a central node to distributed nodes. Comparative results with state-of-the-art approaches, obtained from a realistic testbed and traces, indicate that our approach reduces the average response time by 15% and the number of failed requests by 24%, validating improved scalability as the number of requests increases.","PeriodicalId":13255,"journal":{"name":"IEEE Transactions on Services Computing","volume":"17 6","pages":"3473-3484"},"PeriodicalIF":5.8000,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Services Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10609536/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Microservices have transformed monolithic applications into lightweight, self-contained, and isolated application components, establishing themselves as a dominant paradigm for application development and deployment in public clouds such as Google and Alibaba. Autoscaling emerges as an efficient strategy for managing resources allocated to microservices’ replicas. However, the dynamic and intricate dependencies within microservice chains present challenges to the effective management of scaled microservices. Additionally, the centralized autoscaling approach can encounter scalability issues, especially in the management of large-scale microservice-based clusters. To address these challenges and enhance scalability, we propose an innovative distributed resource provisioning approach for microservices based on the Twin Delayed Deep Deterministic Policy Gradient algorithm. This approach enables effective autoscaling decisions and decentralizes responsibilities from a central node to distributed nodes. Comparative results with state-of-the-art approaches, obtained from a realistic testbed and traces, indicate that our approach reduces the average response time by 15% and the number of failed requests by 24%, validating improved scalability as the number of requests increases.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

DRPC：基于容器的集群中可扩展资源调配的分布式强化学习方法

微服务已经将单一的应用程序转变为轻量级的、自包含的、独立的应用程序组件，并将自己确立为谷歌和阿里巴巴等公共云中应用程序开发和部署的主导范例。自动伸缩作为一种有效的策略出现，用于管理分配给微服务副本的资源。然而，微服务链中动态和复杂的依赖关系对有效管理规模化微服务提出了挑战。此外，集中式自动伸缩方法可能会遇到可伸缩性问题，特别是在管理基于微服务的大规模集群时。为了应对这些挑战并增强可扩展性，我们提出了一种基于双延迟深度确定性策略梯度算法的微服务分布式资源配置方法。这种方法支持有效的自动伸缩决策，并将责任从中心节点分散到分布式节点。从实际的测试平台和跟踪中获得的最先进的方法的比较结果表明，我们的方法将平均响应时间减少了15%，失败请求的数量减少了24%，随着请求数量的增加，验证了改进的可伸缩性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Services Computing COMPUTER SCIENCE, INFORMATION SYSTEMS-COMPUTER SCIENCE, SOFTWARE ENGINEERING

CiteScore

11.50

自引率

6.20%

发文量

278

审稿时长

>12 weeks

期刊介绍： IEEE Transactions on Services Computing encompasses the computing and software aspects of the science and technology of services innovation research and development. It places emphasis on algorithmic, mathematical, statistical, and computational methods central to services computing. Topics covered include Service Oriented Architecture, Web Services, Business Process Integration, Solution Performance Management, and Services Operations and Management. The transactions address mathematical foundations, security, privacy, agreement, contract, discovery, negotiation, collaboration, and quality of service for web services. It also covers areas like composite web service creation, business and scientific applications, standards, utility models, business process modeling, integration, collaboration, and more in the realm of Services Computing.