{"title":"评估 Google Cloud Run 上的无服务器机器学习性能","authors":"Prerana Khatiwada, Pranjal Dhakal","doi":"arxiv-2406.16250","DOIUrl":null,"url":null,"abstract":"End-users can get functions-as-a-service from serverless platforms, which\npromise lower hosting costs, high availability, fault tolerance, and dynamic\nflexibility for hosting individual functions known as microservices. Machine\nlearning tools are seen to be reliably useful, and the services created using\nthese tools are in increasing demand on a large scale. The serverless platforms\nare uniquely suited for hosting these machine learning services to be used for\nlarge-scale applications. These platforms are well known for their cost\nefficiency, fault tolerance, resource scaling, robust APIs for communication,\nand global reach. However, machine learning services are different from the\nweb-services in that these serverless platforms were originally designed to\nhost web services. We aimed to understand how these serverless platforms handle\nmachine learning workloads with our study. We examine machine learning\nperformance on one of the serverless platforms - Google Cloud Run, which is a\nGPU-less infrastructure that is not designed for machine learning application\ndeployment.","PeriodicalId":501333,"journal":{"name":"arXiv - CS - Operating Systems","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Evaluating Serverless Machine Learning Performance on Google Cloud Run\",\"authors\":\"Prerana Khatiwada, Pranjal Dhakal\",\"doi\":\"arxiv-2406.16250\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"End-users can get functions-as-a-service from serverless platforms, which\\npromise lower hosting costs, high availability, fault tolerance, and dynamic\\nflexibility for hosting individual functions known as microservices. Machine\\nlearning tools are seen to be reliably useful, and the services created using\\nthese tools are in increasing demand on a large scale. The serverless platforms\\nare uniquely suited for hosting these machine learning services to be used for\\nlarge-scale applications. These platforms are well known for their cost\\nefficiency, fault tolerance, resource scaling, robust APIs for communication,\\nand global reach. However, machine learning services are different from the\\nweb-services in that these serverless platforms were originally designed to\\nhost web services. We aimed to understand how these serverless platforms handle\\nmachine learning workloads with our study. We examine machine learning\\nperformance on one of the serverless platforms - Google Cloud Run, which is a\\nGPU-less infrastructure that is not designed for machine learning application\\ndeployment.\",\"PeriodicalId\":501333,\"journal\":{\"name\":\"arXiv - CS - Operating Systems\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Operating Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2406.16250\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Operating Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2406.16250","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
终端用户可以从无服务器平台上获得功能即服务(functions-as-a-service),这些平台承诺较低的托管成本、高可用性、容错性和动态灵活性,以托管被称为微服务(microservices)的单个功能。机器学习工具被认为是可靠有用的,使用这些工具创建的服务在大规模需求中日益增多。无服务器平台非常适合托管这些用于大规模应用的机器学习服务。这些平台以其成本效益、容错、资源扩展、强大的通信 API 和全球覆盖而闻名。然而,机器学习服务不同于网络服务,因为这些无服务器平台最初是为托管网络服务而设计的。我们的研究旨在了解这些无服务器平台如何处理机器学习工作负载。我们研究了无服务器平台之一--谷歌云运行(Google Cloud Run)上的机器学习性能,这是一种无 GPU 的基础设施,并非为机器学习应用部署而设计。
Evaluating Serverless Machine Learning Performance on Google Cloud Run
End-users can get functions-as-a-service from serverless platforms, which
promise lower hosting costs, high availability, fault tolerance, and dynamic
flexibility for hosting individual functions known as microservices. Machine
learning tools are seen to be reliably useful, and the services created using
these tools are in increasing demand on a large scale. The serverless platforms
are uniquely suited for hosting these machine learning services to be used for
large-scale applications. These platforms are well known for their cost
efficiency, fault tolerance, resource scaling, robust APIs for communication,
and global reach. However, machine learning services are different from the
web-services in that these serverless platforms were originally designed to
host web services. We aimed to understand how these serverless platforms handle
machine learning workloads with our study. We examine machine learning
performance on one of the serverless platforms - Google Cloud Run, which is a
GPU-less infrastructure that is not designed for machine learning application
deployment.