Living on the Edge: Serverless Computing and the Cost of Failure Resiliency

2019 IEEE International Symposium on Local and Metropolitan Area Networks (LANMAN) Pub Date : 2019-07-01 DOI:10.1109/LANMAN.2019.8846970

Sameer G. Kulkarni, Guyue Liu, K. Ramakrishnan, Timothy Wood

{"title":"Living on the Edge: Serverless Computing and the Cost of Failure Resiliency","authors":"Sameer G. Kulkarni, Guyue Liu, K. Ramakrishnan, Timothy Wood","doi":"10.1109/LANMAN.2019.8846970","DOIUrl":null,"url":null,"abstract":"Serverless computing platforms have gained popularity because they allow easy deployment of services in a highly scalable and cost-effective manner. By enabling just-in-time startup of container-based services, these platforms can achieve good multiplexing and automatically respond to traffic growth, making them particularly desirable for edge cloud data centers where resources are scarce. Edge cloud data centers are also gaining attention because of their promise to provide responsive, low-latency shared computing and storage resources. Bringing serverless capabilities to edge cloud data centers must continue to achieve the goals of low latency and reliability. The reliability guarantees provided by serverless computing however are weak, with node failures causing requests to be dropped or executed multiple times. Thus serverless computing only provides a best effort infrastructure, leaving application developers responsible for implementing stronger reliability guarantees at a higher level. Current approaches for providing stronger semantics such as “exactly once” guarantees could be integrated into serverless platforms, but they come at high cost in terms of both latency and resource consumption. As edge cloud services move towards applications such as autonomous vehicle control that require strong guarantees for both reliability and performance, these approaches may no longer be sufficient. In this paper we evaluate the latency, throughput, and resource costs of providing different reliability guarantees, with a focus on these emerging edge cloud platforms and applications.","PeriodicalId":214356,"journal":{"name":"2019 IEEE International Symposium on Local and Metropolitan Area Networks (LANMAN)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Symposium on Local and Metropolitan Area Networks (LANMAN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/LANMAN.2019.8846970","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Serverless computing platforms have gained popularity because they allow easy deployment of services in a highly scalable and cost-effective manner. By enabling just-in-time startup of container-based services, these platforms can achieve good multiplexing and automatically respond to traffic growth, making them particularly desirable for edge cloud data centers where resources are scarce. Edge cloud data centers are also gaining attention because of their promise to provide responsive, low-latency shared computing and storage resources. Bringing serverless capabilities to edge cloud data centers must continue to achieve the goals of low latency and reliability. The reliability guarantees provided by serverless computing however are weak, with node failures causing requests to be dropped or executed multiple times. Thus serverless computing only provides a best effort infrastructure, leaving application developers responsible for implementing stronger reliability guarantees at a higher level. Current approaches for providing stronger semantics such as “exactly once” guarantees could be integrated into serverless platforms, but they come at high cost in terms of both latency and resource consumption. As edge cloud services move towards applications such as autonomous vehicle control that require strong guarantees for both reliability and performance, these approaches may no longer be sufficient. In this paper we evaluate the latency, throughput, and resource costs of providing different reliability guarantees, with a focus on these emerging edge cloud platforms and applications.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

生活在边缘:无服务器计算和故障恢复的成本

无服务器计算平台越来越受欢迎，因为它们允许以高度可伸缩和经济高效的方式轻松部署服务。通过支持基于容器的服务的及时启动，这些平台可以实现良好的多路复用并自动响应流量增长，这使得它们特别适合资源稀缺的边缘云数据中心。边缘云数据中心也因其承诺提供响应迅速、低延迟的共享计算和存储资源而备受关注。将无服务器功能引入边缘云数据中心必须继续实现低延迟和可靠性的目标。然而，无服务器计算提供的可靠性保证很弱，节点故障会导致请求被丢弃或多次执行。因此，无服务器计算只提供了尽力而为的基础设施，而让应用程序开发人员负责在更高的级别上实现更强的可靠性保证。目前提供更强语义(如“只一次”保证)的方法可以集成到无服务器平台中，但它们在延迟和资源消耗方面的成本都很高。随着边缘云服务转向自动驾驶汽车控制等需要可靠性和性能强有力保证的应用，这些方法可能不再足够。在本文中，我们评估了提供不同可靠性保证的延迟、吞吐量和资源成本，重点关注这些新兴的边缘云平台和应用程序。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2019 IEEE International Symposium on Local and Metropolitan Area Networks (LANMAN)

自引率

0.00%

发文量

期刊最新文献

LANMAN 2019 Copyright Page H2NDN: Supporting Connected Vehicle Applications with Hierarchical Hyperbolic NDN Resource optimization in Visible Light Communication for Internet of Things Managing Background Traffic in Cellular Networks Living on the Edge: Serverless Computing and the Cost of Failure Resiliency