Proceedings of the 2nd Workshop on High Performance Serverless Computing最新文献

英文中文

PONCHO: Dynamic Package Synthesis for Distributed and Serverless Python Applications PONCHO:分布式和无服务器Python应用程序的动态包合成

Proceedings of the 2nd Workshop on High Performance Serverless Computing

Pub Date : 2022-06-30 DOI: 10.1145/3526060.3535459

Barry Sly-Delgado, Nick Locascio, David Simonetti, B. Wiseman, Benjamín Tovar, D. Thain

An increasing number of distributed applications operate by dispatching function invocations across the nodes of a distributed system. To operate correctly, the code and data dependencies of the function must be distributed along with the invocations in some way. When translating applications to work on large scale distributed systems, managing these dependencies becomes challenging: delivery must be scalable to thousands of nodes; the dependencies must be consistent across the system; and the method must be usable by an unprivileged developer. As a solution, in this paper we present PONCHO, which is a lightweight Python based toolkit which allows users to discover, package, and deploy dependencies as an integral part of distributed applications. PONCHO encapsulates a set of commands to be executed within an environment. PONCHO offers a lightweight solution to create and manage environments increasing the portability of scientific applications as well as reproducibility. In this paper, we evaluate PONCHO with real-world applications in the fields of physics, computational chemistry, and hyperparameter optimization, We observe the challenges that arise when creating and distributing an environment and measure the overheads that emerge as a result.

越来越多的分布式应用程序通过在分布式系统的节点之间调度函数调用来运行。为了正确操作，函数的代码和数据依赖关系必须以某种方式与调用一起分发。当将应用程序转换为在大型分布式系统上工作时，管理这些依赖关系变得具有挑战性:交付必须可扩展到数千个节点;依赖关系必须在整个系统中保持一致;并且该方法必须可供非特权开发人员使用。作为解决方案，在本文中，我们提出了PONCHO，这是一个轻量级的基于Python的工具包，它允许用户发现、打包和部署依赖项，作为分布式应用程序的一个组成部分。PONCHO封装了一组要在环境中执行的命令。PONCHO提供了一种轻量级的解决方案来创建和管理环境，提高了科学应用的可移植性和可重复性。在本文中，我们评估了PONCHO在物理、计算化学和超参数优化领域的实际应用，我们观察了在创建和分配环境时出现的挑战，并测量了由此产生的开销。

{"title":"PONCHO: Dynamic Package Synthesis for Distributed and Serverless Python Applications","authors":"Barry Sly-Delgado, Nick Locascio, David Simonetti, B. Wiseman, Benjamín Tovar, D. Thain","doi":"10.1145/3526060.3535459","DOIUrl":"https://doi.org/10.1145/3526060.3535459","url":null,"abstract":"An increasing number of distributed applications operate by dispatching function invocations across the nodes of a distributed system. To operate correctly, the code and data dependencies of the function must be distributed along with the invocations in some way. When translating applications to work on large scale distributed systems, managing these dependencies becomes challenging: delivery must be scalable to thousands of nodes; the dependencies must be consistent across the system; and the method must be usable by an unprivileged developer. As a solution, in this paper we present PONCHO, which is a lightweight Python based toolkit which allows users to discover, package, and deploy dependencies as an integral part of distributed applications. PONCHO encapsulates a set of commands to be executed within an environment. PONCHO offers a lightweight solution to create and manage environments increasing the portability of scientific applications as well as reproducibility. In this paper, we evaluate PONCHO with real-world applications in the fields of physics, computational chemistry, and hyperparameter optimization, We observe the challenges that arise when creating and distributing an environment and measure the overheads that emerge as a result.","PeriodicalId":223581,"journal":{"name":"Proceedings of the 2nd Workshop on High Performance Serverless Computing","volume":"148 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123433240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Session details: Keynote Talk 会议详情:主题演讲

Proceedings of the 2nd Workshop on High Performance Serverless Computing

Pub Date : 2022-06-30 DOI: 10.1145/3545163

I. Foster

引用次数: 0

Scalable and Cost-effective Serverless Architecture for Information Extraction Workflows 用于信息提取工作流的可扩展且经济高效的无服务器架构

Proceedings of the 2nd Workshop on High Performance Serverless Computing

Pub Date : 2022-06-30 DOI: 10.1145/3526060.3535458

Dheeraj Chahal, S. Palepu, Rekha Singhal

Information extraction from an image or scanned document is a complex and challenging process since it involves recognizing various visual structures such as tables, boxes, logos, text, charts, etc. Hence, the content extraction applications contain a pipeline of multiple computer vision algorithms, APIs, and models. Deploying such applications for document processing requires a resilient system to deliver high performance. Such applications can be deployed on cloud to leverage the flexible infrastructure and multiple supporting services available there. In this paper, we discuss a scalable and high performance architecture using a serverless platform for deploying information extraction workflows consisting of multiple APIs and computer vision models. Our experiments show that the use of a serverless platform results in a scalable, cost-effective, and low latency deployment of such workflows. Moreover, we discuss the performance and cost trade-offs while choosing cloud services and their configuration. We also show that the use of workload characterization-based performance and cost models to find the optimal serverless instance configuration results in a significant deployment cost reduction.

从图像或扫描文档中提取信息是一个复杂而具有挑战性的过程，因为它涉及识别各种视觉结构，如表格、框、徽标、文本、图表等。因此，内容提取应用程序包含多种计算机视觉算法、api和模型的管道。为文档处理部署这样的应用程序需要一个弹性系统来提供高性能。这样的应用程序可以部署在云上，以利用灵活的基础设施和多种可用的支持服务。在本文中，我们讨论了一个可扩展的高性能架构，使用无服务器平台来部署由多个api和计算机视觉模型组成的信息提取工作流。我们的实验表明，使用无服务器平台可以实现此类工作流的可伸缩、经济高效和低延迟部署。此外，我们还讨论了在选择云服务及其配置时的性能和成本权衡。我们还展示了使用基于工作负载特征的性能和成本模型来找到最佳的无服务器实例配置可以显著降低部署成本。

{"title":"Scalable and Cost-effective Serverless Architecture for Information Extraction Workflows","authors":"Dheeraj Chahal, S. Palepu, Rekha Singhal","doi":"10.1145/3526060.3535458","DOIUrl":"https://doi.org/10.1145/3526060.3535458","url":null,"abstract":"Information extraction from an image or scanned document is a complex and challenging process since it involves recognizing various visual structures such as tables, boxes, logos, text, charts, etc. Hence, the content extraction applications contain a pipeline of multiple computer vision algorithms, APIs, and models. Deploying such applications for document processing requires a resilient system to deliver high performance. Such applications can be deployed on cloud to leverage the flexible infrastructure and multiple supporting services available there. In this paper, we discuss a scalable and high performance architecture using a serverless platform for deploying information extraction workflows consisting of multiple APIs and computer vision models. Our experiments show that the use of a serverless platform results in a scalable, cost-effective, and low latency deployment of such workflows. Moreover, we discuss the performance and cost trade-offs while choosing cloud services and their configuration. We also show that the use of workload characterization-based performance and cost models to find the optimal serverless instance configuration results in a significant deployment cost reduction.","PeriodicalId":223581,"journal":{"name":"Proceedings of the 2nd Workshop on High Performance Serverless Computing","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127645604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Session details: Workshop Presentations 会议详情:研讨会演讲

Proceedings of the 2nd Workshop on High Performance Serverless Computing

Pub Date : 2022-06-30 DOI: 10.1145/3545164

K. Chard

引用次数: 0

Enabling Stateful Functions for Stream Processing in the Programmable Data Plane 开启可编程数据平面流处理的有状态功能

Proceedings of the 2nd Workshop on High Performance Serverless Computing

Pub Date : 2022-06-30 DOI: 10.1145/3526060.3535461

Sabra Ossen, Lucas R. B. Brasilino, Luke Dalessandro, D. M. Swany

Sensor-rich environments are crucial components of the Internet of Things ecosystem and benefit from real-time applications. Many applications perform real-time analytics on these IoT workloads by performing continuous stream processing for a window of sequence data elements. However, executing light-weight stateful functions on server CPUs adds to the communication latency of each small message in a high data rate environment, primarily due to messages traveling through a complex network stack to reach the CPU. Thus, we present an in-network function deployment architecture with low latency and low resource footprint by introducing a new compute layer. We propose an FPGA-based Switch/NIC prototype with a compute layer utilizing RISC-V soft cores and High-Level Synthesis modules. We evaluate the design for two microbenchmarks on a Zynq 7000 FPGA each, achieving less than 10 μs in latency and consuming less than 6 % of resources.

传感器丰富的环境是物联网生态系统的关键组成部分，并受益于实时应用。许多应用程序通过对序列数据元素窗口执行连续流处理来对这些物联网工作负载进行实时分析。但是，在服务器CPU上执行轻量级的有状态函数会增加高数据速率环境中每个小消息的通信延迟，这主要是由于消息要通过复杂的网络堆栈才能到达CPU。因此，我们通过引入新的计算层，提出了一种具有低延迟和低资源占用的网络内功能部署体系结构。我们提出了一个基于fpga的交换机/网卡原型，其计算层利用RISC-V软核和高级合成模块。我们在Zynq 7000 FPGA上分别对两个微基准测试进行了评估，实现了小于10 μs的延迟和小于6%的资源消耗。

引用次数: 2

Benchmarking the Data Layer Across Serverless Platforms 跨无服务器平台的数据层基准测试

Proceedings of the 2nd Workshop on High Performance Serverless Computing

Pub Date : 2022-06-30 DOI: 10.1145/3526060.3535460

S. Palepu, Dheeraj Chahal, M. Ramesh, Rekha Singhal

The use of highly scalable serverless platforms for web microservices and IoT applications is well known. However, their use for data-intensive applications is restricted due to the stateless nature of serverless functions. Any data retrieval, storage, and the peer-to-peer communication requirement of an application in serverless deployment is satisfied using cloud storage services such as object storage, database, cache, etc. The heterogeneous cloud storage services offered by different cloud service providers have unique deliverable performance. One key challenge is to find the maximum achievable data transfer rate from serverless platforms to cloud storage services. In this work, we evaluate the performance of storage systems available for use with serverless platforms from multiple cloud vendors. Additionally, we examine the effect of cloud service characteristics and configurations on accelerating the data transfer between them. The experimental data provides some key insights to assist with designing application architecture using serverless platforms and cloud storage services.

在web微服务和物联网应用中使用高度可扩展的无服务器平台是众所周知的。然而，由于无服务器功能的无状态特性，它们在数据密集型应用程序中的使用受到限制。在无服务器部署中，应用程序的任何数据检索、存储和点对点通信需求都可以使用云存储服务(如对象存储、数据库、缓存等)来满足。不同云服务提供商提供的异构云存储服务具有独特的可交付性能。一个关键的挑战是找到从无服务器平台到云存储服务的最大可实现数据传输速率。在这项工作中，我们评估了可用于多个云供应商的无服务器平台的存储系统的性能。此外，我们还研究了云服务特征和配置对加速它们之间数据传输的影响。实验数据为使用无服务器平台和云存储服务设计应用程序架构提供了一些关键的见解。

引用次数: 1

Bringing Serverless Computing to the HPC Community 将无服务器计算带入高性能计算社区

Proceedings of the 2nd Workshop on High Performance Serverless Computing

Pub Date : 2022-06-30 DOI: 10.1145/3526060.3536352

Devesh Tiwari

The next wave of cloud computing - the serverless computing model - is enjoying adoption at scale by different cloud computing vendors. The serverless computing model is already rapidly accelerating the development and deployment of enterprise applications. Unfortunately, the HPC community appears to be left behind in the revolution. The widespread assumption or opinion is that the serverless computing model is not beneficial for the HPC community because the serverless computing model was not primarily designed for scientific computing and users. In this talk, I promise to do three things. First, I'll show why existing popular opinion about HPC and serverless is pre-mature and inaccurate. Second, I'll demonstrate how HPC users and programmers can leverage the serverless computing model for mitigating some of the long-standing challenges and causes of frustration in HPC resource management. In the spirit of transparency, I admit that the path to effectively leveraging serverless for traditional HPC is challenging and has long stretches of darkness. I will identify and discuss some lamp posts on this not-so-well-lit path. Finally, I'll share my perspective on what new lampposts we should build as a community and where we should put them.

云计算的下一波浪潮——无服务器计算模型——正在被不同的云计算供应商大规模采用。无服务器计算模型已经在迅速加速企业应用程序的开发和部署。不幸的是，HPC社区似乎在这场革命中落在了后面。普遍的假设或观点是，无服务器计算模型对HPC社区没有好处，因为无服务器计算模型主要不是为科学计算和用户设计的。在这次演讲中，我承诺做三件事。首先，我将说明为什么现有的关于HPC和无服务器的流行观点是不成熟和不准确的。其次，我将演示HPC用户和程序员如何利用无服务器计算模型来减轻HPC资源管理中一些长期存在的挑战和令人沮丧的原因。本着透明的精神，我承认有效利用传统HPC的无服务器之路是具有挑战性的，并且有很长一段时间的黑暗。我将在这条光线不太好的道路上识别并讨论一些灯柱。最后，我将分享我的观点，关于我们应该建立什么样的新灯柱，以及我们应该把它们放在哪里。

{"title":"Bringing Serverless Computing to the HPC Community","authors":"Devesh Tiwari","doi":"10.1145/3526060.3536352","DOIUrl":"https://doi.org/10.1145/3526060.3536352","url":null,"abstract":"The next wave of cloud computing - the serverless computing model - is enjoying adoption at scale by different cloud computing vendors. The serverless computing model is already rapidly accelerating the development and deployment of enterprise applications. Unfortunately, the HPC community appears to be left behind in the revolution. The widespread assumption or opinion is that the serverless computing model is not beneficial for the HPC community because the serverless computing model was not primarily designed for scientific computing and users. In this talk, I promise to do three things. First, I'll show why existing popular opinion about HPC and serverless is pre-mature and inaccurate. Second, I'll demonstrate how HPC users and programmers can leverage the serverless computing model for mitigating some of the long-standing challenges and causes of frustration in HPC resource management. In the spirit of transparency, I admit that the path to effectively leveraging serverless for traditional HPC is challenging and has long stretches of darkness. I will identify and discuss some lamp posts on this not-so-well-lit path. Finally, I'll share my perspective on what new lampposts we should build as a community and where we should put them.","PeriodicalId":223581,"journal":{"name":"Proceedings of the 2nd Workshop on High Performance Serverless Computing","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124711823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 2nd Workshop on High Performance Serverless Computing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀