{"title":"Scalable and Cost-effective Serverless Architecture for Information Extraction Workflows","authors":"Dheeraj Chahal, S. Palepu, Rekha Singhal","doi":"10.1145/3526060.3535458","DOIUrl":null,"url":null,"abstract":"Information extraction from an image or scanned document is a complex and challenging process since it involves recognizing various visual structures such as tables, boxes, logos, text, charts, etc. Hence, the content extraction applications contain a pipeline of multiple computer vision algorithms, APIs, and models. Deploying such applications for document processing requires a resilient system to deliver high performance. Such applications can be deployed on cloud to leverage the flexible infrastructure and multiple supporting services available there. In this paper, we discuss a scalable and high performance architecture using a serverless platform for deploying information extraction workflows consisting of multiple APIs and computer vision models. Our experiments show that the use of a serverless platform results in a scalable, cost-effective, and low latency deployment of such workflows. Moreover, we discuss the performance and cost trade-offs while choosing cloud services and their configuration. We also show that the use of workload characterization-based performance and cost models to find the optimal serverless instance configuration results in a significant deployment cost reduction.","PeriodicalId":223581,"journal":{"name":"Proceedings of the 2nd Workshop on High Performance Serverless Computing","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2nd Workshop on High Performance Serverless Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3526060.3535458","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Information extraction from an image or scanned document is a complex and challenging process since it involves recognizing various visual structures such as tables, boxes, logos, text, charts, etc. Hence, the content extraction applications contain a pipeline of multiple computer vision algorithms, APIs, and models. Deploying such applications for document processing requires a resilient system to deliver high performance. Such applications can be deployed on cloud to leverage the flexible infrastructure and multiple supporting services available there. In this paper, we discuss a scalable and high performance architecture using a serverless platform for deploying information extraction workflows consisting of multiple APIs and computer vision models. Our experiments show that the use of a serverless platform results in a scalable, cost-effective, and low latency deployment of such workflows. Moreover, we discuss the performance and cost trade-offs while choosing cloud services and their configuration. We also show that the use of workload characterization-based performance and cost models to find the optimal serverless instance configuration results in a significant deployment cost reduction.