{"title":"PixelsDB: Serverless and Natural-Language-Aided Data Analytics with Flexible Service Levels and Prices","authors":"Haoqiong Bian, Dongyang Geng, Haoyang Li, Anastasia Ailamaki","doi":"arxiv-2405.19784","DOIUrl":null,"url":null,"abstract":"Serverless query processing has become increasingly popular due to its\nadvantages, including automated hardware and software management, high\nelasticity, and pay-as-you-go pricing. For users who are not system experts,\nserverless query processing greatly reduces the cost of owning a data analytic\nsystem. However, it is still a significant challenge for non-expert users to\ntransform their complex and evolving data analytic needs into proper SQL\nqueries and select a serverless query engine that delivers satisfactory\nperformance and price for each type of query. This paper presents PixelsDB, an open-source data analytic system that allows\nusers who lack system or SQL expertise to explore data efficiently. It allows\nusers to generate and debug SQL queries using a natural language interface\npowered by fine-tuned language models. The queries are then executed by a\nserverless query engine that offers varying prices for different service levels\non query urgency. The service levels are natively supported by dedicated\narchitecture design and heterogeneous resource scheduling that can apply\ncost-efficient resources to process non-urgent queries. We envision that the\ncombination of a serverless paradigm, a natural-language-aided interface, and\nflexible service levels and prices will substantially improve the user\nexperience in data analysis.","PeriodicalId":501541,"journal":{"name":"arXiv - CS - Human-Computer Interaction","volume":"58 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Human-Computer Interaction","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.19784","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Serverless query processing has become increasingly popular due to its
advantages, including automated hardware and software management, high
elasticity, and pay-as-you-go pricing. For users who are not system experts,
serverless query processing greatly reduces the cost of owning a data analytic
system. However, it is still a significant challenge for non-expert users to
transform their complex and evolving data analytic needs into proper SQL
queries and select a serverless query engine that delivers satisfactory
performance and price for each type of query. This paper presents PixelsDB, an open-source data analytic system that allows
users who lack system or SQL expertise to explore data efficiently. It allows
users to generate and debug SQL queries using a natural language interface
powered by fine-tuned language models. The queries are then executed by a
serverless query engine that offers varying prices for different service levels
on query urgency. The service levels are natively supported by dedicated
architecture design and heterogeneous resource scheduling that can apply
cost-efficient resources to process non-urgent queries. We envision that the
combination of a serverless paradigm, a natural-language-aided interface, and
flexible service levels and prices will substantially improve the user
experience in data analysis.