PixelsDB: Serverless and Natural-Language-Aided Data Analytics with Flexible Service Levels and Prices

Haoqiong Bian, Dongyang Geng, Haoyang Li, Anastasia Ailamaki
{"title":"PixelsDB: Serverless and Natural-Language-Aided Data Analytics with Flexible Service Levels and Prices","authors":"Haoqiong Bian, Dongyang Geng, Haoyang Li, Anastasia Ailamaki","doi":"arxiv-2405.19784","DOIUrl":null,"url":null,"abstract":"Serverless query processing has become increasingly popular due to its\nadvantages, including automated hardware and software management, high\nelasticity, and pay-as-you-go pricing. For users who are not system experts,\nserverless query processing greatly reduces the cost of owning a data analytic\nsystem. However, it is still a significant challenge for non-expert users to\ntransform their complex and evolving data analytic needs into proper SQL\nqueries and select a serverless query engine that delivers satisfactory\nperformance and price for each type of query. This paper presents PixelsDB, an open-source data analytic system that allows\nusers who lack system or SQL expertise to explore data efficiently. It allows\nusers to generate and debug SQL queries using a natural language interface\npowered by fine-tuned language models. The queries are then executed by a\nserverless query engine that offers varying prices for different service levels\non query urgency. The service levels are natively supported by dedicated\narchitecture design and heterogeneous resource scheduling that can apply\ncost-efficient resources to process non-urgent queries. We envision that the\ncombination of a serverless paradigm, a natural-language-aided interface, and\nflexible service levels and prices will substantially improve the user\nexperience in data analysis.","PeriodicalId":501541,"journal":{"name":"arXiv - CS - Human-Computer Interaction","volume":"58 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Human-Computer Interaction","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.19784","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Serverless query processing has become increasingly popular due to its advantages, including automated hardware and software management, high elasticity, and pay-as-you-go pricing. For users who are not system experts, serverless query processing greatly reduces the cost of owning a data analytic system. However, it is still a significant challenge for non-expert users to transform their complex and evolving data analytic needs into proper SQL queries and select a serverless query engine that delivers satisfactory performance and price for each type of query. This paper presents PixelsDB, an open-source data analytic system that allows users who lack system or SQL expertise to explore data efficiently. It allows users to generate and debug SQL queries using a natural language interface powered by fine-tuned language models. The queries are then executed by a serverless query engine that offers varying prices for different service levels on query urgency. The service levels are natively supported by dedicated architecture design and heterogeneous resource scheduling that can apply cost-efficient resources to process non-urgent queries. We envision that the combination of a serverless paradigm, a natural-language-aided interface, and flexible service levels and prices will substantially improve the user experience in data analysis.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
PixelsDB:服务等级和价格灵活的无服务器和自然语言辅助数据分析技术
无服务器查询处理因其自动化软硬件管理、高弹性和现收现付等优势而越来越受欢迎。对于不是系统专家的用户来说,无服务器查询处理大大降低了拥有数据分析系统的成本。然而,对于非专业用户来说,如何将复杂且不断变化的数据分析需求转化为适当的 SQL 查询,并为每种类型的查询选择一个性能和价格都令人满意的无服务器查询引擎,仍然是一个巨大的挑战。本文介绍的 PixelsDB 是一个开源数据分析系统,它允许缺乏系统或 SQL 专业知识的用户高效地探索数据。它允许用户使用由微调语言模型支持的自然语言界面生成和调试 SQL 查询。然后,查询由无服务器查询引擎执行,该引擎根据查询的紧急程度为不同的服务级别提供不同的价格。这些服务级别由专用架构设计和异构资源调度提供本机支持,可将具有成本效益的资源用于处理非紧急查询。我们认为,无服务器范式、自然语言辅助界面以及灵活的服务级别和价格的组合将大大改善数据分析的使用体验。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Equimetrics -- Applying HAR principles to equestrian activities AI paintings vs. Human Paintings? Deciphering Public Interactions and Perceptions towards AI-Generated Paintings on TikTok From Data Stories to Dialogues: A Randomised Controlled Trial of Generative AI Agents and Data Storytelling in Enhancing Data Visualisation Comprehension Exploring Gaze Pattern in Autistic Children: Clustering, Visualization, and Prediction Revealing the Challenge of Detecting Character Knowledge Errors in LLM Role-Playing
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1