首页 > 最新文献

ACM SIGMOD Record最新文献

英文 中文
Enterprise Platform and Integration Concepts Research at HPI HPI企业平台与集成概念研究
Pub Date : 2023-01-09 DOI: 10.1145/3582302.3582322
M. Perscheid, H. Plattner, Daniel Ritter, R. Schlosser, Ralf Teusner
The Hasso Plattner Institute (HPI), academically structured as the independent Faculty of Digital Engineering at the University of Potsdam, unites computer science research and teaching with the advantages of a privately financed institute and a tuition-free study program. Founder and namesake of the institute is the SAP co-founder Hasso Plattner, who also heads the Enterprise Platform and Integration Concepts (EPIC) research center which focuses on the technical aspects of business software with a vision to provide the fastest way to get insights out of enterprise data. Founded in 2006, the EPIC combines three research groups comprising autonomous data management, enterprise software engineering, and data-driven decision support.
哈索普拉特纳研究所(HPI)是波茨坦大学独立的数字工程学院,将计算机科学研究和教学与私人资助机构和免学费学习计划的优势结合起来。该研究所的创始人和同名创始人是SAP的联合创始人哈索•普拉特纳(Hasso Plattner),他还领导着企业平台和集成概念(EPIC)研究中心,该中心专注于商业软件的技术方面,旨在提供从企业数据中获得洞察力的最快方法。EPIC成立于2006年,由三个研究小组组成,包括自主数据管理、企业软件工程和数据驱动决策支持。
{"title":"Enterprise Platform and Integration Concepts Research at HPI","authors":"M. Perscheid, H. Plattner, Daniel Ritter, R. Schlosser, Ralf Teusner","doi":"10.1145/3582302.3582322","DOIUrl":"https://doi.org/10.1145/3582302.3582322","url":null,"abstract":"The Hasso Plattner Institute (HPI), academically structured as the independent Faculty of Digital Engineering at the University of Potsdam, unites computer science research and teaching with the advantages of a privately financed institute and a tuition-free study program. Founder and namesake of the institute is the SAP co-founder Hasso Plattner, who also heads the Enterprise Platform and Integration Concepts (EPIC) research center which focuses on the technical aspects of business software with a vision to provide the fastest way to get insights out of enterprise data. Founded in 2006, the EPIC combines three research groups comprising autonomous data management, enterprise software engineering, and data-driven decision support.","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129297675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Chenggang Wu Speaks Out on his ACM SIGMOD Jim Gray Dissertation Award, Rejection, Believing in Your Work, and More 吴成刚谈到了他的ACM SIGMOD吉姆·格雷博士论文奖,拒绝,相信你的工作等等
Pub Date : 2023-01-09 DOI: 10.1145/3582302.3582318
Chenggang Wu
Welcome to this installment of ACM SIGMOD Record's series of interviews with distinguished members of the database community. I'm Marianne Winslett, and today we are on Zoom with Chenggang Wu, co-founder and CTO of Aqueduct. Chenggang received the 2022 ACM SIGMOD Jim Gray Dissertation Award for his thesis entitled The Design of Any-scale Serverless Infrastructure with Rich Consistency Guarantees. His PhD is from UC Berkeley. So, Chenggang, welcome!
欢迎来到ACM SIGMOD Record对数据库社区杰出成员的系列访谈的这一期。我是Marianne Winslett,今天我们和Aqueduct的联合创始人兼首席技术官吴成刚一起上Zoom。程刚博士的论文《具有丰富一致性保证的任意规模无服务器基础设施的设计》获得了2022年ACM SIGMOD Jim Gray博士论文奖。他的博士学位来自加州大学伯克利分校。所以,成钢,欢迎你!
{"title":"Chenggang Wu Speaks Out on his ACM SIGMOD Jim Gray Dissertation Award, Rejection, Believing in Your Work, and More","authors":"Chenggang Wu","doi":"10.1145/3582302.3582318","DOIUrl":"https://doi.org/10.1145/3582302.3582318","url":null,"abstract":"Welcome to this installment of ACM SIGMOD Record's series of interviews with distinguished members of the database community. I'm Marianne Winslett, and today we are on Zoom with Chenggang Wu, co-founder and CTO of Aqueduct. Chenggang received the 2022 ACM SIGMOD Jim Gray Dissertation Award for his thesis entitled The Design of Any-scale Serverless Infrastructure with Rich Consistency Guarantees. His PhD is from UC Berkeley. So, Chenggang, welcome!","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121636289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Collaborative Data Science using Scalable Homoiconicity 使用可扩展同象性的协同数据科学
Pub Date : 2023-01-09 DOI: 10.1145/3582302.3582316
H. Pirk
Motivation: Data science is increasingly collaborative. On the one hand, results need to be distributed, e.g., as interactive visualizations. On the other, collaboration in the data development process improves quality and timeliness. This can take many forms: partitioning a problem and working on aspects in parallel, exploring different solutions or reviewing someone else's work.
动机:数据科学的协作性越来越强。一方面,结果需要分发,例如,作为交互式可视化。另一方面,数据开发过程中的协作提高了质量和及时性。这可以采取多种形式:划分问题并并行处理各个方面,探索不同的解决方案或审查其他人的工作。
{"title":"Collaborative Data Science using Scalable Homoiconicity","authors":"H. Pirk","doi":"10.1145/3582302.3582316","DOIUrl":"https://doi.org/10.1145/3582302.3582316","url":null,"abstract":"Motivation: Data science is increasingly collaborative. On the one hand, results need to be distributed, e.g., as interactive visualizations. On the other, collaboration in the data development process improves quality and timeliness. This can take many forms: partitioning a problem and working on aspects in parallel, exploring different solutions or reviewing someone else's work.","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130267924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Management of Machine Learning Lifecycle Artifacts 机器学习生命周期工件的管理
Pub Date : 2023-01-09 DOI: 10.1145/3582302.3582306
Marius Schlegel, K. Sattler
The explorative and iterative nature of developing and operating ML applications leads to a variety of artifacts, such as datasets, features, models, hyperparameters, metrics, software, configurations, and logs. In order to enable comparability, reproducibility, and traceability of these artifacts across the ML lifecycle steps and iterations, systems and tools have been developed to support their collection, storage, and management. It is often not obvious what precise functional scope such systems offer so that the comparison and the estimation of synergy effects between candidates are quite challenging. In this paper, we aim to give an overview of systems and platforms which support the management of ML lifecycle artifacts. Based on a systematic literature review, we derive assessment criteria and apply them to a representative selection of more than 60 systems and platforms.
开发和操作ML应用程序的探索性和迭代性导致了各种各样的工件,例如数据集、特征、模型、超参数、度量、软件、配置和日志。为了在ML生命周期步骤和迭代中实现这些工件的可比性、再现性和可追溯性,已经开发了系统和工具来支持它们的收集、存储和管理。这些系统提供的精确功能范围通常并不明显,因此比较和估计候选系统之间的协同效应相当具有挑战性。在本文中,我们旨在概述支持机器学习生命周期工件管理的系统和平台。基于系统的文献回顾,我们得出评估标准,并将其应用于60多个系统和平台的代表性选择。
{"title":"Management of Machine Learning Lifecycle Artifacts","authors":"Marius Schlegel, K. Sattler","doi":"10.1145/3582302.3582306","DOIUrl":"https://doi.org/10.1145/3582302.3582306","url":null,"abstract":"The explorative and iterative nature of developing and operating ML applications leads to a variety of artifacts, such as datasets, features, models, hyperparameters, metrics, software, configurations, and logs. In order to enable comparability, reproducibility, and traceability of these artifacts across the ML lifecycle steps and iterations, systems and tools have been developed to support their collection, storage, and management. It is often not obvious what precise functional scope such systems offer so that the comparison and the estimation of synergy effects between candidates are quite challenging. In this paper, we aim to give an overview of systems and platforms which support the management of ML lifecycle artifacts. Based on a systematic literature review, we derive assessment criteria and apply them to a representative selection of more than 60 systems and platforms.","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123753362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
PDQ 2.0 PDQ 2.0
Pub Date : 2023-01-09 DOI: 10.1145/3582302.3582308
M. Benedikt, Fergus Cooper, Stefano Germano, Gabor Gyorkei, Efthymia Tsamoura, Brandon Moore, Camilo Ortiz
Reasoning-based query planning has been explored in many contexts, including relational data integration, the SemanticWeb, and query reformulation. But infrastructure to build reasoning-based optimization in the relational context has been slow to develop. We overview PDQ 2.0, a platform supporting a number of reasoningenhanced querying tasks. We focus on a major goal of PDQ 2.0: obtaining a more modular and flexible architecture for reasoning-based query optimization.
基于推理的查询计划已经在许多上下文中进行了探索,包括关系数据集成、语义web和查询重新表述。但是在关系环境中构建基于推理的优化的基础设施发展缓慢。我们概述PDQ 2.0,这是一个支持许多推理增强查询任务的平台。我们主要关注PDQ 2.0的一个主要目标:为基于推理的查询优化获得更加模块化和灵活的体系结构。
{"title":"PDQ 2.0","authors":"M. Benedikt, Fergus Cooper, Stefano Germano, Gabor Gyorkei, Efthymia Tsamoura, Brandon Moore, Camilo Ortiz","doi":"10.1145/3582302.3582308","DOIUrl":"https://doi.org/10.1145/3582302.3582308","url":null,"abstract":"Reasoning-based query planning has been explored in many contexts, including relational data integration, the SemanticWeb, and query reformulation. But infrastructure to build reasoning-based optimization in the relational context has been slow to develop. We overview PDQ 2.0, a platform supporting a number of reasoningenhanced querying tasks. We focus on a major goal of PDQ 2.0: obtaining a more modular and flexible architecture for reasoning-based query optimization.","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"323 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122709335","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The World of Graph Databases from An Industry Perspective 从行业角度看图数据库的世界
Pub Date : 2022-11-23 DOI: 10.1145/3582302.3582320
Yuanyuan Tian
Rapidly growing social networks and other graph data have created a high demand for graph technologies in the market. A plethora of graph databases, systems, and solutions have emerged, as a result. On the other hand, graph has long been a well studied area in the database research community. Despite the numerous surveys on various graph research topics, there is a lack of survey on graph technologies from an industry perspective. The purpose of this paper is to provide the research community with an industrial perspective on the graph database landscape, so that graph researcher can better understand the industry trend and the challenges that the industry is facing, and work on solutions to help address these problems.
快速增长的社交网络和其他图形数据在市场上对图形技术产生了很高的需求。因此,出现了大量的图形数据库、系统和解决方案。另一方面,图长期以来一直是数据库研究界研究的一个很好的领域。尽管对各种图形研究主题的调查很多,但从行业角度对图形技术的调查还很缺乏。本文的目的是为研究社区提供图形数据库景观的行业视角,以便图形研究人员能够更好地了解行业趋势和行业面临的挑战,并致力于解决这些问题的解决方案。
{"title":"The World of Graph Databases from An Industry Perspective","authors":"Yuanyuan Tian","doi":"10.1145/3582302.3582320","DOIUrl":"https://doi.org/10.1145/3582302.3582320","url":null,"abstract":"Rapidly growing social networks and other graph data have created a high demand for graph technologies in the market. A plethora of graph databases, systems, and solutions have emerged, as a result. On the other hand, graph has long been a well studied area in the database research community. Despite the numerous surveys on various graph research topics, there is a lack of survey on graph technologies from an industry perspective. The purpose of this paper is to provide the research community with an industrial perspective on the graph database landscape, so that graph researcher can better understand the industry trend and the challenges that the industry is facing, and work on solutions to help address these problems.","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"121 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115885408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
The Formidable Mid-Career Crisis 可怕的职业中期危机
Pub Date : 2022-11-21 DOI: 10.1145/3572751.3572761
A. Ailamaki
My high school grades were top except for one subject: composition. Free text was (and still is) my absolute nightmare. After high school I only had to do technical writing, which is much easier: it boils down to math. Fact, supporting evidence, implication, which leads to another fact, repeat. So, when Tamer asked me to write a piece about mid-career challenges, I was excited at first, and then I was terrified. I wrote five outlines and veto'ed them all. "I am not good at this," I wanted to say, "ask somebody else!" But, then I remembered - this happens every time I get into unknown territory.
我的高中成绩除了一门功课外都是最好的,那就是作文。免费文本过去是(现在仍然是)我的噩梦。高中毕业后,我只需要做技术写作,这要容易得多:它归结为数学。事实,支持证据,暗示,导致另一个事实,重复。所以,当Tamer让我写一篇关于职业中期挑战的文章时,我一开始很兴奋,然后就害怕了。我写了五个大纲,都否决了。“我不擅长这个,”我想说,“问别人吧!”但是,后来我想起来了——每次我进入未知领域都会发生这种情况。
{"title":"The Formidable Mid-Career Crisis","authors":"A. Ailamaki","doi":"10.1145/3572751.3572761","DOIUrl":"https://doi.org/10.1145/3572751.3572761","url":null,"abstract":"My high school grades were top except for one subject: composition. Free text was (and still is) my absolute nightmare. After high school I only had to do technical writing, which is much easier: it boils down to math. Fact, supporting evidence, implication, which leads to another fact, repeat. So, when Tamer asked me to write a piece about mid-career challenges, I was excited at first, and then I was terrified. I wrote five outlines and veto'ed them all. \"I am not good at this,\" I wanted to say, \"ask somebody else!\" But, then I remembered - this happens every time I get into unknown territory.","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115044726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
VLDB Scalable Data Science Category VLDB可扩展数据科学类别
Pub Date : 2022-11-21 DOI: 10.1145/3572751.3572769
Arun C. S. Kumar
As part of the International Conference on Very Large Data Bases (VLDB) 2021 / Proceedings of the VLDB Endowment Volume 14, a new Research Track category named Scalable Data Science (SDS) was launched [2, 6]. The goal of SDS is to attract cutting-edge and impactful real-world work in the scalable data science arena to enhance the impact and visibility of the VLDB community on data science practice, spur new technical connections, and inspire new follow-on research. The inaugural year proved to be successful, with numerous interesting papers from a wide cross section of both industry and academia, spanning several data science topics, and originating from several countries around the world. In this report, we reflect on the inaugural year of SDS with some statistics on both submissions and accepted papers, SDS invited talks, and our observations, lessons, and tips as inaugural Associate Editors for SDS. We hope this article is helpful to future authors, reviewers, and organizers of SDS, as well as other interested members of the wider database / data management community and beyond.
作为超大型数据库国际会议(VLDB) 2021 / VLDB捐赠文集第14卷的一部分,推出了一个名为可扩展数据科学(SDS)的新研究轨道类别[2,6]。SDS的目标是在可扩展的数据科学领域吸引前沿和有影响力的现实世界工作,以增强VLDB社区对数据科学实践的影响和可见性,促进新的技术联系,并激发新的后续研究。首届大会是成功的,来自工业界和学术界的许多有趣的论文,涵盖了几个数据科学主题,来自世界各地的几个国家。在这份报告中,我们回顾了SDS的第一年,包括提交和接受的论文的一些统计数据,SDS邀请的演讲,以及我们作为SDS首任副编辑的观察、教训和建议。我们希望本文对未来的作者、审稿人和SDS的组织者,以及更广泛的数据库/数据管理社区的其他感兴趣的成员有所帮助。
{"title":"VLDB Scalable Data Science Category","authors":"Arun C. S. Kumar","doi":"10.1145/3572751.3572769","DOIUrl":"https://doi.org/10.1145/3572751.3572769","url":null,"abstract":"As part of the International Conference on Very Large Data Bases (VLDB) 2021 / Proceedings of the VLDB Endowment Volume 14, a new Research Track category named Scalable Data Science (SDS) was launched [2, 6]. The goal of SDS is to attract cutting-edge and impactful real-world work in the scalable data science arena to enhance the impact and visibility of the VLDB community on data science practice, spur new technical connections, and inspire new follow-on research. The inaugural year proved to be successful, with numerous interesting papers from a wide cross section of both industry and academia, spanning several data science topics, and originating from several countries around the world. In this report, we reflect on the inaugural year of SDS with some statistics on both submissions and accepted papers, SDS invited talks, and our observations, lessons, and tips as inaugural Associate Editors for SDS. We hope this article is helpful to future authors, reviewers, and organizers of SDS, as well as other interested members of the wider database / data management community and beyond.","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"336 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133015941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Query Optimizer as a Service 查询优化器即服务
Pub Date : 2022-11-21 DOI: 10.1145/3572751.3572767
Alekh Jindal, Jyoti Leeka
Query optimization is a critical technology that is common across all modern data processing systems. However, it is traditionally implemented in silos and is deeply embedded in different systems. Furthermore, over the years, query optimizers have become less understood and rarely touched pieces of code that are brittle to changes and very expensive to maintain, thus slowing down the pace of innovation. In this paper, we argue that it is time to think of query optimizer as a service in modern cloud architectures. Such a design can help build a common set of well-maintained optimizations that are externalized from the query engines and that could be learned and improved using the large workloads present in modern clouds. We present, Oasis, a reference architecture for query optimizer as a service and describe our success in deploying the early version of it in Cosmos. Finally, we discuss the risks and responsibilities involved with Oasis to ensure it is a win-win for everyone.
查询优化是一项关键技术,在所有现代数据处理系统中都很常见。然而,它传统上是在筒仓中实现的,并且深深嵌入到不同的系统中。此外,多年来,查询优化器变得越来越不为人所理解,并且很少触及那些易受更改影响且维护成本非常高的代码片段,从而减缓了创新的步伐。在本文中,我们认为是时候将查询优化器视为现代云架构中的一种服务了。这样的设计可以帮助构建一组维护良好的公共优化,这些优化从查询引擎外部化,并且可以使用现代云中的大型工作负载来学习和改进。我们介绍了查询优化器作为服务的参考架构Oasis,并描述了我们在Cosmos中成功部署它的早期版本。最后,我们讨论了Oasis所涉及的风险和责任,以确保它对每个人都是双赢的。
{"title":"Query Optimizer as a Service","authors":"Alekh Jindal, Jyoti Leeka","doi":"10.1145/3572751.3572767","DOIUrl":"https://doi.org/10.1145/3572751.3572767","url":null,"abstract":"Query optimization is a critical technology that is common across all modern data processing systems. However, it is traditionally implemented in silos and is deeply embedded in different systems. Furthermore, over the years, query optimizers have become less understood and rarely touched pieces of code that are brittle to changes and very expensive to maintain, thus slowing down the pace of innovation. In this paper, we argue that it is time to think of query optimizer as a service in modern cloud architectures. Such a design can help build a common set of well-maintained optimizations that are externalized from the query engines and that could be learned and improved using the large workloads present in modern clouds. We present, Oasis, a reference architecture for query optimizer as a service and describe our success in deploying the early version of it in Cosmos. Finally, we discuss the risks and responsibilities involved with Oasis to ensure it is a win-win for everyone.","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127690523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Counting the Answers to a Query 统计查询的答案
Pub Date : 2022-11-21 DOI: 10.1145/3572751.3572753
M. Arenas, L. A. Croquevielle, Rajesh Jayaram, Cristian Riveros
Counting the answers to a query is a fundamental problem in databases, with several applications in the evaluation, optimization, and visualization of queries. Unfortunately, counting query answers is a #P-hard problem in most cases, so it is unlikely to be solvable in polynomial time. Recently, new results on approximate counting have been developed, specifically by showing that some problems in automata theory admit fully polynomial-time randomized approximation schemes. These results have several implications for the problem of counting the answers to a query; in particular, for graph and conjunctive queries. In this work, we present the main ideas of these approximation results, by using labeled DAGs instead of automata to simplify the presentation. In addition, we review how to apply these results to count query answers in different areas of databases.
计算查询的答案是数据库中的一个基本问题,在查询的评估、优化和可视化中有几个应用程序。不幸的是,在大多数情况下,统计查询答案是一个#P-hard问题,因此不太可能在多项式时间内解决。最近,关于近似计数的一些新结果得到了发展,特别是通过表明自动机理论中的一些问题允许完全多项式时间随机逼近方案。这些结果对统计查询答案的问题有几个含义;特别是对于图查询和连接查询。在这项工作中,我们提出了这些近似结果的主要思想,通过使用标记dag而不是自动机来简化表示。此外,我们还回顾了如何将这些结果应用于统计数据库中不同区域的查询答案。
{"title":"Counting the Answers to a Query","authors":"M. Arenas, L. A. Croquevielle, Rajesh Jayaram, Cristian Riveros","doi":"10.1145/3572751.3572753","DOIUrl":"https://doi.org/10.1145/3572751.3572753","url":null,"abstract":"Counting the answers to a query is a fundamental problem in databases, with several applications in the evaluation, optimization, and visualization of queries. Unfortunately, counting query answers is a #P-hard problem in most cases, so it is unlikely to be solvable in polynomial time. Recently, new results on approximate counting have been developed, specifically by showing that some problems in automata theory admit fully polynomial-time randomized approximation schemes. These results have several implications for the problem of counting the answers to a query; in particular, for graph and conjunctive queries. In this work, we present the main ideas of these approximation results, by using labeled DAGs instead of automata to simplify the presentation. In addition, we review how to apply these results to count query answers in different areas of databases.","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114589032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
ACM SIGMOD Record
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1