首页 > 最新文献

ACM SIGMOD Record最新文献

英文 中文
Deciding What Not to Do 决定不做什么
Pub Date : 2022-07-29 DOI: 10.1145/3552490.3552501
David Maier
I recall an early conversation with my advisor, a couple years after I completed my PhD. I was worried about not having been invited onto any program committees when others in my cohort were getting such opportunities. He assured me that it would come in time, though I was still anxious. He was right; after another year or so, the invitations started coming. At this point, I need to decline most of them, or I'd spend all my time reviewing.
我记得在我完成博士学位几年后,我与导师的一次早期对话。我很担心自己没有被邀请到任何项目委员会,而我的同龄人却有这样的机会。他向我保证它会及时到来,尽管我仍然很着急。他是对的;又过了一年左右,邀请接踵而来。在这一点上,我需要拒绝其中的大多数,否则我会把所有的时间都花在审查上。
{"title":"Deciding What Not to Do","authors":"David Maier","doi":"10.1145/3552490.3552501","DOIUrl":"https://doi.org/10.1145/3552490.3552501","url":null,"abstract":"I recall an early conversation with my advisor, a couple years after I completed my PhD. I was worried about not having been invited onto any program committees when others in my cohort were getting such opportunities. He assured me that it would come in time, though I was still anxious. He was right; after another year or so, the invitations started coming. At this point, I need to decline most of them, or I'd spend all my time reviewing.","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114657006","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data Science Through the Looking Glass 透视镜子中的数据科学
Pub Date : 2022-07-29 DOI: 10.1145/3552490.3552496
Fotis Psallidas, Yiwen Zhu, Bojan Karlas, Jordan Henkel, Matteo Interlandi, Subru Krishnan, Brian Kroth, Venkatesh Emani, Wentao Wu, Ce Zhang, Markus Weimer, A. Floratou, C. Curino, Konstantinos Karanasos
The recent success of machine learning (ML) has led to an explosive growth of systems and applications built by an ever-growing community of system builders and data science (DS) practitioners. This quickly shifting panorama, however, is challenging for system builders and practitioners alike to follow. In this paper, we set out to capture this panorama through a wide-angle lens, performing the largest analysis of DS projects to date, focusing on questions that can advance our understanding of the field and determine investments. Specifically, we download and analyze (a) over 8M notebooks publicly available on GITHUB and (b) over 2M enterprise ML pipelines developed within Microsoft. Our analysis includes coarse-grained statistical characterizations, finegrained analysis of libraries and pipelines, and comparative studies across datasets and time. We report a large number of measurements for our readers to interpret and draw actionable conclusions on (a) what system builders should focus on to better serve practitioners and (b) what technologies should practitioners rely on.
最近机器学习(ML)的成功导致了系统和应用程序的爆炸式增长,这些系统和应用程序由不断增长的系统构建者和数据科学(DS)从业者组成。然而,这种快速变化的全景对于系统构建者和实践者来说都是具有挑战性的。在本文中,我们开始通过广角镜头捕捉这一全景,对迄今为止最大规模的DS项目进行分析,重点关注可以促进我们对该领域的理解并决定投资的问题。具体来说,我们下载并分析了(a) GITHUB上公开提供的800多万台笔记本电脑和(b)微软开发的200多万台企业机器学习管道。我们的分析包括粗粒度的统计特征,对库和管道的细粒度分析,以及跨数据集和时间的比较研究。我们报告了大量的测量结果,以供读者解释并得出可操作的结论(a)系统构建者应该关注哪些方面以更好地为从业者服务,以及(b)从业者应该依赖哪些技术。
{"title":"Data Science Through the Looking Glass","authors":"Fotis Psallidas, Yiwen Zhu, Bojan Karlas, Jordan Henkel, Matteo Interlandi, Subru Krishnan, Brian Kroth, Venkatesh Emani, Wentao Wu, Ce Zhang, Markus Weimer, A. Floratou, C. Curino, Konstantinos Karanasos","doi":"10.1145/3552490.3552496","DOIUrl":"https://doi.org/10.1145/3552490.3552496","url":null,"abstract":"The recent success of machine learning (ML) has led to an explosive growth of systems and applications built by an ever-growing community of system builders and data science (DS) practitioners. This quickly shifting panorama, however, is challenging for system builders and practitioners alike to follow. In this paper, we set out to capture this panorama through a wide-angle lens, performing the largest analysis of DS projects to date, focusing on questions that can advance our understanding of the field and determine investments. Specifically, we download and analyze (a) over 8M notebooks publicly available on GITHUB and (b) over 2M enterprise ML pipelines developed within Microsoft. Our analysis includes coarse-grained statistical characterizations, finegrained analysis of libraries and pipelines, and comparative studies across datasets and time. We report a large number of measurements for our readers to interpret and draw actionable conclusions on (a) what system builders should focus on to better serve practitioners and (b) what technologies should practitioners rely on.","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128749630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Reminiscences on Influential Papers 对有影响的论文的回忆
Pub Date : 2022-07-29 DOI: 10.1145/3552490.3552499
M. Tamer Özsu
This column was established by Richard Snodgrass in 1998 and was continued by Ken Ross from 1999 to 2005. It celebrated one of the key aspects that makes us grow as a research community: the papers that influence us. At each issue, different members of the data management community wrote anecdotes about a paper that had a unique impact in their career. The anecdotes highlighted that impact can come in many forms. A paper's value is not only in its citation count, but also in the way it influences individuals who in turn influence other individuals that make up our community. Such impact is not countable. When the SIGMOD Record's editor-in-chief Rada Chirkova approached me to revive this column last year, I was immediately excited. I would like to thank Rada Chirkova, Richard Snodgrass, and Ken Ross for this opportunity. I am delighted to present the three invited contributions for this issue. Hope you enjoy reading them as much as I did. While I will keep inviting members of the data management community, and neighboring communities, to contribute to this column, I also welcome unsolicited contributions. Please contact me if you are interested.
本专栏于1998年由理查德·斯诺德格拉斯创立,1999年至2005年由肯·罗斯担任主编。它庆祝了使我们成长为一个研究团体的关键方面之一:影响我们的论文。在每期杂志上,数据管理社区的不同成员都写了一篇对他们职业生涯产生独特影响的论文的轶事。这些轶事突出表明,影响可以有多种形式。一篇论文的价值不仅在于它的引用次数,还在于它影响个体的方式,而这些个体反过来又影响了构成我们社区的其他个体。这种影响是不可估量的。去年,当SIGMOD Record的总编辑Rada Chirkova找到我,让我恢复这个专栏时,我立刻兴奋起来。我要感谢Rada Chirkova、Richard Snodgrass和Ken Ross给我这次机会。我很高兴向大家介绍本期特邀的三位撰稿人。希望你能像我一样喜欢它们。虽然我将继续邀请数据管理社区和邻近社区的成员为本专栏撰稿,但我也欢迎不请自来的投稿。如果你感兴趣,请联系我。
{"title":"Reminiscences on Influential Papers","authors":"M. Tamer Özsu","doi":"10.1145/3552490.3552499","DOIUrl":"https://doi.org/10.1145/3552490.3552499","url":null,"abstract":"This column was established by Richard Snodgrass in 1998 and was continued by Ken Ross from 1999 to 2005. It celebrated one of the key aspects that makes us grow as a research community: the papers that influence us. At each issue, different members of the data management community wrote anecdotes about a paper that had a unique impact in their career. The anecdotes highlighted that impact can come in many forms. A paper's value is not only in its citation count, but also in the way it influences individuals who in turn influence other individuals that make up our community. Such impact is not countable. When the SIGMOD Record's editor-in-chief Rada Chirkova approached me to revive this column last year, I was immediately excited. I would like to thank Rada Chirkova, Richard Snodgrass, and Ken Ross for this opportunity. I am delighted to present the three invited contributions for this issue. Hope you enjoy reading them as much as I did. While I will keep inviting members of the data management community, and neighboring communities, to contribute to this column, I also welcome unsolicited contributions. Please contact me if you are interested.","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129915780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards Passive, Migration-Free, Standardized, Long-Term Database Archival 面向被动的、无迁移的、标准化的、长期的数据库归档
Pub Date : 2022-07-29 DOI: 10.1145/3552490.3552506
Raja Appuswamy
"How would you archive databases for the next 60 years such that they incur no migration cost, and they remain usable in 2080?" This was an open challenge raised by digital preservation experts from the Landesarchiv of Baden-W¨urttemberg [12], who, similar to other memory institutions (archives, museums, libraries, etc.), have faced several challenges in archiving culturally significant, historic data stored in digital databases since early 1960s.
“你如何在未来60年存档数据库,使它们不产生迁移成本,并且在2080年仍然可用?”这是巴登-符腾堡州州档案馆的数字保存专家提出的一个公开挑战[12],与其他记忆机构(档案馆、博物馆、图书馆等)类似,自20世纪60年代初以来,他们在将具有文化意义的历史数据存储在数字数据库中时面临着一些挑战。
{"title":"Towards Passive, Migration-Free, Standardized, Long-Term Database Archival","authors":"Raja Appuswamy","doi":"10.1145/3552490.3552506","DOIUrl":"https://doi.org/10.1145/3552490.3552506","url":null,"abstract":"\"How would you archive databases for the next 60 years such that they incur no migration cost, and they remain usable in 2080?\" This was an open challenge raised by digital preservation experts from the Landesarchiv of Baden-W¨urttemberg [12], who, similar to other memory institutions (archives, museums, libraries, etc.), have faced several challenges in archiving culturally significant, historic data stored in digital databases since early 1960s.","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124944360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Technical Perspective 技术的角度来看
Pub Date : 2022-05-31 DOI: 10.1145/3542700.3542720
David P. Woodruff
Model counting is the problem of approximately counting the number |Sol(Φ)| of satisfying assignments to a given model Φ, which could, for example, be a formula in conjunctive normal form (CNF) or a formula in disjunctive normal form (DNF).
模型计数是对给定模型Φ的满足赋值的近似计数|Sol(Φ)|的问题,例如,它可以是合取范式(CNF)或析取范式(DNF)的公式。
{"title":"Technical Perspective","authors":"David P. Woodruff","doi":"10.1145/3542700.3542720","DOIUrl":"https://doi.org/10.1145/3542700.3542720","url":null,"abstract":"Model counting is the problem of approximately counting the number |Sol(Φ)| of satisfying assignments to a given model Φ, which could, for example, be a formula in conjunctive normal form (CNF) or a formula in disjunctive normal form (DNF).","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131927292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Technical Perspective 技术的角度来看
Pub Date : 2022-05-31 DOI: 10.1145/3542700.3542718
M. Yannakakis
The paper Structure and Complexity of Bag Consistency by Albert Atserias and Phokion Kolaitis [1] studies fundamental structural and algorithmic questions on the global consistency of databases in the context of bag semantics. A collection D of relations is called globally consistent if there is a (so-called "universal") relation over all the attributes that appear in all the relations of D whose projections yield the relations in D. The basic algorithmic problem for consistency is: given a database D, determine whether D is globally consistent. An obvious necessary condition for global consistency is local (or pairwise) consistency: every pair of relations in D must be consistent. This condition is not sufficient however: It is possible that every pair is consistent, but there is no single global relation over all the attributes whose projections yield the relations in D. A natural structural question is: Which database schemas have the property that every locally consistent database over the schema is also globally consistent?
Albert Atserias和Phokion Kolaitis的论文《Bag Consistency的结构和复杂性》(Structure and Complexity of Bag Consistency)研究了Bag语义背景下数据库全局一致性的基本结构和算法问题。如果在D的所有关系中出现的所有属性上存在一个(所谓的“全称”)关系,则关系集合D被称为全局一致,其投影产生D中的关系。一致性的基本算法问题是:给定数据库D,确定D是否全局一致。全局一致性的一个明显的必要条件是局部(或成对)一致性:D中的每对关系都必须是一致的。然而,这个条件还不够充分:有可能每一对都是一致的,但是在所有属性上没有一个全局关系,这些属性的投影产生d中的关系。一个自然的结构问题是:哪些数据库模式具有这样的属性,即该模式上的每个局部一致的数据库也是全局一致的?
{"title":"Technical Perspective","authors":"M. Yannakakis","doi":"10.1145/3542700.3542718","DOIUrl":"https://doi.org/10.1145/3542700.3542718","url":null,"abstract":"The paper Structure and Complexity of Bag Consistency by Albert Atserias and Phokion Kolaitis [1] studies fundamental structural and algorithmic questions on the global consistency of databases in the context of bag semantics. A collection D of relations is called globally consistent if there is a (so-called \"universal\") relation over all the attributes that appear in all the relations of D whose projections yield the relations in D. The basic algorithmic problem for consistency is: given a database D, determine whether D is globally consistent. An obvious necessary condition for global consistency is local (or pairwise) consistency: every pair of relations in D must be consistent. This condition is not sufficient however: It is possible that every pair is consistent, but there is no single global relation over all the attributes whose projections yield the relations in D. A natural structural question is: Which database schemas have the property that every locally consistent database over the schema is also globally consistent?","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128906731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Making Learned Query Optimization Practical 使学到的查询优化实用
Pub Date : 2022-05-31 DOI: 10.1145/3542700.3542702
V. Markl
Query optimization has been a challenging problem ever since the relational data model had been proposed. The role of the query optimizer in a database system is to compute an execution plan for a (relational) query expression comprised of physical operators whose implementations correspond to the operations of the (relational) algebra. There are many degrees of freedom for selecting a physical plan, in particular due to the laws of associativity, commutativity, and distributivity among the operators in the (relational) algebra, which necessitates our taking the order of operations into consideration. In addition, there are many alternative access paths to a dataset and a multitude of physical implementations for operations, such as relational joins (e.g., merge-join, nestedloop join, hash-join). Thus, when seeking to determine the best (or even a sufficiently good) execution plan there is a huge search space.
自从关系数据模型被提出以来,查询优化一直是一个具有挑战性的问题。查询优化器在数据库系统中的作用是为由物理运算符组成的(关系)查询表达式计算执行计划,这些运算符的实现与(关系)代数的操作相对应。选择物理计划有许多自由度,特别是由于(关系)代数中运算符之间的结合律、交换律和分配律,这就要求我们考虑操作的顺序。此外,数据集有许多可选的访问路径和许多操作的物理实现,例如关系连接(例如,合并连接、嵌套循环连接、哈希连接)。因此,在寻求确定最佳(甚至是足够好的)执行计划时,存在巨大的搜索空间。
{"title":"Making Learned Query Optimization Practical","authors":"V. Markl","doi":"10.1145/3542700.3542702","DOIUrl":"https://doi.org/10.1145/3542700.3542702","url":null,"abstract":"Query optimization has been a challenging problem ever since the relational data model had been proposed. The role of the query optimizer in a database system is to compute an execution plan for a (relational) query expression comprised of physical operators whose implementations correspond to the operations of the (relational) algebra. There are many degrees of freedom for selecting a physical plan, in particular due to the laws of associativity, commutativity, and distributivity among the operators in the (relational) algebra, which necessitates our taking the order of operations into consideration. In addition, there are many alternative access paths to a dataset and a multitude of physical implementations for operations, such as relational joins (e.g., merge-join, nestedloop join, hash-join). Thus, when seeking to determine the best (or even a sufficiently good) execution plan there is a huge search space.","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127901195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
Bao
Pub Date : 2022-05-31 DOI: 10.1145/3542700.3542703
Ryan Marcus, Parimarjan Negi, Hongzi Mao, Nesime Tatbul, Mohammad Alizadeh, Tim Kraska
Recent efforts applying machine learning techniques to query optimization have shown few practical gains due to substantive training overhead, inability to adapt to changes, and poor tail performance. Motivated by these difficulties, we introduce Bao (the Bandit optimizer). Bao takes advantage of the wisdom built into existing query optimizers by providing per-query optimization hints. Bao combines modern tree convolutional neural networks with Thompson sampling, a well-studied reinforcement learning algorithm. As a result, Bao automatically learns from its mistakes and adapts to changes in query workloads, data, and schema. Experimentally, we demonstrate that Bao can quickly learn strategies that improve end-to-end query execution performance, including tail latency, for several workloads containing longrunning queries. In cloud environments, we show that Bao can offer both reduced costs and better performance compared with a commercial system.
最近将机器学习技术应用于查询优化的努力表明,由于大量的训练开销、无法适应变化和较差的尾部性能,实际收益很少。在这些困难的激励下,我们引入了Bao (Bandit优化器)。Bao通过提供每个查询优化提示,利用了现有查询优化器中内置的智慧。Bao将现代树卷积神经网络与汤普森采样(一种经过充分研究的强化学习算法)相结合。因此,Bao会自动从错误中学习,并适应查询工作负载、数据和模式的变化。通过实验,我们证明Bao可以快速学习提高端到端查询执行性能的策略,包括包含长时间运行查询的几个工作负载的尾部延迟。在云环境中,我们展示了与商业系统相比,Bao可以提供更低的成本和更好的性能。
{"title":"Bao","authors":"Ryan Marcus, Parimarjan Negi, Hongzi Mao, Nesime Tatbul, Mohammad Alizadeh, Tim Kraska","doi":"10.1145/3542700.3542703","DOIUrl":"https://doi.org/10.1145/3542700.3542703","url":null,"abstract":"Recent efforts applying machine learning techniques to query optimization have shown few practical gains due to substantive training overhead, inability to adapt to changes, and poor tail performance. Motivated by these difficulties, we introduce Bao (the Bandit optimizer). Bao takes advantage of the wisdom built into existing query optimizers by providing per-query optimization hints. Bao combines modern tree convolutional neural networks with Thompson sampling, a well-studied reinforcement learning algorithm. As a result, Bao automatically learns from its mistakes and adapts to changes in query workloads, data, and schema. Experimentally, we demonstrate that Bao can quickly learn strategies that improve end-to-end query execution performance, including tail latency, for several workloads containing longrunning queries. In cloud environments, we show that Bao can offer both reduced costs and better performance compared with a commercial system.","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116935702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Relative Error Streaming Quantiles 相对错误流分位数
Pub Date : 2022-05-31 DOI: 10.1145/3542700.3542717
Graham Cormode, Zohar S. Karnin, Edo Liberty, J. Thaler, P. Veselý
Estimating ranks, quantiles, and distributions over streaming data is a central task in data analysis and monitoring. Given a stream of n items from a data universe equipped with a total order, the task is to compute a sketch (data structure) of size polylogarithmic in n. Given the sketch and a query item y, one should be able to approximate its rank in the stream, i.e., the number of stream elements smaller than or equal to y.
估计流数据的等级、分位数和分布是数据分析和监控的中心任务。给定一个带有总顺序的数据宇宙的n项流,任务是计算大小为n的多对数的草图(数据结构)。给定草图和查询项y,应该能够近似其在流中的排名,即小于或等于y的流元素的数量。
{"title":"Relative Error Streaming Quantiles","authors":"Graham Cormode, Zohar S. Karnin, Edo Liberty, J. Thaler, P. Veselý","doi":"10.1145/3542700.3542717","DOIUrl":"https://doi.org/10.1145/3542700.3542717","url":null,"abstract":"Estimating ranks, quantiles, and distributions over streaming data is a central task in data analysis and monitoring. Given a stream of n items from a data universe equipped with a total order, the task is to compute a sketch (data structure) of size polylogarithmic in n. Given the sketch and a query item y, one should be able to approximate its rank in the stream, i.e., the number of stream elements smaller than or equal to y.","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127496770","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Technical Perspective - No PANE, No Gain 从技术角度看——没有窗格就没有收获
Pub Date : 2022-05-31 DOI: 10.1145/3542700.3542710
A. Hogan
The machine learning community has traditionally been proactive in developing techniques for diverse types of data, such as text, audio, images, videos, time series, and, of course, matrices, tensors, etc. "But what about graphs?" some of us graph enthusiasts may have asked ourselves, dejectedly, before transforming our beautiful graph into a brutalistic table of numbers that bore little resemblance to its parent, nor the phenomena it represented, but could at least be shovelled into the machine learning frameworks of the time. Thankfully those days are coming to an end.
传统上,机器学习社区一直积极主动地为各种类型的数据开发技术,例如文本、音频、图像、视频、时间序列,当然还有矩阵、张量等。“那么图形呢?”我们中的一些图形爱好者可能会沮丧地问自己,然后把我们美丽的图形转换成一个野蛮的数字表,与它的母体没有什么相似之处,也没有它所代表的现象,但至少可以被纳入当时的机器学习框架。谢天谢地,这样的日子即将结束。
{"title":"Technical Perspective - No PANE, No Gain","authors":"A. Hogan","doi":"10.1145/3542700.3542710","DOIUrl":"https://doi.org/10.1145/3542700.3542710","url":null,"abstract":"The machine learning community has traditionally been proactive in developing techniques for diverse types of data, such as text, audio, images, videos, time series, and, of course, matrices, tensors, etc. \"But what about graphs?\" some of us graph enthusiasts may have asked ourselves, dejectedly, before transforming our beautiful graph into a brutalistic table of numbers that bore little resemblance to its parent, nor the phenomena it represented, but could at least be shovelled into the machine learning frameworks of the time. Thankfully those days are coming to an end.","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"2014 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128565436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ACM SIGMOD Record
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1