ACM SIGMOD Record最新文献

英文中文

Technical perspective: DFI: The Data Flow Interface for High-Speed Networks 技术观点:DFI:高速网络的数据流接口

ACM SIGMOD Record

Pub Date : 2022-05-31 DOI: 10.1145/3542700.3542704

G. Alonso

Optimizing data movement has always been one of the key ways to get a data processing system to perform efficiently. Appearing under different disguises as computers evolved over the years, the issue is today as relevant as ever. With the advent of the cloud, data movement has become the bottleneck to address in any data processing system. In the cloud, compute and storage are typically disaggregated, with a network in between. In addition, cloud systems are scale-out, i.e., performance is obtained by parallelizing across machines, which also involves network communication. And while it is possible to use machines with large amounts of memory, the pricing models and the virtualized nature of the cloud tends to favor clusters of smaller computing nodes. Nowadays, the problem of optimizing data movement has become the problem of using the network as efficiently as possible.

优化数据移动一直是使数据处理系统高效运行的关键方法之一。随着计算机多年来的发展，这个问题以不同的形式出现，今天与以往一样重要。随着云计算的出现，数据移动已经成为任何数据处理系统需要解决的瓶颈。在云计算中，计算和存储通常是分开的，中间有一个网络。此外，云系统是向外扩展的，即通过跨机器并行化来获得性能，这也涉及到网络通信。虽然有可能使用具有大量内存的机器，但定价模型和云的虚拟化特性倾向于支持较小计算节点的集群。如今，优化数据移动的问题已经变成了尽可能高效地利用网络的问题。

引用次数: 1

Technical Perspective 技术的角度来看

ACM SIGMOD Record

Pub Date : 2022-05-31 DOI: 10.1145/3542700.3542706

A. Kemper

With the emergence of (geographically) distributed data mangement in cloud infrastructures the key value systems were promoted as so-called NoSQL systems. In order to achieve maximum availability and performance these KV stores sacrificed the "holy grail" of database consistency and relied on relaxed consistency models, such as eventual consistency.

随着云基础设施中(地理上)分布式数据管理的出现，键值系统被称为所谓的NoSQL系统。为了获得最大的可用性和性能，这些KV存储牺牲了数据库一致性的“圣杯”，而依赖于宽松的一致性模型，例如最终一致性。

引用次数: 0

Model Counting Meets Distinct Elements in a Data Stream 模型计数满足数据流中的不同元素

ACM SIGMOD Record

Pub Date : 2022-05-31 DOI: 10.1145/3542700.3542721

A. Pavan, N. V. Vinodchandran, Arnab Bhattacharyya, Kuldeep S. Meel

Constraint satisfaction problems (CSPs) and data stream models are two powerful abstractions to capture a wide variety of problems arising in different domains of computer science. Developments in the two communities have mostly occurred independently and with little interaction between them. In this work, we seek to investigate whether bridging the seeming communication gap between the two communities may pave the way to richer fundamental insights. To this end, we focus on two foundational problems: model counting for CSPs and computation of zeroth frequency moments (F0) for data streams.

约束满足问题(csp)和数据流模型是捕获计算机科学不同领域中出现的各种问题的两个强大的抽象。两个社区的发展大多是独立发生的，它们之间很少相互作用。在这项工作中，我们试图调查弥合两个社区之间表面上的沟通差距是否可以为更丰富的基本见解铺平道路。为此，我们关注两个基本问题:csp的模型计数和数据流的零频率矩(F0)计算。

引用次数: 0

INODE 索引节点

ACM SIGMOD Record

Pub Date : 2022-01-31 DOI: 10.1145/3516431.3516436

S. Amer-Yahia, G. Koutrika, Martin Braschler, Diego Calvanese, D. Lanti, Hendrik Lücke-Tieke, A. Mosca, Tarcisio Mendes de Farias, D. Papadopoulos, Yogendra Patil, Guillem Rull, Ellery Smith, Dimitrios Skoutas, S. Subramanian, Kurt Stockinger

A full-fledged data exploration system must combine different access modalities with a powerful concept of guiding the user in the exploration process, by being reactive and anticipative both for data discovery and for data linking. Such systems are a real opportunity for our community to cater to users with different domain and data science expertise. We introduce INODE - an end-to-end data exploration system - that leverages, on the one hand, Machine Learning and, on the other hand, semantics for the purpose of Data Management (DM). Our vision is to develop a classic unified, comprehensive platform that provides extensive access to open datasets, and we demonstrate it in three significant use cases in the fields of Cancer Biomarker Research, Research and Innovation Policy Making, and Astrophysics. INODE offers sustainable services in (a) data modeling and linking, (b) integrated query processing using natural language, (c) guidance, and (d) data exploration through visualization, thus facilitating the user in discovering new insights. We demonstrate that our system is uniquely accessible to a wide range of users from larger scientific communities to the public. Finally, we briefly illustrate how this work paves the way for new research opportunities in DM.

一个成熟的数据探索系统必须结合不同的访问模式和引导用户探索过程的强大概念，通过对数据发现和数据链接的反应和预测。这样的系统为我们的社区提供了一个真正的机会，以迎合具有不同领域和数据科学专业知识的用户。我们介绍INODE——一个端到端数据探索系统——它一方面利用机器学习，另一方面利用语义来实现数据管理(DM)。我们的愿景是开发一个经典的、统一的、全面的平台，提供对开放数据集的广泛访问，我们在癌症生物标志物研究、研究与创新政策制定和天体物理学领域的三个重要用例中展示了它。INODE在以下方面提供可持续的服务:(a)数据建模和链接，(b)使用自然语言的集成查询处理，(c)引导，(d)通过可视化进行数据探索，从而促进用户发现新的见解。我们证明了我们的系统对从较大的科学团体到公众的广泛用户是唯一可访问的。最后，我们简要说明了这项工作如何为DM的新研究机会铺平道路。

{"title":"INODE","authors":"S. Amer-Yahia, G. Koutrika, Martin Braschler, Diego Calvanese, D. Lanti, Hendrik Lücke-Tieke, A. Mosca, Tarcisio Mendes de Farias, D. Papadopoulos, Yogendra Patil, Guillem Rull, Ellery Smith, Dimitrios Skoutas, S. Subramanian, Kurt Stockinger","doi":"10.1145/3516431.3516436","DOIUrl":"https://doi.org/10.1145/3516431.3516436","url":null,"abstract":"A full-fledged data exploration system must combine different access modalities with a powerful concept of guiding the user in the exploration process, by being reactive and anticipative both for data discovery and for data linking. Such systems are a real opportunity for our community to cater to users with different domain and data science expertise. We introduce INODE - an end-to-end data exploration system - that leverages, on the one hand, Machine Learning and, on the other hand, semantics for the purpose of Data Management (DM). Our vision is to develop a classic unified, comprehensive platform that provides extensive access to open datasets, and we demonstrate it in three significant use cases in the fields of Cancer Biomarker Research, Research and Innovation Policy Making, and Astrophysics. INODE offers sustainable services in (a) data modeling and linking, (b) integrated query processing using natural language, (c) guidance, and (d) data exploration through visualization, thus facilitating the user in discovering new insights. We demonstrate that our system is uniquely accessible to a wide range of users from larger scientific communities to the public. Finally, we briefly illustrate how this work paves the way for new research opportunities in DM.","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121383294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

Accelerating Video Analytics 加速视频分析

ACM SIGMOD Record

Pub Date : 2022-01-31 DOI: 10.1145/3516431.3516442

Joy Arulraj

MOTIVATION. The advent of inexpensive, high-quality cameras has led to a rapid increase in the volume of generated video data [19, 16]. It is now feasible to automatically analyze these video datasets at scale due to two developments over the last decade. First, researchers have designed complex, computationally-intensive deep learning (DL) models that capture the contents of a given set of video frames (e.g., objects present in a particular frame [11]) [15]. Second, the computational capabilities of hardware accelerators for evaluating these DL models have increased over the last decade (e.g., TPUs) [8]. We anticipate that automated analysis of videos will reduce the labor cost of analyzing video

动机。廉价、高质量摄像机的出现导致了视频数据量的快速增长[19,16]。由于过去十年的两个发展，现在可以大规模地自动分析这些视频数据集。首先，研究人员设计了复杂的、计算密集型的深度学习(DL)模型，用于捕获给定视频帧集的内容(例如，特定帧中存在的对象[11])[15]。其次，用于评估这些深度学习模型的硬件加速器的计算能力在过去十年中有所提高(例如，tpu)[8]。我们预计视频的自动化分析将减少分析视频的人工成本

引用次数: 2

Juliana Freire Speaks Out on Reproducibility and Hard Changes 朱莉安娜·弗莱雷就可重复性和艰难的变化发表了看法

ACM SIGMOD Record

Pub Date : 2022-01-31 DOI: 10.1145/3516431.3516444

M. Winslett, V. Braganholo

Welcome to ACM SIGMOD Record's series of interviews with distinguished members of the database community. I am Marianne Winslett, and today I have here with me Juliana Freire, who is a professor at New York University. Juliana is an ACM Fellow, and she has a Google Faculty Research Award, an IBM Faculty Award, and an NSF Career Award. She is also the chair of SIGMOD, and her term of office ends in just a few days. Juliana's Ph.D. is from Stony Brook. So, Juliana, welcome!

欢迎来到ACM SIGMOD Record对数据库社区杰出成员的系列访谈。我是玛丽安·温斯莱特，今天和我在一起的是朱莉安娜·弗莱雷，她是纽约大学的教授。朱莉安娜是美国计算机协会研究员，她曾获得谷歌学院研究奖、IBM学院奖和美国国家科学基金会职业奖。她也是SIGMOD的主席，她的任期将在几天后结束。朱莉安娜的博士学位来自石溪分校。朱莉安娜，欢迎你!

引用次数: 0

How Inclusive are We? 我们有多包容?

ACM SIGMOD Record

Pub Date : 2022-01-31 DOI: 10.1145/3516431.3516438

A. Bonifati, Michael J. Mior, Felix Naumann, Nele Sina Noack

ACM SIGMOD, VLDB and other database organizations have committed to fostering an inclusive and diverse community, as do many other scientific organizations. Recently, different measures have been taken to advance these goals, especially for underrepresented groups. One possible measure is double-blind reviewing, which aims to hide gender, ethnicity, and other properties of the authors. We report the preliminary results of a gender diversity analysis of publications of the database community across several peer-reviewed venues, and also compare women's authorship percentages in both single-blind and double-blind venues along the years. We also obtained a cross comparison of the obtained results in data management with other relevant areas in Computer Science.

ACM SIGMOD、VLDB和其他数据库组织致力于培养一个包容和多样化的社区，正如许多其他科学组织一样。最近，采取了不同的措施来推进这些目标，特别是为代表性不足的群体。一种可能的措施是双盲审查，其目的是隐藏作者的性别、种族和其他属性。我们报告了数据库社区在几个同行评议场所的出版物性别多样性分析的初步结果，并比较了多年来单盲和双盲场所的女性作者百分比。我们还将数据管理与计算机科学的其他相关领域的结果进行了交叉比较。

引用次数: 4

Congratulations! You Have Become a Senior Researcher. Now What? 恭喜你!你已经成为一名高级研究员。现在怎么办呢?

ACM SIGMOD Record

Pub Date : 2022-01-31 DOI: 10.1145/3516431.3516440

M. Balazinska

It probably seems like yesterday that you were starting at your first post-PhD position, but with this latest promotion, whether it is tenure or promotion to a senior level at your company, you can no longer call yourself "junior". You are now stepping into the shoes of a senior researcher. Congratulations! This is a tremendous accomplishment, and you should celebrate. The road was long and often uphill. You finally made it.

你刚刚从博士毕业后的第一个职位开始工作，这似乎就像昨天一样，但随着最近的晋升，无论是终身职位还是晋升到公司的高级职位，你都不能再称自己为“初级”了。你现在正接替一位高级研究员的工作。恭喜你!这是一个巨大的成就，你们应该庆祝一下。这条路很长，而且经常上坡。你终于来了。

引用次数: 0

VLDB 2021

ACM SIGMOD Record

Pub Date : 2022-01-31 DOI: 10.1145/3516431.3516447

Philippe Bonnet, Xin Dong, Felix Naumann, Pinar Tözün

The 47th International Conference on Very Large Databases (VLDB'21) was held on August 16-20, 2021 as a hybrid conference. It attracted 180 in-person attendees in Copenhagen and 840 remote attendees. In this paper, we describe our key decisions as general chairs and program committee chairs and share the lessons we learned.

第47届超大型数据库国际会议(VLDB'21)于2021年8月16日至20日举行。它在哥本哈根吸引了180名现场与会者和840名远程与会者。在本文中，我们描述了我们作为总主席和项目委员会主席的关键决策，并分享了我们学到的经验教训。

引用次数: 0

Current Trends in Data Summaries 数据摘要的当前趋势

ACM SIGMOD Record

Pub Date : 2022-01-31 DOI: 10.1145/3516431.3516433

Graham Cormode, AI Meta

The research area of data summarization seeks to find small data structures that can be updated flexibly, and answer certain queries on the input accurately. Summaries are widely used across the area of data management, and are studied from both theoretical and practical perspectives. They are the subject of ongoing research to improve their performance and broaden their applicability. In this column, recent developments in data summarization are surveyed, with the intent of inspiring further advances.

数据摘要的研究领域是寻找可以灵活更新的小型数据结构，并准确地回答对输入的某些查询。摘要在数据管理领域被广泛使用，并从理论和实践两个角度进行研究。他们是正在进行的研究的主题，以提高他们的性能和扩大他们的适用性。在本专栏中，将对数据汇总的最新发展进行调查，以期激发进一步的进展。

引用次数: 4

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

ACM SIGMOD Record

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀