Proceedings of the 7th Workshop on Principles and Practice of Consistency for Distributed Data最新文献

英文中文

Low-latency geo-replicated state machines with guaranteed writes 具有保证写入的低延迟地理复制状态机

Proceedings of the 7th Workshop on Principles and Practice of Consistency for Distributed Data

Pub Date : 2020-04-27 DOI: 10.1145/3380787.3393686

M. Eischer, Benedikt Straßner, T. Distler

When deployed in geo-distributed environments, existing state-machine replication protocols require at least one widearea communication step for establishing a total order on client requests. For use cases in which clients are not interested in the actual result of a request, but just need a guarantee that the request will be processed eventually, this property usually incurs unnecessarily high response times. To address this problem we present Weave, a cloud-based georeplication protocol that relies on replica groups in multiple geographic regions to efficiently assign stable sequence numbers to incoming requests. This approach enables Weave to offer guaranteed writes which in the absence of faults only wait for communication within a client's local replica group to produce an execution guarantee for a particular sequence number. Our experiments with a distributed queue and a replicated log show that guaranteed writes can significantly improve response times of geo-replicated applications.

当部署在地理分布式环境中时，现有的状态机复制协议需要至少一个广域通信步骤来建立客户端请求的总顺序。对于客户端对请求的实际结果不感兴趣，而只是需要保证请求最终将被处理的用例，此属性通常会导致不必要的高响应时间。为了解决这个问题，我们提出了Weave，一种基于云的地理复制协议，它依赖于多个地理区域的副本组来有效地为传入请求分配稳定的序列号。这种方法使Weave能够提供有保证的写操作，在没有故障的情况下，只等待客户端的本地复制组之间的通信，从而为特定序列号生成执行保证。我们对分布式队列和复制日志的实验表明，保证写可以显著改善地理复制应用程序的响应时间。

引用次数: 6

Preserving reciprocal consistency in distributed graph databases 保持分布式图数据库的互反一致性

Proceedings of the 7th Workshop on Principles and Practice of Consistency for Distributed Data

Pub Date : 2020-04-27 DOI: 10.1145/3380787.3393675

Jack Waudby, P. Ezhilchelvan, J. Webber, I. Mitrani

Our earlier work identifies reciprocal consistency as an important property that must be preserved in distributed graph databases. It also demonstrates that a failure to do so seriously undermines the integrity of the database itself in the long term. Reciprocal consistency can be maintained as a part of enforcing any known isolation guarantee and such an enforcement is also known to lead to reduction in performance. Therefore, in practice, distributed graph databases are often built atop BASE databases with no isolation guarantees, benefiting from good performance but leaving them susceptible to corruption due to violations of reciprocal consistency. This paper designs and presents a lightweight, locking-free protocol and then evaluates the protocol's abilities to preserve reciprocal consistency and also offer good throughput. Our evaluations establish that the protocol can offer both integrity guarantees and sound performance when the value of its parameter is chosen appropriately.

我们早期的工作将互反一致性确定为分布式图数据库中必须保留的重要属性。它还表明，从长远来看，不这样做会严重破坏数据库本身的完整性。相互一致性可以作为实施任何已知隔离保证的一部分来维护，而且这种强制也会导致性能降低。因此，在实践中，分布式图数据库通常构建在没有隔离保证的BASE数据库之上，这得益于良好的性能，但由于违反互惠一致性而使它们容易损坏。本文设计并提出了一个轻量级的、无锁的协议，并评估了该协议在保持相互一致性和提供良好吞吐量方面的能力。我们的评估表明，当选择适当的参数值时，协议可以提供完整性保证和良好的性能。

引用次数: 3

The intrinsic cost of causal consistency 因果一致性的内在代价

Proceedings of the 7th Workshop on Principles and Practice of Consistency for Distributed Data

Pub Date : 2020-04-27 DOI: 10.1145/3380787.3393674

A. Linde, Pedro Fouto, J. Leitao, Nuno M. Preguiça

In the last few years, causal consistency has become a popular consistency model for geo-replicated databases. The algorithms proposed to enforce causal consistency typically associate with each operation some metadata, which is used to guarantee that an operation is not executed if its execution would break causality. This may lead to the impression that causal consistency is intrinsically costly and non scalable. In this paper, we analyze the metadata costs of enforcing causal consistency and put these costs in perspective, considering the metadata that is necessary to enforce reliability. We show that by wisely ordering the propagation of operations it is possible to enforce causal consistency without any additional metadata other than the already necessary to enforce reliability.

在过去几年中，因果一致性已经成为地理复制数据库的一种流行的一致性模型。强制因果一致性的算法通常与每个操作关联一些元数据，这些元数据用于保证如果操作的执行会破坏因果关系，则不会执行操作。这可能会给人一种印象，即因果一致性本质上是昂贵的，不可扩展的。在本文中，我们分析了强制因果一致性的元数据成本，并考虑了强制可靠性所必需的元数据，对这些成本进行了透视。我们表明，通过明智地对操作的传播进行排序，可以在没有任何额外元数据的情况下强制执行因果一致性，而不是强制执行可靠性所必需的元数据。

引用次数: 2

On combining fault tolerance and partial replication with causal consistency 论容错、部分复制与因果一致性的结合

Proceedings of the 7th Workshop on Principles and Practice of Consistency for Distributed Data

Pub Date : 2020-04-27 DOI: 10.1145/3380787.3393684

A. Linde, Diogo Serra, J. Leitao, Nuno M. Preguiça

The purpose of this paper is to discuss the limitations imposed by introducing fault-tolerance in a partial replication system which aims to provide causal consistency. The general outcome is that, to provide support for indefinite replica-failure, the notion of partial in partial replication becomes not-really-partial-at-all. We prove that to implement causal consistency when indefinite replica-failures are possible, it is impossible to respect the concept of genuine partial replication -- not storing or propagating operations which are on objects a given replica does not replicate locally. In our initial approach to tackle this issue client replicas need only to replicate the operations they depend on which have not yet been marked as durable by the centralised component. We discuss remaining limitations and expected improvements in future work.

本文的目的是讨论在旨在提供因果一致性的部分复制系统中引入容错所施加的限制。一般的结果是，为了提供对不确定复制失败的支持，部分复制中的部分概念变得根本不是真正的部分。我们证明，要实现因果一致性，当不确定的复制失败是可能的，它是不可能尊重真正的部分复制的概念-不存储或传播的对象上的操作，一个给定的副本不复制本地。在我们解决这个问题的最初方法中，客户端副本只需要复制它们所依赖的操作，这些操作尚未被集中式组件标记为持久。我们讨论了仍然存在的局限性和未来工作的预期改进。

引用次数: 0

Moving elements in list CRDTs 移动列表crdt中的元素

Proceedings of the 7th Workshop on Principles and Practice of Consistency for Distributed Data

Pub Date : 2020-04-27 DOI: 10.1145/3380787.3393677

Martin Kleppmann

Conflict-free Replicated Data Types (CRDTs) for lists allow multiple users to concurrently insert and delete elements in a shared list object. However, existing algorithms behave poorly when users concurrently move list elements to a new position (i.e. reorder the elements in the list). We demonstrate the need for such a move operation, and describe an algorithm that extends a list CRDT with an explicit move operation. Our algorithm can be used in conjunction with any existing list CRDT algorithm. In addition to moving a single list element, we also discuss the open problem of moving ranges of elements.

列表的无冲突复制数据类型(crdt)允许多个用户并发地插入和删除共享列表对象中的元素。然而，当用户同时将列表元素移动到一个新的位置(即重新排序列表中的元素)时，现有的算法表现不佳。我们演示了这种移动操作的必要性，并描述了一种用显式移动操作扩展列表CRDT的算法。我们的算法可以与任何现有的列表CRDT算法结合使用。除了移动单个列表元素外，我们还讨论了移动元素范围的开放问题。

引用次数: 9

Composing and decomposing op-based CRDTs with semidirect products: (summary) 半直接积基于op的crdt的组成与分解:(摘要)

Proceedings of the 7th Workshop on Principles and Practice of Consistency for Distributed Data

Pub Date : 2020-04-27 DOI: 10.1145/3380787.3393687

M. Weidner, Heather Miller, Christopher S. Meiklejohn

Operation-based Conflict-free Replicated Data Types (CRDTs) are eventually consistent replicated data types that automatically resolve conflicts between concurrent operations. Opbased CRDTs must be designed differently for each data type, and current designs use ad-hoc techniques to handle concurrent operations that do not naturally commute.We present a new construction, the semidirect product of op-based CRDTs, which combines the operations of two CRDTs into one while handling conflicts between their concurrent operations in a uniform way. We demonstrate the construction's utility by decomposing several existing CRDTs as semidirect products of simpler CRDTs, as well as by using it to construct novel CRDTs. Although it reproduces common CRDT semantics, the semidirect product can be viewed as a restricted kind of operational transformation, thus forming a bridge between the two fields.

基于操作的无冲突复制数据类型(crdt)是最终一致的复制数据类型，可自动解决并发操作之间的冲突。基于操作的crdt必须针对每种数据类型进行不同的设计，目前的设计使用特殊技术来处理不能自然交换的并发操作。我们提出了一种新的结构，即基于op的半直接积的crdt，它将两个crdt的操作合并为一个，同时以统一的方式处理它们并发操作之间的冲突。我们通过将几个现有的crdt分解为更简单的crdt的半直接产物，以及使用它来构建新的crdt来证明该结构的实用性。虽然它再现了常见的CRDT语义，但半直积可以被视为一种受限的操作变换，从而形成了两个领域之间的桥梁。

引用次数: 3

A low-cost set CRDT based on causal lengths 基于因果长度的低成本集CRDT

Proceedings of the 7th Workshop on Principles and Practice of Consistency for Distributed Data

Pub Date : 2020-04-27 DOI: 10.1145/3380787.3393678

Weihai Yu, Sigbjørn Rostad

CRDTs, or Conflict-free Replicated Data Types, are data abstractions that guarantee convergence for replicated data. Set is one of the most fundamental and widely used data types. Existing general-purpose set CRDTs associate every element in the set with causal contexts as meta data. Manipulation of causal contexts can be complicated and costly. We present a new set CRDT, CLSet (causal-length set), where the meta data associated with an element is simply a natural number (called causal length). We compare CLSet with existing general purpose CRDTs in terms of semantics and performance.

crdt，即无冲突复制数据类型，是保证复制数据收敛的数据抽象。Set是最基本和最广泛使用的数据类型之一。现有的通用集crdt将集合中的每个元素与因果上下文作为元数据关联起来。对因果语境的操纵既复杂又昂贵。我们提出了一个新的集CRDT, CLSet(因果长度集)，其中与元素相关联的元数据只是一个自然数(称为因果长度)。我们将CLSet与现有的通用crdt在语义和性能方面进行比较。

引用次数: 4

Paxos vs Raft: have we reached consensus on distributed consensus? Paxos vs Raft:我们在分布式共识上达成共识了吗?

Proceedings of the 7th Workshop on Principles and Practice of Consistency for Distributed Data

Pub Date : 2020-04-10 DOI: 10.1145/3380787.3393681

H. Howard, R. Mortier

Distributed consensus is a fundamental primitive for constructing fault-tolerant, strongly-consistent distributed systems. Though many distributed consensus algorithms have been proposed, just two dominate production systems: Paxos, the traditional, famously subtle, algorithm; and Raft, a more recent algorithm positioned as a more understandable alternative to Paxos. In this paper, we consider the question of which algorithm, Paxos or Raft, is the better solution to distributed consensus? We analyse both to determine exactly how they differ by describing a simplified Paxos algorithm using Raft's terminology and pragmatic abstractions. We find that both Paxos and Raft take a very similar approach to distributed consensus, differing only in their approach to leader election. Most notably, Raft only allows servers with up-to-date logs to become leaders, whereas Paxos allows any server to be leader provided it then updates its log to ensure it is up-to-date. Raft's approach is surprisingly efficient given its simplicity as, unlike Paxos, it does not require log entries to be exchanged during leader election. We surmise that much of the understandability of Raft comes from the paper's clear presentation rather than being fundamental to the underlying algorithm being presented.

分布式共识是构建容错、强一致的分布式系统的基本原语。尽管已经提出了许多分布式共识算法，但只有两种占主导地位的生产系统:Paxos，一种传统的、以微妙著称的算法;Raft是一种较新的算法，定位为Paxos的更容易理解的替代品。在本文中，我们考虑的问题是，Paxos和Raft哪种算法是分布式共识的更好解决方案?通过使用Raft的术语和实用抽象描述一个简化的Paxos算法，我们对两者进行了分析，以确定它们究竟有何不同。我们发现Paxos和Raft都采用了非常相似的分布式共识方法，只是在领导者选举的方法上有所不同。最值得注意的是，Raft只允许具有最新日志的服务器成为leader，而Paxos允许任何服务器成为leader，只要它更新日志以确保它是最新的。Raft的方法非常简单，与Paxos不同的是，它不需要在领导人选举期间交换日志条目。我们推测，Raft的大部分可理解性来自于论文的清晰呈现，而不是所呈现的底层算法的基础。

{"title":"Paxos vs Raft: have we reached consensus on distributed consensus?","authors":"H. Howard, R. Mortier","doi":"10.1145/3380787.3393681","DOIUrl":"https://doi.org/10.1145/3380787.3393681","url":null,"abstract":"Distributed consensus is a fundamental primitive for constructing fault-tolerant, strongly-consistent distributed systems. Though many distributed consensus algorithms have been proposed, just two dominate production systems: Paxos, the traditional, famously subtle, algorithm; and Raft, a more recent algorithm positioned as a more understandable alternative to Paxos. In this paper, we consider the question of which algorithm, Paxos or Raft, is the better solution to distributed consensus? We analyse both to determine exactly how they differ by describing a simplified Paxos algorithm using Raft's terminology and pragmatic abstractions. We find that both Paxos and Raft take a very similar approach to distributed consensus, differing only in their approach to leader election. Most notably, Raft only allows servers with up-to-date logs to become leaders, whereas Paxos allows any server to be leader provided it then updates its log to ensure it is up-to-date. Raft's approach is surprisingly efficient given its simplicity as, unlike Paxos, it does not require log entries to be exchanged during leader election. We surmise that much of the understandability of Raft comes from the paper's clear presentation rather than being fundamental to the underlying algorithm being presented.","PeriodicalId":115452,"journal":{"name":"Proceedings of the 7th Workshop on Principles and Practice of Consistency for Distributed Data","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126819477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 20

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 7th Workshop on Principles and Practice of Consistency for Distributed Data

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀