Consistency issue and related trade-offs in distributed replicated systems and databases: a review

Q3 Computer Science Radioelectronic and Computer Systems Pub Date : 2023-05-25 DOI:10.32620/reks.2023.2.14
Jaafar Ahmed, Andrii Karpenko, Olga Tarasyuk, Anatoliy Gorbenko, Akbar Sheikh-Akbari
{"title":"Consistency issue and related trade-offs in distributed replicated systems and databases: a review","authors":"Jaafar Ahmed, Andrii Karpenko, Olga Tarasyuk, Anatoliy Gorbenko, Akbar Sheikh-Akbari","doi":"10.32620/reks.2023.2.14","DOIUrl":null,"url":null,"abstract":"Distributed replicated databases play a crucial role in modern computer systems enabling scalable, fault-tolerant, and high-performance data management. However, achieving these qualities requires resolving a number of trade-offs between various properties during system design and operation. This paper reviews trade-offs in distributed replicated databases and provides a survey of recent research papers studying distributed data storage. The paper first discusses a compromise between consistency and latency that appears in distributed replicated data storages and directly follows from CAP and PACELC theorems. Consistency refers to the guarantee that all clients in a distributed system observe the same data at the same time. To ensure strong consistency, distributed systems typically employ coordination mechanisms and synchronization protocols that involve communication and agreement among distributed replicas. These mechanisms introduce additional overhead and latency and can dramatically increase the time taken to complete operations when replicas are globally distributed across the Internet. In addition, we study trade-offs between other system properties including availability, durability, cost, energy consumption, read and write latency, etc. In this paper we also provide a comprehensive review and classification of recent research works in distributed replicated databases. Reviewed papers showcase several major areas of research, ranging from performance evaluation and comparison of various NoSQL databases to suggest new strategies for data replication and putting forward new consistency models. In particular, we observed a shift towards exploring hybrid consistency models of causal consistency and eventual consistency with causal ordering due to their ability to strike a balance between operations ordering guarantees and high performance. Researchers have also proposed various consistency control algorithms and consensus quorum protocols to coordinate distributed replicas. Insights from this review can empower practitioners to make informed decisions in designing and managing distributed data storage systems as well as help identify existing gaps in the body of knowledge and suggest further research directions.","PeriodicalId":36122,"journal":{"name":"Radioelectronic and Computer Systems","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radioelectronic and Computer Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.32620/reks.2023.2.14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 1

Abstract

Distributed replicated databases play a crucial role in modern computer systems enabling scalable, fault-tolerant, and high-performance data management. However, achieving these qualities requires resolving a number of trade-offs between various properties during system design and operation. This paper reviews trade-offs in distributed replicated databases and provides a survey of recent research papers studying distributed data storage. The paper first discusses a compromise between consistency and latency that appears in distributed replicated data storages and directly follows from CAP and PACELC theorems. Consistency refers to the guarantee that all clients in a distributed system observe the same data at the same time. To ensure strong consistency, distributed systems typically employ coordination mechanisms and synchronization protocols that involve communication and agreement among distributed replicas. These mechanisms introduce additional overhead and latency and can dramatically increase the time taken to complete operations when replicas are globally distributed across the Internet. In addition, we study trade-offs between other system properties including availability, durability, cost, energy consumption, read and write latency, etc. In this paper we also provide a comprehensive review and classification of recent research works in distributed replicated databases. Reviewed papers showcase several major areas of research, ranging from performance evaluation and comparison of various NoSQL databases to suggest new strategies for data replication and putting forward new consistency models. In particular, we observed a shift towards exploring hybrid consistency models of causal consistency and eventual consistency with causal ordering due to their ability to strike a balance between operations ordering guarantees and high performance. Researchers have also proposed various consistency control algorithms and consensus quorum protocols to coordinate distributed replicas. Insights from this review can empower practitioners to make informed decisions in designing and managing distributed data storage systems as well as help identify existing gaps in the body of knowledge and suggest further research directions.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
分布式复制系统和数据库中的一致性问题和相关权衡:综述
分布式复制数据库在现代计算机系统中起着至关重要的作用,它支持可伸缩、容错和高性能的数据管理。然而,要实现这些品质,需要在系统设计和操作期间解决各种特性之间的许多权衡。本文回顾了分布式复制数据库的利弊,并对最近关于分布式数据存储的研究论文进行了综述。本文首先讨论了分布式复制数据存储中出现的一致性和延迟之间的折衷,并直接遵循CAP和PACELC定理。一致性是指保证分布式系统中的所有客户端同时观察到相同的数据。为了确保强一致性,分布式系统通常采用协调机制和同步协议,这些机制和协议涉及分布式副本之间的通信和协议。这些机制引入了额外的开销和延迟,并且当副本在Internet上全局分布时,可能会大大增加完成操作所需的时间。此外,我们还研究了其他系统属性之间的权衡,包括可用性、持久性、成本、能耗、读写延迟等。本文还对分布式复制数据库的最新研究成果进行了综述和分类。回顾的论文展示了几个主要的研究领域,从性能评估和各种NoSQL数据库的比较,到提出新的数据复制策略和提出新的一致性模型。特别是,我们观察到一种向探索因果一致性的混合一致性模型和因果排序的最终一致性的转变,因为它们能够在操作排序保证和高性能之间取得平衡。研究人员还提出了各种一致性控制算法和共识仲裁协议来协调分布式副本。本综述的见解可以使从业者在设计和管理分布式数据存储系统时做出明智的决策,并有助于识别知识体系中存在的差距,并建议进一步的研究方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Radioelectronic and Computer Systems
Radioelectronic and Computer Systems Computer Science-Computer Graphics and Computer-Aided Design
CiteScore
3.60
自引率
0.00%
发文量
50
审稿时长
2 weeks
期刊最新文献
Risk and uncertainty assessment in software project management: integrating decision trees and Monte Carlo modeling Advanced file carving: ontology, models and methods Modeling the mindfulness people's function based on the recognition of biometric parameters by artificial intelligence elements Influence of the number system in residual classes on the fault tolerance of the computer system A method for extracting the semantic features of speech signal recognition based on empirical wavelet transform
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1