Data Warehouse Failover Cluster for Analytical Queries in Banking

V. V. Sivov, V. Bogatyrev
{"title":"Data Warehouse Failover Cluster for Analytical Queries in Banking","authors":"V. V. Sivov, V. Bogatyrev","doi":"10.23947/2687-1653-2023-23-1-76-84","DOIUrl":null,"url":null,"abstract":"Introduction. The banking sector assigns high priority to data storage, as it is a critical aspect of business operations. The volume of data in this area is steadily growing. With the increasing volume of data that needs to be stored, processed and analyzed, it is critically important to select a suitable data storage solution and develop the required architecture. The presented research is aimed at filling the gap in the existing knowledge of the data base management system (DBMS) suitable for the banking sector, as well as to suggest ways for a fault-tolerant data storage cluster. The purpose of the work is to analyze the key DBMS for analytical queries, determine the priorities of the DBMS for the banking sector, and develop a fault-tolerant data storage cluster. To meet the performance and scalability requirements, a data storage solution with a fault-tolerant architecture that meets the requirements of the banking sector has been proposed.Materials and Methods. Domain analysis allowed us to create a set of characteristics that a DBMS for analytical queries (OnLine Analytical processing — OLAP) should correspond to, compare some popular DBMS OLAP, and offer a fault-tolerant cluster configuration written in xml, supported by the ClickHouse DBMS. Automation was done using Ansible Playbook. It was integrated with the Gitlab version control system and Jinja templates. Thus, rapid deployment of the configuration on all nodes of the cluster was achieved.Results. For OLAP databases, criteria were developed and several popular systems were compared. As a result, a reliable cluster configuration that met the requirements of analytical queries has been proposed for the banking industry. To increase the reliability and scalability of the DBMS, the deployment process was automated. Detailed diagrams of the cluster configuration were also provided.Discussion and Conclusions. The compiled criteria for the DBMS OLAP allowed us to determine the need for this solution in the organization. Comparison of popular DBMS can be used by organizations to minimize costs when selecting a solution. The proposed configuration of the data warehouse cluster for analytical queries in the banking sector will improve the reliability of the DBMS and meet the requirements for subsequent scalability. Automation of cluster deployment by the mechanism of templating configuration files in Ansible Playbook provides configuring a ready-made cluster on new servers in minutes.","PeriodicalId":13758,"journal":{"name":"International Journal of Advanced Engineering Research and Science","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Advanced Engineering Research and Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23947/2687-1653-2023-23-1-76-84","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction. The banking sector assigns high priority to data storage, as it is a critical aspect of business operations. The volume of data in this area is steadily growing. With the increasing volume of data that needs to be stored, processed and analyzed, it is critically important to select a suitable data storage solution and develop the required architecture. The presented research is aimed at filling the gap in the existing knowledge of the data base management system (DBMS) suitable for the banking sector, as well as to suggest ways for a fault-tolerant data storage cluster. The purpose of the work is to analyze the key DBMS for analytical queries, determine the priorities of the DBMS for the banking sector, and develop a fault-tolerant data storage cluster. To meet the performance and scalability requirements, a data storage solution with a fault-tolerant architecture that meets the requirements of the banking sector has been proposed.Materials and Methods. Domain analysis allowed us to create a set of characteristics that a DBMS for analytical queries (OnLine Analytical processing — OLAP) should correspond to, compare some popular DBMS OLAP, and offer a fault-tolerant cluster configuration written in xml, supported by the ClickHouse DBMS. Automation was done using Ansible Playbook. It was integrated with the Gitlab version control system and Jinja templates. Thus, rapid deployment of the configuration on all nodes of the cluster was achieved.Results. For OLAP databases, criteria were developed and several popular systems were compared. As a result, a reliable cluster configuration that met the requirements of analytical queries has been proposed for the banking industry. To increase the reliability and scalability of the DBMS, the deployment process was automated. Detailed diagrams of the cluster configuration were also provided.Discussion and Conclusions. The compiled criteria for the DBMS OLAP allowed us to determine the need for this solution in the organization. Comparison of popular DBMS can be used by organizations to minimize costs when selecting a solution. The proposed configuration of the data warehouse cluster for analytical queries in the banking sector will improve the reliability of the DBMS and meet the requirements for subsequent scalability. Automation of cluster deployment by the mechanism of templating configuration files in Ansible Playbook provides configuring a ready-made cluster on new servers in minutes.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
银行业分析查询的数据仓库故障转移集群
介绍。由于数据存储是业务操作的一个关键方面,因此银行部门对数据存储赋予了高度优先级。这一领域的数据量正在稳步增长。随着需要存储、处理和分析的数据量的增加,选择合适的数据存储解决方案并开发所需的体系结构至关重要。本研究旨在填补现有数据库管理系统(DBMS)知识的空白,并为容错数据存储集群提供建议。这项工作的目的是分析用于分析查询的关键DBMS,确定银行部门DBMS的优先级,并开发一个容错数据存储集群。为了满足性能和可扩展性的需求,提出了一种具有容错架构的数据存储解决方案,满足银行业的需求。材料与方法。域分析允许我们创建一组用于分析查询的DBMS(联机分析处理—OLAP)应该对应的特征,比较一些流行的DBMS OLAP,并提供一个用xml编写的容错集群配置,由ClickHouse DBMS支持。自动化是使用Ansible Playbook完成的。它与Gitlab版本控制系统和Jinja模板集成在一起。因此,可以在集群的所有节点上快速部署配置。对于OLAP数据库,我们制定了标准,并比较了几种流行的系统。因此,为银行业提出了满足分析查询需求的可靠集群配置。为了提高DBMS的可靠性和可伸缩性,部署过程是自动化的。还提供了集群配置的详细图表。讨论和结论。为DBMS OLAP编译的标准允许我们确定组织中对该解决方案的需求。在选择解决方案时,组织可以使用流行DBMS的比较来最小化成本。提出的用于银行业分析查询的数据仓库集群配置将提高DBMS的可靠性,并满足后续可扩展性的要求。通过Ansible Playbook中的模板配置文件机制实现集群部署自动化,可以在几分钟内在新服务器上配置一个现成的集群。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Investigation of Dynamic Characteristics of an Automated Position Long-Stroke Pneumatic Actuator of Fabrication System Optimization of Geometric Characteristics of Cycloidal Profiles of Gerotor Hydraulic Machines Sensitivity of Diffusion-Weighted Image Combined with T2 Turbo Inversion Recovery Magnitude Sequence and as an Alternative to Contrast-Enhanced MRI in the Detection of Perianal Fistula GATCGGenerator: New Software for Generation of Quasirandom Nucleotide Sequences 3D Human Motion Capture Method Based on Computer Vision
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1