Optimization of Columnar NoSQL Data Warehouse Model with Clarans Clustering Algorithm

IF 0.7 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Computing and Informatics Pub Date : 2023-01-01 DOI:10.31577/cai_2023_3_762
N. Soussi
{"title":"Optimization of Columnar NoSQL Data Warehouse Model with Clarans Clustering Algorithm","authors":"N. Soussi","doi":"10.31577/cai_2023_3_762","DOIUrl":null,"url":null,"abstract":". In order to perfectly meet the needs of business leaders, decision-makers have resorted to the integration of external sources (such as Linked Open Data) in the decision-making system in order to enrich their existing data warehouses with new concepts contributing to bring added value to their organizations, enhance its productivity and retain its customers. However, the traditional data warehouse environment is not suitable to support external Big Data. To deal with this new challenge, several researches are oriented towards the direct conversion of classical relational data warehouse to a columnar NoSQL data warehouse, whereas the existing advanced works based on clustering algorithms are very limited and have several shortcomings. In this context, our paper proposes a new solution that conceives an optimized columnar data warehouse based on CLARANS clustering algorithm that has proven its effectiveness in generating optimal column families. Experimental re-sults improve the validity of our system by performing a detailed comparative study between the existing advanced approaches and our proposed optimized method.","PeriodicalId":55215,"journal":{"name":"Computing and Informatics","volume":"2 1","pages":"762-780"},"PeriodicalIF":0.7000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computing and Informatics","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.31577/cai_2023_3_762","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

. In order to perfectly meet the needs of business leaders, decision-makers have resorted to the integration of external sources (such as Linked Open Data) in the decision-making system in order to enrich their existing data warehouses with new concepts contributing to bring added value to their organizations, enhance its productivity and retain its customers. However, the traditional data warehouse environment is not suitable to support external Big Data. To deal with this new challenge, several researches are oriented towards the direct conversion of classical relational data warehouse to a columnar NoSQL data warehouse, whereas the existing advanced works based on clustering algorithms are very limited and have several shortcomings. In this context, our paper proposes a new solution that conceives an optimized columnar data warehouse based on CLARANS clustering algorithm that has proven its effectiveness in generating optimal column families. Experimental re-sults improve the validity of our system by performing a detailed comparative study between the existing advanced approaches and our proposed optimized method.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于Clarans聚类算法的柱状NoSQL数据仓库模型优化
. 为了完美地满足商业领袖的需求,决策者在决策系统中采用了外部资源(如Linked Open Data)的集成,以便用新的概念丰富他们现有的数据仓库,从而为他们的组织带来附加价值,提高其生产力并保留其客户。然而,传统的数据仓库环境并不适合支持外部大数据。为了应对这一新的挑战,一些研究面向将经典关系数据仓库直接转换为列式NoSQL数据仓库,而现有的基于聚类算法的先进工作非常有限,并且存在一些不足。在此背景下,本文提出了一种新的解决方案,即基于CLARANS聚类算法构想一个优化的列数据仓库,该算法已被证明在生成最优列族方面是有效的。实验结果通过对现有的先进方法和本文提出的优化方法进行了详细的对比研究,提高了系统的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Computing and Informatics
Computing and Informatics 工程技术-计算机:人工智能
CiteScore
1.60
自引率
14.30%
发文量
19
审稿时长
9 months
期刊介绍: Main Journal Topics: COMPUTER ARCHITECTURES AND NETWORKING PARALLEL AND DISTRIBUTED COMPUTING THEORETICAL FOUNDATIONS SOFTWARE ENGINEERING KNOWLEDGE AND INFORMATION ENGINEERING Apart from the main topics given above, the Editorial Board welcomes papers from other areas of computing and informatics.
期刊最新文献
Attribute-Based Access Control Policy Generation Approach from Access Logs Based on the CatBoost Classification of Sentiment Using Optimized Hybrid Deep Learning Model BERTDom: Protein Domain Boundary Prediction Using BERT Adaptive Evolutionary Multitasking to Solve Inter-Domain Path Computation Under Node-Defined Domain Uniqueness Constraint: New Solution Encoding Scheme mTreeIllustrator: A Mixed-Initiative Framework for Visual Exploratory Analysis of Multidimensional Hierarchical Data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1