Big Data Warehouse: Building Columnar NoSQL OLAP Cubes

IF 0.6 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS International Journal of Decision Support System Technology Pub Date : 2020-01-01 DOI:10.4018/ijdsst.2020010101
Khaled Dehdouh, Omar Boussaïd, F. Bentayeb
{"title":"Big Data Warehouse: Building Columnar NoSQL OLAP Cubes","authors":"Khaled Dehdouh, Omar Boussaïd, F. Bentayeb","doi":"10.4018/ijdsst.2020010101","DOIUrl":null,"url":null,"abstract":"In the Big Data warehouse context, a column-oriented NoSQL database system is considered as the storage model which is highly adapted to data warehouses and online analysis. Indeed, the use of NoSQL models allows data scalability easily and the columnar store is suitable for storing and managing massive data, especially for decisional queries. However, the column-oriented NoSQL DBMS do not offer online analysis operators (OLAP). To build OLAP cubes corresponding to the analysis contexts, the most common way is to integrate other software such as HIVE or Kylin which has a CUBE operator to build data cubes. By using that, the cube is built according to the row-oriented approach and does not allow to fully obtain the benefits of a column-oriented approach. In this article, the focus is to define a cube operator called MC-CUBE (MapReduce Columnar CUBE), which allows building columnar NoSQL cubes according to the columnar approach by taking into account the non-relational and distributed aspects when data warehouses are stored.","PeriodicalId":42414,"journal":{"name":"International Journal of Decision Support System Technology","volume":null,"pages":null},"PeriodicalIF":0.6000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Decision Support System Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/ijdsst.2020010101","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 11

Abstract

In the Big Data warehouse context, a column-oriented NoSQL database system is considered as the storage model which is highly adapted to data warehouses and online analysis. Indeed, the use of NoSQL models allows data scalability easily and the columnar store is suitable for storing and managing massive data, especially for decisional queries. However, the column-oriented NoSQL DBMS do not offer online analysis operators (OLAP). To build OLAP cubes corresponding to the analysis contexts, the most common way is to integrate other software such as HIVE or Kylin which has a CUBE operator to build data cubes. By using that, the cube is built according to the row-oriented approach and does not allow to fully obtain the benefits of a column-oriented approach. In this article, the focus is to define a cube operator called MC-CUBE (MapReduce Columnar CUBE), which allows building columnar NoSQL cubes according to the columnar approach by taking into account the non-relational and distributed aspects when data warehouses are stored.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
大数据仓库:构建柱状NoSQL OLAP立方体
在大数据仓库环境下,面向列的NoSQL数据库系统被认为是高度适应数据仓库和在线分析的存储模型。实际上,使用NoSQL模型可以很容易地扩展数据,并且列式存储适合存储和管理大量数据,特别是对于决策查询。然而,面向列的NoSQL DBMS不提供在线分析操作符(OLAP)。要构建与分析上下文相对应的OLAP多维数据集,最常见的方法是集成其他软件,如HIVE或Kylin,它们有一个CUBE操作符来构建数据多维数据集。通过使用这种方法,多维数据集是根据面向行方法构建的,并且不允许完全获得面向列方法的优点。在本文中,重点是定义一个名为MC-CUBE (MapReduce Columnar cube)的多维数据集操作符,它允许在存储数据仓库时考虑非关系和分布式方面,根据列式方法构建列式NoSQL多维数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
International Journal of Decision Support System Technology
International Journal of Decision Support System Technology COMPUTER SCIENCE, INFORMATION SYSTEMS-
CiteScore
2.20
自引率
18.20%
发文量
40
期刊最新文献
A Novel Query Method for Spatial Database Based on Improved K-Nearest Neighbor Algorithm Analysis and Evaluation of Roadblocks Hindering Lean-Green and Industry 4.0 Practices in Indian Manufacturing Industries Developing Fuzzy-AHP-Integrated Hybrid MCDM System of COPRAS-ARAS for Solving an Industrial Robot Selection Problem Generalized Parametric Intuitionistic Fuzzy Measures Based on Trigonometric Functions for Improved Decision-Making Problem An Efficient Method to Decide the Malicious Traffic
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1