DATA QUALITY DIMENSIONS, METRICS, AND IMPROVEMENT TECHNIQUES

Menna Ibrahim Gabr, Y. Helmy, Doaa S. Elzanfaly
{"title":"DATA QUALITY DIMENSIONS, METRICS, AND IMPROVEMENT TECHNIQUES","authors":"Menna Ibrahim Gabr, Y. Helmy, Doaa S. Elzanfaly","doi":"10.54623/fue.fcij.6.1.3","DOIUrl":null,"url":null,"abstract":"Achieving high level of data quality is considered one of the most important assets for any small, medium and large size organizations. Data quality is the main hype for both practitioners and researchers who deal with traditional or big data. The level of data quality is measured through several quality dimensions. High percentage of the current studies focus on assessing and applying data quality on traditional data. As we are in the era of big data, the attention should be paid to the tremendous volume of generated and processed data in which 80% of all the generated data is unstructured. However, the initiatives for creating big data quality evaluation models are still under development. This paper investigates the data quality dimensions that are mostly used in both traditional and big data to figure out the metrics and techniques that are used to measure and handle each dimension. A complete definition for each traditional and big data quality dimension, metrics and handling techniques are presented in this paper. Many data quality dimensions can be applied to both traditional and big data, while few number of quality dimensions are either applied to traditional data or big data. Few number of data quality metrics and barely handling techniques are presented in the current works.","PeriodicalId":100561,"journal":{"name":"Future Computing and Informatics Journal","volume":"7 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Computing and Informatics Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54623/fue.fcij.6.1.3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Achieving high level of data quality is considered one of the most important assets for any small, medium and large size organizations. Data quality is the main hype for both practitioners and researchers who deal with traditional or big data. The level of data quality is measured through several quality dimensions. High percentage of the current studies focus on assessing and applying data quality on traditional data. As we are in the era of big data, the attention should be paid to the tremendous volume of generated and processed data in which 80% of all the generated data is unstructured. However, the initiatives for creating big data quality evaluation models are still under development. This paper investigates the data quality dimensions that are mostly used in both traditional and big data to figure out the metrics and techniques that are used to measure and handle each dimension. A complete definition for each traditional and big data quality dimension, metrics and handling techniques are presented in this paper. Many data quality dimensions can be applied to both traditional and big data, while few number of quality dimensions are either applied to traditional data or big data. Few number of data quality metrics and barely handling techniques are presented in the current works.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
数据质量维度、度量和改进技术
实现高水平的数据质量被认为是任何小型、中型和大型组织最重要的资产之一。数据质量是处理传统数据或大数据的从业者和研究人员的主要炒作。数据质量水平是通过几个质量维度来衡量的。目前有很大比例的研究侧重于传统数据的数据质量评估和应用。我们正处于大数据时代,需要注意的是产生和处理的数据量巨大,其中80%的生成数据是非结构化的。然而,创建大数据质量评估模型的举措仍在发展中。本文研究了传统数据和大数据中最常用的数据质量维度,以找出用于测量和处理每个维度的度量和技术。本文给出了每个传统和大数据质量维度、度量和处理技术的完整定义。许多数据质量维度可以同时适用于传统数据和大数据,而少数质量维度既适用于传统数据又适用于大数据。在目前的工作中,很少有数据质量度量和数据处理技术。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Relationship between E-CRM, Service Quality, Customer Satisfaction, Trust, and Loyalty in banking Industry Enhancing query processing on stock market cloud-based database Crow search algorithm with time varying flight length Strategies for feature selection A Framework to Enhance the International Competitive Advantage of Information Technology Graduates A Literature Review on Agile Methodologies Quality, eXtreme Programming and SCRUM
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1