Data Preprocessing: The Techniques for Preparing Clean and Quality Data for Data Analytics Process

Ashish P. Joshi, B. Patel
{"title":"Data Preprocessing: The Techniques for Preparing Clean and Quality Data for Data Analytics Process","authors":"Ashish P. Joshi, B. Patel","doi":"10.13005/ojcst13.0203.03","DOIUrl":null,"url":null,"abstract":"The model and pattern for real time data mining have an important role for decision making. The meaningful real time data mining is basically depends on the quality of data while row or rough data available at warehouse. The data available at warehouse can be in any format, it may huge or it may unstructured. These kinds of data require some process to enhance the efficiency of data analysis. The process to make it ready to use is called data preprocessing. There can be many activities for data preprocessing such as data transformation, data cleaning, data integration, data optimization and data conversion which are use to converting the rough data to quality data. The data preprocessing techniques are the vital step for the data mining. The analyzed result will be good as far as data quality is good. This paper is about the different data preprocessing techniques which can be use for preparing the quality data for the data analysis for the available rough data.","PeriodicalId":270258,"journal":{"name":"Oriental journal of computer science and technology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Oriental journal of computer science and technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.13005/ojcst13.0203.03","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

Abstract

The model and pattern for real time data mining have an important role for decision making. The meaningful real time data mining is basically depends on the quality of data while row or rough data available at warehouse. The data available at warehouse can be in any format, it may huge or it may unstructured. These kinds of data require some process to enhance the efficiency of data analysis. The process to make it ready to use is called data preprocessing. There can be many activities for data preprocessing such as data transformation, data cleaning, data integration, data optimization and data conversion which are use to converting the rough data to quality data. The data preprocessing techniques are the vital step for the data mining. The analyzed result will be good as far as data quality is good. This paper is about the different data preprocessing techniques which can be use for preparing the quality data for the data analysis for the available rough data.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
数据预处理:为数据分析过程准备干净和高质量数据的技术
实时数据挖掘的模型和模式对决策具有重要作用。有意义的实时数据挖掘基本上取决于数据的质量,而行数据或粗数据在仓库中可用。仓库中可用的数据可以是任何格式,它可能是巨大的,也可能是非结构化的。这些类型的数据需要一些流程来提高数据分析的效率。使数据准备好使用的过程称为数据预处理。数据预处理包括数据转换、数据清洗、数据集成、数据优化和数据转换等多种活动,用于将粗糙数据转换为高质量数据。数据预处理技术是数据挖掘的关键环节。只要数据质量好,分析结果就是好的。本文介绍了不同的数据预处理技术,这些技术可用于为现有的粗糙数据分析准备高质量的数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Logical Foundations for Reasoning in Cyber - Physical Systems A Reinforcement Learning Paradigm for Cybersecurity Education and Training Physical Distancing Detection System with Distance Sensor for Covid-19 Prevention A Comparison Between Position-Based and Image-Based Multi-Layer Graphical user Authentication System Arduino Uno Based Child Tracking System Using GPS and GSM
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1