A Primer of Data Cleaning in Quantitative Research: Handling Missing Values and Outliers

IF 3.4 3区 医学 Q1 NURSING Journal of Advanced Nursing Pub Date : 2025-03-27 DOI:10.1111/jan.16908
Amir Masoud Sharifnia, Daniel Edem Kpormegbey, Deependra Kaji Thapa, Michelle Cleary
{"title":"A Primer of Data Cleaning in Quantitative Research: Handling Missing Values and Outliers","authors":"Amir Masoud Sharifnia,&nbsp;Daniel Edem Kpormegbey,&nbsp;Deependra Kaji Thapa,&nbsp;Michelle Cleary","doi":"10.1111/jan.16908","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Aims</h3>\n \n <p>This paper discusses data errors and offers guidance on data cleaning techniques, with a particular focus on handling missing values and outliers in quantitative datasets.</p>\n </section>\n \n <section>\n \n <h3> Design and Methods</h3>\n \n <p>Methodological discussion.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>This paper provides an overview of various techniques for identifying and addressing data anomalies, which can arise from incomplete, noisy, and inconsistent data. These anomalies can significantly affect data quality, leading to biased model parameter estimates and evidence-based decisions. Data cleaning, particularly the appropriate handling of missing values and outliers, is essential to improving data quality before analysis. Data cleaning includes screening for anomalies, diagnosing errors, and applying appropriate corrective measures.</p>\n </section>\n \n <section>\n \n <h3> Conclusion</h3>\n \n <p>Proper handling of missing values and the identification and correction of outliers are crucial aspects of data cleaning in ensuring data quality and the reliability of statistical analyses. Effective data cleaning enhances the validity and accuracy of research findings for evidence-based decision making that leads to optimal patient outcomes.</p>\n </section>\n \n <section>\n \n <h3> Implications for the Profession</h3>\n \n <p>The quality of study results depends on how a dataset and its complexities are processed or handled before the analysis. Nursing researchers must use a framework to identify and address important data anomalies and produce reliable results.</p>\n </section>\n \n <section>\n \n <h3> Impact</h3>\n \n <p>This paper describes data cleaning, often overlooked during the data mining process, as a crucial step before conducting data analysis. By addressing missing values and outliers, identifying and fixing data anomalies, and enhancing data quality prior to analysis, data cleaning techniques can produce precise research findings for evidence-based decision making.</p>\n </section>\n \n <section>\n \n <h3> Reporting Method</h3>\n \n <p>In this methodological paper, no new data were generated.</p>\n </section>\n \n <section>\n \n <h3> Patient or Public Contribution</h3>\n \n <p>No patient or public contribution.</p>\n </section>\n </div>","PeriodicalId":54897,"journal":{"name":"Journal of Advanced Nursing","volume":"82 1","pages":"970-975"},"PeriodicalIF":3.4000,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/jan.16908","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Advanced Nursing","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/jan.16908","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NURSING","Score":null,"Total":0}
引用次数: 0

Abstract

Aims

This paper discusses data errors and offers guidance on data cleaning techniques, with a particular focus on handling missing values and outliers in quantitative datasets.

Design and Methods

Methodological discussion.

Results

This paper provides an overview of various techniques for identifying and addressing data anomalies, which can arise from incomplete, noisy, and inconsistent data. These anomalies can significantly affect data quality, leading to biased model parameter estimates and evidence-based decisions. Data cleaning, particularly the appropriate handling of missing values and outliers, is essential to improving data quality before analysis. Data cleaning includes screening for anomalies, diagnosing errors, and applying appropriate corrective measures.

Conclusion

Proper handling of missing values and the identification and correction of outliers are crucial aspects of data cleaning in ensuring data quality and the reliability of statistical analyses. Effective data cleaning enhances the validity and accuracy of research findings for evidence-based decision making that leads to optimal patient outcomes.

Implications for the Profession

The quality of study results depends on how a dataset and its complexities are processed or handled before the analysis. Nursing researchers must use a framework to identify and address important data anomalies and produce reliable results.

Impact

This paper describes data cleaning, often overlooked during the data mining process, as a crucial step before conducting data analysis. By addressing missing values and outliers, identifying and fixing data anomalies, and enhancing data quality prior to analysis, data cleaning techniques can produce precise research findings for evidence-based decision making.

Reporting Method

In this methodological paper, no new data were generated.

Patient or Public Contribution

No patient or public contribution.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
定量研究中的数据清洗入门:处理缺失值和异常值
本文讨论了数据错误,并提供了数据清理技术的指导,特别侧重于处理定量数据集中的缺失值和异常值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
6.40
自引率
7.90%
发文量
369
审稿时长
3 months
期刊介绍: The Journal of Advanced Nursing (JAN) contributes to the advancement of evidence-based nursing, midwifery and healthcare by disseminating high quality research and scholarship of contemporary relevance and with potential to advance knowledge for practice, education, management or policy. All JAN papers are required to have a sound scientific, evidential, theoretical or philosophical base and to be critical, questioning and scholarly in approach. As an international journal, JAN promotes diversity of research and scholarship in terms of culture, paradigm and healthcare context. For JAN’s worldwide readership, authors are expected to make clear the wider international relevance of their work and to demonstrate sensitivity to cultural considerations and differences.
期刊最新文献
Ten-Year Update of Nurse Practitioner Service Impact on Patient and Health Service Outcomes in Emergency Care Settings-A Systematic Review. Matrescence and Missed Care in Nursing: Implications for Practice and Workforce Sustainability. Adapting to a Shrunken World: A Grounded Theory of Resident Adaptation to Life in Residential Aged Care. Strategies to Position the Clinical Academic Nurse in University, Teaching and General Hospitals Trajectories of Nursing Care During the Critical and Intensive Phases After Coronary Artery Bypass Graft Surgery: A Retrospective Observational Study.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1