Handling Outliers and Missing Data in Regression Models Using R: Simulation Examples

M. Abonazel
{"title":"Handling Outliers and Missing Data in Regression Models Using R: Simulation Examples","authors":"M. Abonazel","doi":"10.32861/ajams.68.187.203","DOIUrl":null,"url":null,"abstract":"This paper has reviewed two important problems in regression analysis (outliers and missing data), as well as some handling methods for these problems. Moreover, two applications have been introduced to understand and study these methods by R-codes. Practical evidence was provided to researchers to deal with those problems in regression modeling with R. Finally, we created a Monte Carlo simulation study to compare different handling methods of missing data in the regression model. Simulation results indicate that, under our simulation factors, the k-nearest neighbors method is the best method to estimate the missing values in regression models.","PeriodicalId":375032,"journal":{"name":"Academic Journal of Applied Mathematical Sciences","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Academic Journal of Applied Mathematical Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.32861/ajams.68.187.203","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

Abstract

This paper has reviewed two important problems in regression analysis (outliers and missing data), as well as some handling methods for these problems. Moreover, two applications have been introduced to understand and study these methods by R-codes. Practical evidence was provided to researchers to deal with those problems in regression modeling with R. Finally, we created a Monte Carlo simulation study to compare different handling methods of missing data in the regression model. Simulation results indicate that, under our simulation factors, the k-nearest neighbors method is the best method to estimate the missing values in regression models.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用R处理回归模型中的异常值和缺失数据:模拟示例
本文综述了回归分析中的两个重要问题(异常值和缺失数据),以及这些问题的处理方法。此外,还介绍了两个应用程序来理解和研究这些方法。为研究人员使用r进行回归建模时处理这些问题提供了实践依据。最后,我们创建了蒙特卡罗模拟研究,比较了回归模型中缺失数据的不同处理方法。仿真结果表明,在我们的模拟因素下,k近邻法是估计回归模型缺失值的最佳方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Special Spirals are Produced by the ROTASE Galactic Spiral Equations with the Sequential Prime Numbers On Bivariate Modeling of the COVID-19 Data with a New Type I Half-Logistic Inverse Weibull Distribution Stable Numerical Differentiation Algorithms Based on the Fourier Transform in Frequency Domain Rotation Equation of a Point in Air and its Solution Linear Programming on Bread Production Using Uncertainty Approach
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1