稳健相关标度主成分回归

IF 0.7 4区 数学 Q2 MATHEMATICS Hacettepe Journal of Mathematics and Statistics Pub Date : 2023-03-31 DOI:10.15672/hujms.1122113
Aiman Tahi̇r, Dr. Maryam Ilyas
{"title":"稳健相关标度主成分回归","authors":"Aiman Tahi̇r, Dr. Maryam Ilyas","doi":"10.15672/hujms.1122113","DOIUrl":null,"url":null,"abstract":"In multiple regression, different techniques are available to deal with the situation where the predictors are large in number, and multicollinearity exists among them. Some of these approaches rely on correlation and others depend on principal components. To cope with the influential observations (outliers, leverage, or both) in the data matrix for regression purposes, two techniques are proposed in this paper. These are Robust Correlation Based Regression (RCBR) and Robust Correlation Scaled Principal Component Regression (RCSPCR). These proposed methods are compared with the existing methods, i.e., traditional Principal Component Regression (PCR), Correlation Scaled Principal Component Regression (CSPCR), and Correlation Based Regression (CBR). Also, Macro (Missingness and cellwise and row-wise outliers) RCSPCR is proposed to cope with the problem of multicollinearity, the high dimensionality of the dataset, outliers, and missing observations simultaneously. The proposed techniques are assessed by considering several simulated scenarios with appropriate levels of contamination. The results indicate that the suggested techniques seem to be more reliable for analyzing the data with missingness and outlyingness. Additionally, real-life data applications are also used to illustrate the performance of the proposed methods.","PeriodicalId":55078,"journal":{"name":"Hacettepe Journal of Mathematics and Statistics","volume":"17 1","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2023-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Robust correlation scaled principal component regression\",\"authors\":\"Aiman Tahi̇r, Dr. Maryam Ilyas\",\"doi\":\"10.15672/hujms.1122113\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In multiple regression, different techniques are available to deal with the situation where the predictors are large in number, and multicollinearity exists among them. Some of these approaches rely on correlation and others depend on principal components. To cope with the influential observations (outliers, leverage, or both) in the data matrix for regression purposes, two techniques are proposed in this paper. These are Robust Correlation Based Regression (RCBR) and Robust Correlation Scaled Principal Component Regression (RCSPCR). These proposed methods are compared with the existing methods, i.e., traditional Principal Component Regression (PCR), Correlation Scaled Principal Component Regression (CSPCR), and Correlation Based Regression (CBR). Also, Macro (Missingness and cellwise and row-wise outliers) RCSPCR is proposed to cope with the problem of multicollinearity, the high dimensionality of the dataset, outliers, and missing observations simultaneously. The proposed techniques are assessed by considering several simulated scenarios with appropriate levels of contamination. The results indicate that the suggested techniques seem to be more reliable for analyzing the data with missingness and outlyingness. Additionally, real-life data applications are also used to illustrate the performance of the proposed methods.\",\"PeriodicalId\":55078,\"journal\":{\"name\":\"Hacettepe Journal of Mathematics and Statistics\",\"volume\":\"17 1\",\"pages\":\"\"},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2023-03-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Hacettepe Journal of Mathematics and Statistics\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.15672/hujms.1122113\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATHEMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Hacettepe Journal of Mathematics and Statistics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.15672/hujms.1122113","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATHEMATICS","Score":null,"Total":0}
引用次数: 1

摘要

在多元回归中,有不同的技术来处理预测因子数量较多,且预测因子之间存在多重共线性的情况。其中一些方法依赖于相关性,另一些依赖于主成分。为了处理回归目的的数据矩阵中的有影响的观测值(异常值、杠杆或两者),本文提出了两种技术。它们是稳健相关回归(RCBR)和稳健相关尺度主成分回归(RCSPCR)。并与传统主成分回归(PCR)、相关尺度主成分回归(CSPCR)和基于相关的回归(CBR)等方法进行了比较。此外,为了同时解决多重共线性、高维数据集、异常值和缺失观测值的问题,提出了宏(缺失、单元和行异常值)RCSPCR。通过考虑几种具有适当污染水平的模拟情景来评估所建议的技术。结果表明,所提出的方法对于缺失性和离群性数据的分析更为可靠。此外,还使用实际数据应用来说明所提出方法的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Robust correlation scaled principal component regression
In multiple regression, different techniques are available to deal with the situation where the predictors are large in number, and multicollinearity exists among them. Some of these approaches rely on correlation and others depend on principal components. To cope with the influential observations (outliers, leverage, or both) in the data matrix for regression purposes, two techniques are proposed in this paper. These are Robust Correlation Based Regression (RCBR) and Robust Correlation Scaled Principal Component Regression (RCSPCR). These proposed methods are compared with the existing methods, i.e., traditional Principal Component Regression (PCR), Correlation Scaled Principal Component Regression (CSPCR), and Correlation Based Regression (CBR). Also, Macro (Missingness and cellwise and row-wise outliers) RCSPCR is proposed to cope with the problem of multicollinearity, the high dimensionality of the dataset, outliers, and missing observations simultaneously. The proposed techniques are assessed by considering several simulated scenarios with appropriate levels of contamination. The results indicate that the suggested techniques seem to be more reliable for analyzing the data with missingness and outlyingness. Additionally, real-life data applications are also used to illustrate the performance of the proposed methods.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
1.70
自引率
0.00%
发文量
100
审稿时长
6-12 weeks
期刊介绍: Hacettepe Journal of Mathematics and Statistics covers all aspects of Mathematics and Statistics. Papers on the interface between Mathematics and Statistics are particularly welcome, including applications to Physics, Actuarial Sciences, Finance and Economics. We strongly encourage submissions for Statistics Section including current and important real world examples across a wide range of disciplines. Papers have innovations of statistical methodology are highly welcome. Purely theoretical papers may be considered only if they include popular real world applications.
期刊最新文献
Reduced-Order Modeling for Heston Stochastic Volatility Model Deferred statistical order convergence in Riesz spaces A numerical approach for a dynamical system of fractional infectious disease problem Generalized product-type operators between Bloch-type spaces Finite-time property of a mechanical viscoelastic system with nonlinear boundary conditions on corner-Sobolev spaces
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1