增加样本量对多层次非线性模型多重共线性的影响

S. B. Atoyebi, Titi Obilade
{"title":"增加样本量对多层次非线性模型多重共线性的影响","authors":"S. B. Atoyebi, Titi Obilade","doi":"10.9734/ajpas/2024/v26i1587","DOIUrl":null,"url":null,"abstract":"A multilevel logistic regression model demonstrating high correlations among predictor variables is susceptible to multi-collinearity. Multi-collinearity significantly impacts the robustness and interpretability of multilevel non-linear models. In multilevel non-linear models, the effects of multi-collinearity can be amplified, leading to distorted parameter estimates and inflated standard errors. This phenomenon contributes to an escalation in the variances of parameter estimates, thereby resulting in inaccurate inferences regarding the relationships between the response and explanatory factors. The primary objective of this study is to investigate the impact of multi-collinearity on multilevel non-linear models. The research aims to assess whether the quantity of independent variables influences the Multilevel Variance Inflation Factor and to explore the effect of altering the correlation degree at one level on multi-collinearity within a multilevel non-linear model. Additionally, the research seeks to determine how multi-collinearity affects the standard errors of parameters in a multilevel non-linear model. In a 2-level logistic regression, a binary variable was the dependent variable, while pre-established standard variables functioned as regressors. The Monte Carlo analysis incorporated three distinct correlation strengths (0.2, 0.5, and 0.9) and sample sizes (500, 100, and 30). The Multilevel Variance Inflation Factor was employed for multi-collinearity diagnosis. The outcomes revealed that, within the logistic multilevel regression model, an increase in sample size correlated with a reduction in multi-collinearity. Notably, the influence of multi-collinearity on standard errors in a multilevel non-linear model was more pronounced. It was observed that increasing the sample size remains an effective strategy to mitigate multi-collinearity errors in a multilevel non-linear model. This approach is particularly crucial due to the reliance on maximum likelihood estimation in logistic regression, as opposed to ordinary least squares (OLS) regression, which contrasts with the methodology of OLS regression.","PeriodicalId":8532,"journal":{"name":"Asian Journal of Probability and Statistics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Effect of Increasing Sample Size on Multi-Collinearity in Multilevel Non-Linear Model\",\"authors\":\"S. B. Atoyebi, Titi Obilade\",\"doi\":\"10.9734/ajpas/2024/v26i1587\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A multilevel logistic regression model demonstrating high correlations among predictor variables is susceptible to multi-collinearity. Multi-collinearity significantly impacts the robustness and interpretability of multilevel non-linear models. In multilevel non-linear models, the effects of multi-collinearity can be amplified, leading to distorted parameter estimates and inflated standard errors. This phenomenon contributes to an escalation in the variances of parameter estimates, thereby resulting in inaccurate inferences regarding the relationships between the response and explanatory factors. The primary objective of this study is to investigate the impact of multi-collinearity on multilevel non-linear models. The research aims to assess whether the quantity of independent variables influences the Multilevel Variance Inflation Factor and to explore the effect of altering the correlation degree at one level on multi-collinearity within a multilevel non-linear model. Additionally, the research seeks to determine how multi-collinearity affects the standard errors of parameters in a multilevel non-linear model. In a 2-level logistic regression, a binary variable was the dependent variable, while pre-established standard variables functioned as regressors. The Monte Carlo analysis incorporated three distinct correlation strengths (0.2, 0.5, and 0.9) and sample sizes (500, 100, and 30). The Multilevel Variance Inflation Factor was employed for multi-collinearity diagnosis. The outcomes revealed that, within the logistic multilevel regression model, an increase in sample size correlated with a reduction in multi-collinearity. Notably, the influence of multi-collinearity on standard errors in a multilevel non-linear model was more pronounced. It was observed that increasing the sample size remains an effective strategy to mitigate multi-collinearity errors in a multilevel non-linear model. This approach is particularly crucial due to the reliance on maximum likelihood estimation in logistic regression, as opposed to ordinary least squares (OLS) regression, which contrasts with the methodology of OLS regression.\",\"PeriodicalId\":8532,\"journal\":{\"name\":\"Asian Journal of Probability and Statistics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Asian Journal of Probability and Statistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.9734/ajpas/2024/v26i1587\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asian Journal of Probability and Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.9734/ajpas/2024/v26i1587","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

多层次逻辑回归模型的预测变量之间存在高度相关性,容易产生多重共线性。多重共线性会严重影响多层次非线性模型的稳健性和可解释性。在多层次非线性模型中,多重共线性的影响会被放大,从而导致参数估计失真和标准误差膨胀。这种现象会导致参数估计值的方差增大,从而导致对反应因素和解释因素之间关系的推断不准确。本研究的主要目的是调查多重共线性对多层次非线性模型的影响。研究旨在评估自变量的数量是否会影响多层次方差膨胀因子,并探讨在多层次非线性模型中改变一个层次的相关程度对多重共线性的影响。此外,研究还试图确定多重共线性如何影响多层次非线性模型中参数的标准误差。在 2 级逻辑回归中,二元变量是因变量,而预先确定的标准变量则是回归变量。蒙特卡罗分析采用了三种不同的相关强度(0.2、0.5 和 0.9)和样本量(500、100 和 30)。多层次方差膨胀因子用于多重共线性诊断。结果显示,在逻辑多层次回归模型中,样本量的增加与多重共线性的减少相关。值得注意的是,在多层次非线性模型中,多重共线性对标准误差的影响更为明显。据观察,在多层次非线性模型中,增加样本量仍然是减少多重共线性误差的有效策略。由于逻辑回归依赖最大似然估计,而不是普通最小二乘法(OLS)回归,这与 OLS 回归的方法截然不同,因此这种方法尤为重要。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Effect of Increasing Sample Size on Multi-Collinearity in Multilevel Non-Linear Model
A multilevel logistic regression model demonstrating high correlations among predictor variables is susceptible to multi-collinearity. Multi-collinearity significantly impacts the robustness and interpretability of multilevel non-linear models. In multilevel non-linear models, the effects of multi-collinearity can be amplified, leading to distorted parameter estimates and inflated standard errors. This phenomenon contributes to an escalation in the variances of parameter estimates, thereby resulting in inaccurate inferences regarding the relationships between the response and explanatory factors. The primary objective of this study is to investigate the impact of multi-collinearity on multilevel non-linear models. The research aims to assess whether the quantity of independent variables influences the Multilevel Variance Inflation Factor and to explore the effect of altering the correlation degree at one level on multi-collinearity within a multilevel non-linear model. Additionally, the research seeks to determine how multi-collinearity affects the standard errors of parameters in a multilevel non-linear model. In a 2-level logistic regression, a binary variable was the dependent variable, while pre-established standard variables functioned as regressors. The Monte Carlo analysis incorporated three distinct correlation strengths (0.2, 0.5, and 0.9) and sample sizes (500, 100, and 30). The Multilevel Variance Inflation Factor was employed for multi-collinearity diagnosis. The outcomes revealed that, within the logistic multilevel regression model, an increase in sample size correlated with a reduction in multi-collinearity. Notably, the influence of multi-collinearity on standard errors in a multilevel non-linear model was more pronounced. It was observed that increasing the sample size remains an effective strategy to mitigate multi-collinearity errors in a multilevel non-linear model. This approach is particularly crucial due to the reliance on maximum likelihood estimation in logistic regression, as opposed to ordinary least squares (OLS) regression, which contrasts with the methodology of OLS regression.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Bayesian Sequential Updation and Prediction of Currency in Circulation Using a Weighted Prior Assessment of Required Sample Sizes for Estimating Proportions Rainfall Pattern in Kenya: Bayesian Non-parametric Model Based on the Normalized Generalized Gamma Process Advancing Retail Predictions: Integrating Diverse Machine Learning Models for Accurate Walmart Sales Forecasting Common Fixed-Point Theorem for Expansive Mappings in Dualistic Partial Metric Spaces
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1