High-Dimensional Imputation for the Social Sciences: A Comparison of State-of-The-Art Methods

IF 6.5 2区 社会学 Q1 SOCIAL SCIENCES, MATHEMATICAL METHODS Sociological Methods & Research Pub Date : 2023-09-16 DOI:10.1177/00491241231200194
Edoardo Costantini, Kyle M. Lang, Tim Reeskens, Klaas Sijtsma
{"title":"High-Dimensional Imputation for the Social Sciences: A Comparison of State-of-The-Art Methods","authors":"Edoardo Costantini, Kyle M. Lang, Tim Reeskens, Klaas Sijtsma","doi":"10.1177/00491241231200194","DOIUrl":null,"url":null,"abstract":"Including a large number of predictors in the imputation model underlying a multiple imputation (MI) procedure is one of the most challenging tasks imputers face. A variety of high-dimensional MI techniques can help, but there has been limited research on their relative performance. In this study, we investigated a wide range of extant high-dimensional MI techniques that can handle a large number of predictors in the imputation models and general missing data patterns. We assessed the relative performance of seven high-dimensional MI methods with a Monte Carlo simulation study and a resampling study based on real survey data. The performance of the methods was defined by the degree to which they facilitate unbiased and confidence-valid estimates of the parameters of complete data analysis models. We found that using lasso penalty or forward selection to select the predictors used in the MI model and using principal component analysis to reduce the dimensionality of auxiliary data produce the best results.","PeriodicalId":21849,"journal":{"name":"Sociological Methods & Research","volume":"5 1","pages":"0"},"PeriodicalIF":6.5000,"publicationDate":"2023-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sociological Methods & Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/00491241231200194","RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOCIAL SCIENCES, MATHEMATICAL METHODS","Score":null,"Total":0}
引用次数: 3

Abstract

Including a large number of predictors in the imputation model underlying a multiple imputation (MI) procedure is one of the most challenging tasks imputers face. A variety of high-dimensional MI techniques can help, but there has been limited research on their relative performance. In this study, we investigated a wide range of extant high-dimensional MI techniques that can handle a large number of predictors in the imputation models and general missing data patterns. We assessed the relative performance of seven high-dimensional MI methods with a Monte Carlo simulation study and a resampling study based on real survey data. The performance of the methods was defined by the degree to which they facilitate unbiased and confidence-valid estimates of the parameters of complete data analysis models. We found that using lasso penalty or forward selection to select the predictors used in the MI model and using principal component analysis to reduce the dimensionality of auxiliary data produce the best results.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
社会科学的高维归算:最新方法的比较
在多重归算(MI)过程的归算模型中包含大量预测因子是归算者面临的最具挑战性的任务之一。各种高维MI技术可以提供帮助,但对其相对性能的研究有限。在这项研究中,我们研究了广泛的现有高维人工智能技术,这些技术可以处理大量的预测因子和一般缺失的数据模式。我们通过蒙特卡罗模拟研究和基于真实调查数据的重新抽样研究评估了七种高维MI方法的相对性能。这些方法的性能是由它们对完整数据分析模型的参数进行无偏和置信有效估计的程度来定义的。我们发现,使用套索惩罚或正向选择来选择MI模型中使用的预测因子,并使用主成分分析来降低辅助数据的维数,可以产生最好的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
16.30
自引率
3.20%
发文量
40
期刊介绍: Sociological Methods & Research is a quarterly journal devoted to sociology as a cumulative empirical science. The objectives of SMR are multiple, but emphasis is placed on articles that advance the understanding of the field through systematic presentations that clarify methodological problems and assist in ordering the known facts in an area. Review articles will be published, particularly those that emphasize a critical analysis of the status of the arts, but original presentations that are broadly based and provide new research will also be published. Intrinsically, SMR is viewed as substantive journal but one that is highly focused on the assessment of the scientific status of sociology. The scope is broad and flexible, and authors are invited to correspond with the editors about the appropriateness of their articles.
期刊最新文献
The Potential for Using a Shortened Version of the Everyday Discrimination Scale in Population Research with Young Adults: A Construct Validation Investigation. Occupational Percentile Rank: A New Method for Constructing a Socioeconomic Index of Occupational Status Marginal and Conditional Confounding Using Logits. High-Dimensional Imputation for the Social Sciences: A Comparison of State-of-The-Art Methods Current and Future Debates in Video Data Analysis
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1