处理多重估算程序崩溃的实用策略。

IF 3.6 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Emerging Themes in Epidemiology Pub Date : 2021-04-01 DOI:10.1186/s12982-021-00095-3
Cattram D Nguyen, John B Carlin, Katherine J Lee
{"title":"处理多重估算程序崩溃的实用策略。","authors":"Cattram D Nguyen, John B Carlin, Katherine J Lee","doi":"10.1186/s12982-021-00095-3","DOIUrl":null,"url":null,"abstract":"<p><p>Multiple imputation is a recommended method for handling incomplete data problems. One of the barriers to its successful use is the breakdown of the multiple imputation procedure, often due to numerical problems with the algorithms used within the imputation process. These problems frequently occur when imputation models contain large numbers of variables, especially with the popular approach of multivariate imputation by chained equations. This paper describes common causes of failure of the imputation procedure including perfect prediction and collinearity, focusing on issues when using Stata software. We outline a number of strategies for addressing these issues, including imputation of composite variables instead of individual components, introducing prior information and changing the form of the imputation model. These strategies are illustrated using a case study based on data from the Longitudinal Study of Australian Children.</p>","PeriodicalId":39896,"journal":{"name":"Emerging Themes in Epidemiology","volume":null,"pages":null},"PeriodicalIF":3.6000,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8017730/pdf/","citationCount":"0","resultStr":"{\"title\":\"Practical strategies for handling breakdown of multiple imputation procedures.\",\"authors\":\"Cattram D Nguyen, John B Carlin, Katherine J Lee\",\"doi\":\"10.1186/s12982-021-00095-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Multiple imputation is a recommended method for handling incomplete data problems. One of the barriers to its successful use is the breakdown of the multiple imputation procedure, often due to numerical problems with the algorithms used within the imputation process. These problems frequently occur when imputation models contain large numbers of variables, especially with the popular approach of multivariate imputation by chained equations. This paper describes common causes of failure of the imputation procedure including perfect prediction and collinearity, focusing on issues when using Stata software. We outline a number of strategies for addressing these issues, including imputation of composite variables instead of individual components, introducing prior information and changing the form of the imputation model. These strategies are illustrated using a case study based on data from the Longitudinal Study of Australian Children.</p>\",\"PeriodicalId\":39896,\"journal\":{\"name\":\"Emerging Themes in Epidemiology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2021-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8017730/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Emerging Themes in Epidemiology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1186/s12982-021-00095-3\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Emerging Themes in Epidemiology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s12982-021-00095-3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0

摘要

多重估算是处理不完整数据问题的一种推荐方法。成功使用该方法的障碍之一是多重估算程序的崩溃,这通常是由于估算过程中使用的算法出现了数值问题。当估算模型包含大量变量时,尤其是采用链式方程进行多元估算的流行方法时,这些问题就会经常出现。本文介绍了导致估算程序失败的常见原因,包括完全预测和共线性,重点讨论了使用 Stata 软件时出现的问题。我们概述了解决这些问题的一系列策略,包括归因综合变量而非单个成分、引入先验信息以及改变归因模型的形式。我们将使用基于澳大利亚儿童纵向研究数据的案例研究来说明这些策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Practical strategies for handling breakdown of multiple imputation procedures.

Multiple imputation is a recommended method for handling incomplete data problems. One of the barriers to its successful use is the breakdown of the multiple imputation procedure, often due to numerical problems with the algorithms used within the imputation process. These problems frequently occur when imputation models contain large numbers of variables, especially with the popular approach of multivariate imputation by chained equations. This paper describes common causes of failure of the imputation procedure including perfect prediction and collinearity, focusing on issues when using Stata software. We outline a number of strategies for addressing these issues, including imputation of composite variables instead of individual components, introducing prior information and changing the form of the imputation model. These strategies are illustrated using a case study based on data from the Longitudinal Study of Australian Children.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Emerging Themes in Epidemiology
Emerging Themes in Epidemiology Medicine-Epidemiology
CiteScore
4.40
自引率
4.30%
发文量
9
审稿时长
28 weeks
期刊介绍: Emerging Themes in Epidemiology is an open access, peer-reviewed, online journal that aims to promote debate and discussion on practical and theoretical aspects of epidemiology. Combining statistical approaches with an understanding of the biology of disease, epidemiologists seek to elucidate the social, environmental and host factors related to adverse health outcomes. Although research findings from epidemiologic studies abound in traditional public health journals, little publication space is devoted to discussion of the practical and theoretical concepts that underpin them. Because of its immediate impact on public health, an openly accessible forum is needed in the field of epidemiology to foster such discussion.
期刊最新文献
Explaining biological differences between men and women by gendered mechanisms. Population cause of death estimation using verbal autopsy methods in large-scale field trials of maternal and child health: lessons learned from a 20-year research collaboration in Central Ghana. Dynamics of COVID-19 progression and the long-term influences of measures on pandemic outcomes. Effect size quantification for interrupted time series analysis: implementation in R and analysis for Covid-19 research. Geographical clustering and geographically weighted regression analysis of home delivery and its determinants in developing regions of Ethiopia: a spatial analysis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1