Standard multiple imputation of survey data didn't perform better than simple substitution in enhancing an administrative dataset: the example of self-rated health in England.

IF 3.6 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Emerging Themes in Epidemiology Pub Date : 2021-07-24 DOI:10.1186/s12982-021-00099-z
Frank Popham, Elise Whitley, Oarabile Molaodi, Linsay Gray
{"title":"Standard multiple imputation of survey data didn't perform better than simple substitution in enhancing an administrative dataset: the example of self-rated health in England.","authors":"Frank Popham,&nbsp;Elise Whitley,&nbsp;Oarabile Molaodi,&nbsp;Linsay Gray","doi":"10.1186/s12982-021-00099-z","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Health surveys provide a rich array of information but on relatively small numbers of individuals and evidence suggests that they are becoming less representative as response levels fall. Routinely collected administrative data offer more extensive population coverage but typically comprise fewer health topics. We explore whether data combination and multiple imputation of health variables from survey data is a simple and robust way of generating these variables in the general population.</p><p><strong>Methods: </strong>We use the UK Integrated Household Survey and the English 2011 population census both of which included self-rated general health. Setting aside the census self-rated health data we multiply imputed self-rated health responses for the census using the survey data and compared these with the actual census results in 576 unique groups defined by age, sex, housing tenure and geographic region.</p><p><strong>Results: </strong>Compared with original census data across the groups, multiply imputed proportions of bad or very bad self-rated health were not a markedly better fit than those simply derived from the survey proportions.</p><p><strong>Conclusion: </strong>While multiple imputation may have the potential to augment population data with information from surveys, further testing and refinement is required.</p>","PeriodicalId":39896,"journal":{"name":"Emerging Themes in Epidemiology","volume":null,"pages":null},"PeriodicalIF":3.6000,"publicationDate":"2021-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s12982-021-00099-z","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Emerging Themes in Epidemiology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s12982-021-00099-z","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 1

Abstract

Background: Health surveys provide a rich array of information but on relatively small numbers of individuals and evidence suggests that they are becoming less representative as response levels fall. Routinely collected administrative data offer more extensive population coverage but typically comprise fewer health topics. We explore whether data combination and multiple imputation of health variables from survey data is a simple and robust way of generating these variables in the general population.

Methods: We use the UK Integrated Household Survey and the English 2011 population census both of which included self-rated general health. Setting aside the census self-rated health data we multiply imputed self-rated health responses for the census using the survey data and compared these with the actual census results in 576 unique groups defined by age, sex, housing tenure and geographic region.

Results: Compared with original census data across the groups, multiply imputed proportions of bad or very bad self-rated health were not a markedly better fit than those simply derived from the survey proportions.

Conclusion: While multiple imputation may have the potential to augment population data with information from surveys, further testing and refinement is required.

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
在加强管理数据集方面,对调查数据进行标准的多重代入并不比简单的代入表现得更好:以英国的自评健康为例。
背景:健康调查提供了丰富的信息,但涉及的个人数量相对较少,有证据表明,随着应对水平的下降,这些调查的代表性正在下降。常规收集的行政数据提供了更广泛的人口覆盖范围,但通常包含较少的健康主题。我们探讨了从调查数据中对健康变量进行数据组合和多重代入是否是在一般人群中生成这些变量的一种简单而稳健的方法。方法:我们使用英国综合住户调查和英国2011年人口普查,这两项调查都包括自我评定的一般健康状况。撇开人口普查自评健康数据不谈,我们使用调查数据将人口普查的自评健康回答乘以,并将其与按年龄、性别、住房保有和地理区域定义的576个独特群体的实际人口普查结果进行比较。结果:与各群体的原始人口普查数据相比,多重估算的不良或非常不良自评健康的比例并不比单纯从调查比例中得出的比例明显更好。结论:虽然多重插值可能有潜力利用调查信息增加人口数据,但需要进一步的测试和改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Emerging Themes in Epidemiology
Emerging Themes in Epidemiology Medicine-Epidemiology
CiteScore
4.40
自引率
4.30%
发文量
9
审稿时长
28 weeks
期刊介绍: Emerging Themes in Epidemiology is an open access, peer-reviewed, online journal that aims to promote debate and discussion on practical and theoretical aspects of epidemiology. Combining statistical approaches with an understanding of the biology of disease, epidemiologists seek to elucidate the social, environmental and host factors related to adverse health outcomes. Although research findings from epidemiologic studies abound in traditional public health journals, little publication space is devoted to discussion of the practical and theoretical concepts that underpin them. Because of its immediate impact on public health, an openly accessible forum is needed in the field of epidemiology to foster such discussion.
期刊最新文献
Explaining biological differences between men and women by gendered mechanisms. Population cause of death estimation using verbal autopsy methods in large-scale field trials of maternal and child health: lessons learned from a 20-year research collaboration in Central Ghana. Dynamics of COVID-19 progression and the long-term influences of measures on pandemic outcomes. Effect size quantification for interrupted time series analysis: implementation in R and analysis for Covid-19 research. Geographical clustering and geographically weighted regression analysis of home delivery and its determinants in developing regions of Ethiopia: a spatial analysis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1