Statistical match of the March 1996 Current Population Survey and the 1995 National Health Interview Survey.

D. D. Ingram, C. Moriarity, John F. O'Hare, Joan L. Turek
{"title":"Statistical match of the March 1996 Current Population Survey and the 1995 National Health Interview Survey.","authors":"D. D. Ingram, C. Moriarity, John F. O'Hare, Joan L. Turek","doi":"10.1037/e414732008-001","DOIUrl":null,"url":null,"abstract":"OBJECTIVES Statistical matching is a method used to combine two files when it is unlikely that individuals on one file are also on the other file. The objectives of this report are to document and evaluate statistical matches of the March 1996 Current Population Survey (CPS) and the 1995 National Health interview Survey (NHIS) and give recommendations for improving future matches. The CPS-NHIS match was motivated by the need for a data set with data on health measures and family resources for use in policy analyses. METHODS Three statistical matches between the March 1996 CPS and the 1995 NHIS are described in this report. All three matches used person-level constrained matching with partitioning and a predictive mean matching algorithm to link records on the two files. For two of the matches, the CPS served as the Host file and the NHIS served as the Donor file; for the third match, the NHIS was the Host file and the CPS was the Donor file. RESULTS The results suggest that the constrained predictive mean matches of the March 1996 CPS and the 1995 NHIS successfully combined some of the information on the two files, but that relationships among some Host and Donor variables on the matched file may be distorted. The evaluation of the matches suggested that the variables used to partition the Host and Donor files prior to matching and the variables involved in the predictive mean matching play an important role in determining whether relationships among variables on the matched file correctly represent relationships among those variables in the population. The evaluation also indicated that estimates for small subgroups may be especially subject to error. The results reinforce the need to proceed cautiously when exploring relationships among Host and Donor variables on a statistically matched file.","PeriodicalId":23577,"journal":{"name":"Vital and health statistics. Series 2, Data evaluation and methods research","volume":"144 1","pages":"1-50"},"PeriodicalIF":0.0000,"publicationDate":"2008-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Vital and health statistics. Series 2, Data evaluation and methods research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1037/e414732008-001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 2

Abstract

OBJECTIVES Statistical matching is a method used to combine two files when it is unlikely that individuals on one file are also on the other file. The objectives of this report are to document and evaluate statistical matches of the March 1996 Current Population Survey (CPS) and the 1995 National Health interview Survey (NHIS) and give recommendations for improving future matches. The CPS-NHIS match was motivated by the need for a data set with data on health measures and family resources for use in policy analyses. METHODS Three statistical matches between the March 1996 CPS and the 1995 NHIS are described in this report. All three matches used person-level constrained matching with partitioning and a predictive mean matching algorithm to link records on the two files. For two of the matches, the CPS served as the Host file and the NHIS served as the Donor file; for the third match, the NHIS was the Host file and the CPS was the Donor file. RESULTS The results suggest that the constrained predictive mean matches of the March 1996 CPS and the 1995 NHIS successfully combined some of the information on the two files, but that relationships among some Host and Donor variables on the matched file may be distorted. The evaluation of the matches suggested that the variables used to partition the Host and Donor files prior to matching and the variables involved in the predictive mean matching play an important role in determining whether relationships among variables on the matched file correctly represent relationships among those variables in the population. The evaluation also indicated that estimates for small subgroups may be especially subject to error. The results reinforce the need to proceed cautiously when exploring relationships among Host and Donor variables on a statistically matched file.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
1996年3月当期人口调查与1995年全国健康访问调查的统计匹配。
目的统计匹配是一种用于组合两个文件的方法,当一个文件上的个人不太可能也在另一个文件上时。本报告的目的是记录和评价1996年3月当前人口调查(CPS)和1995年全国健康访谈调查(NHIS)的统计匹配情况,并提出改进今后匹配情况的建议。CPS-NHIS匹配的动机是需要一套包含卫生措施和家庭资源数据的数据集,以便用于政策分析。方法对1996年3月全国人口统计系统与1995年全国人口统计系统进行了三次统计匹配。所有三个匹配都使用带有分区的人级约束匹配和预测平均匹配算法来链接两个文件上的记录。在其中的两个配对中,CPS作为宿主文件,NHIS作为供体文件;对于第三个匹配,NHIS是主机文件,CPS是供体文件。结果1996年3月CPS和1995年NHIS的约束预测平均匹配成功地结合了两个文件的部分信息,但匹配文件中某些Host和Donor变量之间的关系可能存在扭曲。对匹配的评估表明,在匹配之前用于划分宿主和供体文件的变量以及涉及预测平均匹配的变量在确定匹配文件上的变量之间的关系是否正确地表示总体中这些变量之间的关系方面发挥了重要作用。评估还表明,对小群体的估计可能特别容易出错。结果表明,在统计匹配的文件上探索宿主和供体变量之间的关系时,需要谨慎行事。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
13.20
自引率
0.00%
发文量
0
期刊介绍: Studies of new statistical methodology including experimental tests of new survey methods, studies of vital statistics collection methods, new analytical techniques, objective evaluations of reliability of collected data, and contributions to statistical theory. Studies also include comparison of U.S. methodology with those of other countries.
期刊最新文献
Calibration Weighting Methods for the National Center for Health Statistics Research and Development Survey. Assessing Linkage Eligibility Bias in the National Health Interview Survey. Assessing Linkage Eligibility Bias in the National Health Interview Survey. An Investigation of Nonresponse Bias and Survey Location Variability in the 2017-2018 National Health and Nutrition Examination Survey. National Health and Nutrition Examination Survey, 2015-2018: Sample Design and Estimation Procedures.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1