Estimating Measurement Error in SIPP Annual Job Earnings: A Comparison of Census Bureau Survey and SSA Administrative Data

J. Abowd, Martha Harrison Stinson
{"title":"Estimating Measurement Error in SIPP Annual Job Earnings: A Comparison of Census Bureau Survey and SSA Administrative Data","authors":"J. Abowd, Martha Harrison Stinson","doi":"10.2139/ssrn.1894690","DOIUrl":null,"url":null,"abstract":"We quantify sources of variation in annual job earnings data collected by the Survey of Income and Program Participation (SIPP) to determine how much of the variation is the result of measurement error. Jobs reported in the SIPP are linked to jobs reported in an administrative database, the Detailed Earnings Records (DER) drawn from the Social Security Administration’s Master Earnings File, a universe file of all earnings reported on W-2 tax forms. As a result of the match, each job potentially has two earnings observations per year: survey and administrative. Unlike previous validation studies, both of these earnings measures are viewed as noisy measures of some underlying true amount of annual earnings. While the existence of survey error resulting from respondent mistakes or misinterpretation is widely accepted, the idea that administrative data are also error-prone is new. Possible sources of employer reporting error, employee under-reporting of compensation such as tips, and general differences between how earnings may be reported on tax forms and in surveys, necessitates the discarding of the assumption that administrative data are a true measure of the quantity that the survey was designed to collect. In addition, errors in matching SIPP and DER jobs, a necessary task in any use of administrative data, also contribute to measurement error in both earnings variables. We begin by comparing SIPP and DER earnings for different demographic and education groups of SIPP respondents. We also calculate different measures of changes in earnings for individuals switching jobs. We estimate a standard earnings equation model using SIPP and DER earnings and compare the resulting coefficients. Finally exploiting the presence of individuals with multiple jobs and shared employers over time, we estimate an econometric model that includes random person and firm effects, a common error component shared by SIPP and DER earnings, and two independent error components that represent the variation unique to each earnings measure. We compare the variance components from this model and consider how the DER and SIPP differ across unobservable components.","PeriodicalId":92154,"journal":{"name":"U.S. Census Bureau Center for Economic Studies research paper series","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2011-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"46","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"U.S. Census Bureau Center for Economic Studies research paper series","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.1894690","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 46

Abstract

We quantify sources of variation in annual job earnings data collected by the Survey of Income and Program Participation (SIPP) to determine how much of the variation is the result of measurement error. Jobs reported in the SIPP are linked to jobs reported in an administrative database, the Detailed Earnings Records (DER) drawn from the Social Security Administration’s Master Earnings File, a universe file of all earnings reported on W-2 tax forms. As a result of the match, each job potentially has two earnings observations per year: survey and administrative. Unlike previous validation studies, both of these earnings measures are viewed as noisy measures of some underlying true amount of annual earnings. While the existence of survey error resulting from respondent mistakes or misinterpretation is widely accepted, the idea that administrative data are also error-prone is new. Possible sources of employer reporting error, employee under-reporting of compensation such as tips, and general differences between how earnings may be reported on tax forms and in surveys, necessitates the discarding of the assumption that administrative data are a true measure of the quantity that the survey was designed to collect. In addition, errors in matching SIPP and DER jobs, a necessary task in any use of administrative data, also contribute to measurement error in both earnings variables. We begin by comparing SIPP and DER earnings for different demographic and education groups of SIPP respondents. We also calculate different measures of changes in earnings for individuals switching jobs. We estimate a standard earnings equation model using SIPP and DER earnings and compare the resulting coefficients. Finally exploiting the presence of individuals with multiple jobs and shared employers over time, we estimate an econometric model that includes random person and firm effects, a common error component shared by SIPP and DER earnings, and two independent error components that represent the variation unique to each earnings measure. We compare the variance components from this model and consider how the DER and SIPP differ across unobservable components.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
SIPP年度工作收入测量误差估计:人口普查局调查与SSA行政数据的比较
我们量化了收入和计划参与调查(SIPP)收集的年度工作收入数据的变化来源,以确定有多少变化是测量误差的结果。SIPP中报告的工作与行政数据库中报告的工作相关联,该数据库是来自社会保障局总收入档案的详细收入记录(DER),该档案是W-2税表上报告的所有收入的通用文件。由于这种匹配,每个工作每年都可能有两次收入观察:调查和行政。与之前的验证研究不同,这两种收益指标都被视为一些潜在真实年度收益的嘈杂指标。虽然由于应答者的错误或误解而导致的调查错误的存在被广泛接受,但行政数据也容易出错的想法是新的。雇主报告错误的可能来源,雇员少报小费等报酬,以及在报税表和调查中报告收入的方式之间的普遍差异,都需要抛弃行政数据是调查旨在收集的数量的真实衡量标准的假设。此外,匹配SIPP和DER工作的错误(任何使用行政数据的必要任务)也会导致两个收入变量的测量误差。我们首先比较SIPP调查对象中不同人口和教育群体的SIPP和DER收入。我们还计算了个人换工作时收入变化的不同衡量标准。我们使用SIPP和DER收益估计标准收益方程模型,并比较所得系数。最后,利用长期存在的多份工作和共同雇主的个体,我们估计了一个计量经济模型,该模型包括随机个人和企业效应、SIPP和DER收入共有的一个常见误差成分,以及代表每个收入测量独特变化的两个独立误差成分。我们比较了该模型的方差成分,并考虑了DER和SIPP在不可观测成分之间的差异。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Creditor Control Rights and Resource Allocation Within Firms Examining Multi-Level Correlates of Suicide by Merging NVDRS and ACS Data. Do Firms Mitigate or Magnify Capital Misallocation? Evidence from Plant-Level Data Going Entrepreneurial? IPOs and New Firm Creation Examining Multi-Level Correlates of Suicide by Merging NVDRS and ACS Data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1