Misclassification and Bias in Predictions of Individual Ethnicity from Administrative Records

IF 5.9 1区 社会学 Q1 POLITICAL SCIENCE American Political Science Review Pub Date : 2023-05-15 DOI:10.1017/s0003055423000229
Lisa P. Argyle, Michael J. Barber
{"title":"Misclassification and Bias in Predictions of Individual Ethnicity from Administrative Records","authors":"Lisa P. Argyle, Michael J. Barber","doi":"10.1017/s0003055423000229","DOIUrl":null,"url":null,"abstract":"We show that a common method of predicting individuals’ race in administrative records, Bayesian Improved Surname Geocoding (BISG), produces misclassification errors that are strongly correlated with demographic and socioeconomic factors. In addition to the high error rates for some racial subgroups, the misclassification rates are correlated with the political and economic characteristics of a voter’s neighborhood. Racial and ethnic minorities who live in wealthy, highly educated, and politically active areas are most likely to be misclassified as white by BISG. Inferences about the relationship between sociodemographic factors and political outcomes, like voting, are likely to be biased in models using BISG to infer race. We develop an improved method in which the BISG estimates are incorporated into a machine learning model that accounts for class imbalance and incorporates individual and neighborhood characteristics. Our model decreases the misclassification rates among non-white individuals, in some cases by as much as 50%.","PeriodicalId":48451,"journal":{"name":"American Political Science Review","volume":" ","pages":""},"PeriodicalIF":5.9000,"publicationDate":"2023-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American Political Science Review","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1017/s0003055423000229","RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"POLITICAL SCIENCE","Score":null,"Total":0}
引用次数: 2

Abstract

We show that a common method of predicting individuals’ race in administrative records, Bayesian Improved Surname Geocoding (BISG), produces misclassification errors that are strongly correlated with demographic and socioeconomic factors. In addition to the high error rates for some racial subgroups, the misclassification rates are correlated with the political and economic characteristics of a voter’s neighborhood. Racial and ethnic minorities who live in wealthy, highly educated, and politically active areas are most likely to be misclassified as white by BISG. Inferences about the relationship between sociodemographic factors and political outcomes, like voting, are likely to be biased in models using BISG to infer race. We develop an improved method in which the BISG estimates are incorporated into a machine learning model that accounts for class imbalance and incorporates individual and neighborhood characteristics. Our model decreases the misclassification rates among non-white individuals, in some cases by as much as 50%.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
行政记录中个人种族预测的错误分类和偏差
我们发现,在行政记录中预测个人种族的一种常见方法,贝叶斯改进姓氏地理编码(BISG),会产生与人口统计和社会经济因素密切相关的错误分类错误。除了一些种族亚组的高错误率外,错误分类率还与选民所在社区的政治和经济特征有关。生活在富裕、受过高等教育和政治活跃地区的种族和少数民族最有可能被BISG错误地归类为白人。在使用BISG推断种族的模型中,对社会人口因素和政治结果(如投票)之间关系的推断可能存在偏见。我们开发了一种改进的方法,在该方法中,BISG估计被纳入机器学习模型,该模型考虑了类不平衡,并纳入了个体和邻域特征。我们的模型降低了非白人个体的错误分类率,在某些情况下高达50%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
9.80
自引率
5.90%
发文量
119
期刊介绍: American Political Science Review is political science''s premier scholarly research journal, providing peer-reviewed articles and review essays from subfields throughout the discipline. Areas covered include political theory, American politics, public policy, public administration, comparative politics, and international relations. APSR has published continuously since 1906. American Political Science Review is sold ONLY as part of a joint subscription with Perspectives on Politics and PS: Political Science & Politics.
期刊最新文献
Survivorship analysis of CAD-CAM total shoulder replacement. Descriptive Representation and Party Building: Evidence from Municipal Governments in Brazil Domestic Distributional Roots of National Interest My History or Our History? Historical Revisionism and Entitlement to Lead Don’t Look Back in Anger: Cooperation Despite Conflicting Historical Narratives
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1