An intersectional framework for counterfactual fairness in risk prediction.

IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Biostatistics Pub Date : 2024-07-01 DOI:10.1093/biostatistics/kxad021
Solvejg Wastvedt, Jared D Huling, Julian Wolfson
{"title":"An intersectional framework for counterfactual fairness in risk prediction.","authors":"Solvejg Wastvedt, Jared D Huling, Julian Wolfson","doi":"10.1093/biostatistics/kxad021","DOIUrl":null,"url":null,"abstract":"<p><p>Along with the increasing availability of health data has come the rise of data-driven models to inform decision making and policy. These models have the potential to benefit both patients and health care providers but can also exacerbate health inequities. Existing \"algorithmic fairness\" methods for measuring and correcting model bias fall short of what is needed for health policy in two key ways. First, methods typically focus on a single grouping along which discrimination may occur rather than considering multiple, intersecting groups. Second, in clinical applications, risk prediction is typically used to guide treatment, creating distinct statistical issues that invalidate most existing techniques. We present novel unfairness metrics that address both challenges. We also develop a complete framework of estimation and inference tools for our metrics, including the unfairness value (\"u-value\"), used to determine the relative extremity of unfairness, and standard errors and confidence intervals employing an alternative to the standard bootstrap. We demonstrate application of our framework to a COVID-19 risk prediction model deployed in a major Midwestern health system.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":"702-717"},"PeriodicalIF":1.8000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biostatistics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1093/biostatistics/kxad021","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Along with the increasing availability of health data has come the rise of data-driven models to inform decision making and policy. These models have the potential to benefit both patients and health care providers but can also exacerbate health inequities. Existing "algorithmic fairness" methods for measuring and correcting model bias fall short of what is needed for health policy in two key ways. First, methods typically focus on a single grouping along which discrimination may occur rather than considering multiple, intersecting groups. Second, in clinical applications, risk prediction is typically used to guide treatment, creating distinct statistical issues that invalidate most existing techniques. We present novel unfairness metrics that address both challenges. We also develop a complete framework of estimation and inference tools for our metrics, including the unfairness value ("u-value"), used to determine the relative extremity of unfairness, and standard errors and confidence intervals employing an alternative to the standard bootstrap. We demonstrate application of our framework to a COVID-19 risk prediction model deployed in a major Midwestern health system.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
风险预测中反事实公平性的交叉框架。
随着健康数据的日益普及,为决策和政策提供信息的数据驱动模型也随之兴起。这些模型有可能使患者和医疗服务提供者受益,但也可能加剧健康不平等。现有的衡量和纠正模型偏差的 "算法公平性 "方法在两个关键方面无法满足卫生政策的需要。首先,这些方法通常只关注可能出现歧视的单一分组,而不是考虑多个交叉分组。其次,在临床应用中,风险预测通常用于指导治疗,这就产生了明显的统计问题,使大多数现有技术失效。我们提出了新的不公平度量方法来应对这两个挑战。我们还为我们的指标开发了一个完整的估算和推理工具框架,包括不公平值("u 值")(用于确定不公平的相对极值),以及标准误差和置信区间(采用标准自举法的替代方法)。我们展示了我们的框架在中西部一家大型医疗系统部署的 COVID-19 风险预测模型中的应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Biostatistics
Biostatistics 生物-数学与计算生物学
CiteScore
5.10
自引率
4.80%
发文量
45
审稿时长
6-12 weeks
期刊介绍: Among the important scientific developments of the 20th century is the explosive growth in statistical reasoning and methods for application to studies of human health. Examples include developments in likelihood methods for inference, epidemiologic statistics, clinical trials, survival analysis, and statistical genetics. Substantive problems in public health and biomedical research have fueled the development of statistical methods, which in turn have improved our ability to draw valid inferences from data. The objective of Biostatistics is to advance statistical science and its application to problems of human health and disease, with the ultimate goal of advancing the public''s health.
期刊最新文献
A semiparametric Gaussian mixture model for chest CT-based 3D blood vessel reconstruction. Simultaneous clustering and estimation of networks in multiple graphical models. A joint normal-ordinal (probit) model for ordinal and continuous longitudinal data. A modeling framework for detecting and leveraging node-level information in Bayesian network inference. A marginal structural model for normal tissue complication probability.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1