Targeted learning with an undersmoothed LASSO propensity score model for large-scale covariate adjustment in health-care database studies.

IF 5 2区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH American journal of epidemiology Pub Date : 2024-11-04 DOI:10.1093/aje/kwae023
Richard Wyss, Mark van der Laan, Susan Gruber, Xu Shi, Hana Lee, Sarah K Dutcher, Jennifer C Nelson, Sengwee Toh, Massimiliano Russo, Shirley V Wang, Rishi J Desai, Kueiyu Joshua Lin
{"title":"Targeted learning with an undersmoothed LASSO propensity score model for large-scale covariate adjustment in health-care database studies.","authors":"Richard Wyss, Mark van der Laan, Susan Gruber, Xu Shi, Hana Lee, Sarah K Dutcher, Jennifer C Nelson, Sengwee Toh, Massimiliano Russo, Shirley V Wang, Rishi J Desai, Kueiyu Joshua Lin","doi":"10.1093/aje/kwae023","DOIUrl":null,"url":null,"abstract":"<p><p>Least absolute shrinkage and selection operator (LASSO) regression is widely used for large-scale propensity score (PS) estimation in health-care database studies. In these settings, previous work has shown that undersmoothing (overfitting) LASSO PS models can improve confounding control, but it can also cause problems of nonoverlap in covariate distributions. It remains unclear how to select the degree of undersmoothing when fitting large-scale LASSO PS models to improve confounding control while avoiding issues that can result from reduced covariate overlap. Here, we used simulations to evaluate the performance of using collaborative-controlled targeted learning to data-adaptively select the degree of undersmoothing when fitting large-scale PS models within both singly and doubly robust frameworks to reduce bias in causal estimators. Simulations showed that collaborative learning can data-adaptively select the degree of undersmoothing to reduce bias in estimated treatment effects. Results further showed that when fitting undersmoothed LASSO PS models, the use of cross-fitting was important for avoiding nonoverlap in covariate distributions and reducing bias in causal estimates.</p>","PeriodicalId":7472,"journal":{"name":"American journal of epidemiology","volume":null,"pages":null},"PeriodicalIF":5.0000,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11538566/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of epidemiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/aje/kwae023","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0

Abstract

Least absolute shrinkage and selection operator (LASSO) regression is widely used for large-scale propensity score (PS) estimation in health-care database studies. In these settings, previous work has shown that undersmoothing (overfitting) LASSO PS models can improve confounding control, but it can also cause problems of nonoverlap in covariate distributions. It remains unclear how to select the degree of undersmoothing when fitting large-scale LASSO PS models to improve confounding control while avoiding issues that can result from reduced covariate overlap. Here, we used simulations to evaluate the performance of using collaborative-controlled targeted learning to data-adaptively select the degree of undersmoothing when fitting large-scale PS models within both singly and doubly robust frameworks to reduce bias in causal estimators. Simulations showed that collaborative learning can data-adaptively select the degree of undersmoothing to reduce bias in estimated treatment effects. Results further showed that when fitting undersmoothed LASSO PS models, the use of cross-fitting was important for avoiding nonoverlap in covariate distributions and reducing bias in causal estimates.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
在医疗保健数据库研究中使用下平滑拉索倾向得分模型进行有针对性的学习,以进行大规模协方差调整。
在医疗数据库研究中,Lasso 回归被广泛用于大规模倾向得分(PS)估计。以往的研究表明,在这些情况下,下平滑(过度拟合)Lasso PS 模型可以改善混杂控制,但也会造成协变量分布不重叠的问题。目前仍不清楚在拟合大规模 Lasso PS 模型时如何选择下平滑的程度,以改善混杂控制,同时避免协变量重叠减少可能导致的问题。在此,我们利用模拟评估了在单稳健和双稳健框架内拟合大规模 PS 模型时,使用协作控制的目标学习来数据适应性地选择下平滑程度,以减少因果估计器偏差的性能。模拟结果表明,协作学习可以根据数据自适应地选择下平滑程度,从而减少估计治疗效果的偏差。结果进一步表明,在拟合下平滑 Lasso PS 模型时,使用交叉拟合对于避免协变量分布的非重叠和减少因果估计的偏差非常重要。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
American journal of epidemiology
American journal of epidemiology 医学-公共卫生、环境卫生与职业卫生
CiteScore
7.40
自引率
4.00%
发文量
221
审稿时长
3-6 weeks
期刊介绍: The American Journal of Epidemiology is the oldest and one of the premier epidemiologic journals devoted to the publication of empirical research findings, opinion pieces, and methodological developments in the field of epidemiologic research. It is a peer-reviewed journal aimed at both fellow epidemiologists and those who use epidemiologic data, including public health workers and clinicians.
期刊最新文献
Assessing trends in internalizing symptoms among racialized and minoritized adolescents: results from the Monitoring the Future Study 2005-2020. Targeted learning with an undersmoothed LASSO propensity score model for large-scale covariate adjustment in health-care database studies. DNA methylation as a possible mechanism linking childhood adversity and health: results from a 2-sample mendelian randomization study. Invited commentary: it's not all about residual confounding-a plea for quantitative bias analysis for epidemiologic researchers and educators. Validation of algorithms in studies based on routinely collected health data: general principles.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1