减少相互作用估计中的模型错误指定和偏差

IF 4.7 2区社会学 Q1 POLITICAL SCIENCE Political Analysis Pub Date : 2021-07-23 DOI:10.1017/pan.2021.19

M. Blackwell, Michael Olson

{"title":"减少相互作用估计中的模型错误指定和偏差","authors":"M. Blackwell, Michael Olson","doi":"10.1017/pan.2021.19","DOIUrl":null,"url":null,"abstract":"Abstract Analyzing variation in treatment effects across subsets of the population is an important way for social scientists to evaluate theoretical arguments. A common strategy in assessing such treatment effect heterogeneity is to include a multiplicative interaction term between the treatment and a hypothesized effect modifier in a regression model. Unfortunately, this approach can result in biased inferences due to unmodeled interactions between the effect modifier and other covariates, and including these interactions can lead to unstable estimates due to overfitting. In this paper, we explore the usefulness of machine learning algorithms for stabilizing these estimates and show how many off-the-shelf adaptive methods lead to two forms of bias: direct and indirect regularization bias. To overcome these issues, we use a post-double selection approach that utilizes several lasso estimators to select the interactions to include in the final model. We extend this approach to estimate uncertainty for both interaction and marginal effects. Simulation evidence shows that this approach has better performance than competing methods, even when the number of covariates is large. We show in two empirical examples that the choice of method leads to dramatically different conclusions about effect heterogeneity.","PeriodicalId":48270,"journal":{"name":"Political Analysis","volume":"30 1","pages":"495 - 514"},"PeriodicalIF":4.7000,"publicationDate":"2021-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1017/pan.2021.19","citationCount":"21","resultStr":"{\"title\":\"Reducing Model Misspecification and Bias in the Estimation of Interactions\",\"authors\":\"M. Blackwell, Michael Olson\",\"doi\":\"10.1017/pan.2021.19\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract Analyzing variation in treatment effects across subsets of the population is an important way for social scientists to evaluate theoretical arguments. A common strategy in assessing such treatment effect heterogeneity is to include a multiplicative interaction term between the treatment and a hypothesized effect modifier in a regression model. Unfortunately, this approach can result in biased inferences due to unmodeled interactions between the effect modifier and other covariates, and including these interactions can lead to unstable estimates due to overfitting. In this paper, we explore the usefulness of machine learning algorithms for stabilizing these estimates and show how many off-the-shelf adaptive methods lead to two forms of bias: direct and indirect regularization bias. To overcome these issues, we use a post-double selection approach that utilizes several lasso estimators to select the interactions to include in the final model. We extend this approach to estimate uncertainty for both interaction and marginal effects. Simulation evidence shows that this approach has better performance than competing methods, even when the number of covariates is large. We show in two empirical examples that the choice of method leads to dramatically different conclusions about effect heterogeneity.\",\"PeriodicalId\":48270,\"journal\":{\"name\":\"Political Analysis\",\"volume\":\"30 1\",\"pages\":\"495 - 514\"},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2021-07-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1017/pan.2021.19\",\"citationCount\":\"21\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Political Analysis\",\"FirstCategoryId\":\"90\",\"ListUrlMain\":\"https://doi.org/10.1017/pan.2021.19\",\"RegionNum\":2,\"RegionCategory\":\"社会学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"POLITICAL SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Political Analysis","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1017/pan.2021.19","RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"POLITICAL SCIENCE","Score":null,"Total":0}

引用次数: 21

摘要

摘要分析不同人群治疗效果的差异是社会科学家评估理论论点的重要途径。评估这种治疗效果异质性的一种常见策略是在回归模型中包括治疗和假设效果修饰因子之间的乘法相互作用项。不幸的是，由于效应修饰符和其他协变量之间未建模的相互作用，这种方法可能会导致有偏差的推断，并且包括这些相互作用可能会由于过拟合而导致不稳定的估计。在本文中，我们探讨了机器学习算法在稳定这些估计方面的有用性，并展示了有多少现成的自适应方法会导致两种形式的偏差：直接和间接正则化偏差。为了克服这些问题，我们使用了后双重选择方法，该方法利用几个套索估计器来选择要包含在最终模型中的交互作用。我们扩展了这种方法来估计相互作用和边际效应的不确定性。仿真证据表明，即使协变量数量很大，这种方法也比竞争方法具有更好的性能。我们在两个实证例子中表明，方法的选择导致了关于效应异质性的截然不同的结论。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Reducing Model Misspecification and Bias in the Estimation of Interactions

Abstract Analyzing variation in treatment effects across subsets of the population is an important way for social scientists to evaluate theoretical arguments. A common strategy in assessing such treatment effect heterogeneity is to include a multiplicative interaction term between the treatment and a hypothesized effect modifier in a regression model. Unfortunately, this approach can result in biased inferences due to unmodeled interactions between the effect modifier and other covariates, and including these interactions can lead to unstable estimates due to overfitting. In this paper, we explore the usefulness of machine learning algorithms for stabilizing these estimates and show how many off-the-shelf adaptive methods lead to two forms of bias: direct and indirect regularization bias. To overcome these issues, we use a post-double selection approach that utilizes several lasso estimators to select the interactions to include in the final model. We extend this approach to estimate uncertainty for both interaction and marginal effects. Simulation evidence shows that this approach has better performance than competing methods, even when the number of covariates is large. We show in two empirical examples that the choice of method leads to dramatically different conclusions about effect heterogeneity.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Political Analysis POLITICAL SCIENCE-

CiteScore

8.80

自引率

3.70%

发文量

期刊介绍： Political Analysis chronicles these exciting developments by publishing the most sophisticated scholarship in the field. It is the place to learn new methods, to find some of the best empirical scholarship, and to publish your best research.